Abstract
Students may not express their creativity uniformly, across all settings such as home, school, the natural environment (Abdulla Alabbasi, Runco, et al., 2024; Runco et al., 2023) and under all conditions (Abraham, 2022; Amabile, 1996). Creativity is sensitive to expected evaluations, for example, and likely to vary when the individual is alone, in a small group, or in a large group (Amabile, 1990; Rubenson & Runco, 1995). Quite a bit of research has demonstrated that certain conditions can be created to support creative thinking by students (Acar et al., 2020; Runco, 1986; Said-Metwaly et al., 2020; Wallach & Kogan, 1965). Even the physical characteristics of the setting can have an impact on divergent thinking (DT) and creativity test performances (Deng et al., 2016; Lee & Lee, 2023; McCoy & Evans, 2002). The present study focused on two key factors that may impact DT performance: classroom setting and time of the day. The rationale for focusing on these two factors instead of numerous other possible conditions (e.g., lighting, colors, window view, presence of plants, daylight; see Lee & Lee, 2023 for a review on the relationship between physical environment and creativity) was that the effects of such factors were investigated before (Küller et al., 2006; Lee & Lee, 2023; Lucius & Damberg, 2024). The impact of classroom setting and time of the day was empirically examined in the current study, which also considered the influence of giftedness.
Classroom Setting
Extant research showed that immediate setting may have an influence on creativity and DT performance. Batey et al. (2021) showed that the presence of a large nature poster where testing was conducted resulted in a greater DT performance, evaluated by raters who were blind to the study design than in a setting without such a poster even when openness personality trait and mood were controlled. In another study by Ichimura (2023), the perceived environment was manipulated by the use of virtual reality (VR) to examine the differences in performance on the Alternative Uses Test (AUT). In Experiment 1, participants viewing a 360° video with immersive head-mounted displays scored higher on the AUT than those viewing the same video on a computer screen or viewing a real-world laboratory. In Experiment 2, participants viewing a 360° video of a visually open coast (i.e., Okinawan coast) scored higher on the AUT than those viewing a visually closed laboratory. The findings suggest that visually open VR environments viewed through head-mounted displays promote DT. More broadly, these studies show that changes in the immediate environment can influence DT performance. However, they did not focus on classroom settings. There is also research that compared the influence of different settings on children’s DT performance. In one such study, Guegan et al. (2020) exposed 96 children (grades 1 through 5) to one of three virtual settings (a replica of the headmistress’s office, a replica of their schoolyard, and a dreamlike environment) while completing the DT tests. They found that children generated more original ideas in the dreamlike and playful environments compared to the headmistress’s office. This finding of the benefit of a playful environment is consistent with earlier research showing that DT was more likely in “game-like” rather than “test-like” assessment conditions (Paek et al., 2021; Wallach & Kogan, 1965).
There is also research that considered real classroom or educational settings. Dudek et al. (1993) examined the impact of “classroom atmosphere” on students’ performances on the Torrance Test of Creative Thinking (TTCT). The classroom atmosphere had a significant impact. A related line of research shows the impact of classroom atmosphere on students’ creativity is the difference between students’ performance on DT in traditional (formal schools) vs non-traditional (open schools) (Day, 1974; Lewis, 1980; Thomas & Berk, 1981). Lewis (1980) reported that kindergarten children from open classrooms scored higher in Wallach and Kogan’s (1965) verbal and figural DT tests than their peers in non-open classrooms. The same conclusion was reached by Thomas and Berk (1981), where second-grade children scored higher in an intermediate formal-informal continuum on the figural form of the TTCT.
More recent studies identified specific elements of the school learning environment associated with students’ creativity. These include the physical environment, sense of place, learner engagement, perceived naturalness, and the classroom’s learning climate (Blomberg & Kallio, 2022; Piya-Amornphan et al., 2020; Richardson & Mishra, 2018; Yeh et al., 2022; Zhang et al., 2023). In their scoping review on the association between the physical environment and creativity, which included 33 articles, Lee and Lee (2023) listed 15 physical attributes (e.g., light, furniture, indoor plants, and sounds) and seven spatial types (e.g., open spaces, mixed-use spaces, and production labs) that can support creativity. Related to the current study, Shah and Gustafsson (2021) compared the performances of 7 to 11-year-old students when they had received a verbal DT test (i.e., AUT) and a figural test, namely the Test for Creative Thinking–Drawing Production (TCT-DP) in either an art room or a formal classroom. They reported that verbal originality and total TCT-DP scores were higher in the art room than in the classroom. Based on the abovementioned findings, we hypothesized that DT performance would differ based on classroom setting (i.e., art room vs. regular classroom).
Time of the Day
Time of day was examined in the present research because it may have an impact on creative thinking. This follows from Breslin’s (2019) report that the time of the day influenced group creativity exercises. Across two studies with 36 and 18 groups, respectively, where performance was compared between morning and afternoon, Breslin (2019) found midday to be the optimal time for fluency. In contrast, originality was not influenced by the time of the day. Aiello (2022) found similar results: DT performance was higher around lunchtime and afternoon than morning for both originality and effectiveness. Explanations for the effect of time of day typically focus on cognitive or mental fatigue (Boksem et al., 2005; Goldstein et al., 2007; Mullette-Gillman et al., 2015), and sometimes attention as it may suffer after students have worked for several hours, doing their academic work (Lufi et al., 2011; Sievertsen et al., 2016). There are practical implications, if time of day is indeed associated with performance on tests of creative potential. If performances are in fact better during the first half of the school day, for example, and if there is an interest in determining students’ maximal performances, examiners could schedule accordingly.
The Current Study
The present research was modeled after Shah and Gustafsson’s (2021) study. In addition to examining the setting (art room vs. regular classroom), this investigation compared (a) gifted and nongifted students and (b) early in the school day vs. late in the school day in terms of the impact of setting.
Shah and Gustafsson found that verbal originality and total TCT-DP scores were higher in the art room than in the classroom. These are interesting findings, but there are several limitations because they collected data in a British Catholic Primary School, for example, and their sample size was only moderate. There were 111 students placed in one of two groups (art classroom or regular classroom). Although Shah and Gustafsson used both Urban’s TCT-DP and Guilford’s AUT tests, the latter only contained one item (a ping pong ball). Second, Shah and Gustafsson (2021) study did not use equal sample sizes as this is one of the univariate analysis of variance (ANOVA) assumptions. Related limitations include (a) the use of a univariate ANOVA when the observations are not independent (the same measures were administered once in the art room and then in the classroom) and (b) using univariate ANOVA when there is more than one independent variable. This investigation considered an equal sample size in each condition: (a) art room vs. regular classroom, and (b) first vs. second half of the school day (see Table 1 for the study design). The main four advantages of the current study are: (a) use of repeated measure ANOVA rather than separate univariate ANOVAs, (b) controlling for fluency confound, (c) use of two tasks from each test (i.e., the verbal and the figural DT tests), and (d) scoring DT for figural fluency, flexibility, and originality rather than relying on a composite figural test score (i.e., TCT-DP). Aside from these improvements, replication studies are very informative, especially when conducted in different cultures (Makel & Plucker, 2014).
A Description of the Study Design.
Research Hypotheses
The logic for (b) is much like that used by Abdulla Alabbasi, Runco, et al. (2024) in their research comparing creativity at home and school. They found that nongifted students performed better when they were at home than when they were at school. Gifted students, on the other hand, performed more equivalently at home and at school. In a different study, Rayneri et al. (2006) investigated the relationship between classroom environment, learning styles, and academic achievement. They found that the classroom environment did not influence gifted students’ academic performance even when it was compatible with their learning styles.
Method
Participants
The current sample consisted of 160 female students (80 gifted and 80 nongifted) in grades 10 to 12. It was not possible to collect data from male students since education is segregated in Saudi Arabia, and the author who collected the data is a female. The mean age for the total sample was 15.84 ( A student who has extraordinary aptitudes and abilities or outstanding performance from the rest of their peers in one or more of the areas valued by society, especially in the areas of mental excellence, educational achievement, creativity and innovation, and special skills and abilities, who are selected according to the specific scientific bases and standards used for this purpose (Alfaiz et al., 2022, p. 3).
The participants (i.e., gifted and nongifted) were randomly selected from one of the high schools in the Eastern region of Saudi Arabia. According to the National Center for Assessment in Saudi Arabia, the first step in the identification process for selecting gifted students is a self or a teacher nomination. The following assessments, which were developed and normed for Saudi students, are administered in the final selection of gifted learners: (a) a mental flexibility test, (b) a scientific and mechanical reasoning test, (c) a mathematical and spatial reasoning test, and (d) a linguistic reasoning and reading comprehension test. Those who score at or above the 95th percentile in at least two tests (and above the 90th percentile in the third) are being selected for the gifted program (for more details on the identification process in Saudi Arabia, see Abdulla Alabbasi, Acar, et al., 2024; Alfaiz et al., 2022). Both gifted and nongifted students in the present research were from the same school. Gifted students participated in a pull-out program that focuses on three areas: (a) scientific research, (b) an enrichment program, and (c) a counseling program.
Design and Procedure
After (a) obtaining the approval for conducting the current study from the Ministry of Education in Saudi Arabia and (b) receiving the consent forms from the students and their parents, participants were randomly divided into four equal groups: (a) Group 1, consisting of 40 gifted students, took two DT tasks in the classroom in the first half of the school day and two more DT tasks in the art room in the first half of the school day (on another day), (b) Group 2, consisting of 40 nongifted students, took two DT tasks in the classroom in the first half of the school day and two more DT tasks in the art room in the first half of the school day (on another day), (c) group 3, consisting of 40 gifted students two took two DT tasks in the classroom in the second half of the school day and two more DT tasks in the art room in the second half of the school day (on another day), and (d) Group 4, consisting of 40 nongifted students two took two DT tasks in the classroom in the second half of the school day and two more DT tasks in the art room in the second half of the school day (on another day; see Table 1).
All DT tasks were administered under a game-like condition with explicit instructions on being fluent and creative. The first half of the school day was sampled by collected data in the second class (8:15 a.m.–9:00 a.m.). Similarly, the second half of the school day was represented by collecting data during the fifth class (11:35 a.m.–12:20 p.m.). This structured approach ensured a consistent testing environment for all participants.
The third author collected the data with the assistance of two classroom teachers. She visited the school twice to familiarize herself with the study participants. After the two introductory visits, she administered the DT tasks. Each group administered the DT tasks in two different rooms (i.e., the regular classroom and the art room; see Figures 1 and 2) to ensure that there was enough space between study participants. The capacity of each class is around 35 seats and tables. This procedure (i.e., dividing each group) ensured that students did not copy or share their responses. In all sessions, two teachers assisted the third author. Before starting the session, the third author asked all students if they had any questions about the tasks. Moreover, she asked them to read the instructions for each task carefully. The administration time was calculated using a stopwatch. The mean time for completing each session ranged between 18 and 36 min. Students were asked to raise their hands after completing the two tasks.

Regular classroom.

Regular art room.
Instruments
Alternative Uses Test (AUT)
The AUT was administered to assess participants’ verbal fluency, flexibility, and originality. Two items of the AUT were administered: (a) the alternative uses for a spoon and (b) the alternative uses for a toothbrush. Previous research that used the AUT in Saudi Arabia, Bahrain, Kuwait, and the United Arab Emirates (i.e., Gulf Cooperative Council; GCC) showed a good reliability coefficient (Cronbach’s Alpha) ranging from .61 to .93 for fluency, .84 to .85 for flexibility, and .61 to .89 for originality (Abdulla Alabbasi, Reiter-Palmon et al., 2021; Abdulla Alabbasi, Runco, et al., 2021; Acar et al., 2019; AlSaleh et al., 2021). In the current study, an interclass correlation between two raters indicated good reliability for fluency (.91), flexibility (.80), and originality (.87). For fluency, a score of 1 was given for each response related to the stimuli that was not repeated. Regarding flexibility, a score of 1 was given for each shift/move from one response category to another. Two authors created a list of categories for the two verbal and the two figural tasks. For the spoon task, there were 11 different categories for flexibility in addition to the “others” category, and for the toothbrush task, there were nine categories for flexibility in addition to “others.” Finally, originality was scored based on statistical infrequency with a 1% cutoff. Each participant was awarded 1 point if the response was not above the 1% cutoff. The participants generated 164 ideas for the spoon task and 96 different ideas for the toothbrush task. The verbatim instructions were as follows: Now, in this game, I am going to name an object—any kind of object, like a light bulb or the floor—and it will be your job to tell me many, different, and unique ways that the object could be used. Any object can be used in a lot of different ways. For example, think about string. What are some of the ways you can think of that you might use string? Let’s begin now. And remember, think of all the different and original ways you could use the object that you will see on the next page (Wallach & Kogan, 1965, p. 31).
Figural Divergent Thinking Test
The authors used two figural DT tasks from the Runco Creativity Assessment Battery (r-CAB; http://creativitytestingservice.com) that have been used extensively in previous research: (a) the Lines task and (b) the Squires task. Those figural tasks resemble Wallach and Kogan’s (1965) Line Meaning Test. Again, previous research that used the figural DT in Saudi Arabia and other GCC countries showed a good reliability coefficient ranging between .86 to .93 for fluency, .81 to .85 for flexibility, and .78 to .89 for originality (Abdulla Alabbasi, Reiter-Palmon et al., 2021; Abdulla Alabbasi, Runco, et al., 2021; Acar et al., 2019; AlSaleh et al., 2021). In the current study, an interclass correlation between two raters indicated good reliability for fluency (.93), flexibility (.79), and originality (.88). These tasks do not require participants to draw; they ask them to see the stimuli and tell everything it might be. Like the AUT test, the figural DT was scored for fluency, flexibility, and originality using the same scoring method. For the first figural task (Lines), there were 15 categories for flexibility in addition to “others,” and the same number of categories were for the second figural task (Squires). Participants generated 299 different ideas for the Lines task and 330 different ideas for the Squires tasks. The verbatim instructions for the figural DT were as follows: Look at the figure below. What do you see? List as many things as possible that this figure might be or represent. This is NOT a test. Think of this as a game and have fun with it! The more ideas you list, the better.
Results
Table 2 presents descriptive statistics for the DT performance by giftedness and setting, while Table 3 shows descriptive statistics as a function of giftedness status and time of the day in the classroom and art room settings.
Divergent Thinking Performance by Giftedness and Setting.
Descriptive Statistics by Giftedness and Time of the Day in Classroom and Art Room Settings.
Analyses focused on two main research questions. The first research question focused on the DT performance as it varied as a function of setting (art room vs regular classroom). The second examined the association of test performance as a function of time of the school day.
DT Performance in Art Rooms Versus Classroom
DT scores collected while students were in the art room were compared to those collected when students were in formal classrooms. Giftedness status was included in the model as an independent variable. A repeated measures ANOVA with two independent variables (setting and giftedness status) showed students performed higher in

Differences in DT figural fluency performance between classroom and art room.
Analyses with verbal fluency and originality showed a similar pattern to those of figural fluency and originality.

Differences in DT figural flexibility performance between classroom and art room.
Finally, the mean scores in Table 2 showed a pattern: students, in general, scored higher in the art room than in the classroom, and when fluency was not controlled, the magnitude of the effect sizes for the various DT indices were medium to large (.31 <
Effect Sizes (Cohen’s
DT Performance by Time of the School Day
DT performance was then examined as it varied by time of day. As seen in Table 5, time of the day was a significant factor only in analyses that used DT verbal originality. Performance was higher in the second half (
Descriptive Statistics of Divergent Thinking Performance.

DT performance in verbal originality by time of the school day.
A repeated measures ANOVA analysis with time also included gifted status. There was no significant interaction of giftedness and time in DT verbal or figural tasks. There was no significant difference by giftedness in figural DT tasks, either. In verbal tasks, gifted students had a higher score on verbal fluency (
Discussion
There is evidence that different environmental dimensions can, in fact, affect students’ creativity (Dul, 2019; Plucker et al., 2017; Runco et al., 2023; Taylor, 1975). In the present research, several repeated measures ANOVAs, using both verbal and figural DT scores, indicated that performance was higher in the art room than in the classroom. Further, the difference in verbal flexibility performance between the two settings was larger for nongifted students than gifted students. The differences between the settings in figural and verbal originality were not statistically significant.
The findings contribute to the body of knowledge describing how constructivist education can enhance learning for all students. Moreover, the repeated measures ANOVA uncovered significant differences in verbal and figural fluency, where the students’ performance was higher in the art room, regardless of giftedness status. Shah and Gustafsson (2021) reported nonsignificant differences in verbal fluency, flexibility, and elaboration between the two settings, and they reported a significant difference in verbal originality in the art room. This is quite important because fluency can sometimes confound originality scores (Acar et al., 2023; Hocevar, 1980; Runco & Albert, 1985).
Certainly, the specific environmental features of regular classrooms and art rooms in Saudi Arabia might be different from those in England, where Shah and Gustafsson (2021) collected their data. Further, the participants in the current research were in high school, while those who participated in Shah and Gustafsson’s investigation were in elementary school. The figural test used by Shah and Gustafsson (i.e., TCT-DP) is different from the figural test used in the current study. Nonetheless, various repeated measures analyses of variance, which examined each of the DT indices and compared gifted and nongifted students, were consistent with previous findings where DT performances were higher in open classrooms than in traditional ones (Hyman, 1978; Lewis, 1980; Thomas & Berk, 1981).
The second objective of the current study was to test whether or not there was a difference in DT performance related to the time of the school day (first vs. second half). Previous studies using various cognitive tests have uncovered such differences, and it was possible that DT would also differ (Boksem et al., 2005; Goldstein et al., 2007; Mullette-Gillman et al., 2015; Sievertsen et al., 2016). Interestingly, Sievertsen et al. (2016, p. 2621) reported that “for every hour later in the day, test performance decreases by 9% when administering standardized tests.” This was not the case for the DT tests, at least in the current study, except for verbal originality in the regular classroom during the second half of the school day. Differences in verbal fluency, verbal flexibility, figural fluency, figural flexibility, and figural originality were small and not statistically significant. Finally, the current study’s findings were in line with Breslin’s (2019) reporting that creative thinking performance was higher in the afternoon compared with morning, but only the fluency index. In contrast, the originality index was not influenced by the time of the day.
There are limitations to the current investigation. First, the current sample did not include male students, although a recent meta-analysis study showed a very slight difference in DT between males and females in favor of females (Abdulla Alabbasi et al., 2022). As explained in the Method, this occurred because, in Saudi Arabia, boys and girls are segregated in schools. Second, the elaboration index of DT was not assessed in the current study, as it was in Shah and Gustafsson’s (2021) study. Third, the time of the day was compared within a restricted range, corresponding to early in the morning (8:15 a.m.–9:00 a.m.) and around midday (11:35 a.m.–12:20 p.m.) because the school day was set up as such. The impact of fatigue could be better observed if the second or another assessment was conducted late in the afternoon of a longer school day.
Further research should be conducted on the time of day in different cultures, school systems, and school types (i.e., private vs. government schools). Unlike place (regular classroom vs. art room), the mean scores for the time of the day (see Table 3) were inconsistent: some DT indices were higher in the first half of the school day, whereas others were higher in the second half of the school day. Comparisons of gifted and nongifted students indicated that the former had higher verbal DT scores, except for flexibility. Further research on classroom settings is recommended, but for now, we can conclude that (a) future research might compare different school places such as regular classrooms vs. resource rooms and natural environments (in the school garden, for example); (b) it is advisable to avoid regular classroom when administering DT test, at least for generating more ideas (i.e., fluency); (c) there is no difference in administering DT in different times of the school day; (d) researchers can compare morning vs. evening time for administering DT tests; and (e) it is recommended to compare government vs. private schools in terms of place and time of administering DT tests. More research such as that reported here is needed to improve education such that it leads direction to better student learning, higher test scores, and efficient communication skills.
