We explore the impact of the language of instruction on academic performance of Emirati students who are expected to learn mathematics and science in and through a second language. Using TIMSS 2019 standardized tests and employing propensity score technique we explain the differences in achievement in mathematics and science between two groups of young Emirati children living in an Arabic dialect-dominated environment who get instruction in English and who receive instruction in classical Arabic. Our findings indicate that students who receive instruction in English have lower odds of attaining higher achievement scores.
## I. INTRODUCTION
Since its founding in 1971, the United Arab Emirates (UAE) have transformed from a desolate desert into a bustling metropolis thanks to money earned from oil profits. The United Arab Emirates has attained international status in fields including business, tourism, aviation, and architecture by using international standards to gauge its objectives. Nonetheless, comparable educational advancements still lag behind the nation's social and economic growth (Harold, 2005). Despite strong government support for the education sector, the quality of education as measured by student performance in international evaluations such as Trends in International Mathematics and Science Study (TIMSS) remains below international standards.
The language capital is an aspect that may affect student performance. Language capital, or more simply "the mother tongue," is defined in the literature as the set of skills that are acquired during childhood with no particular effort and strengthened in school (Chiswick and Miller, 1995). Arab students in UAE speak two varieties of Arabic; the classical Arabic known as « fushaa » and the different dialects spoken at home (the mother tongue). In such "dialect-dominated" environment, students are unlikely to be fluent in Arabic. Thus, language acquisition efficiency will be significantly limited (Boutieri, 2012). It has been shown that learning a first language requires at least 12 years of experience (Collier, 1989). By the time they reach this age, arabic-speaking pupils have not finished this process, and it might take them much longer to learn the fundamentals of the language.
In addition, a "new school model" has been announced by the government of Abu Dhabi in the United Arab Emirates for state kindergarten and early primary grades with effect from the academic year 2010-2011, and extending annually thereafter to cover all grade levels. Amongst a range of ongoing pedagogic, curricular and school leadership reforms, a major new departure is the introduction of English as an additional medium of instruction alongside the existing medium of Arabic. The reason to introduce English as a medium of learning for all Emirati children at such a young age is the belief that learning a second language at a younger age is beneficial. Despite this widespread view, international research on the issue is "surprisingly ambiguous" (Saville-Troike, 2006), with some review studies showing that adolescents are better language learners than young children, and others showing the opposite (Marinova-Todd et al., 2000). Cummins' (1984) asserts that there is an interdependence between the first language (L1), or the mother tongue, and the second language (L2), which also influences academic achievement (Cummins, 1978). According to the language interdependence hypothesis, L2 development depends in part on the developmental level of the L1, and students with lower levels of L1 and L2 proficiency are more likely to experience academic difficulties in school (Cummins, 1984).
Bilingual education in the Arab world has received little attention (Al-Khatib, 2006), and there has consequently been little research on language-in-education issues in the region. This paper aims to bridge this gap. Using the UAE as a case study, we want to explain the disparity in academic achievements of two groups of young students living in a dialect-dominated environment. The first is taught in English, while the second is taught in classical Arabic. More specifically, building on Cummins' theoretical background, we aim to provide (a) evidence of language interdependence in the Emirati context, and (b) its association with academic performance as measured by the standardized TIMSS 2019 test.
Our work adds to the empirical literature linking academic performance to school language. It was implemented with three objectives in mind. The first is to expand research on the variables that influence academic performance. The second is to test the language interdependence hypothesis in the Emirati context. The third is to consider future language policies in a multilingual context.
## II. CONTEXT OF THE RESEARCH STUDY
Since the early 1990s, the UAE has witnessed remarkable transformation in all aspects of socioeconomic and political life in a short period of time and this rapid development has been made possible by an influx of an expatriate population. Martin (2003) reports that in 1968, nationals of the United Arab Emirates (UAE) comprised $63\%$ of the population, but by 1975, after the discovery of oil, nationals comprised only $36\%$. Along with the commercialization of oil and migration of expatriate laborers came the proliferation of the English language to the UAE. English has historically played an essential role for the modernized Arab world. However, English, and the values it represents, has become a controversial issue within a culture dominated by Islamic traditions (Al Mahrooqi & Denman, 2015).
English might act as a connecting bridge to the outside, international community, but as such it replaces Arabic as the primary means of communication among local people. In the UAE society today, English not only as a medium of instruction in education, but also in mass media and communication technology, such as satellite TV stations, computers, and the Internet. Knowledge of English has become the key to better jobs, especially in the private sector: "A good command of English is regarded as one way of gaining access into the private sector workforce"(Troudi & Al Hafidf, 2017). That is why everyone feels the need to learn English. In certain settings, such hospitals and private businesses, the usage of English is becoming so imperative that Arabic is now considered secondary. This important role of English explains the increasing focus on using English as a medium of instruction in UAE schools.
Most parents who can afford it are keen to send their children to the mushrooming private and international schools that start English instruction from kindergarten. People are more and more eager to have an English education which they believe is the prerequisite to professional and social success (Troudi and Jendli, 2011).
It is generally understood that individuals' attitudes towards foreign languages are influenced by social, economic, and political factors. In the UAE, the spread of English in education has been made more rapid as a consequence of decisions taken by the government, private institutions, and individuals. Because policy makers in the region tend to link English to modernization, it has been aggressively taught in schools at all levels (Hanani, 2009).
A major challenge facing the new school model is a linguistic one: Arabic diglossia which refers to two varieties of the same language being used alongside each other for different functions. Modern Standard Arabic (MSA) is used in reading, writing, and formal spoken situations, while colloquial Arabic is used for speaking. According to Badry (2004) "the linguistic situation in every Arabic speaking country is characterized by diglossia where literate speakers select from several varieties of Arabic depending on the setting, interlocutors, and occasion of the communication". Furthermore, Al Sharhan (2007), in his study of educational languages in the United Arab Emirates, showed that Emirati children need to master not only two but three Arabic languages (trilingual): Gulf Colloquial Arabic; Modern Standard Arabic which is the language of instruction in schools and the language of written media; and Classical Arabic, required for learning the Quran. Emirati schoolchildren learning their first language faced significant challenges in mastering all registers of the language (Abu-Libdeh, 1996).
This complex linguistic situation is likely to contribute to the low levels of first language literacy in UAE, and is thought to be an aggravating factor in poor general educational attainment levels (Maamouri, 1998).
## III. THEORETICAL BACKGROUND
Bilingual or multilingual people are those who use two or more languages in their everyday lives. In order for someone to be considered bilingual, he/she needs to have an appropriate functional ability in both languages. Such control of the two languages is referred to as balanced bilingualism. However, most bilinguals use each language for different purposes, in different circumstances, and with different people in their everyday lives (Fishman, 1972). Bilingualism comes in three flavors: sub-coordinate, coordinate, and compound. Compound bilinguals acquire two languages in the same setting, combining their two verbal expressions to form one thought. Coordinated bilinguals learn both languages in distinct settings, such as school and home, so their vocabulary is part of two distinct and autonomous systems. One language predominates in a sub-coordinate multilingual (D'aciero, 1990).
There is also a difference between subtractive and additive bilingualism (Cummins, 2000). When the first language continues to be developed and the first culture continues to be valued while the second language is added this is known as additive bilingualism. According to research cited by Cummins (1994), kids who study in additive bilingual environments tend to be more successful than those whose original language and culture are completely or partially devalued by their schools and society and are instead substituted by a powerful second language.
For these reasons scholars have called for the need to use the mother tongue in the primary stages of education in order to improve the quality of education as well as to preserve the language, and to provide a solid foundation in the students' native language. This approach prepares them better for learning a second language (Baker, 2001; Cummins, 1999, 2000). Cummins (1984) argues for the need to make a distinction between the academic (cognitive academic language proficiency; CALP) and the conversational (basic interpersonal communicative skills; BICS) dimensions of language proficiency. BICS refers to the development of conversational fluency in the second language, which is a language needed to interact socially with other people and is an earlier development. CALP describes the use of language in decontextualized academic situations which include listening, speaking, reading, and writing. The above hypothesis means that a student's ability to converse in a second language (BICS) is not an indicator of his/her ability to engage in academic skills (CALP) in that second language.
Cummins (2001) elaborates the relationship between L1 and L2 development in his Development Interdependence hypothesis. According to him, crosslingual proficiency can aid in the growth of intellectual and cognitive abilities. Cummins claims that if a minimal level of L1 academic and cognitive development is met, then cognitive and literacy abilities developed in the mother tongue, or L1, will translate between languages. Therefore, reading in L1 can help with both the continuation of first language development and the growth of literacy and knowledge in L2. This implies that the learner may struggle to become bilingual if the threshold of cognitive proficiency in L1 is not met. Research has shown that it usually takes at least five years for second language learners to catch up academically to their native English-speaking peers, but conversational fluency in English is often attained within two years of intensive exposure to the language (Cummins, 1999). Cummins (2001) further argues in his Threshold hypothesis that there are threshold levels of linguistic competence that bilingual children must achieve in both of their languages to take advantage of the benefits of bilingualism.
The findings of several investigations support Cummins's hypothesis (1979). According to Papanastasiou (2000), students who took TIMSS evaluations in a second language experienced lower achievements. Similarly, Herbert et al. (2002) showed that students who received their instruction in English (second language) instead of Chinese are more disadvantaged than native students. In the same vein, Brock-Utne (2007) emphasized that using English as a medium of instruction in Tanzania hinders students' learning. Likely, Samuelson and Freedman (2010) showed that using English as the only language of instruction, would not always allow students to participate successfully in the global economy because many of them will not develop a solid command of academic literacy in their mother tongue.
The Cummins (1979) theoretical framework has been beneficial for second language education from a theoretical standpoint. His main argument is that bilingualism can be advantageous for academic and cognitive purposes when LI abilities are sufficiently developed. In this paper, we will examine the linguistic interdependence in the setting of the United Arab Emirates by comparing the academic achievement of two groups of students whose mother tongue is "Arabi" (Dialect). One group is receiving instruction in English, while the other is receiving instruction in Arabic.
## IV. TIMSS DATA AND METHODS
### a) TIMSS Data
The primary aim of the TIMSS (Trends in International Mathematics and Science Study) assessments is to establish a reliable and valid measure of knowledge and skills in mathematics and science. These assessments hold significant value within the global education community and are designed to align with the curricula of participating countries. Introduced in 1994/1995, the initial implementation of TIMSS involved 45 countries, with Kuwait being the only Arab participant at that time. However, as time has progressed, the participation of Arab nations in TIMSS evaluations has increased, encompassing both fourth and eighth-grade levels.
In the latest TIMSS survey conducted in 2019, a total of 64 countries and 8 benchmarking participants took part. Notably, 9 Arab countries participated at the fourth-grade level, while 11 Arab countries participated at the eighth-grade level. The TIMSS 2019 assessment witnessed the involvement of 6 Gulf countries, namely Bahrain, Oman, Qatar, the United Arab Emirates, Saudi Arabia, and Kuwait. In addition, Dubai played a role as a benchmarking participant. To ensure meaningful comparisons across countries, the TIMSS assessments are strategically scheduled to take place at the end of the academic year. This timing allows for greater cross-country comparability by minimizing potential variations in the content and timing of instruction across different educational systems.
The choice to use the United Arab Emirates as a sample for our study investigating the impact of language of instruction on academic achievement for primary students is justified for multiple reasons. Firstly, UAE has demonstrated a strong commitment to education and have made significant investments in their educational systems. The UAE's Ministry of
Education (MOE) has started a number of educational reforms since the early 1990s, at a period of significant social and economic development in the country as well as growing influence from foreign cultures (UNESCO, 2016). In 2009, the Abu Dhabi Education Council (ADEC) which is the government organization in charge of managing public education in Abu Dhabi, developed the New School Model (NSM). Amongst a range of ongoing pedagogic, curricular and school leadership reforms, a major new departure is the introduction of English as an additional medium of instruction alongside the existing medium of Arabic. Furthermore, the UAE has a distinctive linguistic diversity due to the various ethnic groups that comprise its population (Syed, 2003). Because of this intricate language map, English has come to be used as the common language across the various ethnic groups. This English invasion has also been expedited by the United Arab Emirates' rapid advancements in business and communication technology.
### b) Sample Design and Exclusion
TIMSS is a globally recognized assessment program that employs meticulous sampling procedures to ensure the accuracy and reliability of its estimates, striving to provide an accurate representation of the larger population. In order to maintain consistency and fairness across different cycles of TIMSS, stringent exclusion criteria have been established at both the school and student levels, as documented by Martin et al. (2020). At the school level, certain criteria are employed to exclude specific schools from the dataset. Schools located in remote areas or with an extremely small student population, such as four or fewer students in the target grade, are excluded. Additionally, schools that deviate significantly from the mainstream educational system or exclusively cater to children with special needs are also excluded. These exclusion criteria ensure that the dataset primarily consists of schools that align with the typical educational context. Moreover, international guidelines are implemented to determine within-school exclusions. These guidelines take into account students with functional disabilities, students with intellectual disabilities, and students who are non-native language speakers. By excluding these specific student groups, the aim is to maintain a standardized assessment environment and ensure that the results accurately reflect the abilities of the target population. It is important to note that these exclusion criteria are designed to enhance the validity and reliability of the assessment outcomes. In our specific sample, the overall exclusion rate is $5.6\%$, demonstrating the adherence to these exclusion guidelines while still maintaining a robust and representative dataset for analysis (Table A-1).
The reason for selecting fourth grade students is that at this grade most uses of language in school is cognitively demanding and context-reduced, which means that students have to rely primarily on linguistic cues to meaning and may in some cases require suspending knowledge of the "real world" in order to appropriately interpret the logic of the communication (Cummins & Swain, 1986). Since our aim is to test the linguistic interdependence hypothesis and its link to students' performance, we define two groups of students: those who took the tests in Arabic known as the control group and those who took the tests in English known as the treatment group. In order to get more accurate results and knowing that only native speakers are allowed to participate in TIMSS (according to the students' exclusion criteria discussed above), we restricted the participants who took the test in English to those whose both parents were born in the country. By doing so, we excluded students who were born from mixed marriages (having a native parent), and speak Arabic and English at home. Our sample is then composed of 3092 students wehere the proportion of students who took the tests in English amounts to $32.47\%$.
## V. METHODS
### a) Empirical Model and Technique
OLS regression is first used to estimate the average treatment effect of the use of English as a medium of instruction. The variable treatment is introduced in Equation (1) as a dummy variable. Recall that the treated group consists of students receiving English instruction, while the control group consists of students receiving Arabic instruction.
$$
P _ {i, c, s} = \alpha_ {0} + \alpha_ {1} F _ {i, c, s} + \alpha_ {2} T _ {i, c, s} + \varepsilon_ {i, c, s} \tag {1}
$$
Where $P_{ics}$ is the score of student i in class c at school s. $F_{ics}$ is a vector of individual and family background characteristics. $T_{ics}$ is the treatment variable which is a binary variable that takes the value 1 for treated observations and 0 for control observations. To draw valid inferences, we use the students' sampling weights. The aim of OLS regression is to examine if there are any significant differences in performance between the treatment and control groups. To gain better understanding of performance differences, and because it is not straightforward to directly compare the outcomes for these two groups because those who choose English education may differ from those who choose Arabic, we use the propensity score matching technique. The propensity score matching (PSM) is a quasi-experimental method in which the researcher creates an artificial control group by matching each treated unit with a nontreated unit with similar characteristics. PSM, in particular, computes the probability of a unit enrolling in a program based on observed characteristics. This is the propensity score. Then, based on the propensity score, PSM assigns treated units to untreated units. PSM is based on the assumption that untreated units can be compared to treated units based on some observable characteristics, as if the treatment had been fully randomized (Rubin, 2001).
The use of the matching technique is to control for the potential confounding influence of pretreatment variables (individual and family variables). We utilize a logit model to predict children's propensity for the treatment group. Following that, we use the Nearest Neighbor Matching to pair cases in both groups based on their likelihood of experiencing a treatment. We use matching with replacement to identify neighbor cases (Frisco et al., 2007).
### b) Variables
This section describes the outcome variables used for the purpose of this study as well as the covariates.
Outcome variables: The dependent variables are overall performance in mathematics and science. In TIMSS, student achievement is represented by sets of five plausible values: Math 1, Math 2, Math 3, Math 4 and Math 5 for mathematics achievement and Science 1, Science 2, Science 3, Science 4 and Science 5 for science achievement. Plausible values are imputed values drawn from the estimated ability distributions (Martin et al., 2016; Mislevy, 1991). Plausible values are generated by making use of all available background data of the students. Plausible values are not intended to be estimates of individual student scores, but rather are imputed scores for like students- students with similar response patterns and background characteristics in the sampled population- that may be used to estimate population characteristics correctly (Martin et al., 2020). A detailed review of the plausible values methodology is given in Mislevy (1991).
Covariates: The core explanatory variables are individual and family background characteristics. These variables and their coding are listed in Table A2. Controlling for socioeconomic status (SES) and pre-education is important as they may influence the rate at which a second language is learned. Individual factors include the students' age, gender, and the number of years in pre-primary education. Family factors include parents' education and educational resources at home. The latter was constructed based on the data reported by students and their parents regarding the number of books and other study materials in their homes, the parents' levels of education (National Academies of Sciences, Engineering, and Medicine, 1997) and the parents' employment. Cut scores were used to define students into three categories: students with Many Resources, students with Some Resources and students with Few Resources. All the nominal variables were introduced in the model as dummies. For Sex, it takes the value 1 for females and zero for males. Parents' education level takes four categories: some primary, lower secondary, upper secondary, post secondary and university or higher. The category some primary is considered as a reference category. For home resources, students are assigned to three categories which are the following: students with many resources, students with some resources and students with few resources. The category few resources is considered as a reference category.
### c) Descriptive Statistics
In what follows, we provide a comprehensive overview of the descriptive statistics pertaining to the outcome variables (Table 1 and 2) and covariates employed (Table 3). For continuous variables, we compute the means to summarize the central tendency of the data. On the other hand, categorical variables, such as gender or socioeconomic status, are characterized by distinct categories or groups. In order to depict the distribution of these variables, we calculate proportions.
Table 1: Descriptive statistics for Mathematics Achievements
<table><tr><td>Variables</td><td>Mean of Total Sample</td><td>Mean of Control Group</td><td>Mean of Treatment Group</td><td>Effect Size (Cohen's d)</td></tr><tr><td>Math1</td><td>480.668</td><td>492.112</td><td>456.868</td><td>-0.396</td></tr><tr><td>Math 2</td><td>481.054</td><td>492.763</td><td>456.705</td><td>-0.409</td></tr><tr><td>Math 3</td><td>481.271</td><td>492.183</td><td>458.576</td><td>-0.386</td></tr><tr><td>Math 4</td><td>480.936</td><td>491.953</td><td>458.025</td><td>-0.386</td></tr><tr><td>Math 5</td><td>479.333</td><td>491.600</td><td>453.821</td><td>-0.426</td></tr><tr><td>Number of Observations</td><td>3092</td><td>2088</td><td>1004</td><td></td></tr></table>
Table 2: Descriptive statistics for Science Achievements
<table><tr><td>Variables</td><td>Mean of Total Sample</td><td>Mean of Control Group</td><td>Mean of Treatment Group</td><td>Effect Size (Cohen's d)</td></tr><tr><td>Science 1</td><td>480.616</td><td>503.133</td><td>433.787</td><td>-0.703</td></tr><tr><td>Science 2</td><td>479.929</td><td>503.619</td><td>430.66</td><td>-0.739</td></tr><tr><td>Science 3</td><td>478.889</td><td>500.441</td><td>434.0671</td><td>-0.675</td></tr><tr><td>Science 4</td><td>478.269</td><td>500.715</td><td>431.589</td><td>-0.690</td></tr><tr><td>Science 5</td><td>479.363</td><td>501.947</td><td>432.396</td><td>-0.691</td></tr><tr><td>Number of Observations</td><td>3.092</td><td>2088</td><td>1004</td><td></td></tr></table>
Within both the treated and control groups, the mean values of the outcome variables hover around the global average of approximately 500 points. The negative sign associated with the effect size signifies that learning in English is less effective than learning in
Arabic. The effect size in mathematics and science performance is relatively important (for further insights on the range of variation, refer to Borenstein, 2009, and Hattie, 2009).
Table 3: Descriptive statistics of covariates for all students
<table><tr><td>Variables</td><td>Mean/Proportion</td><td>Min</td><td>Max</td></tr><tr><td>Age</td><td>9.612</td><td>8.42</td><td>11</td></tr><tr><td>Gender</td><td>0.553</td><td>0</td><td>1</td></tr><tr><td>Pre-Education</td><td>1.566</td><td>0</td><td>4</td></tr><tr><td>Parents' education</td><td></td><td></td><td></td></tr><tr><td>Some primary</td><td>0.0268</td><td>0</td><td>1</td></tr><tr><td>Lower secondary</td><td>0.0252</td><td>0</td><td>1</td></tr><tr><td>Upper secondary</td><td>0.213</td><td>0</td><td>1</td></tr><tr><td>Post-secondary</td><td>0.177</td><td>0</td><td>1</td></tr><tr><td>University or higher</td><td>0.558</td><td>0</td><td>1</td></tr><tr><td>Resources</td><td></td><td></td><td></td></tr><tr><td>Few resources</td><td>0.0124</td><td>0</td><td>1</td></tr><tr><td>Some resources</td><td>0.890</td><td>0</td><td>1</td></tr><tr><td>Many resources</td><td>0.0976</td><td>0</td><td>1</td></tr><tr><td>Number of Observations</td><td colspan="3">3092</td></tr></table>
The students included in the sample exhibit an average age of 9.61 years. In addition, the sample is approximately evenly divided between boys and girls, ensuring a balanced representation of both sexes in the analysis. Students typically received two years of preprimary education. Furthermore, 55 percent of parents in UAE have a university degree or higher and more than 98 percent of students have access to some home educational resources.
## VI. RESULTS AND DISCUSSION
The following section provides a comprehensive analysis of the results obtained from both Ordinary Least Squares (OLS) regression and propensity score matching techniques. OLS regression analysis was conducted independently five times for each evaluation. To derive the final parameter of interest, Rubin's rules (Rubin, 1987) were applied, which enable the combination of results from multiple regression analyses to generate a comprehensive and reliable estimate. The findings from the OLS regression analysis are presented in Table 4, while Table 5 presents the results obtained from propensity score matching.
The OLS results reveal a significant negative association between taking the test in english and academic achievement. Students who took the tests in English have significantly lower achievement in mathematics and science than their peers who passed the tests in Arabic.
Consistent with previous research (Ammermüller et al., 2005; Chiu & Khoo, 2005), the analysis reveals that family variables have a significant positive impact on students' performance. These findings emphasize the influential role of family factors in shaping academic outcomes. Moreover, when examining individual student characteristics, the coefficient of gender demonstrates a significant influence in evaluations; Boys outperform girls in mathematics, however, girls outperform boys in science. Preschool education participation has a positive and significant influence on overall performance for both evaluations which is in line with previous studies (Holla et al., 2021).
Table 4: OLS Results
<table><tr><td>Variables</td><td>Mathematics Achievement</td><td>Science Achievement</td></tr><tr><td>Treat</td><td>-45,436***</td><td>-81,918***</td></tr><tr><td></td><td>(3,918)</td><td>(4,835)</td></tr><tr><td>Age</td><td>28,496***</td><td>37,453***</td></tr><tr><td></td><td>(4,180)</td><td>(4,7630)</td></tr><tr><td>Sex</td><td>-6,083*</td><td>5,848*</td></tr><tr><td></td><td>(3,280)</td><td>(3,609)</td></tr><tr><td>Pre Education</td><td>3,897***</td><td>2,466***</td></tr><tr><td></td><td>(1,379)</td><td>(1,539)</td></tr><tr><td>Parents' Education</td><td></td><td></td></tr><tr><td>Lower secondary</td><td>3,130</td><td>5,695</td></tr><tr><td></td><td>(14,713)</td><td>(16,004)</td></tr><tr><td>Upper secondary</td><td>-6,639</td><td>6,601</td></tr><tr><td></td><td>(11,745)</td><td>(12,884)</td></tr><tr><td>Resources</td><td></td><td></td></tr><tr><td rowspan="2">Some resources</td><td>27,306**</td><td>15,991</td></tr><tr><td>(13,593)</td><td>(16,735)</td></tr><tr><td rowspan="2">Many resources</td><td>42,786***</td><td>35,430**</td></tr><tr><td>(14,227)</td><td>(17,537)</td></tr><tr><td rowspan="2">Constant</td><td>177,0532***</td><td>90,805*</td></tr><tr><td>(42,794)</td><td>(48,426)</td></tr></table>
To further substantiate the finding between language of instruction and academic achievement, we employ a propensity score model. The results of this analysis are presented in Table 5.
Table 5: ATET Nearest Neighbor Results
<table><tr><td>ATE TREAT
(1 vs. 0)</td><td>Mathematics</td><td>Science</td></tr><tr><td>Overall Performance</td><td>-51,003***
(4,128)</td><td>-88,844***
(5,106)</td></tr></table>
Students who received English-language mathematics instruction (Treatment group) had their overall scores reduced by 51 points, respectively, when compared to students who passed the Arabic-language Mathematics test (Control group). Similarly, students who received science instruction in English (Treatment group) had their overall scores fall by 88 points, respectively, as compared to students in the control group. This suggests that performance at this grade level is lower when education is offered in English. All the differences are statistically significant at the $1\%$ level. Our findings reveal that students whose mother tongue is Arabic and with early English learning experience score significantly lower in TIMSS evaluations than their peers who take the tests in Arabic. The differences are more noticeable in science than in mathematics. The learning of mathematics and science requires a variety of linguistic skills that second language learners may not have mastered by the age of nine. Even though mathematics language seems to be abstract, the study of mathematics (especially at the primary level) begins with the study of real-world problems and requires the application of the language. Therefore, the language plays an important role in conveying mathematical knowledge to students and in knowing how abstraction is interpreted (Ferrari, 2003). Likely, the difference in science proficiency between the treatment and control group is manifest. This provides evidence that a first language enhances more science learning than a second language. At grade 4, science context domains are more linked to real-world situations and thus to "home language." Students need to develop a deep understanding of science concepts, make connections among concepts and apply concepts in explaining natural phenomena or real-world situations. Students in science classes must be engaged in science inquiry, have to negotiate ideas and justify claims based on evidence. It has been shown that sometimes, and for effective instruction in science, teachers focus on students' home language as an instructional support. They use students' home language to explain science terms (Goldenberg, 2013).
A key premise in the literature is that age is significant in child language acquisition, whether in L1 or L2 (Oliver and Azkarai, 2017). Given that the critical period of twelve years is required to be fully proficient in the first language (Collier, 1989), students in UAE at the age of nine (fourth grade) have not yet begun to complete full cognitive development in the first language and so do their peers in their native languages. Nonetheless, the difference is that English native speakers are approaching the completion of L1 acquisition, whereas students whose "mother tongue" is Arabic (Dialect) require additional years to improve their Arabic language skills. So they lag behind their peers in terms of years of L1 acquisition. The mismatch between the language of the home and the language of the school makes those students less proficient in "modern Arabic."
## VII. CONCLUSION AND RECOMMENDATIONS
In this study, we investigate the effect of the language of instruction (English in this case) on academic achievement of Emirati-speaking students who are expected to learn science and math in a second language (L2) before achieving a sufficient level of proficiency in it.
Based on Cummins's (1984) theoretical framework, which links language proficiency to academic achievement, we offer some insights regarding the connection between Emirati students' academic achievement in mathematics and science and their L1 and L2 proficiency. In order to explain the disparity in academic performance in science and mathematics between two groups of young children living in a dialect-dominated environment who got instruction in English and who received instruction in Arabic, we used the propensity score technique and the TIMSS 2019 standardized tests.
Our results demonstrate that the differences in performance between the two groups can be explained by the language used during instruction. Young children who started studying L2 before their L1 was fully matured showed overall performance losses as compared to children who started learning Arabic at the same age. The results of this study support the linguistic interdependence hypothesis developed by Cummins (1984). This study helps us to think about future language policy not only in the UAE, but also in Arab countries that are all multilingual.
Strengthening early exposure to the Arabic language (modern Arabic) will minimize the retardation produced by the mismatch between students' mother tongue and the school language. Arabic should be taught and learned in schools in a more practical manner. Additionally, Arabic learners should have access to audio-visual resources, which have the added benefit of advancing the language's cognitive and intellectual components in addition to its fluency. Finally, pupils who are fully proficient in both languages would benefit from the cognitive and academic advantages of bilingualism, while bilinguals who are only sufficiently proficient in one of their languages would not face any such academic drawbacks.
### Statements and Declarations:
Funding: There has been no significant financial support for this work that could have influenced its outcome.
Ethical Approval: Not applicable.
Informed Consent: This article does not contain any studies with human participants performed by any of the authors.
Author's Contribution: There is only one author for this article
Conflict of Interest: We know of no conflicts of interest associated with this publication.
Data Availability Statement: https://timss2019.org/international-database/
#### APPENDIX
Table A-1: Coverage and Exclusion Rates in UAE
<table><tr><td>Coverage</td><td>Overall exclusion</td><td>Schools</td><td>Students</td></tr><tr><td>100%</td><td>5.6%</td><td>2.6%</td><td>3%</td></tr></table>
Table A-2: Variables' List, coding and meaning
<table><tr><td>List of Variables</td><td>Coding</td><td>Meaning</td></tr><tr><td>Overall Performance Math1-Math5</td><td>asmat 01–05</td><td>The 1st to 5th plausible value of Overall Performance in Mathematics</td></tr><tr><td>Overall Performance Science1-Science5</td><td>assci 01–05</td><td>The 1st to 5th plausible value of Overall Performance in Science</td></tr><tr><td>Age</td><td>asdage</td><td>Quantitative variable which indicates student's age.</td></tr><tr><td>Sex</td><td>asbg01</td><td>Dummy variable which takes the value 1 for female and 0 for male.</td></tr><tr><td>Pre-Education</td><td>asbh04b</td><td>Quantitative variable which indicates the number of years in pre-primary education for each student.</td></tr><tr><td>Parents' Education</td><td>asdhedup</td><td>Categorical variable which reflects parents' education level as follows; some primary, upper secondary, postsecondary and university or higher. The category some primary is considered as reference category.</td></tr><tr><td>Resources</td><td>asdghrl</td><td>A score calculated based on the number of books and other study materials in the students' homes, their parents' level of education, and their parents' employment. Scores were used to define students into three categories: students with Many Resources, students with Some Resources and students with Few Resources. The category few resources is considered as a reference category.</td></tr></table>
Generating HTML Viewer...
References
28 Cites in Article
Michael Borenstein,Larry Hedges (2009). EFFECT SIZES FOR META-ANALYSIS.
The handbook of research synthesis and meta analysis.
Charis Boutieri (2012). IN TWO SPEEDS (<i>À DEUX VITESSES</i>): LINGUISTIC PLURALISM AND EDUCATIONAL ANXIETY IN CONTEMPORARY MOROCCO.
Barry Chiswick,Paul Miller (1995). The Endogeneity between Language and Earnings: International Analyses.
Ming Chiu,Lawrence Khoo (2005). Effects of Resources, Inequality, and Privilege Bias on Achievement: Country, School, and Student Level Analyses.
Virginia Collier (1989). How Long? A Synthesis of Research on Academic Achievement in a Second Language.
J Cummins (1978). Educational implications of mother tongue maintenance in minority-language groups.
James Cummins (1979). Linguistic Interdependence and the Educational Development of Bilingual Children.
J Cummins (1984). Wanted: A theoretical framework for relating language proficiency to academic achievement among bilingual students.
Jim Cummins (2008). BICS and CALP: Empirical and Theoretical Status of the Distinction.
Jim Cummins,Merrill Swain (1986). Bilingualism in Education.
Pier Ferrari (2003). Abstraction in mathematics.
B Fishbein,P Foy,L Yin (2021). TIMSS 2019 user guide for the international database.
Michelle Frisco,Chandra Muller,Kenneth Frank (2007). Parents’ Union Dissolution and Adolescents’ School Performance: Comparing Methodological Approaches.
B Dauleshyar,A Alishev (2020). Studying the Perspectives and Career Guidance of 8th-grade Students on Science, Physics, and Future Professions through the Integration of Robotics into Physics Education.
C Goldenberg (2013). Unlocking the research on English learners: What we know-and don't yet know-about effective instruction.
John Hattie (2009). Visible Learning.
W Herbert,T Hua,K Kong (2002). Multilevel causal ordering of academic self-concept and achievement: Influence of language of instruction (English compared with Chinese) for Hong Kong students.
Alaka Holla,Magdalena Bendini,Lelys Dinarte,Iva Trako (2021). Is Investment in Preprimary Education Too Low? Lessons from (Quasi) Experimental Evidence across Countries.
M Martin,I Mullis,P Foy,M Hooper (2016). TIMSS achievement methodology.
(2019). Methods and procedures: TIMSS.
Robert Mislevy (1991). Randomization-Based Inference about Latent Variables from Complex Samples.
(1997). Improving Schooling for Language-Minority Children.
Rhonda Oliver,Agurtzane Azkarai (2017). Review of Child Second Language Acquisition (SLA): Examining Theories and Research.
Constantinos Papanastasiou (2000). Internal and external factors affecting achievement in mathematics: Some findings from TIMSS.
Donald Rubin (1987). Multiple Imputation for Nonresponse in Surveys.
Donald Rubin (2001). Using Propensity Scores to Help Design Observational Studies: Application to the Tobacco Litigation.
Beth Samuelson,Sarah Freedman (2010). Language policy, multilingual education, and power in Rwanda.
No ethics committee approval was required for this article type.
Data Availability
Not applicable for this article.
How to Cite This Article
Imen Hentati. 2026. \u201cExploring the Relationship Between Language of Instruction and Academic Achievement Among Primary School Students: Evidence from UAE\u201d. Global Journal of Human-Social Science - E: Economics GJHSS-E Volume 24 (GJHSS Volume 24 Issue E2): .
Explore published articles in an immersive Augmented Reality environment. Our platform converts research papers into interactive 3D books, allowing readers to view and interact with content using AR and VR compatible devices.
Your published article is automatically converted into a realistic 3D book. Flip through pages and read research papers in a more engaging and interactive format.
We explore the impact of the language of instruction on academic performance of Emirati students who are expected to learn mathematics and science in and through a second language. Using TIMSS 2019 standardized tests and employing propensity score technique we explain the differences in achievement in mathematics and science between two groups of young Emirati children living in an Arabic dialect-dominated environment who get instruction in English and who receive instruction in classical Arabic. Our findings indicate that students who receive instruction in English have lower odds of attaining higher achievement scores.
Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]
Thank you for connecting with us. We will respond to you shortly.