Sage Journals: Discover world-class research

Abstract

Technology-Enhanced Vocabulary Learning (TEVL) has become a central research area in applied linguistics and language education. While numerous meta-analyses and systematic reviews on TEVL have been conducted, a comprehensive science map illustrating the field’s intellectual structure has been lacking. Therefore, the present study aimed to examine the academic output in TEVL from 2014 to 2024 from a bibliometric perspective. Specifically, annual scientific production, influential documents and sources, key authors and countries, and thematic evolution were analyzed based on 3,307 documents from the Web of Science database. The findings reveal that: (a) The study of TEVL has developed rapidly over the past decade, with a significant acceleration in publications after 2020. (b) Research is concentrated in two primary hubs, the United States and China, with growing participation from countries like Turkey and Iran. (c) Thematic trends show a clear evolution from foundational computer- and mobile-assisted learning to emerging frontiers like artificial intelligence (AI), augmented reality (AR), and gamification. (d) Despite technological advancements, the field exhibits persistent methodological limitations and a notable gap between the implicit theoretical underpinnings and explicit research practice.

Plain Language Summary

A Critical Bibliometric Review of Technology-Enhanced Vocabulary Learning: Uncovering a Decade of Tensions and Trends

Technology-Enhanced Vocabulary Learning (TEVL) has become a central research area in applied linguistics and language education. While numerous meta-analyses and systematic reviews on TEVL have been conducted, a comprehensive science map illustrating the field’s intellectual structure has been lacking. Therefore, the present study aimed to examine the academic output in TEVL from 2014 to 2024 from a bibliometric perspective. Specifically, annual scientific production, influential documents and sources, key authors and countries, and thematic evolution were analyzed based on 3,307 documents from the Web of Science database. The findings reveal that: (1) The study of TEVL has developed rapidly over the past decade, with a significant acceleration in publications after 2020. (2) Research is concentrated in two primary hubs, the United States and China, with growing participation from countries like Turkey and Iran. (3) Thematic trends show a clear evolution from foundational computer- and mobile-assisted learning to emerging frontiers like artificial intelligence (AI), augmented reality (AR), and gamification. (4) Despite technological advancements, the field exhibits persistent methodological limitations and a notable gap between the implicit theoretical underpinnings and explicit research practice.

Keywords

technology-enhanced vocabulary learning (TEVL)bibliometrics mobile-assisted language learning digital game-based learning AI in education VOSviewer CiteSpace

Introduction

Vocabulary acquisition is fundamental to language learning, demanding both systematic instruction and extensive exposure for successful mastery (Nation, 2017; Schmitt, 2008). Building comprehensive lexical knowledge requires substantial cognitive effort, particularly in English as a Foreign Language (EFL) context, where learners must acquire not only word meanings but also their phonological, morphological, syntactic, and sociolinguistic properties (Laufer, 2009; Zhang & Koda, 2018). This multifaceted nature of vocabulary learning makes it one of the most persistent challenges faced by foreign language learners. The process is further complicated by the need for learners to notice and process new lexical items in meaningful contexts (Schmidt, 1990), while also avoiding cognitive overload when presented with dense or decontextualized input (Sweller, 1988).

In recent years, technological advancements have profoundly reshaped language education, establishing Technology-Enhanced Vocabulary Learning (TEVL) as a vibrant and critical field of inquiry. The research landscape has rapidly expanded to encompass a diverse array of approaches, ranging from mobile applications designed for ubiquitous learning (Harger, 2020) and digital games that enhance motivation (Zou et al., 2021), to collaborative social media platforms (Reinhardt, 2018) and, most recently, adaptive systems powered by artificial intelligence (Jia et al., 2022). While empirical studies consistently demonstrate the efficacy of these individual technologies (Hao et al., 2021), this rapid proliferation has created a rich but increasingly fragmented knowledge base. Such fragmentation makes it challenging for researchers and practitioners to discern the overarching intellectual structure, collaborative patterns, and evolutionary trends of the field as a whole.

Positioning the Current Study Within the Landscape of TEVL Research

To justify the present study, it is crucial to first critically map these existing reviews and identify the specific gap this bibliometric analysis aims to fill. Broadly, existing reviews can be categorized into two main streams.

The first stream consists of quantitative meta-analyses focused on effectiveness, which seek to answer the question: “What works?” These studies synthesize empirical results to calculate the effectiveness (i.e., effect size) of specific technological tools. For instance, meta-analyses by Lin and Lin (2019) and Hao et al. (2021) have established the moderate effectiveness of mobile-assisted and technology-assisted vocabulary learning, respectively. Similarly, Yu and Trainin (2022) confirmed the benefits of technology for long-term vocabulary retention. While these focused reviews provide invaluable, deep evidence on the efficacy of individual technologies (e.g., digital games, mobile apps), they inherently operate within “technological silos.” They are designed to assess the effectiveness of a pre-defined path, not to map all available paths or to illustrate how these different technological streams intersect, compete, or evolve together over time.

A second stream of systematic reviews addresses the “why and how” of TEVL, collating theoretical frameworks and critiquing research methodologies. These reviews have identified significant gaps, noting that many studies fail to articulate a clear theoretical foundation (Yang et al., 2021; Zhou et al., 2024). Furthermore, they consistently highlight prevalent methodological limitations, such as small sample sizes and brief intervention periods (Hwang & Fu, 2019; Vnucko & Klimova, 2023). While these reviews provide profound qualitative insights, their conclusions are typically drawn from a curated selection of studies. They can diagnose problems within the field but are not designed to provide a data-driven, macro-level map of its intellectual architecture, foundational pillars, or evolutionary trajectory based on thousands of publications.

While both streams of research are essential, a significant gap remains. Neither the siloed meta-analyses nor the qualitative systematic reviews can offer a holistic, structural view of the TEVL field as a whole. A science map that visualizes the entire territory of inquiry is currently lacking. Such a map would reveal the field’s key contributors, collaborative networks, thematic evolution, and emerging frontiers. By moving beyond evaluating a single intervention or qualitatively summarizing trends, it provides a data-driven, comprehensive knowledge map of the TEVL field over the last decade, complementing existing reviews by offering a perspective that is both panoramic and structurally deep.

Research Questions

This study aims to answer the following research questions:

RQ1: What are the key characteristics of TEVL research (2014–2024) in terms of productivity patterns, citation impact, leading contributors, publication venues, and influential studies across technological approaches?

RQ2: How has the intellectual structure of TEVL research evolved (2014–2024), and what emerging trends and future directions are revealed through thematic, theoretical, methodological, and technological analyses?

Materials and Methods

The study conducted a scientific and comprehensive analysis of TEVL via CiteSpace, VOSviewer and the Biblioshiny package in R, based on bibliometric data obtained from The Web of Science Core Collection database.

Data Sources and Extraction

This study employed the Web of Science (WoS) Core Collection database (2014–2024). The WoS Core Collection was chosen as the sole data source for this study due to its reputation for high-quality, peer-reviewed literature and its comprehensive citation data, which is essential for conducting robust co-citation and network analyses. While other databases like Scopus offer broader coverage, the data consistency and quality within WoS are considered highly reliable for mapping the intellectual structure of a scientific field (Aria & Cuccurullo, 2017). This choice prioritizes data quality and analytical rigor, though its limitations are acknowledged.

The search strategy utilized three interconnected keyword clusters: (a) vocabulary-related terms (“vocabulary” OR “word”), (b) technology-related terms (“technology” OR “AI” OR “artificial intelligence” OR “computer” OR “mobile” OR “digital” OR “online” OR “deep learning”), and (c) educational context terms (“learning” OR “education”). These clusters were combined using Boolean operators to ensure precise capture of relevant publications. The selection of keywords was based on considerations of the technological evolution trends in the research topic.

The search was confined to English-language publications indexed in four WoS indices (SCI-EXPANDED, SSCI, A&HCI, and ESCI) and restricted to articles, review articles, and early access publications. Further refinement focused on four WoS categories: Education Educational Research, Linguistics, Language Linguistics, and Education Special. The downloaded dataset comprised 3,316 records, retrieved in seven sequential batches due to platform export limitations (Figure 1). After removing of duplicates, the final dataset included for analysis contains 3,307 records. The bibliographic data, including authors, titles, abstracts, keywords, references, and citation metrics, were exported in plain text format and analyzed using Biblioshiny package in R, VOSviewer and CiteSpace.

Figure 1.

Flowchart for selection of documents for bibliometric analysis.

Bibliometric Analysis

The three complementary software tools employed in this study are selected for their own unique strengths. Biblioshiny is an R-based tool used for facilitating statistical analysis of publication trends, citation metrics, and author productivity, generating detailed reports on scientific production and key contributors (Aria & Cuccurullo, 2017). VOSviewer was chosen for its advanced visualization capabilities, mapping co-authorship, co-citation, and keyword co-occurrence networks, offering an intuitive representation of collaborative patterns and research trends (van Eck & Waltman, 2010, 2014). CiteSpace was used to analyze the temporal evolution of research themes, intellectual structure, and emerging trends. By detecting pivotal research fronts, burst keywords, and co-citation networks, it effectively identifies thematic shifts and influential scholarly clusters (Chen, 2006; Chen et al., 2010).

Results

Table 1 provides an overview of the data collection. The final dataset includes 3,307 documents published between 2014 and 2024, distributed across 651 sources. The corpus demonstrates a robust annual growth rate of 15.01%, indicating increasing scholarly attention to technology-enhanced vocabulary learning. Of these publications, the majority (2,910) are research articles, supplemented by 229 early access articles and 159 reviews, reflecting the field’s emphasis on empirical research.

Table 1.

Main Information About Data.

Descriptions	Results	Description	Results
Timespan	2014–2024	Document contents
Sources (journals, books, etc.)	651	Keywords plus	3,145
Documents	3,307	Author’s keywords	8,353
References	123,721	Authors collaboration
Document types		Single-authored docs	711
Article	2,910	Co-authors per doc	2.88
Article; early access	229	International co-authorships %	18.54
Article; proceedings paper	7	Annual growth rate %	15.01
Article; retracted publication	1	Average citations per doc	11.25
Editorial material; early access	1	Document average age	3.69
Review & early access review	159

The collection involves 8,005 authors, with a notable tendency toward collaborative scholarship as evidenced by the average of 2.88 co-authors per document. While 711 publications are single-authored, the majority are collaborative efforts, with 18.54% involving international co-authorships. This collaborative pattern suggests a growing global research network in the field. The research impact is substantial, with documents averaging 11.25 citations each and maintaining a relatively current knowledge base with an average document age of 3.69 years. The intellectual breadth of the field is reflected in 8,353 author-designated keywords and 3,154 system-generated keywords, indicating diverse research focus within the research domain. The substantial reference count of 123,721 demonstrates the field’s rich theoretical and empirical foundation.

Annual Scientific Production

Figure 2 depicts the annual scientific production in TEVL research from 2014 to 2024. The field exhibits a consistent upward trajectory, with publications increasing from 139 in 2014 to 563 in 2024. Three distinct growth phases are observable: (a) a steady growth period (2014–2019) with annual publications rising from 139 to 255; (b) an acceleration phase (2019–2021) showing a marked increase to 373 publications; and (c) a rapid expansion phase (2021–2024) reaching 563 publications. Notably, the post-2020 period demonstrates the steepest growth, reflecting not only the global catalyst of the COVID-19 pandemic but also, arguably, a maturation of the field itself, where research tools and methodologies became more accessible and established, enabling a wider range of scholars to contribute.

Figure 2.

Annual scientific productions.

Analysis of Cited Documents

Average Article Citations Per Year

Figure 3 presents the average annual citations of TEVL articles generated by Biblioshiny. Older publications generally display higher total citation counts due to their longer exposure, but their mean annual citations are more modest, as seen in 2014 (2.71) and 2015 (2.2). A notable increase occurs between 2018 and 2020, with mean annual citations peaking at 3.07 in 2020, likely driven by the pandemic’s acceleration of research on adaptive and remote learning technologies. Recent years, particularly 2023 (2.0) and 2024 (1.17), show declining averages primarily due to citation lag and the increasing number of publications diluting citations. However, exceptions exist where recent articles achieve high citation rates, driven by their alignment with emerging research priorities.

Figure 3.

Average article citations per year.

Most Highly Cited Documents

The most highly cited papers in TEVL research are examined in Biblioshiny using two important indexes: Global Citation Score (GCS), which shows the total number of citations in WoS at the time the data were retrieved, and Local Citation Score (LCS), which shows the number of citations in the current dataset. Beginning with a historiographical review to identify key literature, I will then analyze globally cited papers, as LCS cannot exceed GCS.

According to Table 2, the top three cited ones are all review articles (Burston, 2015; Golonka et al., 2014; Takacs et al., 2015). Other notable empirical works include Wu’s Designing a smartphone App to teach English (L2) vocabulary (2015; GCS: 88, LCS: 28) and Sandberg’s The added value of a gaming context and intelligent adaptation for a mobile learning application for vocabulary learning (2014; GCS: 89, LCS: 30), highlighting the potential of mobile applications for vocabulary acquisition and the role of gaming and intelligent adaptation in mobile-assisted vocabulary learning respectively. Overall results demonstrate the variety of research areas and the changing approaches in TEVL during the previous 10 years.

Table 2.

Historiography.

Author	Title	DOI	Year	LCS	GCS
Sandberg J	The added value of a gaming context and intelligent adaptation for a mobile learning application for vocabulary learning	10.1016/J.COMPEDU.2014.03.006	2014	30	89
Golonka EM	Technologies for foreign language learning: a review of technology types and their effectiveness	10.1080/09588221.2012.700315	2014	67	479
Smeets DJH	The interactive animated e-book as a word learning device for kindergartners	10.1017/S0142716413000556	2015	23	93
Takacs ZK	Benefits and pitfalls of multimedia and interactive features in technology-enhanced storybooks: A meta-analysis	10.3102/0034654314566989	2015	24	219
WU Q	Designing a smartphone app to teach English (L2) vocabulary	10.1016/J.COMPEDU.2015.02.013	2015	28	88
Burston J	Twenty years of MALL project implementation: A meta-analysis of learning outcomes	10.1017/S0958344014000159	2015	43	200
Duman G	Research trends in mobile assisted language learning from 2000 to 2012	10.1017/S0958344014000287	2015	29	140
Sundqvist P	Out-of-school digital gameplay and in-school L2 English vocabulary outcomes	10.1016/J.SYSTEM.2015.04.001	2015	31	123
Hwang GJ	Single loop or double loop learning: English vocabulary learning performance and behavior of students in situated computer games with different guiding strategies	10.1016/J.COMPEDU.2016.07.005	2016	25	67
Calvo-Ferrer JR	Educational games as stand-alone learning tools and their motivational effect on L2 vocabulary acquisition and perceived learning gains	10.1111/BJET.12387	2017	29	66

Figure 4 presents the 15 most globally cited documents in the field in the past decade. The most cited work by Golonka et al. (2014; GCS: 479) provided a comprehensive effectiveness framework for language learning technologies, catalyzing subsequent empirical investigations. Building on this foundation, Takacs et al. (2015; GCS: 219) and Burston (2015; GCS: 200) contributed crucial meta-analytic evidence, with the former establishing the efficacy of multimedia features in vocabulary acquisition and the latter critically examining methodological rigor in mobile-assisted language learning research.

Figure 4.

Most global cited documents.

The field’s theoretical maturation is evidenced by the emergence of more specialized investigations. Hung et al. (2018; GCS: 152) analysis of digital game-based learning and Hsu (2017; GCS: 154) examination of augmented reality applications exemplify this trend, offering empirical validation for specific technological approaches while refining pedagogical frameworks. Recent high-impact works by Lin and Lin (2019; GCS: 134) and Zou et al. (2021; GCS: 121) represent the field’s current trajectory toward integrated approaches. Their analyses synthesize previous findings while identifying emerging trends, particularly in adaptive learning systems and virtual reality applications. This evolution reflects the field’s progression from basic effectiveness studies to sophisticated implementations that combine multiple technological affordances for optimal vocabulary acquisition outcomes. The citation patterns show a linear theoretical progression, with each stage upholding methodological rigor while expanding and building upon earlier discoveries.

Co-citation Analysis of Papers

Co-citation analysis, introduced by Henry Small (1973), measures how frequently two documents are cited together in subsequent publications. This method reveals the intellectual structure of research fields by identifying pairs of works that researchers frequently reference together, suggesting these works share related concepts, methods, or theoretical foundations (McCain, 1990). Through network visualization, co-citation analysis can effectively map the core literature of a field, identify influential research clusters, and demonstrate how different research streams are interconnected, making it a valuable tool for understanding the evolution and structure of academic knowledge domains (White & McCain, 1998).

A co-citation network analysis was conducted using VOSviewer to examine the intellectual structure of the field (Figure 5). The visualization reveals five distinct clusters, representing key research domains and their interconnections through cited references. The prominence of Vygotsky (1978), a foundational theorist of social learning, alongside Schmitt (2008), a key figure in applied linguistics pedagogy, is particularly telling. This duality suggests that the intellectual heart of TEVL research is not purely technological, but rather a dynamic synthesis of sociocultural learning theory and concrete vocabulary teaching principles. This finding provides a data-driven confirmation for the theoretical underpinnings that will be explored in the Discussion section.

Figure 5.

Co-citation network of cited references.

Regarding closeness centrality, Kukulska-Hulme and Shield (2008) shows high measures in the mobile learning cluster (blue), with Liu and Chu (2010) and Sylvén and Sundqvist (2012) demonstrating similar prominence in technology integration research (yellow). The five clusters represent: theoretical foundations (red, centered on Vygotsky), mobile learning applications (blue), language teaching methodology (green), educational technology integration (yellow), and language assessment (purple).

Sources

Figure 6 presents the most influential sources in TEVL research over the past decade. Analysis of publication patterns reveals that the top 10 sources contributed 686 documents, accounting for 28.5% of the sampled literature, indicating concentrated scholarly attention in specific journals.

Figure 6.

Top 10 relevant journals.

The three most productive journals in this domain are Computer Assisted Language Learning (131 documents, h-index: 75), Education and Information Technologies (100 documents, h-index: 76) and Computers & Education (77 documents, h-index: 232). The notably high h-index of Computers & Education reflects its long-standing influence in educational technology research since 1976. The distribution of publications across educational technology journals (e.g., Computers & Education) and language learning-specific outlets (e.g., System) reflects the field’s dual focus on technological innovation and vocabulary acquisition theory.

Authors, Affiliations and Countries

Prolific Authors

Figure 7 presents the publication patterns of the top 20 most relevant authors in the TEVL field from 2014 to 2024. From the productivity visualization, several prominent research groups emerge.

Figure 7.

Top-authors’ production over the time.

The first group is led by Zou D. (N = 24) and Hwang GJ. (N = 29), who have focused extensively on mobile-assisted vocabulary learning and gamified vocabulary applications. Their consistent publication pattern shows strong activity particularly during 2020 to 2022, with multiple high-impact studies per year. The second group includes researchers like Shadriev R (N = 16) and Xie H. (N = 15), who have contributed significantly to adaptive vocabulary learning systems and personalized vocabulary instruction. Their publication trajectory shows steady growth from 2018 onwards, with notable peaks in productivity during 2021 to 2022. Zhang RF. (15) and Hwang WY. (14) represent another influential group, concentrating on vocabulary learning through augmented reality and multimedia applications. Their research output has been consistent throughout the period, with regular publications maintaining their influence in the field.

According to the visualization, the majority of productive writers continued to pursue active research agendas from 2018 to 2022, and the overlapping publication years show growing patterns of collaboration. The timeline’s bubble sizes highlight times of high productivity; multiple authors produced a lot of work between 2020 and 2022, indicating a rise in interest in TEVL strategies both during and after the global pandemic.

Figure 8 illuminates the intellectual structure of TEVL research through the interconnections among cited references (CR), authors (AU), and author keywords (DE). The bibliometric network reveals three distinct but interrelated components.

Figure 8.

Three-fields co-occurrence plot.

First, the theoretical foundations are anchored by seminal works, including Schmitt (2008), Nation (2017) and Vygotsky (1978), which established fundamental principles in vocabulary acquisition and cognitive development. Second, the author distribution identifies key contributors, with Hwang GJ, Shadiev R, and Zou D emerging as central figures in the field.

The network analysis shows that top researchers have different research paths. Particularly in studies on mobile-assisted vocabulary learning, Hwang GJ and Shadiev R show close ties to the areas of vocabulary acquisition and mobile learning. Zou D’s work emphasizes the integration of digital platforms and demonstrates significant connections to online learning and language instruction. Notably, research of Golonka et al. (2014) on computer-assisted language learning provides an essential link between vocabulary acquisition techniques and mobile learning applications. The keyword analysis also shows that higher education is a noteworthy study environment, with Zhang RF and Kennedy MJ concentrating mostly on tertiary-level TEVL.

Most Relevant Countries

The global cooperation tendencies in research visualized by VOSviewer are shown in Figure 9. Given its greatest node size and central location, the United States stands out as the primary hub, highlighting its crucial role in global research cooperation. The second most important node is China, which has close bilateral ties with the United States and acts as a link to other Asian nations, including South Korea and Japan. The network shows three different geographical clustering patterns: a Middle Eastern cluster that includes Iran, Iraq, and Indonesia; an Asian cluster that is led by China and includes Japan and South Korea; and a Western cluster that includes the United States, Canada, Germany, Spain, and France. The USA-China and USA-Germany partnerships have the strongest links, with the thickness of connecting lines signifying the level of cooperation. Although research collaboration is disseminated globally, this network structure indicates that it is dominated by collaborations between large research-producing nations, especially the USA and China, and is marked by significant regional links. This bipolar concentration of research output in the USA and China, while demonstrating the field’s dynamism in these regions, simultaneously raises critical questions about the global diversity of TEVL research. It suggests that the dominant narratives, research questions, and technological solutions in the field may be heavily shaped by the educational ecosystems of these two superpowers, a point that warrants deeper critical examination.

Figure 9.

International co-authorship network.

The USA (1,683 documents), China (1,179 documents), UK (379 documents), Australia (285 documents), and Spain (243 documents) held the top five spots among the top 10 nations that produced TEVL research output throughout the study period (Table 3). The significant contributions from Asian-Pacific nations, such as Japan (144 documents), Australia (285 documents), and other Asian countries, point to the increasing significance of TEVL research in areas where English is the primary language of instruction. While emerging economies like Turkey (221 documents) and Iran (185 documents) show growing research capacity in TEVL, European representation is noteworthy through nations like the UK (379 documents), Spain (243 documents), and Germany (168 documents). This reflects the global nature of this field and its relevance to a variety of educational contexts.

Table 3.

Top 30 Countries/Regions in TEVL Research Production.

Rank	Country/Region	Frequency	Rank	Country/Region	Frequency
1	USA	1,683	16	South Korea	100
2	China	1,179	17	Belgium	78
3	UK	379	18	India	74
4	Australia	285	19	Israel	71
5	Spain	243	20	Finland	67
6	Canada	234	21	Ukraine	67
7	Turkey	221	22	France	66
8	Iran	185	23	Sweden	66
9	Germany	168	24	Brazil	57
10	Japan	144	25	Thailand	57

Keywords and Research Trends

Keyword Co-occurrence Network

Three separate thematic clusters generated by VOSviewer in research on TEVL are shown in Figure 10. With phrases like “education,”“technology,”“students,” and “teachers,” the red cluster focuses on pedagogical features and emphasizes stakeholder viewpoints and educational implementation. Terms like “vocabulary,”“acquisition,”“English,” and “second-language” predominate in the green cluster, which focuses on language acquisition processes and highlights the theoretical underpinnings of vocabulary development. “Meta-analysis,”“comprehension,”“writing,” and “skills,” among other methodological and outcome-related elements, are represented by the blue cluster, which also denotes research methodologies and learning objectives. The word “vocabulary” connects all three clusters and serves as a bridging idea, showing up as a major node with strong betweenness centrality. The density and connectivity of the network point to a highly integrated field of study where linguistic, educational, and technical elements are intimately related. The size of the nodes, especially for phrases like “vocabulary,”“education,” and “students,” reflects how frequently they appear in the literature and how important they are.

Figure 10.

Keyword co-occurrence network.

Research Trends

Eleven primary research streams are depicted in the thematic evolution map by CiteSpace (Figure 11), where diverse research clusters are represented by different colors and the width of lines indicates the strength of thematic links. Several noteworthy tendencies are revealed by the investigation. Research first concentrated on basic elements such as vocabulary acquisition (#2) and literacy (#0; 2015–2018). Following that, the area shifted toward more technology-specific applications, with game-based learning (#3) and mobile learning (#1) being major themes in 2019 to 2021. More focus has been placed on advanced technical applications in the last several years (2022–2024), especially augmented reality (#6), machine learning (#7), and artificial intelligence (#8). This development is consistent with the publication trends of eminent writers like Hwang GJ and Zou D., who have made substantial contributions to the study of gamified and mobile-assisted vocabulary learning.

Figure 11.

Thematic evolution map.

Additionally, the visualization shows a move away from conventional computer-assisted vocabulary training and toward more advanced methods. Social media (#9) and gamification (#5) have emerged as separate clusters, indicating an increasing focus on social and interactive learning settings. According to the research interests of leading experts in the field, this tendency is consistent with the growing use of digital tools in language training. Furthermore, the fact that reading comprehension (#4) has remained relevant over time suggests that, despite advancements in technology, it remains a fundamental component of vocabulary acquisition. A growing understanding of the necessity of acquiring comprehensive digital literacy abilities in addition to vocabulary acquisition is indicated by the recent rise of media literacies (#10).

Discussion

This bibliometric analysis provides a structural map of the TEVL field, revealing not only its thematic evolution but also the underlying tensions that shape its trajectory. This section interprets two critical findings that represent the core contribution of this study: the theory-practice paradox and the innovation-inclusion gap.

The Theory-Practice Paradox: A Field Guided by an Invisible Hand

The second research question focused on the field’s intellectual structure and evolution. The analysis reveals a clear trajectory from early computer-assisted tools to a frontier of interactive and intelligent technologies like AI, AR, and social media (Figure 11). This evolution is not merely a technological trend; it reflects a deep, though often implicit, alignment with the principles of Vygotsky’s sociocultural theory.

Vygotsky’s (1978) theory posits that cognitive development originates from social interaction within the Zone of Proximal Development (ZPD), facilitated by a More Knowledgeable Other (MKO). The thematic map provides large-scale, data-driven evidence for this theoretical shift. The emergence of “social media” (#9) and “gamification” (#5) highlights a move toward digital ZPDs where learners collaborate with peers (acting as MKOs). The recent surge in “artificial intelligence” (#8) can be interpreted as the development of scalable, personalized MKOs that provide tailored scaffolding. The co-citation analysis (Figure 5), which places Vygotsky (1978) at the theoretical heart of the network, further confirms this connection.

However, this alignment reveals a profound paradox. While the field’s evolution is implicitly guided by a sociocultural “invisible hand,” the analysis, confirming findings from previous qualitative reviews (Yang et al., 2021), shows that explicit theoretical engagement in individual studies is weak. I argue that this theory-practice paradox is sustained by the field’s dominant methodological habits. The prevalent use of short-term, small-sample intervention designs is pragmatically convenient but inherently ill-suited to capture the longitudinal, social, and cultural dynamics central to Vygotskian theory. Consequently, the field’s methodological preferences may be inadvertently stunting its theoretical deepening, causing many studies to remain at the surface level of technological application without exploring the underlying social learning mechanisms.

The Innovation-Inclusion Gap: Is TEVL Reinforcing Global Inequities?

The first research question examined the key characteristics of TEVL research, including its contributors. The results uncover a second critical tension: a widening innovation-inclusion gap. The research frontier is clearly pushing toward sophisticated, resource-intensive technologies like AI and AR. However, the geographic and co-authorship analysis (Figure 9, Table 3) shows this innovation is overwhelmingly concentrated in two well-resourced hubs: the United States and China, and focused predominantly on English as the target language.

This bipolar concentration, while demonstrating dynamism, raises critical questions about the global diversity and equity of TEVL research. The dominant narratives, research questions, and technological solutions are likely shaped by the educational ecosystems of these two superpowers. This creates an innovation-inclusion gap, where the most advanced pedagogical solutions may be designed by and for a privileged subset of the world’s learners.

The rise of research from countries like Turkey and Iran signals a positive trend toward localization. However, the stark absence of research from the Global South (e.g., Africa, South America) in the dataset is a sobering reminder. The field’s technological progress, if unchecked, may inadvertently reinforce rather than dismantle existing educational inequities. This finding, made possible by the large-scale bibliometric approach, presents an ethical and scholarly imperative for the TEVL community to address.

Synthesizing the Field’s Trajectory and Methodological Profile

Viewed through the lenses of these two tensions, other findings gain deeper meaning. The prominence of review articles and meta-analyses among the most cited documents (Figure 4; Golonka et al., 2014; Takacs et al., 2015) suggests a field actively engaged in self-reflection. However, it also reflects a response to the fragmentation caused by the theory-practice paradox, as scholars attempt to synthesize findings from atheoretical studies. Similarly, the concentration of research in high-impact journals like Computers & Education and Computer Assisted Language Learning (Figure 6) indicates a maturing field. Critically, this concentration within the WoS database is itself a key dimension of the innovation-inclusion gap. These high-impact, primarily Anglophone venues represent the epicenter of influential scholarly discourse. The dominance of certain research contexts within them not only reflects existing global inequities but also perpetuates them by setting the research agenda and defining impact, thus marginalizing valuable work from less-resourced contexts which often remains outside this high-visibility ecosystem.

Future Directions

Based on the trends and gaps identified in this bibliometric analysis, future TEVL research should deepen and expand in the following areas to advance the field toward more personalized, interactive, and theoretically-driven approaches.

First, resolve the theory-practice paradox. To bridge this gap, future research must prioritize longitudinal and ecologically valid designs. Instead of asking if an AI tutor works in a one-hour session, studies should investigate how it mediates learning within a classroom ecosystem over a semester. This requires embracing qualitative and mixed-methods approaches to capture the rich social dynamics that the co-citation analysis shows are theoretically central to the field.

Second, close the innovation-inclusion gap. Addressing this is an ethical and scholarly imperative. Future research must consciously expand its scope to Less Commonly Taught Languages (LCTLs) and low-resource contexts. This is not just about geographic diversity; it’s about asking new questions. How can TEVL be designed for offline use? How can technology support vocabulary learning in multilingual, under-resourced classrooms? Answering these questions is crucial for the field’s global relevance.

Third, integrate technologies with pedagogical purpose. The analysis shows clusters for mobile learning, gamification, and AR. The next step is to move from studying these in isolation to integrating them with clear pedagogical goals derived from theory. For example, future projects could explore how AR (providing authentic context) can be combined with gamified social tasks (motivating peer interaction as MKOs) on mobile devices (enabling ubiquitous learning), explicitly testing a Vygotskian framework.

Conclusion

This study has mapped the knowledge landscape of Technology-Enhanced Vocabulary Learning (TEVL) from 2014 to 2024 through a bibliometric analysis of 3,307 documents. The core contribution of this research is that it moves beyond traditional effectiveness assessments (meta-analyses) and theoretical collations (systematic reviews) to provide a macro-level, structural perspective on how the field is organized. The findings reveal that the TEVL field has undergone significant growth and evolution, progressing from traditional computer-assisted tools to a complex ecosystem characterized by mobile, gamified, social, and intelligent technologies. This thematic evolution aligns conceptually with Vygotsky’s sociocultural theory of learning, even though the application of theoretical frameworks in research practice remains underdeveloped. Furthermore, the study uncovers a landscape of both concentration and expansion in global research patterns, alongside systematic limitations in existing research regarding methodology (e.g., short-term, small-sample studies) and scope (e.g., an overemphasis on English). By interpreting these findings within a theoretical framework and critically reflecting on its own methodological limitations, this study not only provides scholars and practitioners with a comprehensive “topographical map” of the TEVL field but also charts a course for future research: one that calls for stronger theoretical grounding, more diverse methodologies, broader linguistic and cultural contexts, and a deeper commitment to bridging the gap between research and practice.

This study’s findings should be interpreted in light of several methodological limitations that also point toward directions for future research. The exclusive reliance on the Web of Science database, while justified for its data quality, introduces a known coverage bias. This decision inevitably underrepresents literature from other significant databases like Scopus, as well as social sciences, humanities, non-journal literature (e.g., books, conference proceedings), and particularly non-Anglophone scholarship, which is a key constraint for a global field like TEVL. Furthermore, the results are shaped by the specific algorithms of the analytical tools; VOSviewer excels at intuitive network mapping but is weaker in time-series analysis, whereas CiteSpace’s strength in identifying emerging trends may overemphasize new topics over foundational research. Finally, the analysis is affected by a natural citation lag, meaning the impact of recent research from 2023 to 2024 is likely underestimated due to incomplete publication and citation data.

Footnotes

The authors would like to thank the anonymous reviewers for their insightful and valuable feedback.

ORCID iD

Dan Feng

Funding

The authors disclosed receipt of the following financial support for the research,authorship,and/or publication of this article: This work was supported by the Hubei Provincial Department of Education Teaching Reform Research Program for Undergraduate Universities under Grant No. 2023410.

Declaration of Conflicting Interests

The author declared no potential conflicts of interest with respect to the research,authorship,and/or publication of this article.

Data Availability Statement

The data that support the findings of this study are available upon reasonable request from the correspondence author,subject to ethical or legal restrictions.

References

Aria

Cuccurullo

(2017). Bibliometrix: An R-tool for comprehensive science mapping analysis. Journal of Informetrics, 11(4), 959–975. https://doi.org/10.1016/j.joi.2017.08.007

Burston

(2015). Twenty years of MALL project implementation: A meta-analysis of learning outcomes. ReCALL, 27(1), 4–20. https://doi.org/10.1017/S0958344014000159

Chen

(2006). CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature. Journal of the American Society for Information Science and Technology, 57(3), 359–377. https://doi.org/10.1002/asi.20317

Chen

Ibekwe-SanJuan

Hou

(2010). The structure and dynamics of cocitation clusters: A multiple-perspective cocitation analysis. Journal of the American Society for Information Science and Technology, 61(7), 1386–1409. https://doi.org/10.1002/asi.21309

Golonka

E. M.

Bowles

A. R.

Frank

V. M.

Richardson

D. L.

Freynik

(2014). Technologies for foreign language learning: A review of technology types and their effectiveness. Computer Assisted Language Learning, 27(1), 70–105. https://doi.org/10.1080/09588221.2012.700315

Hao

Wang

Ardasheva

(2021). Technology-assisted vocabulary learning for EFL learners: A meta-analysis. Journal of Research on Educational Effectiveness, 14(3), 645–667. https://doi.org/10.1080/19345747.2021.1917028

Harger

(2020). The impact of the use of mobile computing on vocabulary learning in the language classroom [Graduate Research Papers. 1385]. https://scholarworks.uni.edu/grp/1385

Hsu

T. C.

(2017). Learning English with augmented reality: Do learning styles matter? Computers & Education, 106, 137–149. https://doi.org/10.1016/j.compedu.2016.12.007

Hung

H. T.

Yang

J. C.

Hwang

G. J.

Chu

H. C.

Wang

C. C.

(2018). A scoping review of research on digital game-based language learning. Computers & Education, 126, 89–104. https://doi.org/10.1016/j.compedu.2018.07.001

10.

Hwang

G. J.

Q. K.

(2019). Trends in the research design and application of mobile language learning: A review of 2007-2016 publications in selected SSCI journals. Interactive Learning Environments, 27(4), 567–581. https://doi.org/10.1080/10494820.2018.1486861

11.

Jia

Sun

Looi

C. K.

(2022). Developing an AI-based learning system for L2 learners’ authentic and ubiquitous learning in English language. Sustainability, 14(23), Article 15527. https://www.mdpi.com/2071-1050/14/23/15527

12.

Kukulska-Hulme

Shield

(2008). An overview of mobile assisted language learning: From content delivery to supported collaboration and interaction. ReCALL, 20(3), 271–289. https://doi.org/10.1017/S0958344008000335

13.

Laufer

(2009). Second language vocabulary acquisition from language input and from form-focused activities. Language Teaching, 42(3), 341–354. https://doi.org/10.1017/S0261444809005771

14.

Lin

J. J.

Lin

(2019). Mobile-assisted ESL/EFL vocabulary learning: A systematic review and meta-analysis. Computer Assisted Language Learning, 32(8), 878–919. https://doi.org/10.1080/09588221.2018.1541359

15.

Liu

T. Y.

Chu

Y. L.

(2010). Using ubiquitous games in an English listening and speaking course: Impact on learning outcomes and motivation. Computers & Education, 55(2), 630–643. https://doi.org/10.1016/j.compedu.2010.02.023

16.

McCain

K. W.

(1990). Mapping authors in intellectual space: A technical overview. Journal of the American Society for Information Science, 41(6), 433–443. https://doi.org/10.1002/(SICI)1097-4571(199009)41:6<433::AID-ASI11>3.0.CO;2-Q

17.

Nation

(2017). How vocabulary is learned. Indonesian JELT: Indonesian Journal of English Language Teaching, 12(1), 1–14. https://doi.org/10.25170/ijelt.v12i1.1458

18.

Reinhardt

(2018). Social media in second and foreign language teaching and learning: Blogs, wikis, and social networking. Language Teaching, 52, 1–39. https://doi.org/10.1017/S0261444818000356

19.

Sandberg

Maris

Hoogendoorn

(2014). The added value of a gaming context and intelligent adaptation for a mobile learning application for vocabulary learning. Computers & Education, 76, 119–130. https://doi.org/10.1016/j.compedu.2014.03.006

20.

Schmidt

R. W.

(1990). The role of consciousness in second language learning. Applied Linguistics, 11(2), 129–158.

21.

Schmitt

(2008). Instructed second language vocabulary learning. Language Teaching Research, 12(3), 329–363. https://doi.org/10.1177/1362168808089921

22.

Small

(1973). Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society for Information Science, 24(4), 265–269. https://EconPapers.repec.org/RePEc:bla:jamest:v:24:y:1973:i:4:p:265–269

23.

Sweller

(1988). Cognitive load during problem solving: Effects on learning. Cognitive Science, 12(2), 257–285.

24.

Sylvén

L. K.

Sundqvist

(2012). Gaming as extramural English L2 learning and L2 proficiency among young learners. ReCALL, 24(3), 302–321. https://doi.org/10.1017/S095834401200016X

25.

Takacs

Z. K.

Swart

E. K.

Bus

A. G.

(2015). Benefits and pitfalls of multimedia and interactive features in technology-enhanced storybooks: A meta-analysis. Review of Educational Research, 85(4), 698–739. https://doi.org/10.3102/0034654314566989

26.

van Eck

N. J.

Waltman

(2010). Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics, 84(2), 523–538. https://doi.org/10.1007/s11192-009-0146-3

27.

van Eck

N. J.

Waltman

(2014). Visualizing bibliometric networks. In Ding

Rousseau

Wolfram

(Eds.), Measuring scholarly impact: Methods and practice (pp. 285–320). Springer International Publishing. https://doi.org/10.1007/978-3-319-10377-8_13

28.

Vnucko

Klimova

(2023). Exploring the potential of digital game-based vocabulary learning: A systematic review. Systems, 11(2), Article 57. https://www.mdpi.com/2079-8954/11/2/57

29.

Vygotsky

L. S.

(1978). Mind in society: The development of higher psychological processes (Vol. 86). Harvard University Press.

30.

White

H. D.

McCain

K. W.

(1998). Visualizing a discipline: An author co-citation analysis of information science, 1972–1995. Journal of the American Society for Information Science, 49(4), 327–355.

31.

(2015). Designing a smartphone app to teach English (L2) vocabulary. Computers & Education, 85, 170–179. https://doi.org/10.1016/j.compedu.2015.02.013

32.

Yang

Kuo

L. J.

Eslami

Z. R.

Moody

S. M.

(2021). Theoretical trends of research on technology and L2 vocabulary learning: A systematic review. Journal of Computers in Education, 8, 465–483. https://doi.org/10.1007/s40692-021-00187-8

33.

Trainin

(2022). A meta-analysis examining technology-assisted L2 vocabulary learning. ReCALL, 34(2), 235–252. https://doi.org/10.1017/S0958344021000239

34.

Zhang

Koda

(2018). Vocabulary knowledge and morphological awareness in Chinese as a heritage language (CHL) reading comprehension ability. Reading and Writing, 31(1), 53–74. https://doi.org/10.1007/s11145-017-9773-x

35.

Zhou

Jin

Song

Huang

Lin

C. H.

(2024). The design of technology-enhanced vocabulary learning: A systematic review. Education and Information Technologies, 29(12), 14875–14897. https://doi.org/10.1007/s10639-023-12423-y

36.

Zou

Huang

Xie

(2021). Digital game-based vocabulary learning: Where are we and where are we going? Computer Assisted Language Learning, 34(5–6), 751–777. https://doi.org/10.1080/09588221.2019.1640745