1119 | Studies in English Language and Education, 12. , 1119-1132, 2025 Key Predictors of Proficiency in L2 Writing: Syntactic. Lexical Complexity, and Accuracy P-ISSN 2355-2794 E-ISSN 2461-0275 Shinjae Park * College of General Education. Kookmin University. Seoul 02707. KOREA Abstract While many studies have examined individual aspects of complexity, accuracy, and fluency (CAF) as indicators of language performance, only few have explored how syntactic complexity, lexical complexity, and accuracy function together to measure writing proficiency. This study aims to fill that gap by identifying reliable methods to measure 14 indices of syntactic complexity, five of lexical complexity, and nine of It additionally investigates how these indices contribute to holistic writing It further examines which specific indices best predict writing proficiency and determine writing quality. A total of 138 essays by Korean learners of English were analyzed using the L2 Syntactic Complexity Analyzer and Text Inspector. Multiple regression analyses were conducted to interpret the data. The results show that proficiency is best predicted by the number of coordinate phrases per clause and dependent clauses per T-unit . yntactic complexit. , sophistication . exical complexit. , and article usage . Lexical sophistication emerged as the strongest predictor of proficiency. The paper also discusses the Common European Framework of Reference for Languages (CEFR) rubrics and offers insights into selecting and categorizing writing proficiency measures. Implications highlight the need to prioritize lexical sophistication and syntactic complexity in L2 writing assessment and pedagogy. Keywords: Accuracy, assessment, lexical complexity, sophistication, syntactic INTRODUCTION The importance of Complexity. Accuracy and Fluency (CAF) in second language (L. speaking and writing has elicited interest from L2 linguists during the last decade . Barrot & Agdeppa, 2021. Khushik & Huhta, 2020. Tabari et al. , 2. From a theoretical perspective, * Corresponding author, email: muhando@kookmin. Citation in APA style: Park. Key predictors of proficiency in L2 writing: Syntactic, lexical complexity, and Studies in English Language and Education, 12. , 1119-1132. Received November 4, 2024. Revised January 20, 2025. Accepted August 26, 2025. Published Online September 30, https://doi. org/10. 24815/siele. Copyright A 2025 by Authors, published by Studies in English Language and Education. This is an open-access article Creative Commons Attribution International License . ttps://creativecommons. org/licenses/by/4. Park. Key predictors of proficiency in L2 writing: Syntactic, lexical complexity, and accuracy . 0 CAF measures have been considered key constructs in research on L2 speaking and writing and have been adopted to find valid and reliable indices of L2 learnersAo language development and global proficiency (Skehan, 2. From a practical standpoint, such measures can be used as frameworks to better understand L2 performance in the context of English as a Foreign Language (EFL) (Kuiken et al. , 2. and implement appropriate pedagogical interventions. Under a CAF framework, therefore, learners are expected to produce more complex, accurate and fluent spoken and written output. If this expectation challenges learners, calibrated pedagogical interventions can be initiated. Complexity has been examined extensively as an index of language performance, writing quality and writing proficiency in L2 (Barrot & Agdeppa, 2021. Park, 2022. For example, some studies have distinguished writing quality based on complexity measures and revealed their relationship with proficiency (Casal & Lee, 2019. Martynez, 2018. Park, 2022. Although these studies have dealt with one or more CAF measures as indices of proficiency, many have focused primarily on syntactic complexity measures. The insights that they derived point to the need for a more comprehensive scrutiny of the three dimensions of L2 writing performance in relation to language proficiency. Accuracy refers to the degree of deviation from a certain norm, with such deviations generally characterised as errors (Michel, 2017, pp. However, defining the scope of errors as a measure of accuracy remains a challenging issue because standards can vary among researchers or evaluators. This problem raises the question of whether to set the linguistic level demonstrated by an ideal native speaker of a target language as the norm or to expand criteria to include non-native usage patterns that are acceptable in certain social contexts or regional communities (Housen & Kuiken, 2009, p. As can be seen, measuring accuracy using valid and reliable methods is a crucial requirement. In contrast to complexity and accuracy, which may pertain to oral and written L2 performance, fluency is first and foremost a measure of spoken language (Michel, 2. Tavakoli and Skehan . formulated a regularly cited speech-oriented definition, which indicates fluency as consisting of three subdimensions: . speed or rate . , the number of words uttered per minut. , . silence or breakdown . , the number, location and duration of . , and . , false starts, repetitions and self-correction. A limited number of studies have suggested the potential for extending this work into the writing domain. These studies employed keystroke logging software to record online writing features, such as the number of characters typed between pauses and the ratio of characters produced during writing to that in the final text (Michel, 2. While fluency is a multifaceted construct with several subcomponents that make it a useful metric for assessing L2 production quality, it remains a challenging dimension to measure and conceptualize in L2 writing. Skehan . argued that one of the major omissions in performance measures is that of lexis, indicating the need for CAF competencies to be supplemented by the characteristics of lexical use (Park, 2023. This is important because empirical evidence suggests that the latter is a separate aspect of overall performance, and it and shows that lexical access and retrieval figure prominently in all models of language production (Park, 2023. Despite these requirements, however, few studies have examined integrated measures as indices of writing proficiency. This gap has limited the understanding of L2 writing proficiency in relation to necessary integrated measures, such as those revolving around CAF and lexis. To fill this gap, the three dimensions useful for assessing L2 writingAisyntactic complexity, lexical complexity, and accuracyAiwere thoroughly examined. Specifically, the three dimensions and the corresponding measures suitable for this analysis were selected, calculated using reliable methods, and subjected to appropriate statistical analysis. Additionally, the relationship between these measures and writing proficiency, as categorized by the Common European Framework of Reference for Languages (CEFR) standards, was analysed. This examination was guided by the following research questions: What variables of syntactic complexity affect L2 proficiency levels? What variables of lexical complexity affect L2 proficiency levels? What variables of accuracy affect L2 proficiency levels? 1121 | Studies in English Language and Education, 12. , 1119-1132, 2025 Which of the variables identified from the analyses have the greatest impact on L2 proficiency LITERATURE REVIEW Researchers in applied linguistics and instructed L2 acquisition have long been interested in measuring L2 performance. For example. Brumfit . highlighted accuracy and fluency as two important aspects of language use, and Skehan . later added complexity, introducing the CAF triad as the three fundamental dimensions characterising L2 performance. These three dimensions have since proven to be essential and useful measures for assessing various aspects of L2 proficiency (Skehan, 2. The three dimensions were defined provisionally by Ellis . as follows. Complexity refers to the extent to which the language produced by learners is elaborate and varied. Accuracy is defined as the degree to which the language produced by a learner conforms to target language Fluency pertains to the extent to which the language produced by a learner shows evidence of pausing, hesitation or reformulation. Meanwhile, fluency will only be briefly mentioned in the current section, as previously stated, this dimension is primarily a measure of spoken language and is challenging to conceptualise and measure in L2 writing (Michel, 2. Additionally, writing fluency is less reasonable to measure than oral fluency, as writing inherently allows for planning, monitoring and editing (Michel, 2. Given the importance of the three dimensions described above, the literature on syntactic complexity, lexical complexity, and accuracy in L2 writing was explored. These are key areas that have been extensively studied in relation to L2 proficiency. Accordingly, this study examines the extent to which these areas account for differences in L2 writing proficiency. Syntactic Complexity Syntactic complexity has been a focal point in L2 research, as it provides insights into learnersAo ability to use a wide range of grammatical forms and structures, reflecting their language sophistication (Barrot & Agdeppa, 2021. Skehan, 2. Such complexity was elaborated by Lu . as encompassing 14 indices, including the mean length of a sentence (MLS), the mean length of a T-unit (MLT), the mean length of a clause (MLC), the number of clauses per T-unit (CT) and the complex T-unit ratio (CTT). Martynez . compared the syntactic complexity of argumentative essays written by lower-intermediate and intermediate secondary education students in Spain. She used measures such as MLS, the simple sentence ratio, the compound sentence ratio, the complex sentence ratio, the compound-complex sentence ratio, the number of clauses per sentence (CS), the number of dependent clauses per clause (DCC) and the number of coordinate phrases per clause (CPC). Her findings revealed that intermediate students score higher on most measures, although the limited scope of the study may not have captured the full range of syntactic complexity. Kuiken and Vedder . investigated the syntactic complexity of argumentative texts using both overall ratio measures and fine-grained measures of subordination, coordination and phrasal complexity. Their study, which involved learners with Dutch. Italian and Spanish as their first languages (L. , indicated that higher-proficiency learners produce more complex syntactic structures, albeit this trend varies across students of different L1 backgrounds. Similar tendencies were found by Khushik and Huhta . , who examined the English argumentative essays produced by Pakistani and Finnish teenagers and found that proficiency levels influence subordination, phrasal density and the length of production units. In contrast. Park . identified complex nominals per T-unit (CNT) as the best predictor of proficiency among Korean L2 learners. Despite variations across studies, there is general agreement that more proficient L2 learners tend to produce longer and more complex syntactic structures (Casal & Lee, 2019. Martynez, 2. Park. Key predictors of proficiency in L2 writing: Syntactic, lexical complexity, and accuracy . 2 Lexical Complexity Lexical complexity, which encompasses diversity, sophistication and density, has received less attention than syntactic complexity despite it being another critical dimension of language use (Read, 2. This assertion is supported by Skehan . , who emphasised the need to incorporate lexical measures into L2 research, given their importance in understanding task Lexical diversity is typically measured using D, a measure that calculates the extent to which a speaker avoids recycling the same set of words (Malvern & Richards, 2. A low D suggests considerable reliance on a limited vocabulary set, which implies minimal lexical diversity (Skehan, 2. Lexical sophistication refers to the use of less frequent, more advanced words, as measured by tools such as Laufer and NationAos . Lexical Frequency Profile. This tool is used to assess the frequency with which words are used in a given text, allowing researchers to gauge the extent to which a writer relies on more frequent versus less frequent terms. Finally, lexical density is employed to measure the ratio of lexical . words to the total number of words in a text (Ure, 1. Lexical words, such as nouns, adjectives, verbs and adverbs, carry the core meanings of a text, while function words . articles, preposition. serve grammatical purposes without adding significant meaning (Halliday, 1. Notwithstanding the importance of lexical complexity in L2 writing, research on this dimension been relatively sparse. Incorporating lexical measures is essential for a comprehensive understanding of L2 proficiency, as it offers insights into the richness and depth of a learnerAos vocabulary. Accuracy Accuracy in L2 writing refers to the extent to which a learnerAos language use aligns with the norms of a target language, encompassing both form and structure (Michel, 2017. Skehan. It was defined by Ellis . as a key dimension of language performance, highlighting its importance in evaluating L2 proficiency. Accuracy has been measured using various developed indices, ranging from holistic measures, such as error-free clauses and the number of errors per 100 words, to specific indices that target particular aspects of language use, such as errors in nounAeadjectiveAegender agreement (Michel, 2. The selection and categorisation of errors allow for comparisons across studies but require careful interpretation and categorisation based on a studyAos scope (Park, 2023. To address these challenges, the Center for English Corpus Linguistics at University Catholique de Louvain (Dagneaux et al. , 1. proposed an alternative error categorisation scheme, which comprises up to four levels of classification depending on error category. This scheme was adopted by S. Yoon et al. through the identification of 40 distinct error categories, covering aspects such as form, grammar, lexico-grammar, lexis, word use, register and style. Although much of the existing research has focused on syntactic complexity, accuracy remains a crucial component of L2 proficiency, as it reflects a learnerAos ability to produce language that conforms to target language norms. However, more explorations are needed to fully understand how accuracy interacts with other dimensions of L2 performance. In summary, syntactic and lexical complexity and accuracy are all critical for understanding L2 proficiency. While there has been substantial research on syntactic complexity, less attention has been given to lexical complexity and accuracy in relation to predicting proficiency levels. This review highlighted the necessity of more comprehensive investigations that incorporate all three dimensions to provide a more complete picture of L2 writing proficiency. This gap was addressed in the current study through an examination of the effects of syntactic complexity, lexical complexity and accuracy on L2 proficiency levels. METHOD This section explains how the study was conducted, starting with a description of the corpus data, including details about the participants and their writing tasks. It also introduces the tools and methods used to measure key linguistic features, such as syntactic complexity, lexical 1123 | Studies in English Language and Education, 12. , 1119-1132, 2025 richness, and accuracy. Together, these elements form the foundation for analyzing the factors that influence L2 writing proficiency. Samples and Data Collection The data analysed in this study were derived from 138 essays included in the Multilanguage Learner Corpus (MULC) (S. Yoon et al. , 2. The participants who produced part of the corpus were recruited through an open call issued at a university in South Korea between 2018 The majors pursued by the participants varied, and they had an average age of 20. Among them, 47. 1% were male, and 52. 9% were female (Table . They were asked to choose one of four topics on which to write an essay, which was later electronically stored in the MULC database. The topics of the essays are listed below: Should everyone get married? Is it essential to wear uniforms in middle and high schools? Should elementary, middle and high school students be allowed to carry mobile phones in Should any college student join a club? Each of the participantAos writing proficiency was assessed by a native English speaker who was specifically trained for this purpose, with the rubrics of the CEFR used as bases (Appendi. The evaluator holds a PhD in linguistics and is currently a professor at a university in Korea. The CEFR has been recognised as a standard for L2 language progression throughout Europe since 2001 and has gradually expanded in use worldwide (Glover, 2. It is divided into six levels of proficiency: A1. A2. B1. B2. C1 and C2 . ith A1 being the lowest and C2 being the highes. However, since the data used in this study were generated mostly by first- and second-year university students who may not have been accustomed to writing in English, none of the writings reached the C1 or C2 level. Therefore, their writing levels were categorised into four groups: A1. A2. B1 and B2 (Table . More specifically, the data were grouped as follows: A1 . = 12, or 7%). A2 . = 77, 55. 8%). B1 . = 40, 29. 0%) and B2 . = 9, 6. 5%). Table 1. Description of participants. Gender Male Female Proficiency Table 2 shows the descriptive statistics of the data. The total number of tokens was 32,128, with an average of 232. 81 words per essay (SD = 101. across the 138 pieces. The total number of sentences was 2,320, with an average of 16. 81 sentences per essay (SD = 6. Some of the essays had as many as 597 words and 41 sentences. Table 2. Description of corpus. Word Sentence Tokens 32,128 2,320 Mean Min. Max. Instruments and Data Analysis This section describes the tools and methods used to analyse the linguistic features of the corpus data, focusing on syntactic complexity, lexical complexity, and accuracy. Each dimension was measured using established computational tools, complemented by manual verification to ensure accuracy. The following subsections detail the specific measures and procedures employed for each dimension. Park. Key predictors of proficiency in L2 writing: Syntactic, lexical complexity, and accuracy . 4 1 Syntactic complexity The L2 syntactic complexity analyser (L2SCA) was used to measure the syntactic complexity indices adopted in this study. More details on each index can be found in LuAos . The five major categories and the number of indices that they subsumed were as follows: the length of production units . = 3 indice. , overall sentence complexity . = 1 inde. , the number of subordination clauses . = 4 indice. , the number of coordination clauses . = 3 indice. and phrasal sophistication . = 3 indice. The specific linguistic features and indices gauged using the L2SCA are summarised in Table 3, which also presents the descriptive statistics of the categories. A comparative analysis based solely on the mean values of each measure is not meaningful, and only relative comparisons through the analysis of their relationship with proficiency are important. This issue is further addressed in the results section. Table 3. Syntactic complexity measures (Lu, 2. Length of production unit Sentence complexity Subordination Coordination Particular structures Categories Mean length of clause (MLC: words/claus. Mean length of sentence (MLS: words/sentenc. Mean length of T-unit (MLT: words/T-uni. Clauses per sentence (CS: clauses/sentenc. Clauses per T-unit (CT: clauses/T-uni. Complex T-unit per T-unit (CTT: complex T-units/T-uni. Dependent clauses per clause (DCC: dependent clauses/claus. Dependent clauses per T-unit (DCT: dependent clauses/T-uni. Coordinate phrases per clause (CPC: coordinate phrases/claus. Coordinate phrases per T-unit (CPT: coordinate phrases/T-uni. T-units per sentence (TS: T-units/sentenc. Complex nominals per clause (CNC: complex nominals/claus. Complex nominals per T-unit (CNT: complex nominals/T-uni. Verb phrases per T-unit (VPT: verb phrases/T-uni. M(SD) 863. 2 Lexical Complexity In L2 writing research, lexical complexity is a multidimensional construct consisting of various subconstructs that can be measured using specific metrics. A popular approach is to conceptualise lexical complexity as encompassing three main subconstructs: lexical sophistication/rarity, lexical diversity/variability and lexical density (Bui, 2021. Tabari et al. This multidimensional framework was adopted in the present work to analyse lexical complexity (Table . Lexical sophistication was estimated using the Academic Word List and Beyond-2000 scores based on the British National Corpus (BNC) (Park, 2023. To enquire into lexical sophistication, researchers employ frequency-based metrics to compare the lexical information in L2 writing samples against frequency data extracted from language corpora (Johnson, 2. The use of low-frequency lexis is considered indicative of L2 writing development (Laufer & Nation. Therefore, sophisticated lexical usage reflects a low overall lexical frequency (Crossley & McNamara, 2012. Yoon & Polio, 2. Lexical diversity refers to the variety of vocabulary used in a textAithe more varied the vocabulary, the higher the lexical diversity. For a text to be highly lexically diverse, a speaker or writer must use many different words with minimal repetition (Johansson, 2. Lexical diversity indicates the range of a learnerAos vocabulary as demonstrated in their language use (Lu. McCarthy & Jarvis, 2. To measure lexical diversity, this study used metrics such as Textual Lexical Diversity (MTLD) and Vocd-D (McCarthy & Jarvis, 2. These metrics were calculated using the Text Inspector, a tool prototyped by Stephen Bax and further developed by the software team at Versantus (Owen et al. , 2. 1125 | Studies in English Language and Education, 12. , 1119-1132, 2025 Lexical density is defined as the proportion of content words in a written text (Bui, 2. and reflects how densely lexical items are used within grammatical structures (Halliday & Martin. It was determined in the present research through a series of manual operations. Specifically, before an automated analysis, several processes were added. First, a cleaned text file was tagged for parts of speech (POS) using the Stanford tagger, which assigns each token in a language sample a label that indicates the POS category to which it belongs . noun, verb, adjective, adver. This tagging was necessary for the calculation of lexical density, which is crucial in distinguishing between content and function words. To reduce errors that might occur due to the automatic tagging process, a manual review by linguists, including the present author, was conducted. This approach was expected to considerably increase the accuracy of measurements compared with those performed in studies that rely solely on results obtained using a single automatic tagger. AntConc 3. 9 was used to advance the calculation of content words using the POS-tagged sample. Table 4 presents the measures, definitions and descriptive statistics of lexical complexity. The analysis of its relationship with proficiency is presented in the results Measures Diversity Density Sophistication Table 4. Lexical complexity measures. Definitions M(SD) VOCD A mathematical transformation of the standard type-token ratio (TTR), which reduces the intervening impacts of text length and indicates the degree of word repetition in a text 120. MTLD The average length of sequential word strings in a text which maintain a given TTR value 519. Verbal E. /S. Verbal word ratio 149. Noun E. /S. Noun word ratio 069. Beyond BNC . The Beyond-2000 values calculated by subtracting K1 and K2 ratios from 100% 287. 3 Accuracy In this study, the error category scheme proposed by the Center for English Corpus Linguistics at University Catholique de Louvain (Dagneaux et al. , 1. was used to divide errors into nine categories: morphology, articles, nouns, pronouns, verbs, spelling, lexis, style and word omissions and redundancies. These categories were selected because they represent the top nine in which the learners committed the most errors, as determined based on the frequency analysis of the corpus used in this study (Park, 2023. Other categories either had very few errors or were part of the same grammatical category but in different subcategories of errors. These were combined into a single category. For example, verb tense and verb voice were merged under the verb category. An excessively small number of tokens per category may lead to unreliable results. Thus. I divided the categories according to the frequency of errors in writing. For error annotation, the Universidad Autynoma de Madrid Corpus Tool 3. 0 (UAM) ttp://w. com/CorpusTool/) was used, which is designed specifically to support typical user workflows, with everything needed to perform annotation tasks included (Wang & Yu, 2022. Yoon et al. , 2. After automated annotation using the UAM, four linguists, including the present author, manually double-checked the categorisation of grammatical errors and held regular weekly meetings to reach a consensus on a common grammatical standard. Following the annotation, a native English speaker on the corpus team, who is also a qualified linguist, tagged the proficiency levels reflected in the essays according to the CEFR scheme (S. Yoon et al. , 2. The descriptions of each error category were excerpted from Dagneaux et al. and Granger et al. , as presented in Table 5. The analysis of the relationship between accuracy and proficiency is provided in the results section. This approach ensured reliable categorisation and interpretation of errors prior to statistical analysis. Park. Key predictors of proficiency in L2 writing: Syntactic, lexical complexity, and accuracy . 6 Categories Table 5. Accuracy measures (Dagneaux et al. , 1996. Granger et al. , 2. Descriptions Inflectional errors: Misuse of grammatical morphemes . plurals, possessives, verb Morphology forms or adjective degree. Derivational errors: Incorrect affix added to an existing word Articles Issues with definite, indefinite or zero articles Misuse of the Saxon genitive Nouns Addition/omission of plural morphemes Errors with all pronoun categories: Demonstrative, interrogative, personal, possessive. Pronouns quantifying, reciprocal, reflexive, relative, indefinite Verb number: All errors of concord between a subject and its verb Verb morphology: Erroneous uses of existing verb forms . a simple past form instead of a past participle form, an infinitive instead of a past participl. Verbs Non-finite/finite verb forms Verb voice: Confusion of the passive and active voice Verb tense: Misuse of verb tense or aspect Misuse or omission of capital letters Non-standard word coinages Incorrect use of borrowings Spellings 4. Confusion between homophones Doubling of consonants/vowels Variations in American spelling Misuse/omission of hyphens or spaces in compound words Issues with errors involving the semantic . onceptual or collocationa. properties of Lexis words or phrases Long, foreign-sounding or clumsy text chunks that a native speaker would want to Style Word redundant: Unnecessary repetitions of words Word Word missing: Omission of words Word order: Problems of word order M(SD) RESULTS This study examined the extent to which measures in various domains can predict performance in L2 writing. The goal was to identify which factors among syntactic complexity, lexical sophistication, and accuracy have the most significant impact on L2 writing proficiency. To achieve this aim, multiple regression analyses were conducted, with proficiency as the dependent variable and these linguistic domains as the independent variables. Table 6 summarizes the results, showing the contributions of these measures, with certain factors reaching levels of statistical significance. Table 6. Results of multiple regression Analysis . ackward metho. Models Accuracy Lexis Complexity (Constan. Article (Constan. Sophistication (Constan. CPC <0. <0. <0. <0. <0. Adjusted DurbinAe Watson DCT Note. Criterion for removing independent variables: Probability of F Ou 0. DurbinAeWatson statistic: Ranges from 0 to 4, with values closer to 2 indicating more independent values 1127 | Studies in English Language and Education, 12. , 1119-1132, 2025 Syntactic Complexity With regard to syntactic complexity, when CPC and DCT served as independent variables, the model exhibited statistical significance . < 0. and explained 13. 5% of the variances in the L2 essays . djusted RA = 0. These results demonstrate that structural features such as coordination and subordination are predictive of proficiency. This finding suggests that the ability to construct more complex sentence structures is a meaningful indicator of learnersAo writing Lexical Complexity Regarding lexical complexity, the model with sophistication . se of advanced or less frequent word. as an independent variable was statistically significant . < 0. and explained 6% of the variances in the L2 writing pieces . djusted RA = 0. This indicates that lexical sophistication is the strongest predictor among the three domains. This result underscores the importance of advanced vocabulary uses in distinguishing higher-proficiency writers from their Accuracy In terms of accuracy, however, with article usage as the independent variable, the model was statistically non-significant . = 0. , but it accounted for 2. 7% of the variances in proficiency . djusted RA = 0. While accuracy measures, such as article usage, showed limited predictive power in this study, they remain relevant for understanding language conformity to norms, as discussed in prior research (Ellis, 2003. Michel, 2. These findings indicate that while accuracy alone explains only a small portion of proficiency differences, it still contributes useful information when considered with other dimensions. Comparative Impact To assess the impact of the most significant independent variables subsumed under the three major measures for proficiency, their standardised beta coefficients from the multiple regression analyses were compared. Article use under accuracy had a standardised beta coefficient 165, suggesting a modest effect on proficiency. Sophistication under lexis had a coefficient 562, indicating a substantial impact on proficiency, while CPC and DCT under complexity had a coefficient of 0. 327, showing a moderate effect. The largest absolute beta value was that of sophistication, suggesting that it significantly influences proficiency. Sophistication, represented by the use of Beyond-2000 words, is reflected by the occurrence of less frequently used words outside the 2000 most commonly occurring terms in the BNC. Such sophistication may lead to more complex writing, potentially impacting proficiency The relationship between lexical sophistication and proficiency is nuanced, suggesting that while sophisticated lexical use indicates advanced language skills, it also complicates writing and affects overall proficiency assessment. DISCUSSION This study explored the extent to which measures of syntactic complexity, lexical complexity, and accuracy can predict L2 writing proficiency. The multiple regression analyses generated key findings that addressed the research questions. The following subsections discuss these results in relation to each dimension and their implications for understanding L2 writing S. Park. Key predictors of proficiency in L2 writing: Syntactic, lexical complexity, and accuracy . 8 Syntactic Complexity in Relation to L2 Proficiency Levels The results revealed that among the various measures of syntactic complexity, the combination of CPC and DC was a significant predictor of L2 writing proficiency. The model incorporating these variables was statistically significant and explained 13. 5% of the variances in proficiency scores. This finding suggests that complex sentence structures, as evidenced by the presence of numerous clauses per sentence and dependent clauses, contributes positively to perceived proficiency in L2 writing. This aligns with previous research indicating that the ability to construct syntactically complex sentences is often associated with high proficiency levels (Casal & Lee, 2019. Martynez, 2. Lexical Complexity in Relation to L2 Proficiency Levels Lexical sophistication emerged as a particularly strong predictor of L2 writing proficiency, with the sophistication measure alone accounting for 31. 6% of the variances in proficiency. The use of advanced and low-frequency vocabulary was strongly associated with higher proficiency, indicating that lexical choices serve as a key marker of advanced language skills. Writers who utilise sophisticated vocabulary demonstrate not only a broader lexical repertoire but also an ability to adapt their language to contextually appropriate expressions. These findings are consistent with prior research, such as H. Yoon and Polio . , which underscores the importance of lexical richness in differentiating proficiency levels. Accuracy in Relation to L2 Proficiency Levels In contrast to syntactic and lexical complexity, accuracyAispecifically the use of articlesAi did not emerge as a statistically significant predictor of L2 proficiency Its influence was limited but close enough to the threshold to warrant consideration. These results suggests that while accuracy in article usage might play a role in proficiency, its impact is less pronounced than that of syntactic and lexical factors. They are consistent with the idea that although accuracy is important, it may not be as critical in determining overall proficiency levels as the ability to produce complex and sophisticated language (Housen & Kuiken, 2. Key Determinants of L2 Proficiency Levels Among the variables analysed, lexical sophistication stood out as the most impactful predictor of L2 proficiency. This finding not only underscores the importance of lexical choices but also highlights that among the various measures of complexity and accuracy, lexical sophistication is the key determinant of writing proficiency. The prominence of this variable suggests that in the context of L2 writing, enhancing vocabulary sophistication should be a primary focus for improving overall proficiency. It also implies that teaching strategies and assessment practices should prioritise the development of advanced vocabulary to elevate writing Syntactic complexity, as represented by CPC and DCT, had a significant impact on proficiency, though to a lesser extent, while the influence of accuracy, particularly article usage, was even less pronounced. This reinforces the idea that while grammatical complexity and accuracy are important, the richness and variety of vocabulary play a more decisive role in determining proficiency. These findings echo the observations of Crossley and McNamara . , who noted that proficient writers consistently demonstrate advanced vocabulary and diverse syntactic structures. Consequently. L2 writing instruction should focus on both grammatical complexity and the enhancement of learners' ability to use advanced vocabular. 1129 | Studies in English Language and Education, 12. , 1119-1132, 2025 CONCLUSION Overall, the findings contribute to our understanding of how different aspects of linguistic complexity and accuracy influence L2 writing proficiency. The strong predictive power of lexical sophistication suggests that teaching practices should prioritise enhancing learnersAo vocabulary, particularly by encouraging the use of more sophisticated lexical items. While syntactic complexity also plays a significant role, it is slightly less influential than lexical sophistication but still underscores the importance of developing studentsAo ability to construct complex sentence However, the relatively minor impact of accuracy measures, such as article usage, implies that while accuracy should not be neglected, greater emphasis should be placed on fostering syntactic complexity, particularly lexical sophistication, to improve overall proficiency. Moreover, this study underscores the importance of integrating multiple measuresAi syntactic complexity, lexical complexity, and accuracyAiinto the assessment of L2 writing Through the use of tools such as the L2SCA and Text Inspector, 138 essays from Korean learners of English were analysed with precision, employing multiple regression analyses to determine the most significant predictors of writing proficiency. The successful application of automated text analysers in this study highlights their potential as reliable tools for evaluating writing proficiency in both research and educational contexts. These tools can provide objective, scalable assessments that complement traditional evaluation methods. This study is limited by its focus on Korean learners of English and a specific set of linguistic features, which may not fully capture the range of factors influencing L2 proficiency across diverse contexts. Finally, the study points to several directions for future research. Expanding the range of linguistic features analysed and exploring different learner populations and writing contexts will help validate and refine the current findings. Additionally, further exploration of how automated tools can be integrated into classroom assessment practices can enhance the effectiveness and efficiency of L2 writing instruction. ACKNOWLEDGMENTS This work was supported by the Ministry of Education of the Republic of Korea and the National Research Foundation of Korea (NRF-2022S1A5A8054. REFERENCES