ENHANCING SECOND LANGUAGE WRITING ASSESSMENT THROUGH NATURAL LANGUAGE PROCESSING: A CORPUS-BASED STUDY

Authors

  • Areeba Sajid MS Scholar (Applied Linguistics), National University of Computer and Emerging Sciences. Author
  • Bisma Amjad MS Scholar (English Language and Literature), University of Management and Technology Lahore. Author
  • Saadia Khan PhD Scholar, English (Linguistics), University of Education, Lahore. Author

DOI:

https://doi.org/10.63878/jalt1019

Keywords:

Natural Language Processing (NLP), Second Language Writing, DECOR Framework, EFCAMDAT Corpus, Automated Writing Assessment.

Abstract

This study investigates the role of Natural Language Processing (NLP) in enhancing second language (L2) writing assessment, with a specific focus on coherence and cohesion. Utilizing the DECOR (Detect, Explain, and Rewrite) framework, this research applies advanced NLP techniques to analyze and improve the logical flow of learner texts. A representative dataset is extracted from the EF-Cambridge Open Language Database (EFCAMDAT), which contains over one million texts written by English learners across CEFR levels. The corpus provides rich metadata and revision history, enabling a comprehensive analysis of L2 writing development. The study employs a hybrid methodology, combining automated coherence detection and rewriting using DECOR with corpus-based feature extraction and manual evaluation. Results are evaluated by comparing pre- and post-revision drafts, examining the effectiveness of NLP-generated feedback in improving coherence and lexical cohesion. This research not only highlights recurring coherence challenges faced by L2 learners but also demonstrates the pedagogical potential of integrating NLP tools into writing instruction. By aligning computational output with human assessment practices, the study offers insights into the evolving relationship between applied linguistics and artificial intelligence, advocating for data-driven, scalable, and learner-centered approaches to writing assessment in multilingual education contexts.

Downloads

Published

2025-07-20