APJCR_2025_6_1_25

Asia Pacific Journal of Corpus Research Vol. 6, No. 1, pp. 25-45
Abbreviation: APJCR
e-ISSN: 2733-8096
Publication date: 31 August 2025
Received: 25 May 2025 / Received in Revised Form: 5 August 2025 / Accepted: 17 August 2025
DOI: https://doi.org/10.22925/apjcr.2025.6.1.25

“I agree with this”: A multi-modal register analysis of lexical bundles in the ICNALE

Trevor Sitler (Kansai University), JAPAN; Martin Spivey (Akita University), JAPAN
Copyright 2025 APJCR This is an open access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

The purpose of this study was to examine the differences in lexical bundle use between the different registers of speaking and writing in a learner corpus. While various studies have examined multi-word items by foreign language learners, there is currently a lack of research on lexical bundle use in spoken corpora, particularly monologues. A comparative register analysis was designed to compare the lexical bundle output of L1 Japanese students of English in two corpora of argumentative written essays and spoken monologues, with two native speaker corpora being compared for reference. 4-word lexical bundles were extracted and analyzed through a Generalized Linear Mixed Model (GLMM) and categorized into two functional and structural taxonomies. Findings suggest bundle frequency decreases as proficiency level increases. Compositionally, it was found that both registers are structurally and functionally similar to each other, though the Japanese spoken monologue sub-corpus was the least varied. Furthermore, the use of prompt-specific language was found to be more influenced by register than L1 background, with both groups producing a significant amount of language that was dependent on the prompt. The register descriptions found in this study can inform pedagogical approaches to writing and monologue tasks.

Keywords

ICNALE, Learner Corpora, Lexical Bundles, Register Analysis, Sketch Engine

References

Ädel, A., & Erman, B. (2012). Recurrent word combinations in academic writing by native and non-native speakers of English: A lexical bundles approach. English for Specific Purposes, 31(2), 81–92. https://doi.org/10.1016/j.esp.2011.08.004

Anderson, F. (1993). The enigma of the college classroom: Nails that don’t stick up. In Wadden, P. (Ed.) A Handbook for Teaching English at Japanese Colleges and Universities. (pp. 101-110).  New York, NY: Oxford University Press.

Altenberg, B., & Granger, S. (2001). The grammatical and lexical patterning of MAKE in native and non-native student writing. Applied Linguistics, 22(2), 173–195. https://doi.org/10.1093/applin/22.2.173

Biber, D. (2010). Corpus-based and corpus-driven analyses of language variation and use. In Heine, B., & Narrog, H. (Eds.), The Oxford Handbook of Linguistic Analysis (1st ed., pp. 159–191). New York, NY: Oxford University Press.

Biber, D., Conrad, S., & Cortes, V. (2004). If you look at …: Lexical bundles in university teaching and textbooks. Applied Linguistics, 25(3), 371–405. https://doi.org/10.1093/applin/25.3.371

Biber, D., Johansson, S., Leech, G., Conrad, S., & Finegan, E. (1999). The Longman Grammar of Spoken and Written English. London: Longman.

Bychkovska, T., & Lee, J. J. (2017). At the same time: Lexical bundles in L1 and L2 university student argumentative writing. Journal of English for Academic Purposes, 30, 38–52. https://doi.org/ 10.1016/j.jeap.2017.10.008

Chen, Y.-H., & Baker, P. (2010). Lexical bundles in L1 and L2 academic writing. Academic Writing, 14(2), 30–49.

Chen, Y.-H., & Baker, P. (2016). Investigating criterial discourse features across second language development: Lexical bundles in rated learner essays, CEFR B1, B2 and C1. Applied Linguistics, 37(6), 849–880. https://doi.org/10.1093/applin/amu065

Durrant, P., & Schmitt, N. (2009). To what extent do native and non-native writers make use of collocations? IRAL – International Review of Applied Linguistics in Language Teaching, 47(2). https://doi.org/10.1515/iral.2009.007

Ellis, R. (1991). Communicative competence and the Japanese learner. JALT Journal, 13, 103-127.

Friginal, E. (2018). Corpus Linguistics for English Teachers: New Tools, Online Resources, and Classroom Activities. New York, NY: Routledge. https://doi.org/10.4324/9781315649054

Gablasova, D., & Bottini, R. (2022). Spoken learner corpora for language teaching. In Jablonkai, R. R., & Csomay, E. (Eds.), The Routledge Handbook of Corpora and English Language Teaching and Learning (pp. 296–310). New York, NY: Routledge.

Gohel, D., & Skintzos, P. (2024). flextable: Functions for tabular reporting. R package version 0.9.7, https://davidgohel.github.io/flextable/, https://ardata-fr.github.io/flextable-book/.

Götz, S., & Granger, S. (2024). Learner corpus research for pedagogical purposes: An overview and some research perspectives. International Journal of Learner Corpus Research, 10(1), 1–38. https://doi.org/10.1075/ijlcr.00039.got

Granger, S., & Paquot, M. (2009). Lexical verbs in academic discourse: A corpus-driven study of learner use. In Charles, M., Hunston, S., & Pecorari, D. (Eds.), Academic Writing: At the Interface of Corpus and Discourse. London: Bloomsbury Academic. https://doi.org/10.5040/9781474211703

Hartig, F. (2024). DHARMa: Residual diagnostics for hierarchical (multi-level / mixed) regression models. R package version 0.4.7, https://github.com/florianhartig/dharma.

Hyland, K. (2008). As can be seen: Lexical bundles and disciplinary variation. English for Specific Purposes, 27(1), 4–21. https://doi.org/10.1016/j.esp.2007.06.001

Ishikawa, S. (2013). The ICNALE and sophisticated contrastive interlanguage analysis of asian learners of english. Learner Corpus Studies in Asia and the World, 1, 91–118.

Ishikawa, S. (2014). Design of the ICNALE spoken: A new database for multi-modal contrastive interlanguage analysis. Learner Corpus Studies in Asia and the World, 2, 63–76.

Ishikawa, S. (2023). The ICNALE Guide: An Introduction to a Learner Corpus Study on Asian Learners’ L2 English. New York, NY: Routledge.

Jeong, H., & Jiang, N. (2019). Representation and processing of lexical bundles: Evidence from word monitoring. System, 80, 188–198. https://doi.org/10.1016/j.system.2018.11.009

Kilgarriff, A., Baisa, V., Bušta, J., Jakubíček, M., Kovář, V., Michelfeit, J., Rychlý, P., & Suchomel, V. (2014). The Sketch Engine: Ten years on. Lexicography, 1(1), 7–36.

Kim, S., & Kessler, M. (2022). Examining L2 english university students’ uses of lexical bundles and their relationship to writing quality. Assessing Writing, 51, 100589. https://doi.org/10.1016/ j.asw.2021.100589

Lee, K. R., & Zipagan, M. N. (2018). Korean english learners’ use of lexical bundles in speaking. The Journal of Asia TEFL, 15(2), 276–291. https://doi.org/10.18823/asiatefl.2018.15.2.2.276

McGillycuddy, M., Warton, D.I., Popovic, G., & Bolker, B.M. (2025). Parsimoniously fitting large multivariate random effects in glmmTMB. Journal of Statistical Software, 112(1), 1–19. doi:10.18637/jss.v112.i01.

Pan, F., Reppen, R., & Biber, D. (2020). Methodological issues in contrastive lexical bundle research: The influence of corpus design on bundle identification. International Journal of Corpus Linguistics, 25(2), 216–230. https://doi.org/10.1075/ijcl.19063.pan

R Core Team. (2024). R: A language and environment for statistical computing (Version 4.3.3) Computer software. R Foundation for Statistical Computing. https://www.R-project.org/

Ruan, Z. (2017). Lexical bundles in chinese undergraduate academic writing at an english medium university. RELC Journal, 48(3), 327–340. https://doi.org/10.1177/0033688216631218

Sánchez Hernández, P. (2013). Lexical bundles in three oral corpora of university students. Nordic Journal of English Studies, 12(S1), 187–209. https://doi.org/10.35360/njes.281

Sawaguchi, R. (2024). Potential of L1 and L2 corpora to identify target lexical bundles for argumentative essay writing. Asia Pacific Journal of Corpus Research, 5(1), 1–21. https://doi.org/10.22925/APJCR.2024.5.1.1

Shirato, J., & Stapleton, P. (2007). Comparing English vocabulary in a spoken learner corpus with a native speaker corpus: Pedagogical implications arising from an empirical study in Japan. Language Teaching Research, 11(4), 393–412. https://doi.org/10.1177/1362168807080960

Staples, S., Egbert, J., Biber, D., & McClair, A. (2013). Formulaic sequences and EAP writing development: Lexical bundles in the TOEFL iBT writing section. Journal of English for Academic Purposes, 12(3), 214–225. https://doi.org/10.1016/j.jeap.2013.05.002

Vo, S. (2019). Use of lexical features in non-native academic writing. Journal of Second Language Writing, 44, 1–12. https://doi.org/10.1016/j.jslw.2018.11.002

Wray, A., & Perkins, M. R. (2000). The functions of formulaic language: An integrated model. Language & Communication, 20(1), 1–28. https://doi.org/10.1016/S0271-5309(99)00015-4

Yan, H. (2019). I think we should…: Investigating lexical bundle use in the speech of english learners across proficiency levels. International Journal of Translation, Interpretation, and Applied Linguistics, 1(2), 1–16. https://doi.org/10.4018/IJTIAL.2019070105.

The Authors

Kevin Heffernan is a Professor at Kwansei Gakuin University. He has written many articles about the Japanese language on a wide range of topics, such as local identity, language contact, language change, the formation of irregular forms, vernacular Japanese, and cognitive processing. He has published two textbooks about linguistics.

The Authors’ Addresses

First and Corresponding Author
Trevor Sitler
Ph.D. Student
Kansai University
3-chōme-3-35 Yamatecho, Suita, Osaka, JAPAN
Email: sitlert@yahoo.comCo-author
Martin Spivey
Assistant Professor
Akita University
1-1 Tegatagakuenmachi, Akita, JAPAN
Email: mart.spiv@gmail.com

☞ How to submit your manuscript to APJCR.