Quantitative Analysis within Language Studies: An Analytical Views Based on the Bibliometrics Method
DOI:
https://doi.org/10.24903/sj.v9i2.1695Keywords:
analytical views, bibliometrics, language studies, quantitative analysisAbstract
Abstract
Background:
While qualitative research methods have provided insights into linguistic structures and processes, the recent decades have witnessed a growing emphasis on quantitative approaches.
Methodology:
This study utilizes a bibliometric analysis based on the PRISMA guidelines to explore research trends in “Quantitative Analysis within Language Studies” from 2014 to 2024 by analyzing the metadata of 53,575 journal articles retrieved from a reputable database, i.e., Dimensions.
Findings:
The findings show that (1) there was a steady rise in publications throughout the period; (2) the total citations increased by over 113,000, representing a nearly 100-fold growth (1,141 citations in 2014 compared to 41,203 citations in 2024); (3) according to the ANZSRC 2020 classification, the field of Language, Communication and Culture has the highest number of publications (53,575), followed by Linguistics (32,121) and Language Studies (18,272); (4) there was the top 10 journals from 2014 to 2024; (5) there was a detail into the publication productivity and citation impact of 20 leading researchers; and (6) it has been extracted 1,028 terms concerning the examined query that classified into eight clusters.
Conclusion:
By integrating the insights gleaned from publication trends, citation analysis, leading authors, journal distribution, map-based visualizations, and cluster analysis of terms, this study provides a comprehensive and detailed picture of research trends within the search query.
Originality:
While current several research in quantitative language studies tends to delve into specific subfields, offering piecemeal understandings of the overall landscape, this study takes a distinct approach. By examining 53,575 articles content and metadata, this study aims to provide a more comprehensive perspective compare to previous fragmented insights.
References
Biber, D. (2012). Corpus-Based and Corpus-driven Analyses of Language Variation and Use. In The Oxford Handbook of Linguistic Analysis. https://doi.org/10.1093/oxfordhb/9780199544004.013.0008
Biber, D. (2023). Representativeness in Corpus Design. In Practical Lexicography: A Reader. https://doi.org/10.1093/oso/9780199292332.003.0005
Biber, D., Reppen, R., & Friginal, E. (2012). Research in Corpus Linguistics. In The Oxford Handbook of Applied Linguistics, (2 Ed.). https://doi.org/10.1093/oxfordhb/9780195384253.013.0038
Biber, D., & Jones, J. K. (2009). Quantitative methods in corpus linguistics. In Corpus Linguistics: An International Handbook (Vol. 2, pp. 1286-1304). De Gruyter Mouton.
Botes, E., Dewaele, J., & Greiff, S. (2021). The Development and Validation of the Short Form of the Foreign Language Enjoyment Scale. Modern Language Journal, 105(4), 858–876. https://doi.org/10.1111/modl.12741
Chen, H., & Liu, H. (2014). Quantitative Aspects of Journal of Quantitative Linguistics. Journal of Quantitative Linguistics, 21(4), 299–340. https://doi.org/10.1080/09296174.2014.944327
Chen, H., Liang, J., & Liu, H. (2015). How does word length evolve in written Chinese?. PloS one, 10(9), e0138567. https://doi.org/10.1371/journal.pone.0138567
Chen, H., & Liu, H. (2016). How to Measure Word Length in Spoken and Written Chinese. Journal of Quantitative Linguistics, 23(1), 5–29. https://doi.org/10.1080/09296174.2015.1071147
Comanaru, R.-S., & Dewaele, J.-M. (2015). A bright future for interdisciplinary multilingualism research. International Journal of Multilingualism, 12(4), 404–418. https://doi.org/10.1080/14790718.2015.1071016
Crosthwaite, P., Ningrum, S., & Schweinberger, M. (2023). Research trends in corpus linguistics. International Journal of Corpus Linguistics, 28(3), 344–377. https://doi.org/10.1075/ijcl.21072.cro
Dewaele, J.-M. (2019). When Elephants Fly: The Lift-Off of Emotion Research in Applied Linguistics. Modern Language Journal, 103(2), 533–536. https://doi.org/10.1111/modl.12576
Dewaele, J.-M. (2018). Online Questionnaires. In The Palgrave Handbook of Applied Linguistics Research Methodology. https://doi.org/10.1057/978-1-137-59900-1_13
Dewaele, J.-M., Rolland, L., Cook, S., & Costa, B. (2023). Mixed Methods When Researching Sensitive Topics. In Supporting Research in Counselling and Psychotherapy: Qualitative, Quantitative, and Mixed Methods Research. https://doi.org/10.1007/978-3-031-13942-0_13
Dewaele, J.-M., Saito, K., & Halimi, F. (2022). How teacher behaviour shapes foreign language learners’ enjoyment, anxiety and attitudes/motivation: A mixed modelling longitudinal investigation. Language Teaching Research. https://doi.org/10.1177/13621688221089601
Donthu, N., Kumar, S., Mukherjee, D., Pandey, N., & Lim, W. M. (2021). How to conduct a bibliometric analysis: An overview and guidelines. Journal of Business Research, 133, 285–296. https://doi.org/10.1016/j.jbusres.2021.04.070
Eatough, V., & Tomkins, L. (2022). 8 Qualitative methods. In G. Schiewer, J. Altarriba & B. Ng (Ed.), Volume 1 (pp. 163-182). Berlin, Boston: De Gruyter Mouton. https://doi.org/10.1515/9783110347524-008
Egbert, J., & LaFlair, G. T. (2018). Statistics for Categorical, Nonparametric, and Distribution-Free Data. In The Palgrave Handbook of Applied Linguistics Research Methodology. https://doi.org/10.1057/978-1-137-59900-1_23
Egbert, J., Larsson, T., & Biber, D. (2020). Doing linguistics with a corpus: Methodological considerations for the everyday user. In Doing Linguistics with a Corpus: Methodological Considerations for the Everyday User. https://doi.org/10.1017/9781108888790
Gray, B. (2013). Interview with Douglas Biber. Journal of English Linguistics, 41(4), 359–379. https://doi.org/10.1177/0075424213502237
Greenhill, S. J., Heggarty, P., & Gray, R. D. (2020). Bayesian Phylolinguistics. In The Handbook of Historical Linguistics, Volume II. https://doi.org/10.1002/9781118732168.ch11
Gries, S. T. (2015a). Quantitative designs and statistical techniques. In The Cambridge Handbook of English Corpus Linguistics. https://doi.org/10.1007/9781139764377.004
Gries, S. T. (2015b). Statistics for learner corpus research. In The Cambridge Handbook of Learner Corpus Research. https://doi.org/10.1017/CBO9781139649414.008
Hou, R., & Huang, C.-R. (2020). Robust stylometric analysis and author attribution based on tones and rimes. Natural Language Engineering, 26(1), 49–71. https://doi.org/10.1017/S135132491900010X
Jiang, J., & Liu, H. (2019a). Interlanguage: a perspective of quantitative linguistic typology. Language Sciences, 74, 85–97. https://doi.org/10.1016/j.langsci.2019.04.004
Jiang, J., & Liu, H. (2019b). Does Scale-Free Syntactic Network Emerge in Second Language Learning? Frontiers in Psychology, 10, 925. https://doi.org/10.3389/fpsyg.2019.00925
Kato, M., Kawahara, S., & Idemaru, K. (2020). Speaking rate normalization across different talkers in the perception of Japanese stop and vowel length contrasts. Proceedings of the International Conference on Speech Prosody, 2020-May, 61–65. https://doi.org/10.21437/SpeechProsody.2020-13
Kawahara, S., & Kumagai, G. (2023). Lyman’s Law can count only up to two. Laboratory Phonology, 14(1). https://doi.org/10.16995/labphon.9335
Köhler, R. (2013). Statistical comparability: Methodological caveats. In Building and Using Comparable Corpora. https://doi.org/10.1007/978-3-642-20128-8_4
Kortmann, B. (2021). Reflecting on the quantitative turn in linguistics. Linguistics, 59(5), 1207–1226. https://doi.org/10.1515/ling-2019-0046
Kruk, M., Pawlak, M., Elahi Shirvan, M., Taherian, T., & Yazdanmehr, E. (2022). A longitudinal study of foreign language enjoyment and boredom: A latent growth curve modeling. Language Teaching Research. https://doi.org/10.1177/13621688221082303
LaFlair, G. T., Egbert, J., & Plonsky, L. (2015). LaFlair, G. T., Egbert, J., & Plonsky, L. (2015). A practical guide to bootstrapping descriptive statistics, correlations, t tests, and ANOVAs. In Advancing quantitative methods in second language research (pp. 46-77). Routledge.
Larson-Hall, J. (2012). Our statistical intuitions may be misleading us: Why we need robust statistics. Language Teaching, 45(4), 460–474. https://doi.org/10.1017/S0261444811000127
Larsson, T., Egbert, J., & Biber, D. (2022). On the status of statistical reporting versus linguistic description in corpus linguistics: a ten-year perspective. Corpora, 17(1), 137–157. https://doi.org/10.3366/cor.2022.0238
Lei, L. (2012). Quantitative Linguistic Computing with Perl [Review of the book Quantitative Linguistic Computing with Perl, by Fengxiang Fan, Yaochen Deng]. Literary and Linguistic Computing, 27(2), 227–230. https://doi.org/10.1093/llc/fqs012
Liao, S., & Lei, L. (2017). What we talk about when we talk about corpus: A bibliometric analysis of corpus-related research in linguistics (2000-2015). Glottometrics, 38, 1–20.
Lim, W. M., & Kumar, S. (2024). Guidelines for interpreting the results of bibliometric analysis: A sensemaking approach. Global Business and Organizational Excellence, 43(2), 17–26. https://doi.org/10.1002/joe.22229
List, J.-M., & Moran, S. (2013). An open source toolkit for quantitative historical linguistics. Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2013-Augus, 13–18.
Liu, H. (2011). Quantitative Properties of English Verb Valency. Journal of Quantitative Linguistics, 18(3), 207–233. https://doi.org/10.1080/09296174.2011.581849
Liu, H., & Xu, C. (2012). Quantitative typological analysis of Romance languages. Poznań Studies in Contemporary Linguistics, 48(4), 597-625. https://doi.org/10.1515/psicl-2012-0027
Lu, X. (2017). Automated measurement of syntactic complexity in corpus-based L2 writing research and implications for writing assessment. Language Testing, 34(4), 493–511. https://doi.org/10.1177/0265532217710675
Mersbergen, M. van, & Patrick, C. J. (2022). 10 Quantitative and physiological measures. In G. Schiewer, J. Altarriba & B. Ng (Ed.), Volume 1 (pp. 201-225). Berlin, Boston: De Gruyter Mouton. https://doi.org/10.1515/9783110347524-010
Miller, D., & Biber, D. (2015). Evaluating reliability in quantitative vocabulary studies: The influence of corpus design and composition. International Journal of Corpus Linguistics, 20(1), 30–53. https://doi.org/10.1075/ijcl.20.1.02mil
Mizumoto, A., & Plonsky, L. (2016). R as a Lingua Franca: Advantages of using r for quantitative research in applied linguistics. Applied Linguistics, 37(2), 284–291. https://doi.org/10.1093/applin/amv025
Mizumoto, A., Plonsky, L., & Egbert, J. (2020). Meta-analyzing Corpus Linguistic Research. In A Practical Handbook of Corpus Linguistics (pp. 663–688). Springer International Publishing. https://doi.org/10.1007/978-3-030-46216-1_27
Mukherjee, D., Lim, W. M., Kumar, S., & Donthu, N. (2022). Guidelines for advancing theory and practice through bibliometric research. Journal of Business Research, 148, 101–115. https://doi.org/10.1016/j.jbusres.2022.04.042
Mystkowska-Wiertelak, A., & Pawlak, M. (2016). Designing a Tool for Measuring the Interrelationships between L2 WTC, Confidence, Beliefs, Motivation, and Context. In Second Language Learning and Teaching. https://doi.org/10.1007/978-3-319-30373-4_2
Norris, J. M., Plonsky, L., Ross, S. J., & Schoonen, R. (2015). Guidelines for reporting quantitative methods and results in primary research. Language Learning, 65(2), 470–476. https://doi.org/10.1111/lang.12104
Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., Shamseer, L., Tetzlaff, J. M., Akl, E. A., Brennan, S. E., Chou, R., Glanville, J., Grimshaw, J. M., Hróbjartsson, A., Lalu, M. M., Li, T., Loder, E. W., Mayo-Wilson, E., McDonald, S., … Moher, D. (2021). The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. PLOS Medicine, 18(3), e1003583. https://doi.org/10.1371/journal.pmed.1003583
Paquot, M., & Plonsky, L. (2017). Quantitative research methods and study quality in learner corpus research. International Journal of Learner Corpus Research, 3(1), 61–94. https://doi.org/10.1075/ijlcr.3.1.03paq
Plonsky, L. (2012). Effect size. In The Routledge Encyclopedia of Second Language Acquisition.
Plonsky, L. (2014). Study quality in quantitative l2 research (1990-2010): A methodological synthesis and call for reform. Modern Language Journal, 98(1), 450–470. https://doi.org/10.1111/j.1540-4781.2014.12058.x
Plonsky, L. (2015a). Advancing quantitative methods in second language research. In Advancing Quantitative Methods in Second Language Research. https://doi.org/10.4324/9781315870908
Plonsky, L. (2015b). Quantitative considerations for improving replicability in CALL and applied linguistics. CALICO Journal, 32(2), 232–244. https://doi.org/10.1558/cj.v32i2.26857
Plonsky, L. (2017). Quantitative research methods. In The Routledge Handbook of Instructed Second Language Acquisition. https://doi.org/10.4324/9781315676968
Plonsky, L., & Gass, S. (2011). Quantitative Research Methods, Study Quality, and Outcomes: The Case of Interaction Research. Language Learning, 61(2), 325–366. https://doi.org/10.1111/j.1467-9922.2011.00640.x
Plonsky, L., & Oswald, F. L. (2014). How Big Is “Big”? Interpreting Effect Sizes in L2 Research. Language Learning, 64(4), 878–912. https://doi.org/10.1111/lang.12079
Rietveld, T., & van Hout, R. (2010). Statistics in Language Research: Analysis of Variance. In Statistics in Language Research: Analysis of Variance. https://doi.org/10.1515/9783110877809
Rietveld, T., & van Hout, R. (2011). Statistical techniques for the study of language and language behaviour. In Statistical Techniques for the Study of Language and Language Behaviour.
Rietveld, T., & van Hout, R. (2015). The t test and beyond: Recommendations for testing the central tendencies of two independent samples in research on speech, language and hearing pathology. Journal of Communication Disorders, 58, 158–168. https://doi.org/10.1016/j.jcomdis.2015.08.002
Shcherbakova, O., Gast, V., Blasi, D. E., Skirgård, H., Gray, R. D., & Greenhill, S. J. (2023). A quantitative global test of the complexity trade-off hypothesis: The case of nominal and verbal grammatical marking. Linguistics Vanguard, 9(s1), 155–167. https://doi.org/10.1515/lingvan-2021-0011
Sheng, D. (2023). Statistics in corpus linguistics: a practical guide. Social Semiotics, 33(4), 909–912. https://doi.org/10.1080/10350330.2021.1969215
Staples, S., & Biber, D. (2015). Cluster analysis. In Advancing Quantitative Methods in Second Language Research. https://doi.org/10.4324/9781315870908-11
van Eck, N. J., & Waltman, L. (2023). VOSviewer (1.6.20). Universiteit Leiden.
Xue, W., Cucchiarini, C., van Hout, R., & Strik, H. (2023). Measuring the intelligibility of dysarthric speech through automatic speech recognition in a pluricentric language. Speech Communication, 148, 23–30. https://doi.org/10.1016/j.specom.2023.02.004
Yoon, S.-Y., Lu, X., & Zechner, K. (2019). Features measuring vocabulary and grammar. In Automated Speaking Assessment: Using Language Technologies to Score Spontaneous Speech. https://doi.org/10.4324/9781315165103-8
Zhang, J., & Lu, X. (2019). Measuring and supporting second language development using computerized dynamic assessment. Language and Sociocultural Theory, 6(1), 92–115. https://doi.org/10.1558/lst.31710
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Danang Satria Nugraha
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.