The Effectiveness of AI Speech Recognition on Students’ English Pronunciation

Authors

  • Woro Kusmaryani Universitas Borneo Tarakan
  • Ramli Universitas Borneo Tarakan
  • Winarno Universitas Borneo Tarakan

DOI:

https://doi.org/10.24903/sj.v11i1.2282

Keywords:

AI Speech Recognition, Effectiveness, English Pronunciation

Abstract

Background

Pronunciation is probably one of the most problematic and the conventional teaching approaches do not tend to offer the immediate, personal feedback, which is required to achieve effective pronunciation. But AI speech recognition applications can provide a real-time corrective response that may be used to overcome these drawbacks in teaching.

Methodology

The research utilized a pre experimental one group pre-test post-test study design whose qualitative observations were supported by 30 first-semester students purposely selected to participate in the study. The information was gathered by means of pre- and post-teaching tests, observation forms, and AI-based feedback. The quantitative analysis was based on descriptive statistics and paired sample t-test, and instead, thematic coding based on observation was applied to qualitative data.

Findings

The results showed that the pronunciation of students has been significantly improved, and the average scores have increased by 68.4 to 81.7 (p < 0.05). The qualitative observations revealed the enhancement of the accuracy in the production of segmental characteristics, including the interdental sounds (/θ/, /ð/), voicing contrasts (/v/ vs. /f/), and vowel length differences, and advances in supra segmental characteristics, which included word stress, rhythm, and intonation. Besides, self-confidence, motivation, and independence of the students in practicing pronunciation using AI-supported learning also improved.

Conclusion

AI speech recognition technology is a priceless aid in enhancing the English pronunciation. Feedback that was given to the learners was both regular, individualized, and timely and this resulted to a rise in accuracy and self-regulated learning. 

Originality

The study provides classroom-based evidence concerning the application of AI speech recognition in learning English pronunciation. It shows the possibilities of using AI in teaching pronunciation and increasing the level of learner autonomy and positive results of learning. 

References

Abimanto, D., & Sumarsono, W. (2024). Improving English pronunciation with AI speech-recognition technology. Acitya: Journal of Teaching and Education, 6(1), 146–156. https://doi.org/https://doi.org/10.30650/ajte.v6i1.3810

Aryanti, R. D., & Santosa, M. H. (2024). A systematic review on artificial intelligence applications for enhancing EFL students’ pronunciation skill. The Art of Teaching English as a Foreign Language (TATEFL), 5(1), 102–113. https://doi.org/https://doi.org/10.36663/tatefl.v5i1.718

Benson, P. (2011). Autonomy in language learning. Language Teaching, 44(2), 149–171. https://doi.org/https://doi.org/10.1017/S026144481000040X

Benson, P. (2013). Teaching and researching autonomy (2nd ed.). Routledge.

Celce-Murcia, M., Brinton, D., & Goodwin, J. (2010). Teaching pronunciation: A course book and reference guide (2nd ed.). Cambridge University Press.

Clarke-Jones, L. (2021). Engaging learners in pronunciation: Developing learner autonomy via an action research approach. Language Teaching, 54(3), 434–437. https://doi.org/https://doi.org/10.1017/S0261444821000112

Creswell, J. W. (2014). Research Design Qualitative, Quantitative, and Mixed Methods Approaches Fourth Edition. SAGE Publication, Inc.

Cutler, A. (2018). Suprasegmentals: Prosody in English pronunciation teaching. Language Teaching, 51(2), 228–246. https://doi.org/https://doi.org/10.1017/S0261444817000361

Dennis, N. K. (2024). Using AI-Powered Speech Recognition Technology to Improve English Pronunciation and Speaking Skills. IAFOR Journal of Education, 12(2). https://doi.org/https://doi.org/10.22492/ije.12.2.05

Derwing, T. M., & Munro, M. J. (2005). Second language accent and pronunciation teaching: A research-based approach. TESOL Quarterly, 39(3), 379–397. https://doi.org/https://doi.org/10.2307/3588486

Derwing, T. M., & Munro, M. J. (2009). Put on your listening ears: The role of pronunciation in ESL teaching. Language Teaching, 42(4), 482–504. https://doi.org/https://doi.org/10.1017/S0261444809990144

Derwing, T. M., & Munro, M. J. (2015). Pronunciation fundamentals: Evidence-based perspectives for L2 teaching and research. Language Learning & Language Teaching, 42. https://doi.org/https://doi.org/10.1075/lllt.42

Dja’far, V. H., & Hamidah, F. N. (2023). Improving English pronunciation skills through AI-based speech recognition technology. Ethical Lingua: Journal of Language Teaching and Literature, 11(2). https://doi.org/https://doi.org/10.30605/25409190.747

Dutta, P., & Arora, P. (2021). Exploring AI-based tools for improving pronunciation skills in ESL learners. Journal of Applied Linguistics, 42(2), 231–248. https://doi.org/https://doi.org/10.1080/17501229.2021.1887984

Hidayatulllah, M., Zainuddin, Z., & Hamdani, B. (2025). Exploring speech-recognition technology on English pronunciation skills: A qualitative study. Jo-ELT, 12(1). https://doi.org/https://doi.org/10.33394/jo-elt.v12i1.15230

Hsu, L. (2023). Integrating AI-based feedback and human mediation in pronunciation instruction: A hybrid CALL approach. Language Learning & Technology, 27(2), 48–65. https://www.lltjournal.org/27.2/hsu

Indari, A. (2023). The detection of pronunciation errors in English speaking skills based on artificial intelligence (AI). Jurnal Serunai Bahasa Inggris, 15(2). https://ejournal.stkipbudidaya.ac.id/index.php/jd/article/view/1007

Jenkins, J. (2000). The phonology of English as an international language. Oxford University Press.

Junining, E., Alif, S., & Setiarini, N. (2020). Automatic speech recognition in computer-assisted language learning for individual learning in speaking. JEES (Journal of English Educators Society), 5(2), 219–223. https://doi.org/https://doi.org/10.21070/jees.v5i2.867

Kenworthy, J. (1987). Teaching English pronunciation. Longman.

Kholis, A. (2021). ELSA Speak App: Automatic Speech Recognition (ASR) for supplementing English pronunciation skills. Pedagogy: Journal of English Language Teaching, 9(1). https://doi.org/https://doi.org/10.32332/joelt.v9i1.2723

Khurshedjonovna, O. S. (2025). AI technologies for assessing students’ pronunciation. Web of Humanities: Journal of Social Science and Humanitarian Research, 3(2), 163–167. https://webofjournals.com/index.php/9/article/view/3275

Kruk, M. (2012). Using online resources in the development of learner autonomy and English pronunciation: The case of individual learners. Journal of Second Language Teaching & Research, 1(2), 113–142. https://pops.uclan.ac.uk/index.php/jsltr/article/view/28

Levis, J. M. (2018). Technology and pronunciation. In Cambridge Handbook of English Pronunciation (pp. 487–504). Cambridge University Press. https://doi.org/https://doi.org/10.1017/9781316688973.033

Martin, I. A. (2023). Motivation in computer-assisted pronunciation training: Online and face-to-face environments. Language Learning & Technology, 27(1), 1–21. https://doi.org/https://doi.org/10.64152/10125/73526

Miladiyenti, F., Rozi, F., Haslina, W., & Marzuki, D. (2022). Incorporating mobile-based artificial intelligence to English pronunciation learning in tertiary-level students: Developing autonomous learning. International Journal of Advanced Science Computing and Engineering, 4(3), 220–232. https://doi.org/https://doi.org/10.62527/ijasce.4.3.92

Nasir, M., Nazar, F., Abbas, M. K., Waheed, A., & Ahmad, J. (2023). Enhancing pronunciation skills of intermediate students through computer assisted language learning (CALL). Al-Qanṭara, 9(3), 142–160. https://www.alqantarajournal.com/index.php/Journal/article/view/300

Neri, A., Cucchiarini, C., & Strik, H. (2008). The effectiveness of computer-based speech corrective feedback for improving segmental quality in L2 Dutch. ReCALL, 20(2), 225–243. https://doi.org/https://doi.org/10.1017/S0958344008000724

Ngo, T. T. N., Chen, H. H. J., & Lai, K. K.-W. (2023). The effectiveness of automatic speech recognition in ESL/EFL pronunciation: A meta-analysis. ReCALL, 36(1), 4–21. https://doi.org/https://doi.org/10.1017/S0958344023000113

Nguyen, T. S., Nguyen, T. D. T., & Hoang, N. Q. N. (2025). How AI-powered voice recognition has supported pronunciation competence among EFL university learners. Computer-Assisted Language Learning Electronic Journal, 26(3), 64–83. https://doi.org/https://doi.org/10.54855/callej.252634

Noviyanti, S. D. (2022). Artificial intelligence (AI)-based pronunciation checker: An alternative for independent learning in pandemic situation. ELT Echo: The Journal of English Language Teaching in Foreign Language Context, 5(2). https://doi.org/https://doi.org/10.24235/eltecho.v5i2.7246

Nursyafida, & Putri, R. E. (2025). Exploring technology use in English pronunciation instruction: A systematic review of trends and tools. SALEE: Study of Applied Linguistics and English Education, 6(2), 414–436. https://doi.org/https://doi.org/10.35961/salee.v6i2.414-436

Saeed, A. (2020). The impact of speech recognition technology on language learning. International Journal of Educational Technology in Higher Education, 17(1), 1–15. https://doi.org/https://doi.org/10.1186/s41239-020-00209-w

Saeed, A., & Sharma, K. (2020). AI in ESL classrooms: Improving pronunciation through speech recognition tools. Journal of Language Teaching and Research, 11(6), 967–975. https://doi.org/https://doi.org/10.17507/jltr.1106.14

Sardegna, V. G., & McGregor, A. (2022). Classroom research for pronunciation. In J. M. Levis & T. M. Drewing (Eds.), Second Language Pronunciation: Bridging the Gap Between Research and Teaching. John Wiley & Sons, Inc.

Schmidt, R. W. (1990). The role of consciousness in second language learning. Applied Linguistics, 11(2), 129–158. https://doi.org/https://doi.org/10.1093/applin/11.2.129

Spring, R., & Tabuchi, R. (2022). The role of ASR training in EFL pronunciation improvement: An in-depth look at the impact of treatment length and guided practice on specific pronunciation points. Computer-Assisted Language Learning Electronic Journal, 23(3), 163–185. https://callej.org/index.php/journal/article/view/417

Sun, W. (2023). The impact of automatic speech recognition technology on second language pronunciation and speaking skills of EFL learners: A mixed methods investigation. Frontiers in Psychology, 14. https://doi.org/https://doi.org/10.3389/fpsyg.2023.1210187

Sun, Y. (2024). The application of intelligent speech recognition in the teaching of spoken English in colleges and universities. Applied Mathematics and Nonlinear Sciences, 9(1). https://doi.org/https://doi.org/10.2478/amns-2024-2125

Thomson, R. I., & Derwing, T. M. (2015). The effectiveness of L2 pronunciation instruction: A narrative review. Applied Linguistics, 36(3), 326–344. https://doi.org/https://doi.org/10.1093/applin/amu076

Vančová, H. (2024). AI and AI-powered tools for pronunciation training. Journal of Language and Cultural Education, 11(3), 12–24. https://doi.org/https://doi.org/10.2478/jolace-2023-0022

Vassallo, S., Boulos, M. N. K., & Goller, P. (2019). AI in language learning: Challenges and opportunities for speech recognition. Computers & Education, 137, 1–12. https://doi.org/https://doi.org/10.1016/j.compedu.2019.04.007

Wen, X., & Li, Y. (2023). Beyond accuracy: Teacher mediation in AI-assisted pronunciation learning. ReCALL, 35(1), 88–106. https://www.cambridge.org/core/journals/recall/issue/E9F26F8863E0DDA0DF264E2A888D518F

Zainuddin, N. (2024). Technology enhanced language learning research trends and practices: A systematic review (2020–2022). Electronic Journal of E-Learning (EJEL), 21(2). https://doi.org/https://doi.org/10.34190/ejel.21.2.2835

Zimmerman, B. J. (2002). Becoming a self-regulated learner: An overview. Theory Into Practice, 41(2), 64–70. https://doi.org/https://doi.org/10.1207/s15430421tip4102_2

Downloads

Published

2026-04-11