Identifikasi Penggunaan Chat GPT Pada Esai TOEFL Menggunakan Metode Long Short Term Memory


Authors

  • Karina Natasya Darmawan Universitas Trilogi, Jakarta, Indonesia
  • Silvester Dian Handy Permana Universitas Trilogi, Jakarta, Indonesia
  • Ketut Bayu Yogha Bintoro Universitas Trilogi, Jakarta, Indonesia

DOI:

https://doi.org/10.47065/bulletincsr.v6i3.772

Keywords:

Identification; Chat GPT; LSTM; Human; TOEFL

Abstract

The use of Artificial Intelligent (AI) technology is increasing along with technological developments. One of the technologies that is often used is Chat GPT (Generative Pre-trained Transformers). Chat GPT is an application used for many things such as source of information, write an essay, and answer TOEFL essay questions. Because of its easiness, people will excessively use this that can cause people to lose creativity because they do not understand the material context and rely too much on the AI text result, which poses academic risks. Teachers also have difficulty to distinguish between AI and human text writing. Therefore, this research is to identify whether TOEFL essay are result of human text or GPT. This research used the Long Short Term Memory (LSTM) method to identify the use of GPT in TOEFL essay. This research also used 3 different split data configurations to find the best results. This research consists of 2 TOEFL essay datasets with the same prompt and has total of 220 data samples. The LSTM method is a modification of algorithm Recurrent Neural Network (RNN) and part of Deep Learning. The LSTM method involves memory cell controlled by three gates, such as input gate, forgot fate, output gate, and the hidden state. The gates are used to decide and control the information added, deleted, and removed from memory cell. The results of this research is a system that can help teachers detect the use of GPT in TOEFL essay. This research successfully identified the use of GPT in TOEFL essay in a 70:30 data split configuration with a loss score of 25,07%, accuracy score of 89,83%, and prediction score of 64,32%. Therefore, it is hoped that this system can help teachers identify the use of GPT and facilitate the assessment of TOEFL essay.

Downloads

Download data is not yet available.

References

I. Belcic and C. Stryker, “What is GPT (generative pre-trained transformer)? | IBM.” Accessed: May 31, 2025. [Online]. Available: https://www.ibm.com/think/topics/gpt

J. Fleckenstein, J. Meyer, T. Jansen, S. D. Keller, O. Köller, and J. Möller, “Do teachers spot AI? Evaluating the detectability of AI-generated texts among student essays,” Computers and Education: Artificial Intelligence, vol. 6, Jun. 2024, doi: 10.1016/j.caeai.2024.100209.

A. Fiedler and J. Döpke, “Do humans identify AI-generated text better than machines? Evidence based on excerpts from German theses,” International Review of Economics Education, vol. 49, p. 100321, Jun. 2025, doi: 10.1016/J.IREE.2025.100321.

A. Mizumoto, S. Yasuda, and Y. Tamura, “Identifying ChatGPT-generated texts in EFL students’ writing: Through comparative analysis of linguistic fingerprints,” Applied Corpus Linguistics, vol. 4, no. 3, Dec. 2024, doi: 10.1016/j.acorp.2024.100106.

Y. Luan and S. Lin, “Research on Text Classification Based on CNN and LSTM,” 2019 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA), pp. 352–355, Mar. 2019, doi: 10.1109/ICAICA.2019.887345410.1109/ICAICA.2019.8873454.

T. Rahman et al., “Human vs AI: Evaluating the Effectiveness of Deep Learning in AI Text Detection,” in 2024 27th International Conference on Computer and Information Technology, ICCIT 2024 - Proceedings, Institute of Electrical and Electronics Engineers Inc., 2024, pp. 2736–2741. doi: 10.1109/ICCIT64611.2024.11021749.

R. An, Y. Yang, F. Yang, and S. Wang, “Use prompt to differentiate text generated by ChatGPT and humans,” Machine Learning with Applications, vol. 14, p. 100497, Dec. 2023, doi: 10.1016/j.mlwa.2023.100497.

GlobalExam, “TOEFL Writing - Online Training.” Accessed: Jun. 08, 2025. [Online]. Available: https://global-exam.com/blog/en/toefl-writing-online-training/

Toeflessays.Com and G. Gao, “Answers to All TOEFL Essay Questions.” Accessed: Jun. 28, 2025. [Online]. Available: https://www.goodreads.com/book/show/41726611-answers-to-all-toefl-essay-questions

M. Goodine, “Master the TOEFL Writing Section in 2025 | Test Resources.” Accessed: Jun. 08, 2025. [Online]. Available: https://www.toeflresources.com/writing-section/

S. Zhu and F. Chollet, “Understanding masking & padding | TensorFlow Core.” Accessed: Jan. 05, 2025. [Online]. Available: https://www.tensorflow.org/guide/keras/understanding_masking_and_padding#padding_sequence_data

Krithika, “Introduction to FastText Embeddings and its Implication -.” Accessed: Jan. 05, 2025. [Online]. Available: https://www.analyticsvidhya.com/blog/2023/01/introduction-to-fasttext-embeddings-and-its-implication/

J. Barnard, “What Are Word Embeddings? | IBM.” Accessed: Jan. 05, 2025. [Online]. Available: https://www.ibm.com/think/topics/word-embeddings

T. Mikolov, E. Grave, P. Bojanowski, C. Puhrsch, and A. Joulin, “Advances in Pre-Training Distributed Word Representations,” LREC 2018 - 11th International Conference on Language Resources and Evaluation, pp. 52–55, Dec. 2017, Accessed: Feb. 04, 2025. [Online]. Available: https://arxiv.org/abs/1712.09405v1

A. S. Gillis, “What is data splitting and why is it important?” Accessed: Jan. 09, 2025. [Online]. Available: https://www.techtarget.com/searchenterpriseai/definition/data-splitting

M. Bansal, A. Goyal, and A. Choudhary, “A comparative analysis of K-Nearest Neighbor, Genetic, Support Vector Machine, Decision Tree, and Long Short Term Memory algorithms in machine learning,” Decision Analytics Journal, vol. 3, p. 100071, Jun. 2022, doi: 10.1016/J.DAJOUR.2022.100071.

V. Gallan, “LSTM (Long Short Term Memory). Sebelum mempelajari tentang LSTM, kita… | by Varellino Gallan | Bina Nusantara IT Division | Medium.” Accessed: Feb. 09, 2025. [Online]. Available: https://medium.com/bina-nusantara-it-division/lstm-long-short-term-memory-d29779e2ebf8

A. Satyo Bayangkari Karno, “Prediksi Data Time Series Saham Bank BRI Dengan Mesin Belajar LSTM (Long ShortTerm Memory),” Journal of Information and Information Security (JIFORTY), vol. 1, no. 1, pp. 1–8, 2020, [Online]. Available: http://ejurnal.ubharajaya.ac.id/index.php/jiforty

M. Umer et al., “Impact of convolutional neural network and FastText embedding on text classification,” Multimed Tools Appl, vol. 82, no. 4, pp. 5569–5585, Feb. 2023, doi: 10.1007/S11042-022-13459-X/FIGURES/3.

K. Chowdhury, “10 Hyperparameters to keep an eye on for your LSTM model — and other tips | by Kuldeep Chowdhury | Geek Culture | Medium,” Geek Culture. Accessed: Feb. 10, 2025. [Online]. Available: https://medium.com/geekculture/10-hyperparameters-to-keep-an-eye-on-for-your-lstm-model-and-other-tips-f0ff5b63fcd4

M. A. H. Wadud, M. M. Kabir, M. F. Mridha, M. A. Ali, M. A. Hamid, and M. M. Monowar, “How can we manage Offensive Text in Social Media - A Text Classification Approach using LSTM-BOOST,” International Journal of Information Management Data Insights, vol. 2, no. 2, p. 100095, Nov. 2022, doi: 10.1016/J.JJIMEI.2022.100095.

N. K. Hemalatha, R. N. Brunda, G. S. Prakruthi, B. V. B. Prabhu, A. Shukla, and O. S. J. Narasipura, “Sugarcane leaf disease detection through deep learning,” Deep Learning for Sustainable Agriculture, pp. 297–323, Jan. 2022, doi: 10.1016/B978-0-323-85214-2.00003-3.

P. K. Yechuri and S. Ramadass, “Classification of Image and Text Data Using Deep Learning-Based LSTM Model,” Traitement du Signal, vol. 38, no. 6, pp. 1809–1817, Dec. 2021, doi: 10.18280/ts.380625.

englishclubmskh, “An Essay Collection For TOEFL,” 2018. Accessed: Jun. 28, 2025. [Online]. Available: https://englishclubmskh.wordpress.com/wp-content/uploads/2018/09/toefl-essays.pdf

S. K. Ahmed, “How to choose a sampling technique and determine sample size for research: A simplified guide for researchers,” Oral Oncology Reports, vol. 12, p. 100662, Dec. 2024, doi: 10.1016/J.OOR.2024.100662.

fchollet, “The Sequential model.” Accessed: Jul. 01, 2025. [Online]. Available: https://keras.io/guides/sequential_model/

“Keras Input Layer - GeeksforGeeks.” Accessed: Jul. 01, 2025. [Online]. Available: https://www.geeksforgeeks.org/deep-learning/keras-input-layer/

B. Krishnamurthy, “ReLU Activation Function Explained | Built In.” Accessed: Jul. 03, 2025. [Online]. Available: https://builtin.com/machine-learning/relu-activation-function

B. Janeczko and G. Srivastava, “The use of deep learning in image analysis for the study of oncology,” Internet of Multimedia Things (IoMT): Techniques and Applications, pp. 133–150, Jan. 2022, doi: 10.1016/B978-0-32-385845-8.00011-3.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Identifikasi Penggunaan Chat GPT Pada Esai TOEFL Menggunakan Metode Long Short Term Memory

Dimensions Badge

ARTICLE HISTORY

Published: 2026-04-30

Abstract View: 14 times
PDF Download: 2 times

How to Cite

Darmawan, K. N., Permana, S. D. H., & Bintoro, K. B. Y. (2026). Identifikasi Penggunaan Chat GPT Pada Esai TOEFL Menggunakan Metode Long Short Term Memory. Bulletin of Computer Science Research, 6(3), 975-985. https://doi.org/10.47065/bulletincsr.v6i3.772

Issue

Section

Articles

Most read articles by the same author(s)