dc.description.abstract | Artificial Intelligence (AI) has revolutionized
many parts of modern life including written content.
Because of this reason, it is challenging to identify separate
AI-generated documents and human-written documents.
There are different positive and negative effects of AI-
generated documents in different fields including
Education. Therefore, this research objective is to detect
AI-generated documents and human-written documents
automatically using machine learning (ML) algorithms.
The acquired AI-generated and human-written documents
were pre-processed by cleaning the data set and Term
Frequency-Inverse Document Frequency (TF-IDF) was
used for feature extraction. Then the study continued
utilizing five classification methods such as Naïve Bayes,
Random Forest, Decision Tree, Support Vector Machine
(SVM), and ensemble learning algorithm that combined the
four individual algorithms listed above. The Random
Forest individual algorithm shows the best testing accuracy
with 65% training and 35% testing dataset for the
classification. Ensemble learning outperformed the
outcomes in the precision, accuracy, recall, f-measure, and
error values. Based on the results, the study can
successfully detect AI-generated documents and human-
written documents separately using an ensemble learning
approach. | en_US |