• Login
    • University Home
    • Library Home
    • Lib Catalogue
    • Advance Search
    View Item 
    •   KDU-Repository Home
    • INTERNATIONAL RESEARCH CONFERENCE ARTICLES (KDU IRC)
    • 2024 IRC Articles
    • Computing
    • View Item
    •   KDU-Repository Home
    • INTERNATIONAL RESEARCH CONFERENCE ARTICLES (KDU IRC)
    • 2024 IRC Articles
    • Computing
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Classification of Research Papers Based on Their Subject Field Using an Ensemble Machine Learning Approach

    Thumbnail
    View/Open
    IRC-FOC-2024-02.pdf (921.6Kb)
    Date
    2024-09
    Author
    Dilushika, GKE
    Rupasingha, RAHM
    Metadata
    Show full item record
    Abstract
    Research papers are the most important things for researchers and scholars. Scholars face the challenge of sifting through relevant research data to find papers that align with their educational interests. This abundance of information can complicate the process of identifying valuable insights across various research types. Therefore, finding related research papers becomes a time-consuming process. Here classification of research papers automatically helps researchers do their research easily and effectively. This study introduces a novel method to classify research papers based on subject fields using machine learning. Unlike previous approaches, which rely on abstracts, limited subjects, or individual algorithms, research analyzes full paper content and expands classification to five disciplines using an ensemble learning approach by combining four individual algorithms. With a dataset of 2000 papers, preprocessed and extracted feature vectors using Term Frequency-Inverse Document Frequency (TF-IDF). The research used five machine learning algorithms namely NavieBayes, Random Forest, Decision Tree (J48), SVM, and ensemble learning. Employing Naive Bayes and ensemble learning, our results demonstrate high accuracy, with ensemble learning surpassing individual algorithms in 5-fold cross validation. The performance of the classification system was evaluated using metrics such as accuracy, precision, recall, and F-measure, as well as error rates. Results indicate that Naive Bayes exhibited the highest accuracy among individual algorithms, while ensemble learning, particularly through the Majority Voting combination rule outperformed individual algorithms with an accuracy of 94.20%. This research underscores ensemble learning-based machine learning's efficacy in enhancing research paper classification processes and accessing relevant research.
    URI
    http://ir.kdu.ac.lk/handle/345/8518
    Collections
    • Computing [52]

    Library copyright © 2017  General Sir John Kotelawala Defence University, Sri Lanka
    Contact Us | Send Feedback
     

     

    Browse

    All of KDU RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsFacultyDocument TypeThis CollectionBy Issue DateAuthorsTitlesSubjectsFacultyDocument Type

    My Account

    LoginRegister

    Library copyright © 2017  General Sir John Kotelawala Defence University, Sri Lanka
    Contact Us | Send Feedback