11
of being selected, with criteria for difficulty level, question type, and knowledge points. The paper also discusses
the use of Expert-Based algorithms and Genetic algorithms for better question selection and large-scale test
paper creation. It highlights the importance of standardized test question banks to reduce teacher workload and
improve exam quality.
A research paper by Rachman, Alfian, and Yuhana (2023) focuses on using Deep Learning to classify questions
in the Indonesian National Assessment. It addresses the challenge of small datasets by applying data
augmentation and using Convolutional Neural Networks (CNN) to classify illustrated images into two
categories: information and literature. The study found that ResNet-50 gave the best performance, and transfer
learning helped solve the dataset issue. This work shows how Deep Learning can improve automated
classification in educational assessments, supporting Intelligent Question Bank Systems.
Yu, B. (2023) presents an intelligent system for creating exam papers using a Random Extraction algorithm.
This algorithm is selected for its speed and simplicity in meeting constraints like question type, quantity,
difficulty, and knowledge coverage. The study highlights the shift from traditional exams to AI-aided methods,
improving efficiency and fairness in assessments. It also compares the random extraction approach with other
algorithms and explores its application in teaching Chinese as a foreign language to enhance test quality.
The article by Zhang, J., and Li, Z. (2023) describes a question bank system using Machine Learning to support
intelligent paper grouping, online mock exams, and automated grading. Built with JSP, Java, and MySQL, it
includes roles for administrators, teachers, and students, ensuring fair and accurate marking by integrating
natural language processing and text similarity detection libraries. The system minimizes grading errors,
enhances personalized learning, and streamlines exams, reducing educators' workload while improving
assessment accuracy.
The research paper by Liu, X., and Yang, C. (2022) introduces an intelligent system for creating English test
papers using the Radial Basis Function (RBF) algorithm. The RBF neural network, known for its simple structure
and fast learning, helps select and organize questions based on knowledge points, difficulty levels, and total
scores. This system aims to reduce teachers’ workload and improve fairness in exams by ensuring a standardized
and efficient test composition process. It also addresses challenges in China’s paper-making systems, such as
inefficient grouping and the need for a large question bank, using advanced clustering and optimization
techniques. This innovative approach enhances the quality and accuracy of assessments in education.
Banujan et al. (2023) present an automated system for classifying over 16,000 exam questions from Sri Lankan
universities into the six levels of revised Bloom's Taxonomy (remember, understand, apply, analyze, evaluate,
create) using Deep Learning models like LSTM combined with BERT and GloVe embeddings, alongside ANN
with TF-IDF, achieving superior accuracy through semantic pattern recognition and NLP preprocessing such as
tokenization and stop-word removal. This approach addresses manual classification's subjectivity and scalability
issues in large question banks, outperforming traditional ML like SVM or DT by capturing contextual
relationships in text that directly complementing this project, which similarly analyzes keywords, complexity,
and cognitive demands for Easy/Medium/Hard categorization aligned with Bloom's framework, thus enhancing
consistency and efficiency in assessment design for educators managing extensive repositories.
While, Romadhony et al. (2022) present a Machine Learning framework for classifying primary and high school
exam questions into Bloom's Taxonomy's six cognitive levels (remembering, understanding, applying,
analyzing, evaluating, creating), using feature extraction techniques like TF-IDF variants and word embeddings
to detect semantic complexity and reduce teacher workload in manual categorization. This also directly
complements this project, which automates difficulty classification (Easy, Medium, Hard) aligned with Bloom's
hierarchy for large-scale question banks, by demonstrating ML's efficacy in handling diverse educational
datasets and ensuring consistent, scalable assessments across subjects addressing similar challenges of
inefficiency and subjectivity in traditional methods.
METHODOLOGY
The Intelligent Question Bank System (IQBS) uses a well-structured database to manage questions, categorized
by subject, topic, and difficulty. The system supports roles for Teacher (create, edit/manage, and categorize