Hybrid SVM-TLBO model for improved frequent itemset classification: a performance comparison
Seenam Bee 1 and Animesh Kumar Dubey1
Corresponding Author : Seenam Bee
Recieved : 25-Dec-2024; Revised : 10-Mar-2025; Accepted : 14-Mar-2025
Abstract
Frequent itemset classification is a critical task in data mining, enabling efficient discovery of patterns in large transactional and categorical datasets. In this work, a hybrid support vector machine–teaching learning-based optimization (SVM–TLBO) model is proposed to enhance classification performance. The pipeline consists of four stages: preprocessing and transformation of transactional datasets, frequent itemset mining with Apriori, construction of discriminative feature vectors, and optimization of SVM parameters through TLBO. The model was validated on benchmark datasets, namely T10I4D100K and Mushroom, obtained from the UCI repository. Experimental evaluation using precision, recall, F1-score, accuracy, and AUC-ROC metrics demonstrated that the proposed hybrid approach outperformed baseline SVM. Notable improvements were observed in recall and AUC values, highlighting the ability of the SVM–TLBO model to reduce false negatives and improve decision boundary discrimination. The results confirm the effectiveness, scalability, and robustness of the hybrid model for frequent itemset classification in both large-scale transactional and categorical domains.
Keywords
Frequent itemset mining, Support vector machine (SVM), Teaching learning-based optimization (TLBO), SVM-TLBO.
References
[1] Chen S, Xue Y, Cui X. Information literacy of college students from library education in smart classrooms: based on big data exploring data mining patterns using Apriori algorithm. Soft Computing. 2024; 28(4):3571-89.
[2] Pamnani HK, Raja L, Ives T. Developing a novel H-Apriori algorithm using support-leverage matrix for association rule mining. International Journal of Information Technology. 2024; 16(8):5395-405.
[3] Oktory HD, Hadiwandra TY. Penerapan algoritma apriori untuk penentuan pola pembelian kacamata pada optik indah optikal: application of an apriori algorithm to determine eyeglass purchasing patterns at optik indah optik. MALCOM: Indonesian Journal of Machine Learning and Computer Science. 2024; 4(4):1275-81.
[4] Verma AK, Dubey AK. Enhancing frequent itemset mining through machine learning and nature-inspired algorithms: a comprehensive review. International Journal of Advanced Computer Research. 2024; 14(68):97-103.
[5] Wang J, Xue B, Wang Y, Wang G, Han D. Identification of pollution source and prediction of water quality based on deep learning techniques. Journal of Contaminant Hydrology. 2024; 261:104287.
[6] Rahman SI, Ahmed S, Fariha TA, Mohammad A, Haque MN, Chellappan S, et al. Unsupervised machine learning approach for tailoring educational content to individual student weaknesses. High-Confidence Computing. 2024; 4(4):100228.
[7] Stylianou T, Pantelidou A. A machine learning approach to consumer behavior in supermarket analytics. Decision Analytics Journal. 2025: 100600.
[8] Deepika R, Gogula S, Kanagalakshmi K, Mehta A, Vivekanandan SJ, Vetrithangam D. Frequent pattern mining using artificial intelligence and machine learning. Natural Language Processing for Software Engineering. 2025:15-28.
[9] Al-Zeiadi MA, AI-Maqaleh BM. Incremental closed frequent itemsets mining based approach using maximal candidates. IEEE Access. 2025; 13:34023- 37.
[10] Aljehani SS, Alotaibi YA. Preserving privacy in association rule mining using metaheuristic-based algorithms: a systematic literature review. IEEE Access. 2024; 12:21217-36.
[11] Gao Z, Han M, Liu S, Li A, Mu D. High utility itemsets mining based on hybrid harris hawk optimization and beluga whale optimization algorithms. Journal of Intelligent & Fuzzy Systems. 2024; 46(4):7567-602.
[12] Raj S, Ramesh D, Gantela P. CrossFIM: a spark-based hybrid frequent itemset mining algorithm for large datasets. Cluster Computing. 2025; 28(4):231.
[13] Abualigah L, Abu-Dalhoum E, Ikotun AM, Zitar RA, Alsoud AR, Khodadadi N, et al. Teaching–learning-based optimization algorithm: analysis study and its application. In Metaheuristic Optimization Algorithms 2024 (pp. 59-71). Morgan Kaufmann.
[14] Usman S, Lu C, Gao G. Flexible job-shop scheduling with limited flexible workers using an improved multiobjective discrete teaching–learning based optimization algorithm. Optimization and Engineering. 2024; 25(3):1237-70.
[15] Yadav R, Kaur M. Teaching learning based optimization-a review on background and development. In AIP conference proceedings 2024 (p. 030173). AIP Publishing LLC.
[16] Tiwari G, Dubey SM, Sharma G, Bansal A. Modified improved apriori algorithm for reduced time complexity. In 2025 4th OPJU international technology conference (OTCON) on smart computing for innovation and advancement in industry 5.0 2025 (pp. 1-5). IEEE.
[17] Shafaati S, Mohammadzadeh J. Scalable anomaly detection and pattern mining in IoT aquaponics systems via federated learning. In 11th international conference on web research (ICWR) 2025 (pp. 39-43). IEEE.
[18] Prabha M, KS VA. Tailoring content with keyword-based recommendation engine using machine learning. In international conference on intelligent and innovative technologies in computing, electrical and electronics (IITCEE) 2025 (pp. 1-6). IEEE.
[19] Agarwal S, Madhubala P, Singh MK, Arulini K, Keerthana S, Jenitha T. Association rule mining for optimizing inventory management in retail: improving stock control and reducing costs using ML model. In 3rd international conference on communication, security, and artificial intelligence (ICCSAI) 2025 (pp. 855-9). IEEE.
[20] Wang S, Meng Q. Intelligent enterprise financial anomaly detection using association rule mining for enhanced risk assessment. In 4th international conference on distributed computing and electrical circuits and electronics (ICDCECE) 2025 (pp. 1-7). IEEE.
[21] Fnu H, Murri S. Algorithmic approach for fraudulent transaction detection using market basket analysis with big data. In 4th international conference on distributed computing and electrical circuits and electronics (ICDCECE) 2025 (pp. 1-7). IEEE.
[22] Surapareddy PR, Wan HH, Pirouz M. A novel framework for early prediction of student grades using click stream analysis. In 15th annual computing and communication workshop and conference (CCWC) 2025 (pp. 272-80). IEEE.
[23] Zhong Y, Chen P, Zhang H. ESX: a self-generated control policy for remote access with SSH based on eBPF. IEEE Access. 2024; 13:6487-506.