Outlier Detection On Graduation Data Of Darussalam Gontor University Using One-Class Support Vector Machine


Deteksi Outlier Pada Data Kelulusan Mahasiswa Universitas Darussalam Gontor Menggunakan One-Class Support Vector Machine


  • (1) * Oddy Virgantara Putra            Universitas Darussalam Gontor  
            Indonesia

  • (2)  Triana Harmini            Universitas Darussalam Gontor  
            Indonesia

  • (3)  Ahmad Saroji            Universitas Darussalam Gontor  
            Indonesia

    (*) Corresponding Author

Abstract

Outlier detection is an important field of study because it is able to detect abnormal data distribution from a set of data. In the student graduation data, there are students with high semester GPA but do not graduate on time but students with low semester GPA can graduate on time. This study aims to detect outlier values ​​in student graduation data for the 2020-2021 class. Factors (attributes) used in this study are Student Academic Support Credit Scores (AKPAM) and Social Studies from semester one to semester six. The dataset used is 1204 graduates. The outlier detection method used is One-Class Support Vector Machine (SVM). One-class SVM is a derivative of SVM method that detects outliers based on data outside the specified class. The results of outlier detection using One-Class SVM method with three types of kernels produce the following reference values: kernel 'rbf' n by 91.4%, kernel 'linear' by 90% and kernel 'poly' by 90%. After normalization using MinMaxScaler the reference value increased by 2% in each kernel. The results of testing the One-Class SVM method get an average 90.3%, thus it can be concluded that the One-Class SVM method is feasible to be used as an outlier detection method.

Downloads

Download data is not yet available.

References

I. Kartikasari, “laporan kamisan 23 September 2021.pdf,” BAAK Data, Ponorogo.

T. Kieu, B. Yang, and C. S. Jensen, “Outlier detection for multidimensional time series using deep neural networks,” Proc. - IEEE Int. Conf. Mob. Data Manag., vol. 2018-June, pp. 125–134, 2018, doi: 10.1109/MDM.2018.00029. DOI: https://doi.org/10.1109/MDM.2018.00029

P. Nair and I. Kashyap, “Hybrid Pre-processing Technique for Handling Imbalanced Data and Detecting Outliers for KNN Classifier,” Proc. Int. Conf. Mach. Learn. Big Data, Cloud Parallel Comput. Trends, Prespectives Prospect. Com. 2019, pp. 460–464, 2019, doi: 10.1109/COMITCon.2019.8862250. DOI: https://doi.org/10.1109/COMITCon.2019.8862250

I. Atastina, “Analysis of missing value handling by collateral missing value estimation (cmve) method,” 2011.

D. A. Nasution, H. H. Khotimah, and N. Chamidah, “Perbandingan Normalisasi Data untuk Klasifikasi Wine Menggunakan Algoritma K-NN,” Comput. Eng. Sci. Syst. J., vol. 4, no. 1, p. 78, 2019, doi: 10.24114/cess.v4i1.11458. DOI: https://doi.org/10.24114/cess.v4i1.11458

Y. Wang, J. Wong, and A. Miner, “Anomaly intrusion detection using one class SVM,” Proc. fron Fifth Annu. IEEE Syst. Man Cybern. Inf. Assur. Work. SMC, pp. 358–364, 2004, doi: 10.1109/iaw.2004.1437839. DOI: https://doi.org/10.1109/IAW.2004.1437839

R. Arthana, “Mengenal Accuracy, Precision, Recall dan Specificity serta yang diprioritaskan dalam Machine Learning,” Arthana, Resika. https://rey1024.medium.com/mengenal-accuracy-precission-recall-dan-specificity-serta-yang-diprioritaskan-b79ff4d77de8.

D. S. Informasi, MENGGUNAKAN METODE SUPPORT VECTOR MACHINE FORECASTING THE NUMBER OF TUBERCULOSIS DISEASE PATIENTS IN EAST JAVA REGION USING. 2019.

Picture in here are illustration from public domain image (License) or provided by the author, as part of their works
Published
2021-12-01
 
How to Cite
[1]
O. V. Putra, T. Harmini, and A. Saroji, “Outlier Detection On Graduation Data Of Darussalam Gontor University Using One-Class Support Vector Machine”, PELS, vol. 2, Dec. 2021.