Pair data division algorithm for handling data classification
Information Security and Applied Computing
2019 6th International Conference on Social Networks Analysis, Management and Security, SNAMS 2019
Data heterogeneity creates the complexity for understanding the different domains. Most of domains require sophisticated algorithm to understand the complexity. To handle the data heterogeneity problem, several methods have been proposed (e.g., Support Vector Machine (SVM), Decision Tree, Random Forest, Liner regression etc.). However, these existing methods suffer due to accuracy, outlying. In this paper, we introduce Pair Data Division (PDD) with support of SMV for handling the data heterogeneity. PDD divides the data into the pairs then further pairs are processed using SVM. The PDD algorithm is implanted by using Python and PyCharm is used as Interface Development Environment (IDE). Furthermore, training datasets will be given to PDD for testing purpose. Based on the testing results, we confirm that our proposed algorithm provides better accuracy as compared to known algorithms: Random Forest and Linear regression when handling the data heterogeneity.
Link to Published Version
Razaque, A., Kanapina, A., Sailaubek, M., Tsoy, D., Turganov, Z., Almiani, M., Amsaad, F., & Almahamed, M. A. (2019). Pair data division algorithm for handling data classification. 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS) , 316–320. https://doi.org/10.1109/SNAMS.2019.8931871