Data Mining Techniques for Predicting Employability in Morocco

 
 
 
  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract


    One of the biggest challenges for Big Data applications is to explore large volumes of data and extract valuable information and knowledge for future actions. Employment is the main form of social integration, a factor in improving living conditions and preventing risks of poverty and vulnerability and the most appropriate indicator for assessing the level of social cohesion in a country. Mining employability data will give decision makers a great view of the data and opportunities to make improvement in this sector. In this paper, we presented an experimental study comparing various classification data mining algorithms on employability data in Morocco, which are Decision tree, Logistic regression and Naïve Bayes, which take place in the top 10 data mining algorithms identified by the IEEE International Conference on Data mining. The objective in our experiment is to choose the most efficient and suited algorithm for the employability data.

     


     

  • Keywords


    Data mining, Big Data, Employability, Classification, Decision tree, Logistic regression, Naïve Bayes.

  • References


      [1] Venkatadri.m, lokanatha c. reddy A comparative study on decision tree classification algorithms in data mining, 2008, ISSN: 0974-3596

      [2] Pooja Thakar, Anil Mehta, Manisha - Role of Secondary Attributes to Boost the Prediction Accuracy of Student’s Employability Via Data Mining, IJACSA, 2015, doi: 10.14569/IJACSA.2015.061112

      [3] Muskan Kukreja, Stephen Albert Johnston and Phillip Stafford - Comparative study of classification algorithms for immunosignaturing data, 2012. doi: 10.1186/1471-2105-13-139

      [4] Xindong Wu, Vipin Kumar, J. Ross Quinlan, Joydeep Ghosh, Qiang Yang · Hiroshi Motoda · Geoffrey J. McLachlan, Angus Ng, Bing Liu, Philip S. Yu, Zhi-Hua Zhou, Michael Steinbach, David J. Hand, Dan Steinberg-Top_10_algorithms_in_data_mining, 2008, doi: 10.1007/s10115-007-0114-2

      [5] Mary L. McHugh - Interrater reliability: the kappa statistic, 2012, IJSCE, ISSN: 2231-2307

      [6] Hetal Bhavsar, amit Ganatra - An empirical evaluation of data mining classification algorithms, 2016, IJCSIS, 2016, ISSN 1947-5500

      [7] Hossin, M.1 and Sulaiman - A review on evaluation metrics for data classification evaluations, IJDKP, 2015, doi: 10.5121/ijdkp.2015.5201

      [8] Karimella Vikram, - Data Mining Tools and Techniques: A review, CEIS, 2011, ISSN 2222-2863

      [9] Marko Arsenovic - A Comparison of Contemporary Data Mining Tools, ISCIS, 2017

      [10] Vijay Kotu and Bala Deshpande, Morgan Kaufmann, Predictive analytics and Data mining, 2014, pp. 17-27, 64-71.

      [11] Stephane Tuffery, Data mining and statistics for decision making, Ltd, 2011, pp. 43-72.

      [12] Robert Nisbet, Gary Miner, Ken Yale - Handbook of statistical analysis and data mining applications, USA: Academic Press, 2009, pp 39-50, 53-62.

      [13] Lemberger, Pirmin, Batty, Marc, Morel, Mederic, Big Data et Data mining: Manuel du data scientist, Malakoff: Dunod, 2015, pp.91-106.

      [14] Nong Ye, Handbook of Data mining, New Jersey: Lawrence Erlbaum Associates, 2003, pp. 5-24, 104-125.

      [15] Han, Jiawei,Kamber, Micheline,Pei, Jian, Data mining: Concepts and Techniques 3rd edition, USA: Morgan Kaufmann,2012, pp. 44-50, 88-94.

      [16] https://rapidminer.com/products/studio/feature-list/

      [17] https://docs.rapidminer.com/


 

View

Download

Article ID: 23237
 
DOI: 10.14419/ijet.v7i4.32.23237




Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.