Big Data in E-government: Classification and Prediction using Machine Learning Algorithms


Big data
Data mining
Machine learning


Many countries have used big data to develop their institutions, such as Estonia in policing, India in health care, and the development of agriculture in the United Kingdom, etc. Data is very important as it is no longer oil that is the most valuable resource in the world, but data. This research examines ways to develop the Iraqi state institutions by using the big data of one of its institutions (electronic civil registry) (ECR) using the mining and analysis of this data. The pre-processing and analysis of this data are carried out depending on the needs of each institution and then using Machine Learning (ML) techniques. Its use has shown remarkable results in many areas, especially in data analysis, classification and forecasting. We applied five ML algorithms that are Support Vector Machine (SVM), Decision Tree (DT), K-Nearest Neighbor (KNN), Random Forest (RF), and Naive Bayes (NB) carried out in the Orange Data Mining tool. According to the simulation results of the proposed system, the accuracy of the classifications was around 100%, 99%, and 100% for the military department by the SVM classifier, the social welfare department by the RF classifier, and the statistics-planning department by the SVM classifier, respectively.