Air pollution analysis using big data technology: towards a better world

  • Authors

    • Siva Krishna kvs
    • Saikumar Pulluri
    • Kamalakannan J
    2018-06-08
    https://doi.org/10.14419/ijet.v7i2.33.15532
  • Air Pollution, Big Data, Classifiers, Decision Tree, Logistic Regression, Naïve Byes, Pollutants, Random Forest.
  • Big data is generally perceived as being one of the most intense drivers to advance profitability, Enhance effectiveness, furthermore, bolsters advancement. It is quite anticipated which would analyze big data and transform big data into big values. To find the answer of the fascinat-ing question whether there are characteristic connections between the two inclinations of big data and green challenges, a study has exam-ined the issues on greening the entire life cycle of big data frameworks. As the data which is captured from different sensors is huge, to analysis that data and find patterns to predict the future data, we need big data technology which can handle that huge amount of data in a better way. In this paper, we have used different classifiers to analysis the results based on available data in the spark framework using the Python and Scala programming languages. We showed a comparative study between python and Scala technology based on classifiers. For this research data set of Andhra-Pradesh and Tamilnadu (states in India) are utilized to show the analysis of air pollution with the help of big data concept. We compared the classifiers on based on time and accuracy. Generally random forest gives good results but in our case deci-sion tree and logistic regression have given high accuracy.

     

  • References

    1. [1] Carson K. Leung, Fan Jiang, Hao Zhang, and Adam G.M. Pazdor, “A Data Science Model for Big Data Analytics of Frequent Patterns†978-1-5090-4065-0© 2016 IEEE.

      [2] Mihaela Oprea, Hai-Ying Liu, “A knowledge-based approach for PM2.5 air pollution effects analysis†978-1-4673-9910-4 ©2016 IEEE.

      [3] Haripriya Ayyalasomayajula, Edgar Gabriel, Peggy Lindner, Daniel Price,†Air Quality Simulations using Big Data Programming Models†978-1-5090-2251-9/16 © 2016 IEEE.

      [4] Elena Baralis, Tania Cerquitelli, Silvia Chiusano, Paolo Garza, and Mohammad Reza Kavoosifar, “Analyzing air pollution on the urban environment†MIPRO 2016, May 30 - June 3, 2016.

      [5] Navjot Kaur Walia, Parul Kalra, Deepti Mehrotra, “Prediction of Carbon Stock Available in Forest using Naive Bayes Approach†978-1-5090-0210-8/16 © 2016 IEEE.

      [6] David G. Rickerby, Andreas N. Skouloudis, “Big data for innovative air-pollution assessments in the era of verifiable regulatory decisions†978-1-5090-0058-6/16 ©2016 European Union.

      [7] Jinsong Wu, Senior Member, IEEE, Song Guo, Senior Member, IEEE, Jie Li, Senior Member, IEEE, “Big Data application in Green Challenges†1932-8184 © 2016 IEEE.

      [8] James Manyika, Michael Chui, Brad Brown, “Big data: The next frontier for innovation, competition, and productivity†Report, McKinsey Global Institute, USA, May 2011.

      [9] Nancy Agrawal and Arushi Baboota, “The Importance of Including Carcinogenic Benzene in Real-Time Ambient Air Quality Data in Delhi†COMSNETS 2016 - Net Health Workshop, 978-1-4673-9622-6/16 ©2016 IEEE.

      [10] Sreemoyee Roy and Abhik Mukherjee, “Information system analysis for monitoring of air quality in peri-urban Howrah†2012 Third International Conference on Emerging Applications of Information Technology (EAIT), 978-1-4673-1827-3/12 ©2012 IEEE.

      [11] Lily Bui, “Breathing Smarter: A critical look at Representation of air quality sensing data across platform and publics†2015 IEEE.

      [12] Wenjun Lv, Yu Kang, Zerui Li, Yunbo Zhao “Fusion Approach for Real-Time Mapping Street Atmospheric Pollution Concentration†978-1-5090-1729-4/16©2016 IEEE.

      [13] A. Cuzzocrea and C. K. Leung, “Computing theoretically-sound upper bounds to expected support for frequent pattern mining problems over uncertain big data,†Proc. IPMU 2016, Part II, pp. 379–392.

      [14] Elena Baralis, Tania Cerquitelli, Silvia Chiusano, “Analyzing air pollution on the urban environment†MIPRO 2016, May 30 - June 3, 2016, Opatija, Croatia.

      [15] Abhishek Pandey, Amit Sinha, “An Analytical Approach to Check the Development of any State in India†2016 Second International Conference on Computational Intelligence & Communication Technology, 978-1-5090-0210-8/16 © 2016 IEEE.

      [16] Valerio Persico, Antonio Montieri, Antonio Pescape, “On the Network Performance of Amazon S3 Cloud-storage Service†2016 fifth IEEE International Conference on Cloud Networking, 978-1-5090-5093-2/16 © 2016 IEEE.

  • Downloads

  • How to Cite

    Krishna kvs, S., Pulluri, S., & J, K. (2018). Air pollution analysis using big data technology: towards a better world. International Journal of Engineering & Technology, 7(2.33), 919-923. https://doi.org/10.14419/ijet.v7i2.33.15532