A Study on Big Data Hadoop Map Reduce Job Scheduling

N Deshai; S Venkataramana; I Hemalatha; G P. S. Varma

doi:10.14419/ijet.v7i3.31.18202

Article Summary Abstract References Full Article How to cite

Authors
- N Deshai
- S Venkataramana
- I Hemalatha
- G P. S. Varma
2018-08-24

https://doi.org/10.14419/ijet.v7i3.31.18202
Big data, Hadoop, HDFS, Map Reduce, Scheduling,
Abstract

A latest tera to zeta era has been created during huge volume of data sets, which keep on collected from different social networks, machine to machine devices, google, yahoo, sensors etc. called as big data. Because day by day double the data storage size, data processing power, data availability and digital world data size in zeta bytes. Apache Hadoop is latest market weapon to handle huge volume of data sets by its most popular components like hdfs and mapreduce, to achieve an efficient storage ability and efficient processing on massive volume of data sets. To design an effective algorithm is a key factor for selecting nodes are important, to optimize and acquire high performance in Big data. An efficient and useful survey, overview, advantages and disadvantages of these scheduling algorithms provided also identified throughout this paper.
Â
Â
References
1. [1] Ehab Mohamed Zheng Hong, â€œHadoop-MapReduce Job Scheduling Algorithms Surveyâ€, 7^th International confrence on Cloud Computing and Big Data, (2016), 237â€“ 242.
  [2] Abhishek Verma, Ludmila Cherkasova, Roy H. Campbell, "ARIA: Automatic Resource Inference And Allocation for MapReduce environments", 8th Autonomic computing ACM, IEEE, (2011), 235 â€“ 244.
  [3] S. Bardhan, D. A. Menasce, "Queuing Network Models to Predict the Completion Time of the Map Phase of Map reduce Jobs", ICMG, IEEE, ( 2012).
  [4] J.V.Gautam, Harshadkumar, Vipul K Dabhi, Sanjay Chaud hary B,"A survey on job scheduling Algrithms in Big data processingâ€, (ICECCT), IEEE, (2015), 1 â€“ 11.
  [5] Nikos Zacheilas, Vana Kalogeraki, â€œPareto-Based Scheduling of Map Reduce Workloadsâ€ 19th International Symposium on Real-Time Distributed Computing (ISORC), IEEE, (2016), 174 â€“ 181.
  [6] Mark Yong, Nitin Garegrat, Shiwali Mohan, â€œTowards a Resource Aware Scheduler in Hadoopâ€, in Proc. ICWS, (2009), 102â€“ 109.
  [7] A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. A nthon, H. Liu, P. Wyckoff, R. Murthy, â€œHive - A W rehousing Solution over a Map-Reduce Frameworkâ€, PV LDB, (2009), 1626 â€“ 1629.
  [8] J. K. Laurila, D. Gatica-Perez, I. Aad, O. Bornet, T.-M.T.D O. Dousse, J. Eberle, and M. Miettinen, â€œThe Mobile Data Challenge: Big Data for Mobile Computing Researchâ€, Nokia Mobile Data Challenge Workshop, Newcastle, U K, (2012), 321 â€“ 330.
  [9] Sanjay G, Howard G, S.T.Leung, â€œThe Google file systemâ€ , 19^th Symposium Op. Sys. Principle, New York, (2003), 29 â€“ 43.
  [10] Casavant et al, â€œTaxonomy of scheduling in general purpose Distributed computing systems", IEEE Transactions, (1988), 141 â€“ 154.
  [11] N. Tiwari, "Classification Framework of Map Reduce Scheduling Algorithms", ACM Computing Surveys, 47, 3, (2015), 49.
  [12] Quan Chen, Daqiang Zhang, Minyi Guo, Qianni Deng, Song Guo, "SAMR: A Self-adaptive MapReduce Scheduling Algorithm in Heterogeneous Environment", IEEE 10^th International Conference, (2010), 2736 â€“ 2743.
  [13] M.Zaharia, "Delay scheduling: a simple technique for achieving locality and fairness in cluster schedulingâ€, 5^th European conference on computers, New York, ( 2010), 265-278.
  [14] PEI Shu-jun, Zheng Xi-min, Hu Da-ming, Lou Shu-hui, Zhang Yuan-xu, â€œOptimization and Research of Hadoop Platform Based on FIFO Schedulerâ€, Seventh Internation al Conference on Measuring Technology and Mechatroni cs Automation, IEEE, (2015), 727 â€“ 730
  [15] Kc K, Anyanwu K, â€Scheduling Hadoop Jobs to Meet Deadlines Cloud Computing Technology and Science (C1oudCom)â€ 2^nd International Conference, IEEE, (2010), 388-392.
  [16] J. Chen et al, "A Task Scheduling Algorithm for Hadoop Platform", journal of Computers, 8, 4, (2013), 29 â€“ 936.
  [17] Matei Zaharia, Andy Konwinski, Anthony D. Joseph, Randy Katz, Ion Stoica, "Improving Map Reduce Performance in Heterogeneous environments", 8th USENIX Symposium, (2008), 26 â€“ 33.
  [18] Geetha J., N. Uday Bhaskar, P. Chenna Reddy, Neha Sniha,â€ Hadoop Scheduler with Deadline Costraint ", (IJCCSA), 4, 5, (2014), 1â€“ 7.
  [19] Mark Yong, Nitin Garegrat, Shiwali Mohan, â€œTowards a Resource Aware Scheduler in Hadoopâ€, Proc. ICWS, (2009), 102â€“109.
  [20] Archana G.K, V. Deeban, "HPCA: A Node Selection and Scheduling Method for Hadoop MapReduce", (ICCCTâ€™15) , IEEE, (2015), 368 â€“ 372.
  [21] Y. Wang, Ruonan Rao, Yinglin Wang, â€œA Round Robin with Multiple Feedback Job Scheduler in Hadoop", Progress in Informatics and Computing (PIC) International Conference, IEEE, (2014), 471 â€“ 475.
  [22] J. Chen, Dan Wang, Wenbing Zhao," A Task Scheduling Algorithm for Hadoop Platform ", Journal of Computers, 8, 4, (2013), 929â€“ 936.
  [23] Yingjie Guoa, Linzhi Wub, Wei Yuc, Bin Wud, Xiaotian Wang, â€œResearch and Improvement of Job Scheduling Algorithms in Hadoop Platformâ€, IEEE, (2010), 15â€“ 21.
  [24] Wei Zhang, Sundaresan R., â€œTimothy Wood, Min gfa Zhu " MIMP: Deadline and Interference Aware Sched-uling of Hadoop Virtual Machines",14^th IEEE/ACM I international Symposium on Cluster, Cloud and Grid Com putting, IEEE, (2014), 394 â€“ 403.
  [25] Saima Gulzar Ahmad, Chee Sun Liew, M. Mustafa Rafique, Ehsan Ullah Munir, Samee U. Khan " Data-Intensive Work flow Optimiztion based on Application Task Graph Partitioning in Heterogeneous Computing Systems ", Fourth International Conference on Big Data and Cloud Computing, IEEE, (2014), 129 â€“ 136.
  [26] Zhao-Rong Lai1, Che-Wei Chang, Xue Liu, Tei-Wei Kuo, Pi-Cheng Hsiu, â€œDeadline-Aware Load Balancing for MapReduce ", 20^th (RTCSA), IEEE, (2014), 1â€“10.
  [27] J. Wang, Jiayin Wang, Yi Yao, Ying Mao, Bo Sheng, Ning fang Mi, "FRESH: Fair and Efficient Slot Configuration and Scheduling for Hadoop Clusters", International Conference on Cloud Computing, IEEE, (2014), 761â€“ 768.
  [28] Paolo Bellavista, Antonio Corradi, Andrea Reale, and Nicola Ticca, " Priority-based Resource Scheduling in Distributed Stream Processing Systems for Big Data Applicationsâ€, ACM 7^th International Conference on Utility and Cloud Co mputing, IEEE, (2014), 363 â€“ 370.
  [29] C. He, Y. Lu and D. Swanson," Real-Time Scheduling in MapReduce Clusters ", doi: 10.1109/HPCC.and.EUC. 216, 2013.
Downloads
How to Cite
Deshai, N., Venkataramana, S., Hemalatha, I., & P. S. Varma, G. (2018). A Study on Big Data Hadoop Map Reduce Job Scheduling. International Journal of Engineering & Technology, 7(3.31), 59-65. https://doi.org/10.14419/ijet.v7i3.31.18202
ACM

ACS

APA

ABNT

Chicago

Harvard

IEEE

MLA

Turabian

Vancouver

Download Citation

Endnote/Zotero/Mendeley (RIS)

BibTeX
Received date: 2018-08-25

Accepted date: 2018-08-25

Published date: 2018-08-24

A Study on Big Data Hadoop Map Reduce Job Scheduling

Authors

Abstract

References

Downloads

How to Cite

Published