BigData (Hadoop,Spark & Memcached)

Overview

Apache Hadoop and Spark are gaining prominence in handling Big Data and analytics. Similarly, Memcached in Web 2.0 environment is becoming important for large-scale query processing. These middleware are traditionally written with sockets and do not deliver best performance on datacenters with modern high performance networks. In this tutorial, we will provide an in-depth overview of the architecture of Hadoop components (HDFS, MapReduce, RPC, HBase, etc.), Spark and Memcached. We will examine the challenges in re-designing the networking and I/O components of these middleware with modern interconnects, protocols (such as InfiniBand, iWARP, RoCE, and RSocket) with RDMA and storage architecture. Using the publicly available software packages in the High-Performance Big Data (HiBD, http://hibd.cse.ohio-state.edu) project, we will provide case studies of the new designs for several Hadoop/Spark/Memcached components and their associated benefits. Through these case studies, we will also examine the interplay between high performance interconnects, storage systems (HDD and SSD), and multi-core platforms to achieve the best solutions for these components.

Software Distribution

Link to BigData (Hadoop,Spark & Memcached)

Results

Link to HiBD Resulsts

Journals (6)
1	M. W. Rahman, N. Islam, X. Lu, D. Shankar, and DK Panda, MR-Advisor: A Comprehensive Tuning, Profiling, and Prediction Tool for MapReduce Execution Frameworks on HPC Clusters, Journal of Parallel and Distributed Computing (JPDC), Nov 2017.
2	X. Lu, D. Shankar, and DK Panda, Scalable and Distributed Key-Value Store-based Data Management Using RDMA-Memcached, "IEEE Data Engineering Bulletin (DEBull), Volume 40", Bulletin of the Technical Committee on Data Engineering (TCDE), (Invited Paper), Mar 2017.
3	N. Islam, X. Lu, M. W. Rahman, J. Jose, and DK Panda, A Micro-Benchmark Suite for Evaluating HDFS Operations on Modern Clusters, Special Issue of LNCS on papers from WBDB '12 Workshop, May 2012.
4	D. Shankar, X. Lu, M. W. Rahman, N. Islam, and DK Panda, Characterizing and benchmarking stand-alone Hadoop MapReduce on modern HPC clusters, The Journal of Supercomputing - Springer, Jun 2016.
5	M. W. Rahman, N. Islam, X. Lu, and DK Panda, A Comprehensive Study of MapReduce over Lustre for Intermediate Data Placement and Shuffle Strategies on HPC Clusters, IEEE Transactions on Parallel and Distributed Systems, Jul 2016.
6	X. Lu, H. Shi, R. Biswas, M. H. Javed, and DK Panda, DLoBD: A Comprehensive Study of Deep Learning over Big Data Stacks on HPC Clusters, IEEE Transactions on Multi-Scale Computing Systems, Jun 2018.

Conferences & Workshops (59)
1	MPI4Spark Meets YARN: Enhancing MPI4Spark through YARN support for HPC K. Al Attar, A. Shafi, H. Subramoni, and DK Panda, 11th International Workshop on Distributed Storage and Blockchain Technologies for Big Data (IEEE Big Data '23), Dec 2023 [Bib - Plain]
2	Spark Meets MPI: Towards High-Performance Communication Framework for Spark using MPI K. Al Attar, A. Shafi, M. Abduljabbar, H. Subramoni, and DK Panda, IEEE Cluster '22, Sep 2022 [Bib - Plain]
3	Towards Java-based HPC using the MVAPICH2 Library: Early Experiences K. Al Attar, A. Shafi, H. Subramoni, and DK Panda, HIPS '22 (IPDPSW), May 2022 [Bib - Plain]
4	Efficient MPI-based Communication for GPU-Accelerated Dask Applications A. Shafi, J. Hashmi, H. Subramoni, and DK Panda, The 21st IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, May 2021 [Bib - Plain]
5	SIMD-KV: Accelerating End-to-End Performance in Key-Value Stores with SIMD and RDMA over Emerging CPU Architectures D. Shankar, X. Lu, and DK Panda, 26th IEEE International Conference on High Performance Computing, Data, Analytics and Data Science (HiPC '19), Dec 2019 [Bib - Plain]
6	SimdHT-Bench: Characterizing SIMD-Aware Hash Table Designs on Emerging CPU Architectures D. Shankar, X. Lu, and DK Panda, 2019 IEEE International Symposium on Workload Characterization, Nov 2019 [Best Paper Finalist] [Bib - Plain]
7	Accelerating TensorFlow with Adaptive RDMA-based gRPC R. Biswas, X. Lu, and DK Panda, 25th IEEE International Conference on High Performance Computing, Data, and Analytics, Dec 2018 [Bib - Plain]
8	Spark-uDAPL: Cost-Saving Big Data Analytics on Microsoft Azure Cloud with RDMA Networks X. Lu, D. Shankar, H. Shi, and DK Panda, 2018 IEEE International Conference on Big Data, Dec 2018 [Short Paper] [Bib - Plain]
9	EC-Bench: Benchmarking Onload and Offload Erasure Coders on Modern Hardware Architectures H. Shi, X. Lu, and DK Panda, 2018 International Symposium on Benchmarking, Measuring and Optimizing, Dec 2018 [Best Paper Award] [Bib - Plain]
10	High-Performance Multi-Rail Erasure Coding Library over Modern Data Center Architectures: Early Experiences H. Shi, X. Lu, D. Shankar, and DK Panda, ACM Symposium on Cloud Computing (SoCC) 2018, Oct 2018 [Poster Paper] [Bib - Plain]
11	Cutting the Tail: Designing High Performance Message Brokers to Reduce Tail Latencies in Stream Processing M. H. Javed, X. Lu, and DK Panda, IEEE Cluster 2018, Sep 2018 [Bib - Plain]
12	Designing a Micro-Benchmark Suite to Evaluate gRPC for TensorFlow: Early Experiences R. Biswas, X. Lu, and DK Panda, The Ninth Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware, Mar 2018 [Bib - Plain]
13	Characterizing and Accelerating Indexing Techniques on Distributed Ordered Tables S. Gugnani, X. Lu, H. Qi, L. Zha, and DK Panda, 2017 IEEE International Conference on Big Data (IEEE Big Data 2017), Dec 2017 [Bib - Plain]
14	Performance Characterization and Acceleration of Big Data Workloads on OpenPOWER System X. Lu, H. Shi, D. Shankar, and DK Panda, 2017 IEEE International Conference on Big Data (IEEE Big Data 2017), Dec 2017 [Bib - Plain]
15	NVMD: Non-Volatile Memory Assisted Design for Accelerating MapReduce and DAG Execution Frameworks on HPC Systems M. W. Rahman, N. Islam, X. Lu, and DK Panda, 2017 IEEE International Conference on Big Data (IEEE Big Data 2017), Dec 2017 [Short Paper] [Bib - Plain]
16	Characterization of Big Data Stream Processing Pipeline: A Case Study using Flink and Kafka M. H. Javed, X. Lu, and DK Panda, 4th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, Dec 2017 [Bib - Plain]
17	Characterizing Deep Learning over Big Data (DLoBD) Stacks on RDMA-capable Networks X. Lu, H. Shi, M. H. Javed, R. Biswas, and DK Panda, The 25th Annual Symposium on High-Performance Interconnects (HotI), Aug 2017 [Bib - Plain]
18	High-Performance and Resilient Key-Value Store with Online Erasure Coding for Big Data Workloads D. Shankar, X. Lu, and DK Panda, 37th IEEE International Conference on Distributed Computing Systems (ICDCS 2017), Jun 2017 [Bib - Plain]
19	Benchmarking Kudu Distributed Storage Engine on High-Performance Interconnects and Storage Devices N. Islam, M. W. Rahman, X. Lu, and DK Panda, The 8th Workshop on Big Data Benchmarks, Performance, Optimization, and Emerging Hardware (BPOE-8), Apr 2017 [Bib - Plain]
20	NRCIO: NVM-aware RDMA-based Communication and I/O Schemes for Big Data Analytics X. Lu, N. Islam, M. W. Rahman, and DK Panda, The 8th Annual Non-Volatile Memories Workshop (NVMW '17), Mar 2017 [Bib - Plain]
21	Designing Virtualization-aware and Automatic Topology Detection Schemes for Accelerating Hadoop on SR-IOV-enabled Clouds S. Gugnani, X. Lu, and DK Panda, 8th IEEE International Conference on Cloud Computing Technology and Science (IEEE CloudCom '16), Dec 2016 [Bib - Plain]
22	Impact of HPC Cloud Networking Technologies on Accelerating Hadoop RPC and HBase X. Lu, D. Shankar, S. Gugnani, H. Subramoni, and DK Panda, 8th IEEE International Conference on Cloud Computing Technology and Science (IEEE CloudCom '16), Dec 2016 [Bib - Plain]
23	Performance Characterization of Hadoop Workloads on SR-IOV-enabled Virtualized InfiniBand Clusters S. Gugnani, X. Lu, and DK Panda, 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT'16), Dec 2016 [Bib - Plain]
24	Efficient Data Access Strategies for Hadoop and Spark on HPC Cluster with Heterogeneous Storage N. Islam, M. W. Rahman, X. Lu, and DK Panda, 2016 IEEE International Conference on Big Data, Dec 2016 [Bib - Plain]
25	High-Performance Design of Apache Spark with RDMA and Its Benefits on Various Workloads X. Lu, D. Shankar, S. Gugnani, and DK Panda, 2016 IEEE International Conference on Big Data, Dec 2016 [Bib - Plain]
26	Boldio: A Hybrid and Resilient Burst-Buffer Over Lustre for Accelerating Big Data I/O D. Shankar, X. Lu, and DK Panda, 2016 IEEE International Conference on Big Data, Dec 2016 [Short Paper] [Bib - Plain]
27	Can Non-Volatile Memory Benefit MapReduce Applications on HPC Clusters? M. W. Rahman, N. Islam, X. Lu, and DK Panda, First Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems (PDSW-DISCS, SC Workshop), Nov 2016 [Bib - Plain]
28	MR-Advisor: A Comprehensive Tuning Tool for Advising HPC Users to Accelerate MapReduce Applications on Supercomputers M. W. Rahman, N. Islam, X. Lu, D. Shankar, and DK Panda, 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD'16), Oct 2016 [Bib - Plain]
29	Experiences and Benefits of Running RDMA Hadoop and Spark on SDSC Comet M. Tatineni, X. Lu, D. J. Choi, A. Majumdar, and DK Panda, The 5th Annual Conference on Extreme Science and Engineering Discovery Environment (XSEDE), Jul 2016 [Bib - Plain]
30	High Performance Design for HDFS with Byte-Addressability of NVM and RDMA N. Islam, M. W. Rahman, X. Lu, and DK Panda, 24th International Conference on Supercomputing (ICS '16), Jun 2016 [Bib - Plain]
31	High-Performance Hybrid Key-Value Store on Modern Clusters with RDMA Interconnects and SSDs: Non-blocking Extensions, Designs, and Benefits D. Shankar, X. Lu, N. Islam, M. W. Rahman, and DK Panda, The 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS '16), May 2016 [Bib - Plain]
32	Characterizing Cloudera Impala Workloads with BigDataBench on InfiniBand Clusters K. Kulkarni, X. Lu, and DK Panda, The 7th Workshop on Big Data Benchmarks, Performance, Optimization, and Emerging Hardware (BPOE-7), Apr 2016 [Bib - Plain]
33	Benchmarking Key-Value Stores on High-Performance Storage and Interconnects for Web-Scale Workloads D. Shankar, X. Lu, M. W. Rahman, N. Islam, and DK Panda, 2015 IEEE International Conference on Big Data, Oct 2015 [Short Paper] [Bib - Plain]
34	Performance Characterization and Acceleration of In-Memory File Systems for Hadoop and Spark Applications on HPC Clusters N. Islam, M. W. Rahman, X. Lu, D. Shankar, and DK Panda, 2015 IEEE International Conference on Big Data, Oct 2015 [Bib - Plain]
35	Accelerating I/O Performance of Big Data Analytics on HPC Clusters through RDMA-based Key-Value Store N. Islam, D. Shankar, X. Lu, M. W. Rahman, and DK Panda, The 44th International Conference on Parallel Processing (ICPP '15), Sep 2015 [Bib - Plain]
36	A Plugin-based Approach to Exploit RDMA Benefits for Apache and Enterprise HDFS A. Bhat, N. Islam, X. Lu, M. W. Rahman, D. Shankar, and DK Panda, The Sixth workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware, Aug 2015 [Bib - Plain]
37	High-Performance Design of YARN MapReduce on Modern HPC Clusters with Lustre and RDMA M. W. Rahman, X. Lu, N. Islam, R. Rajachandrasekar, and DK Panda, IPDPS '15, May 2015 [Bib - Plain]
38	Triple-H: A Hybrid Approach to Accelerate HDFS on HPC Clusters with Heterogeneous Storage Architecture N. Islam, X. Lu, M. W. Rahman, D. Shankar, and DK Panda, CCGrid '15, May 2015 [Bib - Plain]
39	Can RDMA Benefit On-Line Data Processing Workloads with Memcached and MySQL D. Shankar, X. Lu, J. Jose, M. W. Rahman, N. Islam, and DK Panda, ISPASS '15, Mar 2015 [Poster Paper] [Bib - Plain]
40	In-Memory I/O and Replication for HDFS with Memcached: Early Experiences N. Islam, X. Lu, M. W. Rahman, R. Rajachandrasekar, and DK Panda, IEEE BigData'14, Oct 2014 [Short Paper] [Bib - Plain]
41	A Micro-benchmark Suite for Evaluating Hadoop MapReduce on High-Performance Networks D. Shankar, X. Lu, M. W. Rahman, N. Islam, and DK Panda, The 5th Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware (BPOE-5), Sep 2014 [Bib - Plain]
42	Performance Modeling for RDMA-Enhanced Hadoop MapReduce M. W. Rahman, X. Lu, N. Islam, and DK Panda, 43rd International Conference on Parallel Processing (ICPP), Sep 2014 [Bib - Plain]
43	MapReduce over Lustre: Can RDMA-based Approach Benefit? M. W. Rahman, X. Lu, N. Islam, R. Rajachandrasekar, and DK Panda, 20th International European Conference on Parallel Processing (Euro-Par), Aug 2014 [Bib - Plain]
44	Accelerating Spark with RDMA for Big Data Processing: Early Experiences X. Lu, M. W. Rahman, N. Islam, D. Shankar, and DK Panda, International Symposium on High Performance Interconnects (HotI'14), Aug 2014 [Bib - Plain]
45	HOMR: A Hybrid Approach to Exploit Maximum Overlapping in MapReduce over High Performance Interconnects M. W. Rahman, X. Lu, N. Islam, and DK Panda, International Conference on Supercomputing (ICS '14), Jun 2014 [Bib - Plain]
46	SOR-HDFS: A SEDA-based Approach to Maximize Overlapping in RDMA-Enhanced HDFS N. Islam, X. Lu, M. W. Rahman, and DK Panda, ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC '14), Short Paper, Jun 2014 [Bib - Plain]
47	Does RDMA-based Enhanced Hadoop MapReduce Need a New Performance Model? M. W. Rahman, X. Lu, N. Islam, and DK Panda, ACM Symposium on Cloud Computing (SoCC '13), Poster Paper, Oct 2013 [Bib - Plain]
48	High-Performance Design of Hadoop RPC with RDMA over InfiniBand X. Lu, N. Islam, M. W. Rahman, J. Jose, H. Subramoni, H. Wang, and DK Panda, International Conference on Parallel Processing (ICPP '13), Oct 2013 [Bib - Plain]
49	Can Parallel Replication Benefit HDFS for High-Performance Interconnects? N. Islam, X. Lu, M. W. Rahman, and DK Panda, International Symposium on High-Performance Interconnects (HotI '13), Aug 2013 [Bib - Plain]
50	A Micro-Benchmark Suite for Evaluating Hadoop RPC on High-Performance Networks X. Lu, M. W. Rahman, N. Islam, and DK Panda, International Workshop on Big Data Benchmarking (WBDB '13), Jul 2013 [Bib - Plain]
51	High-Performance RDMA-based Design of Hadoop MapReduce over InfiniBand M. W. Rahman, N. Islam, X. Lu, J. Jose, H. Subramoni, H. Wang, and DK Panda, International Workshop on High Performance Data Intensive Computing (HPDIC), May 2013 [Bib - Plain]
52	A Micro-benchmark Suite for Evaluating HDFS Operations on Modern Clusters N. Islam, X. Lu, M. W. Rahman, J. Jose, and DK Panda, Special Issue of LNCS on papers from WBDB '12 Workshop., May 2013 [Bib - Plain]
53	High Performance RDMA-Based Design of HDFS over InfiniBand N. Islam, M. W. Rahman, J. Jose, R. Rajachandrasekar, H. Wang, H. Subramoni, C. Murthy, and DK Panda, International Conference on Supercomputing (SC '12), Nov 2012 [Slides] [Bib - Plain]
54	SSD-Assisted Hybrid Memory to Accelerate Memcached over High Performance Networks X. Ouyang, N. Islam, R. Rajachandrasekar, J. Jose, M. Luo, H. Wang, and DK Panda, International Conference on Parallel Processing (ICPP '12), Sep 2012 [Bib - Plain]
55	High-Performance Design of HBase with RDMA over InfiniBand J. Huang, X. Ouyang, J. Jose, M. W. Rahman, H. Wang, M. Luo, H. Subramoni, C. Murthy, and DK Panda, International Parallel and Distributed Processing Symposium (IPDPS '12), May 2012 [Bib - Plain]
56	Understanding the Communication Characteristics in HBase: What are the Fundamental Bottlenecks? M. W. Rahman, J. Huang, J. Jose, X. Ouyang, H. Wang, N. Islam, H. Subramoni, C. Murthy, and DK Panda, International Symposium on Performnce Analysis of Systems and Software (ISPASS '12), Poster Paper, Apr 2012 [Bib - Plain]
57	Memcached Design on High Performance RDMA Capable Interconnects J. Jose, H. Subramoni, M. Luo, M. Zhang, J. Huang, M. W. Rahman, N. Islam, X. Ouyang, H. Wang, S. Sur, and DK Panda, International Conference on Parallel Processing (ICPP '11), Sep 2011 [Slides] [Bib - Plain]
58	Scalable Memcached design for InfiniBand Clusters using Hybrid Transports J. Jose, H. Subramoni, K. Kandalla, M. W. Rahman, H. Wang, S. Narravula, and DK Panda, International Symposium on Cluster, May 2011 [Bib - Plain]
59	Can High-Performance Interconnects Benefit Hadoop Distributed File System? S. Sur, H. Wang, J. Huang, X. Ouyang, and DK Panda, Workshop on Micro Architectural Support for Virtualization, Dec 2010 [Slides] [Bib - Plain]

Ph.D. Disserations (3)
1	D. Shankar, Designing Fast, Resilient and Heterogeneity-Aware Key-Value Storage for Modern HPC Clusters, Jul 2019
2	J. Zhang, Designing and Building Efficient HPC Cloud with Modern Networking Technologies on Heterogeneous HPC Clusters, Jul 2018
3	J. Jose, Designing High Performance and Scalable Unified Communication Runtime (UCR) for HPC and Big Data Middleware, Aug 2014

M.S. Thesis (3)
1	R. Biswas, Benchmarking and Accelerating TensorFlow-based Deep Learning on Modern HPC Systems, Jul 2018
2	K. Kulkarni, Performance Characterization and Improvements of SQL-on-Hadoop Systems, Aug 2016
3	A. Bhat, RDMA-based Plugin Design and Profiler for Apache and Enterprise Hadoop Distributed Filesystem, Aug 2015

NOWLAB: Network Based Computing Lab