You are here

Xiaoyi Lu

  • Research Assistant Professor, Computer Science & Engineering
  • 472 Dreese Laboratories
    2015 Neil Ave
    Columbus, OH 43210
  • 614-292-6371

Chapters

2017

  • Lu, X.; Zhang, J.; Panda, D.K.. 2017. "Building Efficient HPC Cloud with SR-IOV-Enabled InfiniBand: The MVAPICH2 Approach." In Research Advances in Cloud Computing, edited by Chaudhary, S.; Somani, G.; Buyya, R.,

2016

  • 2016. "Accelerating Big Data Processing on Modern HPC Clusters." In Conquering Big Data Using High Performance Computing, edited by Arora, R.,

Journal Articles

2019

  • Jiang, Z.; Gao, W.; Wang, L.; Xiong, X. et al., 2019, "HPC AI500: A Benchmark Suite for HPC AI Systems." Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 11459 LNCS, 10-22 - 10-22.
  • Javed, M.H.; Ibrahim, K.Z.; Lu, X., 2019, "Performance analysis of deep learning workloads using roofline trajectories." CCF Transactions on High Performance Computing 1, no. 3, 224-239 - 224-239.

2018

  • Lu, X.; Shi, H.; Biswas, R.; Javed, M.H. et al., 2018, "DLoBD: A Comprehensive Study of Deep Learning over Big Data Stacks on HPC Clusters." IEEE Transactions on Multi-Scale Computing Systems
  • Chu, C.H.; Lu, X.; Awan, A.A.; Subramoni, H. et al., 2018, "Exploiting Hardware Multicast and GPUDirect RDMA for Efficient Broadcast." IEEE Transactions on Parallel and Distributed Systems
  • Panda, D.; Lu, X-Y.; Subramoni, H., 2018, "Networking and communication challenges for post-exascale systems." FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING 19, no. 10, 1230-1235 - 1230-1235.

2017

  • Rahman, M.W.U.; Islam, N.S.; Lu, X.; Panda, D.K.D., 2017, "A Comprehensive Study of MapReduce over Lustre for Intermediate Data Placement and Shuffle Strategies on HPC Clusters." IEEE Transactions on Parallel and Distributed Systems 28, no. 3, 633-646 - 633-646.
  • Rahman, M.W.U.; Islam, N.S.; Lu, X.; Shankar, D. et al., 2017, "MR-Advisor: A comprehensive tuning, profiling, and prediction tool for MapReduce execution frameworks on HPC clusters." Journal of Parallel and Distributed Computing

2016

  • Shankar, D.; Lu, X.; Wasi-ur-Rahman, M.; Islam, N. et al., 2016, "Characterizing and benchmarking stand-alone Hadoop MapReduce on modern HPC clusters." Journal of Supercomputing 1-28 - 1-28.

2015

  • Liang, F.; Lu, X., 2015, "Accelerating Iterative Big Data Computing Through MPI." Journal of Computer Science and Technology 30, 283-294 - 283-294.

2014

  • Shankar, D.; Lu, X.; Wasi-ur-Rahman, M.; Islam, N. et al., 2014, "A Micro-benchmark Suite for Evaluating Hadoop MapReduce on High-Performance networks." Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 8807, 19-33 - 19-33.
  • Liang, F.; Feng, C.; Lu, X.; Xu, Z., 2014, "Performance Benefits of DataMPI: A Case Study with BigDataBench." Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 8807, 111-123 - 111-123.
  • Han, R.; Lu, X.; Xu, J., 2014, "On Big Data Benchmarking." Big Data Benchmarks, Performance Optimization, and Emerging Hardware - 4th and 5th Workshops, BPOE 2014, Salt Lake City, USA, March 1, 2014 and Hangzhou, China, September 5, 2014, Revised Selected Papers 3-18 - 3-18.

2011

  • Lu, X.; Lin, J.; Zha, L., 2011, "Architecture and key technologies of LingCloud." Jisuanji Yanjiu yu Fazhan/Computer Research and Development 48, 1111-1122 - 1111-1122.

Presentations

  • "Designing High-Performance, Scalable, and Resilient Middleware over RDMA-based Networks." 2019, Presented at HPC-AI Advisory Council China Workshop, co-located with National Annual Conference on High Performance Computing (HPC China),
  • "Paving the Way for RDMA-based Networked Systems into High-Performance Data Centers." 2019, Presented at Forum on Intelligent Systems, co-located with BenchCouncil International Symposium on Intelligent Computers,
  • "Scalable, Resilient, and Distributed Key-Value Store-Based Data Management over RDMA Networks." 2019, Presented at International OpenFabrics Alliance Workshop (OFAW),
  • Accelerating TensorFlow with RDMA for High-Performance Deep Learning. 2019, Presented at International OpenFabrics Alliance Workshop (OFAW),
  • "Benchmarking and Accelerating Big Data and Deep Learning Systems on Modern HPC and Cloud Architectures." 2019, Presented at International Workshop on Benchmarking in the Data Center, co-located with International Conference on High-Performance Computing in Asia-Pacific Region (HPC Asia),
  • "Characterizing and Benchmarking Deep Learning Systems on Modern Data Center Architectures." 2018, Presented at International Symposium on Benchmarking, Measuring, and Optimizing (Bench),
  • "Paving the Way for RDMA into High-Performance Data Centers." 2018, Presented at Forum on Data Center Computing, co-located with The 15th China National Computer Congress (CNCC),
  • "Paving the Way for RDMA into High-Performance Data Centers." 2018, Presented at Alibaba Group,
  • "A Convergent Trajectory for High-Performance Data-Intensive Computing." 2018, Presented at Supercomputing Center of the Chinese Academy of Sciences (SCCAS),
  • "Designing High-Performance Cloud Systems on Modern Data Center Architectures: Opportunities and Challenges." 2018, Presented at Forum on System Benchmarking and Optimization, co-located with National Annual Conference on High-Performance Computing (HPC China),
  • "High-Performance Datacenters and Clouds Need RDMA Networks and Systems." 2018, Presented at Forum on Benchmarking and Performance Optimization, co-located with the 35th China National Database Conference,
  • Designing High-Performance Non-Volatile Memory-aware RDMA Communication Protocols for Big Data Processing. 2018, Presented at Storage Developer Conference (SDC), Santa Clara, CA, USA,
  • "NeuroScience Meets Cloud: Designing High-Performance HPC and Big Data Libraries on Clouds for Accelerating NeuroScience Applications." 2018, Presented at The 3rd Big Data Neuroscience Workshop, Organized by the Advanced Computational Neuroscience Network (ACNN), Cleveland, Ohio, USA,
  • "Exploiting HPC Technologies to Accelerate Big Data Processing and Associated Deep Learning." 2018, Presented at Intel Collaboration Hub Technical Sessions, co-located with the 33rd International Supercomputing Conference (ISC), Frankfurt, Germany,
  • "Paving the Way for RDMA Towards High-Performance Data Computing." 2018, Presented at Institute of Computing Technology, Chinese Academy of Sciences (ICT/CAS), Beijing, China,
  • DLoBD: An Emerging Paradigm of Deep Learning Over Big Data Stacks. 2018, Presented at Spark+AI Summit, San Francisco, California, USA,
  • Building Efficient HPC Clouds with MVAPICH2 and OpenStack over SR-IOV-enabled Heterogeneous Clusters. 2018, Presented at OpenStack Summit, Vancouver, Canada,
  • "Paving the Way for RDMA Towards High-Performance Data Computing." 2018, Presented at Department of Computer Science and Engineering, The Ohio State University, Columbus, Ohio, USA,
  • High-Performance Big Data Analytics with RDMA over NVM and NVMe-SSD. 2018, Presented at International OpenFabrics Alliance Workshop (OFAW), Boulder, Colorado USA,
  • "Building Efficient Clouds for HPC, Big Data, and Neuroscience Applications over SR-IOV-enabled InfiniBand Clusters." 2018, Presented at International OpenFabrics Alliance Workshop (OFAW), Boulder, Colorado USA,
  • "DLoBD: An Emerging Paradigm of Deep Learning over Big Data Stacks on RDMA-enabled Clusters." 2018, Presented at International OpenFabrics Alliance Workshop (OFAW), Boulder, Colorado USA,
  • Accelerating TensorFlow with RDMA for High-Performance Deep Learning. 2018, Presented at DataWorks Summit, Berlin, Germany,
  • "Benchmarking, Characterizing, and Accelerating Deep Learning over Big Data (DLoBD) Stacks." 2018, Presented at The 9th International Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware (BPOE), in conjunction with The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Williamsburg, VA, USA,
  • "High-Performance Hadoop and Spark on OpenPOWER Platform." 2018, Presented at OpenPOWER Summit, Las Vegas, USA,
  • "Accelerating Big Data Processing and Associated Deep Learning on Modern HPC Clusters and Clouds." 2017, Presented at Center of High-Performance Computing National Meeting and Conference (CHPC), Pretoria, South Africa,
  • "Overview of High-Performance Big Data Stacks." 2017, Presented at OSU Booth, co-located with the 29th International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), Denver, Colorado, USA,
  • "Accelerate Big Data Processing (Hadoop, Spark, Memcached, and TensorFlow) with HPC Technologies." 2017, Presented at Intel HPC Developer Conference Technical Sessions, Denver, Colorado, USA,
  • "Accelerating Big Data Processing and Associated Deep Learning on HPC Clusters and Clouds." 2017, Presented at The 14th China National Computer Congress (CNCC), Fuzhou, China,
  • "Designing MPI Library with On-Demand Paging (ODP) of InfiniBand: Challenges and Benefits." 2017, Presented at HPC Plus Forum, in conjunction with HPC China, Hefei, China,
  • "Overview of High-Performance Big Data Processing and Associated Deep Learning Stacks on Modern HPC Clusters." 2017, Presented at Computing and Big Data in Engineering: An Interdisciplinary Discussion, The Ohio State University,
  • "NeuroScience Meets HPC Cloud: Designing High-Performance MPI and Big Data Libraries on Virtualized InfiniBand Clusters for NeuroScience Applications." 2017, Presented at The 2nd Big Data Neuroscience Workshop, Organized by the Advanced Computational Neuroscience Network (ACNN), Bloomington, Indiana, USA,
  • "Exploiting HPC Technologies to Accelerate Big Data Processing (Hadoop, Spark, and Memcached)." 2017, Presented at Intel Collaboration Hub Technical Sessions, co-located with the 32nd International Supercomputing Conference (ISC), Frankfurt, Germany,
  • Building Efficient HPC Clouds with MVAPICH2 and OpenStack over SR-IOV enabled InfiniBand Clusters. 2017, Presented at OpenStack Summit, Boston, MA, USA,
  • Accelerating OpenStack Swift with RDMA for Building Efficient HPC Clouds. 2017, Presented at OpenStack Summit, Boston, MA, USA,
  • Accelerating Big Data Processing and Framework Provisioning with OpenStack Heat-based Hadoop/Spark. 2017, Presented at OpenStack Summit, Boston, MA, USA,
  • "Accelerating Big Data Processing (Spark and Hadoop) and Associated Deep Learning on HPC Clusters." 2017, Presented at Technical Interchange Meeting on High Performance Signal Processing with Accelerators, Dayton, Ohio, USA,
  • "Accelerating Big Data Processing and Management on Modern HPC Clusters." 2017, Presented at Xidian University, Xi’an, Shanxi, China,
  • "Building Efficient Clouds over SR-IOV enabled HPC Clusters: Opportunities and Challenges." 2017, Presented at Institute of Computing Technology, Chinese Academy of Sciences (ICT/CAS), Beijing, China,
  • "NVM-aware RDMA-Based Communication and I/O Schemes for High-Perf Big Data Analytics." 2017, Presented at International OpenFabrics Alliance Workshop (OFAW), Austin, Texas, USA,
  • Building Efficient HPC Clouds with MVAPICH2 and RDMA-Hadoop over SR-IOV InfiniBand Clusters. 2017, Presented at International OpenFabrics Alliance Workshop (OFAW), Austin, Texas, USA,
  • "HPC Meets Big Data: Accelerating Hadoop, Spark, and Memcached with HPC Technologies." 2017, Presented at International OpenFabrics Alliance Workshop (OFAW), Austin, Texas, USA,
  • "Exploiting HPC Technologies to Accelerate Big Data Processing (Hadoop, Spark, and Memcached)." 2016, Presented at Intel HPC Developer Conference Technical Sessions, Salt Lake City, Utah USA,
  • "High-Performance Big Data Processing Tools for Neuroscience and A Demo on Chameleon Cloud." 2016, Presented at The 1st Big Data Neuroscience Workshop, Organized by the Advanced Computational Neuroscience Network (ACNN), Ann Arbor, Michigan, USA,
  • "High-Performance Big Data Processing Tools for Neuroscience." 2016, Presented at The 1st Big Data Neuroscience Workshop, Organized by the Advanced Computational Neuroscience Network (ACNN), Ann Arbor, Michigan, USA,
  • Building Efficient HPC Clouds with MVAPICH2 and OpenStack over SR-IOV enabled InfiniBand Clusters. 2016, Presented at OpenStack Summit, Austin, Texas, USA,
  • "High-Performance MPI Library with SR-IOV and Slurm for Virtualized InfiniBand Clusters." 2016, Presented at International OpenFabrics Alliance Workshop (OFAW), Monterey, California, USA,
  • Accelerating Apache Hadoop through High-Performance Networking and I/O Technologies. 2016, Presented at Hadoop Summit, Dublin, Ireland,,
  • "High Performance Big Data Processing Tools: MVAPICH2 and HiBD." 2016, Presented at Midwest Big Data Hub All-Hands Meeting, Rosemont, Illinois, USA,
  • "Designing High-Performance Middleware for HPC and Big Data Applications." 2015, Presented at International Symposium on High Performance Computing Middleware Technologies, in conjunction with HPC China, Wuxi, China,
  • "Accelerating Big Data Processing and Management on Modern HPC Clusters." 2015, Presented at The 9th ChinaSys Workshop (ChinaSys), Shanghai, China,
  • "Accelerating Big Data Management and Analytics through HPC Technologies." 2015, Presented at The 12th China National Computer Congress (CNCC), Hefei, China,
  • "Supporting SR-IOV and IVSHMEM in MVAPICH2 on Slurm: Challenges and Benefits." 2015, Presented at The 6th Slurm User Group Meeting (SLUG), Washington DC, USA,
  • "Accelerating Apache Spark with RDMA for Big Data Processing." 2015, Presented at Spark Meetup, Beijing, China,
  • "Accelerating Big Data Processing with RDMA on Modern Clusters." 2014, Presented at 2014 Big Data Technology Conference (BDTC), Beijing, China,
  • "Extending MPI to Big Data Computing: Challenges and Benefits of DataMPI." 2013, Presented at 2013 Big Data Technology Conference (BDTC), Beijing, China,
  • "SimdHT-Bench: Characterizing SIMD-Aware Hash Table Designs on Emerging CPU Architectures." 2019, Presented at IEEE International Symposium on Workload Characterization (IISWC), Orlando, Florida, USA,
  • "Analyzing, Modeling, and Provisioning QoS for NVMe SSDs." 2018, Presented at The 11th IEEE/ACM International Conference on Utility and Cloud Computing (UCC), Zurich, Switzerland,
  • "Spark-uDAPL: Cost-Saving Big Data Analytics on Microsoft Azure Cloud with RDMA Networks." 2018, Presented at IEEE International Conference on Big Data (IEEE BigData), Seattle, WA, USA,
  • "Cutting the Tail: Designing High Performance Message Brokers to Reduce Tail Latencies in Stream Processing." 2018, Presented at IEEE International Conference on Cluster Computing (IEEE Cluster), Belfast, UK,
  • "Performance Characterization and Acceleration of Big Data Workloads on OpenPOWER System." 2017, Presented at IEEE International Conference on Big Data (IEEE BigData), Boston, MA, USA,
  • "Characterizing and Accelerating Indexing Techniques on Distributed Ordered Tables." 2017, Presented at IEEE International Conference on Big Data (IEEE BigData), Boston, MA, USA,
  • "NVMD: Non-Volatile Memory Assisted Design for Accelerating MapReduce and DAG Execution Frameworks on HPC Systems." 2017, Presented at IEEE International Conference on Big Data (IEEE BigData), Boston, MA, USA,
  • "Characterizing Deep Learning over Big Data (DLoBD) Stacks on RDMA-capable Networks." 2017, Presented at The 25th Annual Symposium on High-Performance Interconnects (HotI), Santa Clara, California, USA,
  • "Swift-X: Accelerating OpenStack Swift with RDMA for Building an Efficient HPC Cloud." 2017, Presented at The 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Madrid, Spain,
  • "Designing Locality and NUMA Aware MPI Runtime for Nested Virtualization based HPC Cloud with SR-IOV Enabled InfiniBand." 2017, Presented at The 13th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE) , Xi’an, Shanxi, China,
  • "Benchmarking Kudu Distributed Storage Engine on High-Performance Interconnects and Storage Devices." 2017, Presented at The 8th International Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware (BPOE-8), in conjunction with the 22nd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Xi’an, Shanxi, China,
  • "NRCIO: NVM-aware RDMA-based Communication and I/O Schemes for Big Data Analytics." 2017, Presented at The 8th Annual Non-Volatile Memories Workshop (NVMW), San Diego, California, USA,
  • "High-Performance Design of Apache Spark with RDMA and Its Benefits on Various Workloads." 2016, Presented at IEEE International Conference on Big Data (IEEE BigData), Washington D.C., USA,
  • "Characterizing Cloudera Impala Workloads with BigDataBench on Infiniband Clusters." 2016, Presented at The 7th International Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware (BPOE-7), in conjunction with the 21th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Atlanta, GA, USA,
  • "A Plugin-based Approach to Exploit RDMA Benefits for Apache and Enterprise HDFS." 2015, Presented at The 6th International Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware (BPOE-6), in conjunction with the 41st International Conference on Very Large Data Bases (VLDB), Hawaii, USA,
  • "Accelerating I/O Performance of Big Data Analytics on HPC Clusters through RDMA-based Key-Value Store." 2015, Presented at The 44th International Conference on Parallel Processing (ICPP), Beijing, China,
  • "Triple-H: A Hybrid Approach to Accelerate HDFS on HPC Clusters with Heterogeneous Storage Architecture." 2015, Presented at The 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Shenzhen, Guangdong, China,
  • "A Micro-benchmark Suite for Evaluating Hadoop MapReduce on High-Performance Networks." 2014, Presented at The 5th International Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware (BPOE-V), in conjunction with the 40th International Conference on Very Large Data Bases (VLDB), Hangzhou, China,
  • "Accelerating Spark with RDMA for Big Data Processing: Early Experiences." 2014, Presented at The 22nd Annual Symposium on High-Performance Interconnects (HotI), Mountain View, California, USA,
  • "DataMPI: Extending MPI to Hadoop-like Big Data Computing." 2014, Presented at The 28th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Arizona, USA,
  • "Performance Benefits of DataMPI: A Case Study with BigDataBench." 2014, Presented at The 4th Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware (BPOE-IV), in conjunction with the 19th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Salt Lake City, Utah, USA,
  • "High-Performance Design of Hadoop RPC with RDMA over InfiniBand." 2013, Presented at The 42nd International Conference on Parallel Processing (ICPP), Lyon, France,
  • "Vega LingCloud: A Resource Single Leasing Point System to Support Heterogeneous Application Modes on Shared Infrastructure." 2011, Presented at The 9th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA), Busan, Korea,
  • "JAMILA: A Usable Batch Job Management System to Coordinate Heterogeneous Clusters and Diverse Applications over Grid or Cloud Infrastructure." 2010, Presented at The 7th IFIP International Conference on Network and Parallel Computing (NPC), Zhengzhou, China,
  • "ICOMC: Invocation Complexity Of Multi-language Clients for Classified Web Services and its Impact on Large Scale SOA Applications." 2009, Presented at The 10th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), Hiroshima, Japan,
  • "Tutorial: How to Accelerate Your Big Data and Associated Deep Learning Applications with Hadoop and Spark?." 2019, Presented at International Conference on Practice & Experience in Advanced Research Computing (PEARC), Chicago, IL, USA,
  • "Tutorial: Accelerating Big Data Processing and Associated Deep Learning on Modern Datacenters." 2019, Presented at The 46th International Symposium on Computer Architecture (ISCA), Phoenix, Arizona, USA,
  • "Tutorial: Accelerating Big Data Processing on Modern HPC Clusters." 2019, Presented at The 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Larnaca, Cyprus,
  • "Tutorial: HPC Meets Cloud: Building Efficient Clouds for HPC, Big Data, and Deep Learning Middleware and Applications." 2018, Presented at The 11th IEEE/ACM International Conference on Utility and Cloud Computing (UCC), Zurich, Switzerland,
  • "Tutorial: Exploiting HPC Technologies for Accelerating Big Data Processing and Associated Deep Learning." 2018, Presented at The 31st International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), Dallas, Texas, USA,
  • "Tutorial: Building Efficient Cloud Middleware for HPC, Big Data, and Deep Learning Applications." 2018, Presented at The 38th IEEE International Conference on Distributed Computing Systems (ICDCS), Vienna, Austria,
  • "Tutorial: How to Accelerate Your Big Data and Associated Deep Learning Applications with Hadoop and Spark?." 2018, Presented at International Conference on Practice & Experience in Advanced Research Computing (PEARC), Pittsburgh, PA, USA,
  • "Tutorial: Accelerating Big Data Processing and Associated Deep Learning on Datacenters with Modern Architectures." 2018, Presented at The 45th International Symposium on Computer Architecture (ISCA), Los Angeles, California, USA,
  • "Tutorial: Accelerating Big Data Processing and Associated Deep Learning on Data Centers and HPC Clouds with Modern Architectures." 2018, Presented at The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Williamsburg, VA, USA,
  • "Tutorial: Accelerating Big Data Processing and Associated Deep Learning on Datacenters and HPC Clouds with Modern Architectures." 2018, Presented at The 24th IEEE International Symposium on High Performance Computer Architecture (HPCA), Vienna, Austria,
  • "Tutorial: HPC Meets Cloud: Building Efficient Clouds for HPC, Big Data, and Deep Learning Middleware and Applications." 2017, Presented at The 10th IEEE/ACM International Conference on Utility and Cloud Computing (UCC), Austin, Texas, USA,
  • "Tutorial: Big Data Meets HPC: Exploiting HPC Technologies for Accelerating Big Data Processing and Management." 2017, Presented at Center of High-Performance Computing National Meeting and Conference (CHPC), Pretoria, South Africa,
  • "Tutorial: Building HPC Cloud with InfiniBand: Efficient Support in MVAPICH2 for KVM, Docker, Singularity, OpenStack, and SLURM." 2017, Presented at The 5th Annual MVAPICH User Group (MUG) Meeting, Columbus, Ohio, USA,
  • "Tutorial: Big Data Meets HPC: Exploiting HPC Technologies for Accelerating Big Data Processing and Management." 2017, Presented at The 30th International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), Denver, Colorado, USA,
  • "Tutorial: Exploiting High-Performance Interconnects to Accelerate Big Data Processing with Hadoop, Spark, Memcached, and gRPC/TensorFlow." 2017, Presented at The 25th Annual Symposium on High-Performance Interconnects (HotI), Santa Clara, California, USA,
  • "Tutorial: How to Accelerate Your Big Data Applications with Hadoop and Spark?." 2017, Presented at International Conference on Practice & Experience in Advanced Research Computing (PEARC), New Orleans, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark, and Memcached on Datacenters with Modern Architectures." 2017, Presented at The 44th International Symposium on Computer Architecture (ISCA), Toronto, Canada,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark, and Memcached on Modern Clusters." 2017, Presented at The 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Madrid, Spain,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark and Memcached on Datacenters with Modern Architectures." 2017, Presented at The 22nd International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Xi’an, China,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark and Memcached on Datacenters with Modern Architectures." 2017, Presented at The 23rd IEEE International Symposium on High Performance Computer Architecture (HPCA), Austin, Texas, USA,
  • "Tutorial: Big Data Meets HPC: Exploiting HPC Technologies for Accelerating Apache Hadoop, Spark, and Memcached." 2016, Presented at The 29th International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Salt Lake City, Utah, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark, and Memcached on Datacenters with Modern Architectures." 2016, Presented at The 26th International Conference on Field-Programmable Logic and Applications (FPL), Lausanne, Switzerland,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark, and Memcached over High-Performance Interconnects." 2016, Presented at The 24th Annual Symposium on High-Performance Interconnects (HotI), Santa Clara, California, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark and Memcached on Datacenters with Modern Architectures." 2016, Presented at The 43rd International Symposium on Computer Architecture (ISCA), Seoul, Korea,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark and Memcached on Modern Clusters." 2016, Presented at The 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Cartagena, Colombia,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark and Memcached on Datacenters with Modern Architectures." 2016, Presented at The 21st International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Atlanta, Georgia, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark and Memcached on Datacenters with Modern Architectures." 2016, Presented at The 22nd IEEE International Symposium on High Performance Computer Architecture (HPCA), Barcelona, Spain,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark and Memcached on Modern Clusters." 2015, Presented at The 28th International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Austin, Texas, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark and Memcached over High-Performance Interconnects." 2015, Presented at The 23rd Annual Symposium on High-Performance Interconnects (HotI), Oracle Santa Clara, California, USA,
  • "Tutorial: Accelerating Big Data Applications with Hadoop, Spark, and Memcached on Modern HPC Clusters." 2015, Presented at The 4th Conference on Extreme Science and Engineering Discovery Environment (XSEDE), St. Louis, Missouri, USA,
  • "Tutorial: Accelerating Big Data Applications with Hadoop, Spark, and Memcached on Datacenters with Modern Architectures." 2015, Presented at The 42nd International Symposium on Computer Architecture (ISCA), Portland, Oregon, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark and Memcached on Datacenters with Modern Architectures." 2015, Presented at The 20th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Istanbul, Turkey,
  • "Tutorial: Accelerating Big Data Processing with Hadoop, Spark and Memcached on Datacenters with Modern Networking and Storage Architecture." 2015, Presented at The 21th IEEE International Symposium on High Performance Computer Architecture (HPCA), Bay Area, California, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop and Memcached on Modern Clusters." 2014, Presented at IEEE International Conference on Cluster Computing (IEEE Cluster), Madrid, Spain,
  • "Tutorial: Accelerating Big Data Processing with Hadoop and Memcached over High-Performance Interconnects." 2014, Presented at The 22nd Annual Symposium on High-Performance Interconnects (HotI), Google Headquarters, Mountain View, California, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop and Memcached on Datacenters with Modern Networking and Storage Architecture." 2014, Presented at The 41st International Symposium on Computer Architecture (ISCA), Minneapolis, Minnesota, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop and Memcached on Modern Clusters." 2014, Presented at The 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Chicago, Illinois, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop and Memcached on Datacenters with Modern Networking and Storage Architecture." 2014, Presented at The 19th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Salt Lake City, Utah, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop and Memcached on Datacenters with Modern Networking and Storage Architecture." 2014, Presented at The 20th IEEE International Symposium on High Performance Computer Architecture (HPCA), Orlando, Florida, USA,
  • "Tutorial: Accelerating Big Data Processing with Hadoop and Memcached Using High Performance Interconnects: Opportunities and Challenges." 2013, Presented at The 21st Annual Symposium on High-Performance Interconnects (HotI), Cisco Headquarters, San Jose, California, USA,

Papers in Proceedings

2019

  • Shi, H.; Lu, X. "TriEC: Tripartite Graph Based Erasure Coding NIC Offload." in The 32nd International Conference for High Performance Computing, Networking, Storage and Analysis (SC). (11 2019).
  • Shi, H.; Lu, X. "Designing High-Performance Erasure Coding Schemes for Next-Generation Storage Systems." in The 32nd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Denver, Colorado, USA. (11 2019).
  • Shankar, D.; Lu, X.; Panda, D.K. "SimdHT-Bench: Characterizing SIMD-Aware Hash Table Designs on Emerging CPU Architectures." in International Symposium on Workload Characterization (IISWC). (11 2019).
  • Hui, Y.; Lien, J.; Lu, X. "Three-Dimensional Characterization on Edge AI Processors with Object Detection Workloads." in The 32nd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2019. (11 2019).
  • Zhang, J.; Lu, X.; Chu, C.H.; Panda, D.K. "C-GDR: High-performance container-aware GPUDirect MPI communication schemes on RDMA networks." in The 33rd IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2019. (5 2019).
  • Shi, H.; Lu, X.; Shankar, D.; Panda, D.K. "UMR-EC: A unified and multi-rail Erasure Coding library for high-performance distributed storage systems." in The 28th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2019. (6 2019).
  • Gugnani, S.; Lu, X.; Panda, D.K.D.K. "Analyzing, modeling, and provisioning QoS for NVME SSDs." in The 11th IEEE/ACM International Conference on Utility and Cloud Computing (UCC), 2018. (1 2019).
  • Hui, Y.; Lien, J.; Lu, X. "Early Experience in Benchmarking Edge AI Processors with Object Detection Workloads." in International Symposium on Benchmarking, Measuring, and Optimizing (Bench). (11 2019).

2018

  • Shi, H.; Lu, X.; Panda, D.K. "EC-Bench: Benchmarking Onload and Offload Erasure Coders on Modern Hardware Architectures." in International Symposium on Benchmarking, Measuring, and Optimizing (Bench), Seattle, WA, USA. (12 2018).
  • Javed, M.H.; Lu, X.; Panda, D.K. "Cutting the Tail: Designing High Performance Message Brokers to Reduce Tail Latencies in Stream Processing." in IEEE International Conference on Cluster Computing (IEEE Cluster), 2018.. (10 2018).
  • Awan, A.A.; Chu, C.; Subramoni, H.; Lu, X. et al. "OC-DNN: Exploiting Advanced Unified Memory Capabilities in CUDA 9 and Volta GPUs for Out-of-Core DNN Training." in IEEE 25th International Conference on High Performance Computing (HiPC), 2018. (12 2018).
  • Biswas, R.; Lu, X.; Panda, D.K. "Accelerating TensorFlow with Adaptive RDMA-Based gRPC." in IEEE 25th International Conference on High Performance Computing (HiPC), 2018. (12 2018).
  • Lu, X.; Shankar, D.; Shi, H.; Panda, D.K.D.K. "Spark-uDAPL: Cost-Saving Big Data Analytics on Microsoft Azure Cloud with RDMA Networks." in IEEE International Conference on Big Data (Big Data). (1 2018).
  • Gugnani, S.; Lu, X.; Pestilli, F.; Caiafa, C. et al. "MPI-LiFE: Designing High-Performance Linear Fascicle Evaluation of Brain Connectome with MPI." in The 24th IEEE International Conference on High Performance Computing, HiPC 2017. (2 2018).
  • Gugnani, S.; Lu, X.; Qi, H.; Zha, L. et al. "Characterizing and accelerating indexing techniques on distributed ordered tables." in IEEE International Conference on Big Data (IEEE BigData), 2017. (1 2018).
  • Li, M.; Subramoni, H.; Lu, X.; Panda, D.K. "Multi-threading and lock-free MPI RMA based graph processing on KNL and power architectures." in International Conference on EuroMPI (EuroMPI), 2018. (9 2018).
  • Li, M.; Lu, X.; Subramoni, H.; Panda, D.K. "Designing registration caching free high-performance MPI library with implicit on-demand paging (ODP) of InfiniBand." in The 24th IEEE International Conference on High Performance Computing, HiPC 2017. (2 2018).
  • Lu, X.; Shi, H.; Shankar, D.; Panda, D.K.D.K. "Performance characterization and acceleration of big data workloads on OpenPOWER system." in IEEE International Conference on Big Data (IEEE BigData), 2017. (1 2018).
  • Rahman, M.W.U.; Islam, N.S.; Lu, X.; Panda, D.K.D.K. "NVMD: Non-volatile memory assisted design for accelerating MapReduce and DAG execution frameworks on HPC systems." in IEEE International Conference on Big Data (IEEE BigData), 2017. (1 2018).
  • Shi, H.; Lu, X.; Shankar, D.; Panda, D.K. "High-performance multi-rail erasure coding library over modern data center architectures: Early experiences." in 2018 ACM Symposium on Cloud Computing (SoCC), 2018. (10 2018).

2017

  • Shankar, D.; Lu, X.; Panda, D.K. "High-Performance and Resilient Key-Value Store with Online Erasure Coding for Big Data Workloads." in 37th IEEE International Conference on Distributed Computing Systems (ICDCS). (1 2017).
  • Bayatpour, M.; Chakraborty, S.; Subramoni, H.; Lu, X. et al. "Scalable reduction collectives with data partitioning-based multi-leader design." in The 30th International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2017. (11 2017).
  • Chu, C.H.; Lu, X.; Awan, A.A.; Subramoni, H. et al. "Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning." in The 46th International Conference on Parallel Processing (ICPP), 2017. (9 2017).
  • Gugnani, S.; Lu, X.; Panda, D.K. "Swift-X: Accelerating openstack swift with RDMA for building an efficient HPC cloud." in The 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), 2017. (7 2017).
  • Islam, N.S.; Wasi-Ur-Rahman, M.; Lu, X.; Panda, D.K.D.K. "Efficient data access strategies for Hadoop and Spark on HPC cluster with heterogeneous storage." in IEEE International Conference on Big Data (IEEE BigData), 2016. (2 2017).
  • Zhang, J.; Lu, X.; Panda, D.K.D.K. "High-Performance Virtual Machine Migration Framework for MPI Applications on SR-IOV enabled InfiniBand Clusters." in 31st IEEE International Parallel and Distributed Processing Symposium (IPDPS). (1 2017).
  • Panda, D.K.; Lu, X. "HPC Meets Cloud: Building Efficient Clouds for HPC, Big Data, and Deep Learning Middleware and Applications." in The10th International Conference on Utility and Cloud Computing (UCC), 2017. (1 2017).
  • Zhang, J.; Lu, X.; Panda, D.K. "Is Singularity-based Container Technology Ready for Running MPI Applications on HPC Clouds?." in The10th International Conference on Utility and Cloud Computing (UCC), 2017. (1 2017).
  • Lu, X.; Shi, H.; Javed, M.H.; Biswas, R. et al. "Characterizing Deep Learning over Big Data (DLoBD) Stacks on RDMA-capable Networks." in 25th IEEE Annual Symposium on High-Performance Interconnects (HOTI). (1 2017).
  • Subramoni, H.; Lu, X.; Panda, D.K. "A Scalable Network-Based Performance Analysis Tool for MPI on Large-Scale HPC Systems." in IEEE International Conference on Cluster Computing (CLUSTER). (1 2017).
  • Javed, M.H.; Lu, X.; Panda, D.K.D.K. "Characterization of Big Data Stream Processing Pipeline: A Case Study Using Flink and Kafka." in The 4th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT), 2017. (1 2017).
  • Zhang, J.; Lu, X.; Panda, D.K.D. "Designing Locality and NUMA Aware MPI Runtime for Nested Virtualization based HPC Cloud with SR-IOV Enabled InfiniBand." in 13th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE). (7 2017).

2016

  • Shankar, D.; Lu, X.; Panda, D.K.D.K. "Boldio: A Hybrid and Resilient Burst-Buffer Over Lustre for Accelerating Big Data I/O." in 4th IEEE International Conference on Big Data (Big Data). (1 2016).
  • Subramoni, H.; Augustine, A.M.; Arnold, M.; Perkins, J. et al. "INAM(2): InfiniBand Network Analysis and Monitoring with MPI." in 31st International Conference on ISC High Performance. (1 2016).
  • Panda, D.K.; Zhan, J.; Lu, X. "IEEE international workshop on high-performance big data computing - HPBDC 2016." in IEEE 30th International Parallel and Distributed Processing Symposium Workshops (IPDPSW), 2016. (7 2016).
  • Lu, X.; Shankar, D.; Gugnani, S.; Subramoni, H. et al. "Impact of HPC Cloud Networking Technologies on Accelerating Hadoop RPC and HBase." in 8th IEEE International Conference on Cloud Computing Technology and Science (CloudCom). (1 2016).
  • Lu, X.; Shankar, D.; Gugnani, S.; Panda, D.K.D.K. "High-Performance Design of Apache Spark with RDMA and Its Benefits on Various Workloads." in 4th IEEE International Conference on Big Data (Big Data). (1 2016).
  • Wasi-ur-Rahman, M.; Islam, N.S.; Lu, X.; Shankar, D. et al. "MR-Advisor: A Comprehensive Tuning Tool for Advising HPC Users to Accelerate MapReduce Applications on Supercomputers." in 28th IEEE International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD). (1 2016).
  • Islam, N.S.; Wasi-Ur-Rahman, M.; Lu, X.; Panda, D.K. "High performance design for HDFS with byte-addressability of NVM and RDMA." in The 30th International Conference on Supercompuing (ICS), 2016. (6 2016).
  • Gugnani, S.; Lu, X.; Panda, D.K.D.K. "Designing Virtualization-aware and Automatic Topology Detection Schemes for Accelerating Hadoop on SR-IOV-enabled Clouds." in 8th IEEE International Conference on Cloud Computing Technology and Science (CloudCom). (1 2016).
  • Gugnani, S.; Lu, X.; Panda, D.K. "Performance characterization of hadoop workloads on SR-IOV-enabled virtualized infiniband clusters." in The 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT), 2016. (12 2016).
  • Tatineni, M.; Lu, X.; Choi, D.; Majumdar, A. et al. "Experiences and Benefits of Running RDMA Hadoop and Spark on SDSC Comet." in Conference on Diversity, Big Data, and Science at Scale (XSEDE). (1 2016).
  • Bhat, A.; Islam, N.S.; Lu, X.; Wasi-ur-Rahman, M. et al. "A plugin-based approach to exploit RDMA benefits for apache and enterprise HDFS." in The 6th International Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware (BPOE-6), in conjunction with the 41st International Conference on Very Large Data Bases (VLDB), 2015. (1 2016).
  • Shankar, D.; Lu, X.; Islam, N.; Wasi-ur-Rahman, M. et al. "High-Performance Hybrid Key-Value Store on Modern Clusters with RDMA Interconnects and SSDs: Non-blocking Extensions, Designs, and Benefits." in 30th IEEE International Parallel and Distributed Processing Symposium (IPDPS). (1 2016).
  • Zhang, J.; Lu, X.; Panda, D.K.D. "Performance Characterization of Hypervisor- and Container-based Virtualization for HPC on SR-IOV Enabled InfiniBand Clusters." in 30th IEEE International Parallel and Distributed Processing Symposium (IPDPS). (1 2016).
  • Wasi-ur-Rahman, M.; Islam, N.S.; Lu, X.; Panda, D.K.D.K. "Can Non-Volatile Memory Benefit MapReduce Applications on HPC Clusters?." in 1st Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems (PDSW-DISCS). (1 2016).
  • Li, M.; Lu, X.; Hamidouche, K.; Zhang, J. et al. "Mizan-RMA: Accelerating Mizan Graph Processing Framework with MPI RMA." in 23rd IEEE International Conference on High Performance Computing (HiPC). (1 2016).
  • Zhang, J.; Lu, X.; Chakraborty, S.; Panda, D.K.D. "Slurm-V: Extending Slurm for Building Efficient HPC Cloud with SR-IOV and IVShmem." in 22nd International Conference on Parallel and Distributed Computing (Euro-Par). (1 2016).
  • Zhang, J.; Lu, X.; Panda, D.K.D. "High Performance MPI Library for Container-based HPC Cloud on InfiniBand Clusters." in 45th International Conference on Parallel Processing (ICPP). (1 2016).
  • Li, M.; Hamidouche, K.; Lu, X.; Subramoni, H. et al. "Designing MPI Library with On-Demand Paging (ODP) of InfiniBand: Challenges and Benefits." in International Conference on High Performance Computing, Networking, Storage and Analysis (SC). (1 2016).

2015

  • Shankar, D.; Lu, X.; Wasi-ur-Rahman, M.; Islam, N. et al. "Benchmarking Key-Value Stores on High-Performance Storage and Interconnects for Web-Scale Workloads." in IEEE International Conference on Big Data. (1 2015).
  • Wasi-ur-Rahman, M.; Lu, X.; Islam, N.S.; Rajachandrasekar, R. et al. "High-Performance Design of YARN MapReduce on Modern HPC Clusters with Lustre and RDMA." in 29th IEEE International Parallel and Distributed Processing Symposium (IPDPS). (1 2015).
  • Panda, D.K.; Zhan, J.; Lu, X. "Message from the HPBDC 2015 workshop co-chairs." in IEEE International Parallel and Distributed Processing Symposium (IPDPS) Workshops, 2015. (1 2015).
  • Lin, J.; Hamidouche, K.; Lu, X.; Li, M. et al. "High-Performance Coarray Fortran Support with MVAPICH2-X: Initial Experience and Evaluation." in The 29th IEEE International Parallel and Distributed Processing Symposium Workshop (IPDPSW), 2015. (1 2015).
  • Islam, N.S.; Wasi-ur-Rahman, M.; Lu, X.; Shankar, D. et al. "Performance Characterization and Acceleration of In-Memory File Systems for Hadoop and Spark Applications on HPC Clusters." in IEEE International Conference on Big Data. (1 2015).
  • Islam, N.S.; Shankar, D.; Lu, X.; Wasi-ur-Rahman, M. et al. "Accelerating I/O Performance of Big Data Analytics on HPC Clusters through RDMA-based Key-Value Store." in The 44th Annual International Conference on Parallel Processing (ICPP), 2015. (1 2015).
  • Li, M.; Subramoni, H.; Hamidouche, K.; Lu, X. et al. "High Performance MPI Datatype Support with User-mode Memory Registration: Challenges, Designs and Benefits." in IEEE International Conference on Cluster Computing (CLUSTER). (1 2015).
  • Lin, J.; Liang, F.; Lu, X.; Zha, L. et al. "Modeling and Designing Fault-Tolerance Mechanisms for MPI-Based MapReduce Data Computing Framework." in IEEE First International Conference on Big Data Computing Service and Applications, 2015. (3 2015).
  • Zhang, J.; Lu, X.; Arnold, M.; Panda, D.K.D.K. "MVAPICH2 over OpenStack with SR-IOV: An Efficient Approach to Build HPC Clouds." in 2015 15th IEEE ACM International Symposium on Cluster Cloud and Grid Computing (CCGrid 2015). (1 2015).
  • Li, M.; Hamidouche, K.; Lu, X.; Lin, J. et al. "High-Performance and Scalable Design of MPI-3 RMA on Xeon Phi Clusters." in 21st International Conference on Parallel and Distributed Computing (Euro-Par). (1 2015).
  • Li, M.; Hamidouche, K.; Lu, X.; Zhang, J. et al. "High Performance OpenSHMEM Strided Communication Support with InfiniBand UMR." in 22nd International Conference on High Performance Computing. (1 2015).
  • Islam, N.S.; Lu, X.; Wasi-ur-Rahman, M.; Shankar, D. et al. "Triple-H: A Hybrid Approach to Accelerate HDFS on HPC Clusters with Heterogeneous Storage Architecture." in 2015 15th IEEE ACM International Symposium on Cluster Cloud and Grid Computing (CCGrid 2015). (1 2015).
  • Shankar, D.; Lu, X.; Jose, J.; Wasi-ur-Rahman, M. et al. "Can RDMA Benefit Online Data Processing Workloads on Memcached and MySQL?." in IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS). (1 2015).
  • Lin, J.; Hamidouche, K.; Zhang, J.; Lu, X. et al. "Accelerating k-NN Algorithm with Hybrid MPI and OpenSHMEM." in OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies, 2015. (1 2015).
  • Chao, L.; Li, C.; Liang, F.; Lu, X. et al. "Accelerating Apache Hive with MPI for Data Warehouse Systems." in IEEE 35th International Conference on Distributed Computing Systems (ICDCS), 2015. (6 2015).

2014

  • Shi, R.; Lu, X.; Potluri, S.; Hamidouche, K. et al. "HAND: A Hybrid Approach to Accelerate Non-contiguous Data Movement using MPI Datatypes on GPU Clusters." in 43rd Annual International Conference on Parallel Processing (ICPP). (1 2014).
  • Islam, N.S.; Lu, X.; Rahman, M.W.; Panda, D.K. "SOR-HDFS: A SEDA-based approach to maximize overlapping in RDMA-enhanced HDFS." in The 23rd International ACM Symposium on High Performance Parallel and Distributed Computing (HPDC), 2014. (1 2014).
  • Lu, X.; Wasi-ur-Rahman, M.; Islam, N.S.; Panda, D.K. "A micro-benchmark suite for evaluating hadoop RPC on high-performance networks." in The 3rd Workshop on Big Data Benchmarking (WBDB), 2013. (1 2014).
  • Luo, M.; Lu, X.; Hamidouche, K.; Kandalla, K. et al. "Initial Study of Multi-Endpoint Runtime for MPI plus OpenMP Hybrid Programming Model on Multi-Core Systems." in 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). (1 2014).
  • Li, M.; Hamidouche, K.; Lin, J.; Tomko, K. et al. "Scalable MiniMD design with hybrid MPI and OpenSHMEM." in 2014 OpenSHMEM User Group (OUG) Meeting, in conjunction with the 8th International Conference on Partitioned Global Address Space Programming Models (PGAS). (1 2014).
  • Jose, J.; Potluri, S.; Subramoni, H.; Lu, X. et al. "Designing scalable out-of-core sorting with hybrid MPI+PGAS programming models." in The 8th International Conference on Partitioned Global Address Space Programming Models (PGAS), 2014. (1 2014).
  • Lu, X.; Wasi-ur-Rahman, M.; Islam, N.; Shankar, D. et al. "Accelerating Spark with RDMA for Big Data Processing: Early Experiences." in 22nd IEEE Annual Symposium on High-Performance Interconnects (HOTI). (1 2014).
  • Wasi-Ur-Rahman, M.; Lu, X.; Islam, N.S.; Rajachandrasekar, R. et al. "Mapreduce over lustre: Can RDMA-based approach benefit?." in The 20th International Conference on Euro-Par Parallel Processing (Euro-Par), 2014. (1 2014).
  • Zhang, J.; Lu, X.; Jose, J.; Shi, R. et al. "Can Inter-VM Shmem Benefit MPI Applications on SR-IOV Based Virtualized Infiniband Clusters?." in 20th European Conference on Parallel Computing (Euro-Par). (1 2014).
  • Islam, N.S.; Lu, X.; Wasi-ur-Rahman, M.; Jose, J. et al. "A Micro-benchmark Suite for Evaluating HDFS Operations on Modern Clusters." in 1st Workshop on Big Data Benchmarking (WBDB). (1 2014).
  • Wasi-ur-Rahman, M.; Lu, X.; Islam, N.S.; Panda, D.K.D. "Performance Modeling for RDMA-Enhanced Hadoop MapReduce." in 43rd Annual International Conference on Parallel Processing (ICPP). (1 2014).
  • Li, M.; Lu, X.; Potluri, S.; Hamidouche, K. et al. "Scalable Graph500 Design with MPI-3 RMA." in 16th IEEE International Conference on Cluster Computing (CLUSTER). (1 2014).
  • Jose, J.; Hamidouche, K.; Lu, X.; Potluri, S. et al. "High Performance OpenSHMEM for Xeon Phi Clusters: Extensions, Runtime Designs and Application Co-design." in 16th IEEE International Conference on Cluster Computing (CLUSTER). (1 2014).
  • Zhang, J.; Lu, X.; Jose, J.; Li, M. et al. "High Performance MPI Library over SR-IOV Enabled InfiniBand Clusters." in 21st International Conference on High Performance Computing (HiPC). (1 2014).
  • Lu, X.; Liang, F.; Wang, B.; Zha, L. et al. "DataMPI: Extending MPI to Hadoop-Like Big Data Computing." in IEEE 28th International Parallel and Distributed Processing Symposium (IPDPS), 2014. (5 2014).
  • Wasi-ur-Rahman, M.; Lu, X.; Islam, N.S.; Panda, D.K.D.K. "HOMR: A Hybrid Approach to Exploit Maximum Overlapping in MapReduce over High Performance Interconnects." in 28th ACM International Conference on Supercomputing (ICS). (1 2014).
  • Islam, N.S.; Lu, X.; Wasi-Ur-Rahman, M.; Rajachandrasekar, R. et al. "In-Memory I/O and Replication for HDFS with Memcached: Early Experiences." in IEEE International Conference on Big Data. (1 2014).
  • Liang, F.; Feng, C.; Lu, X.; Xu, Z. "Performance Characterization of Hadoop and Data MPI Based on Amdahl’s Second Law." in The 9th IEEE International Conference on Networking, Architecture, and Storage (NAS), 2014. (8 2014).

2013

  • Shi, R.; Potluri, S.; Hamidouche, K.; Lu, X. et al. "A Scalable and Portable Approach to Accelerate Hybrid HPL on Heterogeneous CPU-GPU Clusters." in 15th IEEE International Conference on Cluster Computing (CLUSTER). (1 2013).
  • Lu, X.; Islam, N.S.; Wasi-ur-Rahman, M.; Jose, J. et al. "High-Performance Design of Hadoop RPC with RDMA over InfiniBand." in 42nd Annual International Conference on Parallel Processing (ICPP). (1 2013).
  • Jose, J.; Li, M.; Lu, X.; Kandalla, K.C. et al. "SR-IOV Support for Virtualization on InfiniBand Clusters: Early Experience." in 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid). (1 2013).
  • Wasi-Ur-Rahman, M.; Lu, X.; Islam, N.S.; Panda, D.K. "Does RDMA-based enhanced hadoop MapReduce need a new performance model?." in The 4th Annual Symposium on Cloud Computing (SoCC), 2013. (1 2013).
  • Wasi-Ur-Rahman, M.; Islam, N.S.; Lu, X.; Jose, J. et al. "High-performance RDMA-based design of hadoop MapReduce over infiniband." in IEEE 27th International Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), 2013. (1 2013).
  • Islam, N.S.; Lu, X.; Wasi-ur-Rahman, M.; Panda, D.K.D.K. "Can Parallel Replication Benefit Hadoop Distributed File System for High Performance Interconnects?." in 21st Annual IEEE Symposium on High-Performance Interconnects (HOTI). (1 2013).

2011

  • Lu, X.; Lin, J.; Zha, L.; Xu, Z. "Vega LingCloud: A Resource Single Leasing Point System to Support Heterogeneous Application Modes on Shared Infrastructure." (5 2011).
  • Lu, X.; Wang, B.; Zha, L.; Xu, Z. "Can MPI Benefit Hadoop and MapReduce Applications?." (9 2011).

2010

  • Lin, J.; Lu, X.; Yu, L.; Zou, Y. et al. "VegaWarden: A Uniform User Management System for Cloud Applications." in IEEE Fifth International Conference on Networking, Architecture, and Storage (NAS), 2010. (7 2010).
  • Peng, J.; Lu, X.; Cheng, B.; Zha, L. "JAMILA: A Usable Batch Job Management System to Coordinate Heterogeneous Clusters and Diverse Applications over Grid or Cloud Infrastructure." in The 7th IFIP International Conference on Network and Parallel Computing (NPC), 2010. (1 2010).
  • Lu, X.; Lin, J.; Zou, Y.; Peng, J. et al. "Investigating, Modeling, and Ranking Interface Complexity of Web Services on the World Wide Web." (7 2010).

2009

  • Yue, Q.; Lu, X.; Shan, Z.; Xu, Z. et al. "A Model of Message-Based Debugging Facilities for Web or Grid Services." in Congress on Services - I, 2009. (7 2009).
  • Lu, X.; Zou, Y.; Xiong, F.; Lin, J. et al. "ICOMC: Invocation complexity of multi-language clients for classified web services and its impact on large scale SOA applications." in The 10th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2009. (1 2009).

2008

  • Lu, X.; Yue, Q.; Zou, Y.; Wang, X. "An Experimental Analysis for Memory Usage of GOS Core." (1 2008).