For Citations

If you need a citation for BigDataBench, please cite the following papers related with your work:

BigDataBench: a Dwarf-based Big Data and AI Benchmark Suite. [PDF]

Wanling Gao, Jianfeng Zhan, Lei Wang, Chunjie Luo, Daoyi Zheng, Rui Ren, Chen Zheng, Gang Lu, Jingwei Li, Zheng Cao, Shujie Zhang, and Haoning Tang. Technical Report, arXiv preprint arXiv:1802.08254, January 27, 2018.

BOPS, Not FLOPS! A New Metric, Measuring Tool, and Roofline Performance Model For Datacenter Computing. [PDF]

Lei Wang, Jianfeng Zhan, Wanling Gao, Rui Ren, Xiwen He, Chunjie Luo, Gang Lu, Jingwei Li. Technical Report, arXiv preprint arXiv:1801.09212, January 28, 2018.

Big Data Dwarfs: Towards Fully Understanding Big Data Analytics Workloads. [PDF]

Wanling Gao, Lei Wang, Jianfeng Zhan, Chunjie Luo, Daoyi Zheng, Zhen Jia, Biwei Xie, Chen Zheng, Qiang Yang and Haibin Wang. Technical Report, arXiv preprint arXiv:1802.00699, February 1, 2018.

BigDataBench: a Big Data Benchmark Suite from Internet Services. [PDF]

Lei Wang, Jianfeng Zhan, Chunjie Luo, Yuqing Zhu, Qiang Yang, Yongqiang He, WanlingGao, Zhen Jia, Yingjie Shi, Shujie Zhang, Cheng Zhen, Gang Lu, Kent Zhan, Xiaona Li, and Bizhu Qiu. The 20th IEEE International Symposium On High Performance Computer Architecture (HPCA-2014), February 15-19, 2014, Orlando, Florida, USA.

Understanding Big Data Analytics Workloads on Modern Processors. [PDF]

Zhen Jia, Jianfeng Zhan, Lei Wang, Chunjie Luo, Wanling Gao, Yi Jin, Rui Han and Lixin Zhang. IEEE Transactions on Parallel and Distributed Systems, 28(6), 1797-1810, 2017.

Understanding Processors Design Decisions for Data Analytics in Homogeneous Data Centers. [PDF]

Zhen Jia, Wanling Gao, Yingjie Shi, Sally A. McKee, Jianfeng Zhan, Lei Wang, Lixin Zhang. IEEE Transactions on Big Data, 2017.

A Dwarf-based Scalable Big Data Benchmarking Methodology. [PDF]

Wanling Gao, Lei Wang, Jianfeng Zhan, Chunjie Luo, Daoyi Zheng, Zhen Jia, Biwei Xie, Chen Zheng, Qiang Yang, and Haibin Wang. arXiv preprint arXiv: 1711.03229

Characterization and Architectural Implications of Big Data Workloads. [PDF]
Lei Wang, Jianfeng Zhan, Zhen Jia and Rui Han.  arXiv:1506.07943 [cs.DC]

Characterizing data analysis workloads in data centers. [PDF]
Zhen Jia, Lei Wang, Jianfeng Zhan, Lixin Zhang, Chunjie Luo. 2013 IEEE International Symposium on Workload Characterization (IISWC 2013) (Best paper award).

Identifying Dwarfs Workloads in Big Data Analytics.  [PDF]

W Gao, C Luo, J Zhan, H Ye, X He, L Wang, Y Zhu, X Tian. 
arXiv preprint arXiv:1505.06872

BDGS: A Scalable Big Data Generator Suite in Big Data Benchmarking. [PDF]

Zijian Ming, Chunjie Luo, Wanling Gao, Rui Han, Qiang Yang, Lei Wang, and Jianfeng Zhan. In Advancing Big Data Benchmarks (pp. 138-154). Springer International Publishing.

BigDataBench-MT: A Benchmark Tool for Generating Realistic Mixed Data Center Workloads.  [PDF]

R Han, S Zhan, C Shao, J Wang, J Xu, LK John, L Wang, J Zhan. arXiv preprint arXiv:1504.02205

Characterizing and Subsetting Big Data Workloads. [PDF]

Zhen Jia, Jianfeng Zhan, Wang Lei, Rui Han, Sally A. McKee, Qiang Yang, Chunjie Luo, and Jingwei Li.  In 2014 IEEE International Symposium on Workload Characterization (IISWC). IEEE, 2014.

BigOP: generating comprehensive big data workloads as a benchmarking framework. [pdf]

Yuqing Zhu, Jianfeng Zhan, Chuliang Weng, Raghunath Nambiar, Jingchao Zhang,
Xingzhen Chen, and Lei Wang. The 19th International Conference on Database  Systems for Advanced Applications (DASFAA 2014), 2014.

BigDataBench: a Big Data Benchmark Suite from Web Search Engines.

Wanling Gao, Yuqing Zhu, Zhen Jia, Chunjie Luo, Lei Wang, Jianfeng Zhan, Yongqiang He, Shiming Gong, Xiaona Li, Shujie Zhang, and Bizhu Qiu. Third Workshop on Architectures and Systems for Big Data (ASBD 2013) in conjunction with The 40th International Symposium on Computer Architecture, May 2013.

CloudRank-D: Benchmarking and Ranking Private Cloud Computing System
for Data Processing Applications.

Chunjie Luo, Jianfeng Zhan, Zhen Jia, Lei Wang, Gang Lu, Lixin Zhang, Cheng-Zhong Xu, Ninghui Sun. Front. Comput. Sci., 2012, 6(4): 347-362.

Characterization of real workloads of web search engines.

Xi, H., Zhan, J., Jia, Z., Hong, X., Wang, L., Zhang, L., … Lu, G. (2011, November). In
2011 IEEE International Symposium on Workload Characterization (IISWC),
on (pp. 15-25). IEEE.

A half-day tutorial at Micro’ 14

Presentations are available from the tutorial homepage.

(1) What is BigDataBench? /BigDataBench methodology [Slides]

(2) BigDataBench workloads and scalable data sets[Slides-1 ,Slides-2 ]

(3) How to generate large-scale data from small-scale real-world one [Slides ]

(4) Multi-tenancy version of BigDataBench [Slides ]

(5) Subsetting big data workloads from BigDataBench [Slides ]

(6) How to use the simulator versions? [Slides ]