Jia Zou

Assistant Professor

Computer Science and Engineering

School of Computing and Augmented Intelligence

Arizona State University

     jia.zou@asu.edu

Bio

Jia Zou is a Tenure-Track Assistant Professor in the School of Computing and Augmented IntelligenceIra A. Fulton Schools of Engineering,  Arizona State University - Tempe,  starting in summer 2019. She is the director of the CACTUS data-intensive systems lab founded in the summer of 2020. Jia Zou is also affiliated with the Lawrence Berkeley National Lab since 2023.  Before 2019, she was a Research Scientist in the Department of Computer Science of Rice University.  Before that she worked in IBM Research - China as a researcher. She received her Ph.D in Computer Science from Tsinghua University, China. 

Jia Zou received a prestigious NSF CAREER award in 2022, an Amazon Research Award in 2023, and an IBM faculty award in 2021. Her research interests include database systems, AI/ML in database, applied AI/ML to database, and federated data management. She has more than 20 papers published in VLDB, SIGMOD, ICDE, VLDB journal, SoCC, ICDCS, ICDM, and so on, and has been granted 15 patents. Her work received VLDB 2019 best paper honorable mention award and SIGMOD 2020 research highlight award.



More project information is here.

[New!!!] We have 2 Ph.D openings starting for 2024 fall,  if you are interested in database systems research, please send your CV to jia.zou@asu.edu. [More Details] . 

News and Activities

Awards

Publications

(Supervised students are marked with *)

2024


[ICDE 2024] Lixi Zhou*, K. Selçuk Candan, Jia Zou. DeepMapping: Learned Data Mapping for Lossless Compression and Efficient Lookup. The 40th IEEE International Conference on Data Engineering (ICDE 2024) Preprint: CoRR abs/2307.05861 (2023) [PDF] [13 pages]To Appear

[EDBT 2024] Lixi Zhou*, Qi Lin*, Kanchan Chowdhury, Saif Masood*, Eichenberger, Alexandre, Hong Min, Alexander Sim, Jie Wang, Yida Wang, Kesheng Wu, Binhang Yuan, Jia Zou. "Serving Deep Learning Model in Relational Databases." 27th International Conference on Extending Database Technology (EDBT'24). Research Track. [9 pages] [PDF]. 

Guan, H*., Gautier, S.*, Gupta, D., Ambrish, R.H.*, Wang, Y., Lakamsani, H.*, Giriyan, D.*, Maslanka, S.*, Xiao, C., Yang, Y. and Zou, J., 2024. A Learning-based Declarative Privacy-Preserving Framework for Federated Data Management. arXiv preprint arXiv:2401.12393.

2023


[IEEE Big Data 2023] Ankita Sharma*,  Xuanmao Li*,  Hong Guan*,  Guoxin Sun*, Liang Zhang, Lanjun Wang, Kesheng Wu, Lei Cao, Erkang Zhu, Alexander Sim, Teresa Wu, Jia Zou. Automatic Data Transformation Using Large Language Model: An Experimental Study on Building Energy Data. 2023 IEEE International Conference on Big Data. (Industrial and Government Track) Pre-Print CoRR abs/2309.01957 (2023) [PDF] [10 pages] (The first four student authors have equal contributions to the work)

[SoCC 2023] Hong Guan*, Saif Masood*, Mahidhar Dawrampudi*, Venkatesh Gunda*, Hong Min, Lei Yu, Soham Nag*, Jia Zou.  A Comparison of End-to-End Decision Forest Inference Pipelines,  2023 ACM Symposium on Cloud Computing SoCC'23 [16 pages][PDF]

[SSDBM 2023] Lixi Zhou*, Lei Yu, Jia Zou, Hong Min. Privacy-Preserving Redaction of Diagnosis Data through Source Code Analysis. In Proceedings of the 35th International Conference on Scientific and Statistical Database Management, SSDBM 2023 [4 pages] [PDF]


Liang Zhang, Jianli Chen, Jia Zou. Taxonomy, Semantic Data Schema, and Schema Alignment for Open Data in Urban Building Energy Modeling. CoRR abs/2311.08535 (2023) [PDF]

2022

[VLDB 2022] Lixi Zhou*, Jiaqing Chen*, Amitabh Das*, Hong Min, Lei Yu, Ming Zhao, and Jia Zou. "Serving Deep Learning Models with Deduplication from Relational Databases." VLDB 2022, PVLDB Volume 15 Issue 10. [14 pages][PDF]

[SSDBM 2022] Lixi Zhou*, Arindam Jain*, Zijie Wang*, Amitabh Das*,  Yingzhen Yang, and Jia Zou, "Benchmark of DNN Model Search at Deployment Time." SSDBM 2022. [12 pages] [PDF]

2021

[VLDB 2021] Jia Zou, Amitabh Das*, Pratik Barhate*, Arun Iyengar, Binhang Yuan, Dimitrije Jankov, and Chris Jermaine. "Lachesis: Automatic Partitionings for UDF-Centric Analytics.",  VLDB 2021, PVLDB Volume 14 Issue 8 [14 pages][PDF]

[VLDB 2021]  Binhang Yuan, Dimitrije Jankov, Jia Zou, Yuxin Tang, Daniel Bourgeois, and Chris Jermaine. “Tensor Relational Algebra for Machine Learning System Design.” VLDB 2021, PVLDB Volume 14 Issue 8 [13 pages][PDF]

[CIDR 2021] Jia Zou, "Using Deep Learning Models to Replace Large Materialized Views in Relational Database",  CIDR 2021 (Abstract)  [1 page] [PDF]

2020

Zijie Wang,* Lixi Zhou*, Amitabh Das*, Valay Dave*, Zhanpeng Jin, Jia Zou, "Survive the Schema Changes: Integration of Unmanaged Data Using Deep Learning",  arxiv:2010.07586  [cs.DB] (pre-submission) [12 pages]

Zhou, Lixi*, Zijie Wang*, Amitabh Das*, and Jia Zou. "It's the Best Only When It Fits You Most: Finding Related Models for Serving Based on Dynamic Locality Sensitive Hashing." arXiv preprint arXiv:2010.09474 (2020) [9pages]

[MSST 2020] Jia Zou, Ming Zhao, Juwei Shi and Chen Wang. "WATSON: A Workflow-based Data Storage Optimizer for Analytics." MSST 2020 [14 pages][PDF]

[SIGMOD Record] Dimitrije Jankov, Shangyu Luo, Binhang Yuan, Zhuhua Cai, Jia Zou, Chris Jermaine,  Zekai J. Gao. Declarative recursive computation on an RDBMS, or, why you should use a database for distributed machine learning, SIGMOD Record, Volume 49 No. 1. [8 pages] [PDF]  (Invited)

[SFDI 2020] Zijie Wang*, Lixi Zhou*, Jia Zou. "Integration of Fast-Evolving Data Sources Using A Deep Learning Approach." SFDI 2020, workshop co-located with VLDB 2020 [14 pages] [PDF][Video]

[VLDB Journal] Jia Zou,  Arun Iyengar, and Chris Jermaine. "Architecture of a distributed storage that combines file system, memory and computation in a single layer." The VLDB Journal 29(5) (2020): 1049-1073.  [25 pages][PDF] 

2019 and before

[VLDB 2019]  Dimitrije Jankov, Shangyu Luo, Binhang Yuan, Zhuhua Cai, Jia Zou, Chris Jermaine,  Zekai J. Gao. Declarative recursive computation on an RDBMS, or, why you should use a database for distributed machine learning, VLDB 2019, PVLDB Volume 12 Issue 7. [14 pages] (PDF)  (Honorable Mention, VLDB 2019 Best Paper Award runner-up, 2020 SIGMOD Research Highlight Award)

[VLDB 2019]  Jia Zou,  Arun Iyengar, Chris Jermaine, Pangea: Monolithic Distributed Storage for Data Analytics, VLDB 2019, PVLDB Volume 12 Issue 6.  [14 pages] (PDF) 

[SIGMOD 2018]  Jia Zou, R Matthew Barnett, Tania Lorido-Botran, Shangyu Luo, Carlos Monroy, Sourav Sikdar, Kia Teymourian, Binhang Yuan, Chris Jermaine, PlinyCompute: A Platform for High- Performance, Distributed, Data-Intensive Tool Development, SIGMOD 2018.  [16 pages] (PDF)

[ICDCS 2015] Jia Zou, Juwei Shi, Tongping Liu, Zhao Cao, Chen Wang, Foreseer: Workload-aware Data Storage for MapReduce, ICDCS 2015.  [2 pages]

[VLDB 2015] Lanjun Wang, Oktie Hassanzadeh, Shuo Zhang, Juwei Shi, Limei Jiao, Jia Zou, Chen Wang, Schema Management for Document Stores, VLDB 2015, PVLDB Volume 8 Issue 9. [12 pages] (PDF)

[VLDB 2014] Juwei Shi, Jia Zou, Jiaheng Lu, Zhao Cao, Shiqiang Li, Chen Wang, MRTuner: A Toolkit to Enable Holistic Optimization for MapReduce Jobs, VLDB 2014, PVLDB Volume 7 Issue 13.  [12 pages] (PDF)

[ICDCS 2011] Jia Zou, Gong Su, Arun Iyengar, Yu Yuan, Yi Ge, Design and Analysis of a Distributed Multi-leg Stock Trading System, ICDCS 2011. [12 pages]

[ICDM 2010] Jia Zou, Jing Xiao, Rui Hou, Yanqi Wang, Frequent Instruction Sequential Pattern Mining in Hardware Sample Data, ICDM 2010. [6 pages]

[ICPP 2008] Jia Zou, Zhiyong Liang, Yiqi Dai, Scalability Evaluation and Optimization of Multi-core SIP Proxy Server, ICPP 2008.  [8 pages][PDF]

[ICEBE 2008] Jianguo Hao, Jia Zou, Yiqi Dai, A real-time payment scheme for SIP service based on hash chain, ICEBE 2008.  [8 pages]

[GLOBECOM 2007] Jia Zou, Wei Xue, Zhiyong Liang, Yixin Zhao, Bo Yang and Ling Shao, SIP Parsing Offload: Design and Performance, GLOBECOM 2007.  [6 pages]

[ICCCN 2007] Jia Zou, Yiqi Dai, Motivating and Modeling SIP Offload, ICCCN 2007. [6 pages]

Granted Patents

Teaching

Professional Activities

Department Services

Recent Presentations