Presenter Profile

REMOTE PRESENTER

Erman Ayday

Associate Professor
Case Western Reserve University, Department of Computer and Data Sciences

Erman Ayday is an Associate Professor of Computer Science at Case Western Reserve University, Cleveland, OH, USA. Before that he was a Post-Doctoral Researcher at Ecole Polytechnique Fédérale de Lausanne (EPFL), Switzerland. He received his M.S. and Ph.D. degrees from Georgia Institute of Technology, Atlanta, GA, in 2007 and 2011, respectively. Dr. Ayday's research interests include privacy-enhancing technologies (including big data and genomic privacy), ethical computing, and fingerprinting algorithms. Dr. Ayday is the recipient of 2022 CWRU Computer and Data Sciences Department Research Award, Distinguished Student Paper Award at IEEE S&P 2015, 2010 Outstanding Research Award from the Center of Signal and Image Processing (CSIP) at Georgia Tech, and 2011 ECE Graduate Research Assistant (GRA) Excellence Award from Georgia Tech. Other various accomplishments of Dr. Ayday include several patents, research grants, H2020 Marie Curie individual fellowship, and NSF CAREER award. Dr. Ayday holds more than $4M research funding related to data privacy and ethical computing. He also published more than 90 peer reviewed papers. He is a member of the IEEE and the ACM.

TALK TITLE
Accelerating medical data sharing and collaborative research with privacy protection

KEYWORDS
Privacy, collaborative research, medicine, genomics

ABSTRACT
The rapid progress in medical research has led to significant data collection. Analyzing this data can be transformative in answering the key questions about disease associations and our evolution. However, due to growing privacy concerns about the sensitive information of participants, access to medical datasets used in studies, such as genome-wide association studies (GWAS), is restricted to only a limited number of large groups. On the other hand, collaborative research over medical datasets, which will also lead to democratizing medical data sharing, requires sharing data across collaborators. One way to share such datasets across collaborators is through the IRB process and the use of institutional data use agreements. Currently, due to the sensitivity of data, the collaborative medical research can only be carried out after IRB review for all collaborators. In this talk, I will present a sandbox environment in which potential collaborators come together and obtain an accurate "preview" of their collaborative research in an efficient, reproducible (verifiable), and privacy-preserving way. Our proposed framework allows each collaborator to share information about their dataset in a privacy-preserving way within the proposed sandbox environment. This will help the researchers (1) rectify their federated datasets from low-quality, biased, or statistically dependent records, (2) generate an accurate preview of their collaborative research results to provide evidence for benefit versus risk tradeoff in IRB approval, and (3) identify what part of the datasets should be shared among the collaborators (once they obtain the full IRB approval). To achieve these goals, we will develop (1) novel algorithms that enable quality control over federated data while preserving ownership and privacy and (2) algorithms that promote reproducibility research results by developing novel techniques for verifying the correctness of the computation and for sharing the whole research datasets while preserving privacy. Our preliminary results show that the proposed framework accurately provides evidence of reproducibility of research results, identifies low-quality (e.g., statistically dependent) data in federated datasets, and preserves the privacy of individuals in collaborators' datasets.