06 

Data Quality

Evolving Employer Expectations

Thursday June 29, 2023

2:00-3:30 PM EST | 1:00-2:30 PM CST | 12:00-1:30 PM MST | 11:00-12:30 PM PST

Description

What data skills and knowledge do employers expect when recruiting? How can we help our students and researchers acquire those skills?  How does artificial intelligence (AI) impact data quality, and can technological advancements alone solve data quality issues? A panel from academia and industry –  Douglas Berman (Stanford University), Jack Goetz (Meta Platforms), David Rothschild (Microsoft), and Rujuta Umarji (ICPSR) – will lead a lively discussion.

Session Speakers

Jack Goetz

Senior Research Scientist (Machine Learning Specialist), Meta Platforms, Inc.

Jack Goetz received his PhD in Statistics from the University of Michigan in 2020. His dissertation was on Active Learning, or the algorithmic selection of training data for machine learning models. After graduating, Jack worked at Meta as a Research Scientist, specializing in Natural Language Understanding for the voice assistant deployed on Meta hardware products (Portal, Oculus, Rayban Stories etc). He focused on using synthetic and user data for quality measurement and improvement, increasing data efficiency, and developing new methods for privacy-preserving machine learning.


Rujuta Umarji 

Director of Data Curation, Inter-University Consortium for Political and Social Research (ICPSR)

Rujuta Umarji is the Director of Data Curation at the Inter-university Consortium for Political and Social Research (ICPSR). She joined ICPSR over 10 years ago as a Data Curator, and is currently responsible for directing Data Curation Unit efforts in the production, archiving, and dissemination of ICPSR data. The Data Curation Unit has grown to 4 teams, each led by a Curation Supervisor, and 25 Curators at 4 levels of experience. As a result, Rujuta has overseen the hiring and onboarding of many staff, ensured consistent hiring standards, and developed career paths within the unit. In addition, she has overseen the implementation of curation standards and improvements to the curation workflow to maintain high quality data production. 


David  Rothschild

Economist/Principal Researcher, Microsoft Research

David Rothschild is an economist at Microsoft Research. He has a Ph.D. in applied economics from the Wharton School of Business at the University of Pennsylvania. He has written extensively, in both the academic and popular press. His work pushes the boundaries on varying data and methods: polling, prediction markets, social media and online data, and large behavioral and administrative data. His work focuses on solving practical and interesting questions including: mapping and updating public opinion, the markets for both news and advertising, finance, and an economist take on public policy. 


Douglas Berman

Data Governance Program Director, Stanford University

Douglas Berman leads Stanford University’s Data Governance Program. He has over 30 years’ experience managing the implementation, development, and use of data for reporting, analytics, and research. He has led notable projects including the implementation of the data warehouse for Kaiser Permanente’s first national electronic health record and providing data services for analytics and research at University of California San Francisco and University of California Davis Health. Throughout this work ensuring accurate and appropriate use of data through governance has been a central and critical theme.

He authored and taught the UC Davis Extension class ‘Healthcare Data Quality and Governance’ that has enrolled over 7,000 learners on Coursera.

Presentation Slides

2023.06.29_Session06_Slides.pdf

Webinar Recording

Audio and Transcription

GMT20230629-180001_Recording.m4a
Session 6_Cleaned Transcription
Session 6_Cleaned Transcription.pdf

Transcription (Word) 

Transcription (PDF) 

Supplementary Materials

Glossary