Rahul Pathak
General Manager, Amazon EMR & Amazon Athena, Amazon Web Services
Rahul Pathak runs the Amazon EMR and Amazon Athena businesses at AWS and helps customers solve big data problems at petabyte-scale and beyond, utilizing techniques of distributed systems, performance engineering, and open source frameworks. With Amazon EMR Rahul enables customers to launch and run managed Apache Hadoop, Hive, Spark, Presto, HBase clusters at massive scale with support for HDFS, S3, and EC2 spot. With Amazon Athena he enables customers to query their data in Amazon S3 using standard SQL with no infrastructure to manage. Besides Amazon EMR & Amazon Athena, Rahul's specialities include Hadoop, Spark, Presto, Hive, Big Data, Data Warehousing, RDBMS, Product Management, Memcached, Oracle, MySQL, MPP, Redshift, RDS, and Cloud Computing. Rahul is an alumni of Massachusetts Institute of Technology.
John Carnahan
Executive Vice President, Ticketmaster
John Carnahan is responsible for overseeing all aspects of data science and strategy at Ticketmaster. With more than 25 years experience in software development, big/fast data computation and data science, John has held engineering and leadership positions at Overture, Yahoo, Fox Interactive Media. He joined Ticketmaster from the Rubicon Project where he was the CTO building innovative solutions in the area of computational advertising. He joined Ticketmaster to bring the power and legacy of advertising technology into the live event industry. Since joining Ticketmaster he has built cohesive and independent teams building software in areas such as abuse prevention, distributed commerce, personalization and marketing. Mr. Carnahan has been a founding member and key contributor to many open source projects. Mr. Carnahan has been a researcher and author in a wide range of disciplines including machine learning, distributed computing and quantitative genetics for over 20 years.
Ian Small
Chairman, TokBox
Ian is currently Chairman at TokBox, the leading real-time video Platform-as-a-Service provider, and a wholly-owned independently operated subsidiary of Telefónica. Ian was formerly the global Chief Data Officer at Telefónica SA and a member of Telefónica's global Executive Commitee. As gCDO, Ian formulated and drove implementatino of Telefónica Group’s data strategy, as well as being responsible for group-level BI and Big Data units, Telefónica's advertising business, the development and operation of next-generation communication services across Telefónica’s operating geographies, as well as management and oversight of group innovation and R&D units. Prior to Telefónica and TokBox, Ian was Senior Vice President, Products at MarkLogic, a highly-scalable high-performance NoSQL database provider, where he was responsible for all product management and engineering. Prior to MarkLogic, Ian was an executive at CKS, USWeb/CKS and MarchFIRST, and held engineering management and research roles at Apple. Ian holds numerous US patents, has degrees in Engineering Science and Computer Science from the University of Toronto, and definitely prefers dogs to cats.
Reynold Xin
Chief Architect, Databricks
Reynold is a co-founder and Chief Architect of Databricks, responsible for technical directions of the company. He is the top committer on the Apache Spark project, and led the efforts to create Spark 2.0, revamping the entire user-facing APIs as well as a whole new execution engine. He holds the 2014 world record for creating the fastest system to sort 100TB of data, and the current (2016) record for the cheapest to sort 100TB. Prior to Databricks, he was pursuing PhD research at the UC Berkeley AMPLab, where he focused on scalable data processing. He wrote the highest cited papers in SIGMOD 2011, 2013, and 2015, and won Best Demo Award at VLDB 2011 and SIGMOD 2012.
Cindi Thompson
Principal Data Scientist, Apple
Cindi is a principal data scientist with and a unique blend of academic and industry experience in machine learning, natural language understanding, and R&D. Naturally collaborative problem solver able to bridge technical and business concerns using strong communication and facilitation skills, Cindi created curriculum and taught courses in a top MS Analytics program. At US Firm (PwC), she led a research project that resulted in a text analysis tool deployment to entire company. Cindi's skills include designing, building, and applying machine learning algorithms, agile scrum & traditional software project management, user requirements elicitation, user acceptance testing, teaching, and proposal presentation. She obtained three patents and authored several dozen publication. Cindi holds a PhD degree in artificial intelligence from The University of Texas at Austin.
Mark Madsen
President, Third Nature, Inc.
Mark spent most of the past 25 years working in the data engineering & science field, starting in AI at the University of Pittsburgh and robotics at Carnegie Mellon University. Today he is president of Third Nature, where he advises global organizations on data strategy and technology infrastructure to support data science and data governance. Mark focuses on two types of work: applying data engineering & science to business problems and guiding the construction of data infrastructure. He has designed analysis, data collection, and data management infrastructure for companies worldwide. He is also involved with emerging technology as a researcher, speaks about analytics internationally, sits on the O’Reilly Strata conference committee and chairs the Accelerate data science and analytics conference. Mark holds Masters degree in Software Engineering from Carnegie Mellon University.
Chris Wensel
Principal Architect/VP, Central Performance at Salesforce
Chris Wensel is the author of the Cascading data processing open-source project and was recently the Founder/CTO of Concurrent Inc. He also co-founded Scale Unlimited, the first Hadoop and "Big Data" related professional services and training company, where he mentored companies like Sun Microsystems, Apple, and numerous startups in the Bay Area. Chris bootstrapped his first Internet startup in the early 90's, creating an early Web server-side scripting language used by companies in the real estate and insurance verticals. During the late 90's, Chris focused on distributed-agent based systems where he received several patents on distributed computing. From there he became Chief Architect for the fastest growing business unit at Thomson Reuters. Chris also advises several startups in the "Big Data" technology space. Chris is an alumni of Texas A&M University
Ben White
Co-founder and VP, Engineering Services, Data Republic Inc.
Ben is responsible for Data Republic's unique approach for accelerating and simplifying the process of building analytic pipelines on Big Data systems. Prior to founding Data Republic, Ben was one of the first field engineers for Cloudera, driving commercial adoption of Hadoop. He previously spent a decade designing, building and deploying data-driven forecasting applications, and earned Bachelor's and Master's degrees in Computer Science from the University of Melbourne, Australia, where he also contributed to Deductive Database research.
Mikhail Lyukmanov
Vice President, Data Engineering, Amobee
Mikhail Lyukmanov is the Vice President, Data Engineering of Amobee, the company defining digital marketing. Amobee brings marketers, agencies, publishers and operators an innovative digital marketing technology platform and solutions. The Amobee technology platform enables advertisers to run data-driven, targeted, cross channel digital marketing campaigns across display, video, social, mobile, email and native that deliver results at scale. Amobee Brand Intelligence allows brands to analyze and target the world's content consumption across web, social, and video in real time. Prior to Amobee, Mikhail was the VP of Data Engineering of Adconion Direct. Mikhail is an alumni of Nizhniy Novgorod State Technical University.
Julian Hyde
Architect, Hortonworks
Julian Hyde is an expert in query optimization, in-memory analytics and streaming, and an active developer of open source database software. An architect at Hortonworks, he is the original developer of Apache Calcite, the query planning framework behind Apache Hive, Drill and Kylin. He was also the original developer of Mondrian, an open source OLAP engine that was acquired by Pentaho (now Hitachi Ventara). In 2003, he co-founded SQLstream, a pioneering engine that queries data-in-flight using standard SQL; before that, he was a database kernel engineer at Broadbase and Oracle. He holds a degree in Computer Science from Cambridge University, England.
Greg Rokita
Executive Director, Technology, Edmunds.com
Greg is responsible for Edmunds Data Engineering platform, Inventory and Pricing & Statistics. Prior to his current role, Greg architected Edmunds [Data] Publishing System, co-architected Content Management System and authored Digital Asset Management platform. Prior to Edmunds, he worked on United Airlines aircraft scheduling system, created protein biochip Data Profiling platform, and authored video streaming solutions for a company acquired by Cisco. Beyond architecting content, messaging and search solutions for companies in Silicon Valley, including the fastest growing subsidiary of Thomson Reuters, he spoke at several conferences, including Hadoop World, organized the first ever Edmunds Technology Conference and funded the Data Engineering & Science Council. Greg holds an MS degree in Computer Science from Stanford University with specialization in Databases and Distributed Systems.