Cell: 650-861-6858
WEBLINKS
https://sites.google.com/view/shih-yu-chang/
https://www.linkedin.com/in/shih-yu-chang-904781116/
https://github.com/shihyuch/
OBJECTIVE
Seeking a position in Data Science, Machine Learning, and their applications.
EMPLOYMENT
Senior Data Scientist Lead, Appzen, Santa Clara, 2018 Oct. – Present
o Design and implement Chinese to English translator and this help Appzen to save several hundreds (US dollars) per day.
o Design and implement new feature extraction function for Chinese Receipts which improves the amount and date identification accuracy rate from 50% to 80%.
Senior Machine Learning Algorithm Engineer, Teradata, Santa Clara, 2017 Nov. – 2018 Oct.
o Design and implement collaborative optimizer analytic function library, improving the time efficiency for the distributed machine learning algorithms 4X.
o Design distributed model training framework with light data by applying sample size theorem, the training data and time are reduced by 1-2 order of magnitude.
Principle Software Engineer, Oracle, USA, Santa Clara 2016 Mar. – 2017 Oct.
o Identified Solaris 10 network performance bottleneck in Oracle Public Cloud (OPC) network environment and build prototype to enhance network throughput by 3X.
o Designed Role-based Access Control (RBAC) security architecture for Goolago, Oracle internal data warehouse for storing machines logs.
Professor, National Tsinghua University, Taiwan 2006 Aug. – 2016 Feb.
o Predicted IT marketing by utilizing particle swarm optimization (PSO) and the real-valued genetic algorithms (RGA) based on Least squares support vector machine (LS-SVM).
o Established a mobile-based learning management system with NoSQL, PHP, R language, which improved the student average grade by 15%.
o Proposed Hierarchical SVM to perform multiclass classification with parallel learning algorithm, which can reduce the traditional SVM learning complexity from O(n^3) to O(n^2).
o Designed a framework to generate new ideas mathematically, namely, INVENRELATION (sold by http://www.amazon.com/INVENRELATION-Shih-Yu-Chang-book/dp/B00EPF4RRI), guided students to implement following invention methods by Android
EDUCATION (New)
University of California, Berkeley, Berkeley, CA Jan. 2016 – Aug. 2017
Master of Information and Data Science
Focusing Areas: Statistics for Data Science, Statistical Methods for Time Series and Panel Data, Applied Machine Learning, Machine Learning at Large Scale, Storing and Retrieving Data, Data Visualization and Communication, Causal Analysis and Experiments Design.
Selected Data Projects at Berkeley (Details can be found at personal website)
· Stock Trend Prediction and Business Strategy Design
o Scraped financial news and performed text rank semantic analysis to predict companies stock trends with 30% accuracy improvement, then applied DQN Reinforcement Learning to design promotion route based on Tweets geographical response.
· Saving Unemployed by Data Science
o Programmed data acquisition API for Federal Reserve Economic Data (FRED) databases with location query feature and applied time-series analysis to predict the survival duration for unemployed by considering individual location and financial factors.
· Impact of Financial Aid and Total Costs on University Graduation Rates
o Identified common problems in analyzing graduation rates and proposed difference linear model to resolve these problems, new model discovered that universities should not attempt to provide financial aid or reduce total costs in order to improve graduation rates.
· The Causal Effect of Health Education Link for Flu Shot Misconceptions on Twitter
o Designed Tweet Bot to run randomized, controlled experiment, the results showed that treatment effect is positive in U.S area, but negative in Non-US area.
· Power Data Visualization Projects
o Monitored battery storage stations to understand costs, functions and response alerts.
o Predicted global green energy capabilities based on real-time weather conditions with PubNub data stream platform.
EDUCATION
· University of Michigan, Ann Arbor Ph.D. EECS (2006), M.S. Mathematics (2005)
· University of Southern California M.S. EE (2001)
· National Taiwan University B.S. EE (1998)
AWARDS and LEADER EXPERIENCE
· (BEST PAPER AWARD) Best Paper Award, IEEE, IWCMC 2005.
· (RESEARCH AWARD) Awards for papers published at first class journals regulated by NTHU R&D Office, 2009, 2010.
· (Program Chair) The 9-th Workshop on Wireless, Ad Hoc and Sensor Networks (WASN 2013)
· (Project Award) Contribution to Oracle Public Cloud networking performance evaluation, 2016.
SKILLS
· Software/Tools: AWS, Tableau, OpenRefine, Hadoop, MapReduce, Spark
· Programming Languages: Python, R, SQL, Java, C, JavaScript/D3.js
· Technical Writing.