Hey, I'm
-Hangyu (Cedric) Liu-
A growing data scientist!
A growing data scientist!
I am currently a last year Master student at Brown University, and majoring in Data Science.
Recently, I'm working as a Data Science Co-op at Wayfair's MAD science team, and focusing on developing a Bayesian Hierarchical Marketing Mix Modeling to optimize the allocation of advertising expenditure.
And this summer I worked at Biogen Inc. as a Data Mining intern, where I was focused on some NLP tasks, such as Topic Modeling, Text Similarity Search Engine, etc. For Topic Modeling part, I cooperated with Prof. Tracy Ke at Harvard University to use AutoEncoder to extend her new SVD-based topic modeling method. Besides, I also developed a web application with Flask and Dash module to improve the ease of use.(Web Application Demo)
I received my B.S. degrees in Management Science and Information System & Mathematical Economics from Xiamen University (2014-2018).
My areas of interest are: Advanced Data Visualization(Dash and Plotly), Machine Learning, Computer Vision, Natural Language Processing, Recommendation System, Reinforcement Learning, Causal Inference, A/B Test and Data Mining.
Data Collection and Wrangling: Python: beautifulsoup, pandas, Numpy, SciPy, json; R: Rvest, dplyr, jsonlite; RegEx
Data Visualization: Python: seaborn, matplotlib, Plotly, Dash, Tabpy; R: ggplot2, lattice, htmlwidgets
Data Modeling and Machine Learning: Python: scikit-learn, XGBoost, mlens.ensemble; R: e1071, nnet, caret
Hyperparameter Optimization: Grid Search, Random Search and Bayesian Optimization in Python and R
Version Control: Github, ReviewNB
Deep Learning Frameworks: TensorFlow, Keras
Big Data & Scalability: PySpark, AirFlow
Other Skills: Julia Programming (Built SVM, Neural Network and etc. from scratch), Unix Shell(Bash), SQL, C, Jupyter Notebook , A/B Test, Python-driven Web Application, Time Series Analysis, NLTK and gensim(NLP)
English (Full Professional Proficiency); Mandarin (Native)
Wayfair Inc., Data Science Co-op, Boston MA Aug. 2019-Present
Biogen Inc., Data Mining Intern( Demo) , Cambridge MA May. 2019-Aug. 2019
Xiamen Yucheng Limited, Co-Founder, Xiamen China April. 2016-June. 2018
Toxic Comment Classification (Demo) Apr. 2019-May. 2019
One Shot learning & CV: Face Verification System (Demo) Dec. 2018-Jan. 2019
Kaggle Competition: Predict the Housing Price with High-dimensional Features (Top 15%) Nov. 2018-Dec. 2018
Data Acquisition & Deployment: Spotify and Billboard Text Data Analysis Oct. 2018-Nov. 2018
JDD-2017 Global Data Challenge – Transaction Risk Detection Oct. 2017-Dec. 2017
Contact:
E-mail: hangyu_liu@brown.edu
Phone Number: (+1) 401-601-1828
LinkedIn: www.linkedin.com/in/hliu5
Github: https://github.com/Cedric-Liu/Coding-Journal/tree/master/Model%20Building%20From%20Scratch
Projects Writing Sample: https://xinyanhe1.wixsite.com/2040finalproject
Address:
Greater Boston Area