A critical thinker who is passionate about delivering actionable HUMAN insights using data-driven solutions
Montana State University |
|
Pi Mu Epsilon (Math Honor Society) |
04/'17
|
Graduate with the Highest Honor |
04/'17
|
• Completed graduate level statistics courses on sampling, experimental design, probability theory |
|
• Completed data science courses on DataCamp and Coursera |
|
• Fluent in Chinese and English |
|
the Refugee Center Online · Data Science Consultant |
11/'17 - 08/'18
|
Portland, OR |
Quant Research Analyst II · Google LLC ·
Seattle, WA
|
07/'18 - Present
|
- Designed and developed workflows and dashboards to monitor unexpected internal user behavior such as fraudulent invoices/PO/Payment, data policy violation and etc
- Created user metrics of several Google core systems such as Ads and YouTube to quantify business impact of users' risky events
- Built advanced analytical tools for auditors to enable them to do statistical sampling and visualize text data using one click button
- Created and hosted data manipulation trainings and statistical learning sessions to empower auditors and people analysts to work with data independently, effectively and rigorously
Applied social science and statistics to support Google's diversity, equity, and inclusion goals
Data Scientist · CityBldr ·
Seattle, WA
|
10/'17 - 06/'18
|
- Lead project on translating unstructured information into interpretable features for CityBldr's machine learning pipeline using NLP techniques such as Naive Bayes and Topic Modeling
- Developed comprehensive model of each city’s unique development infrastructure by combining data from various sources (the only one in the data science team to build and implement the project pipeline)
- Scraped and stored real estate/GIS data using web scraping, SQL and QGIS
- Pulled insights using multivariate data analysis to inform company strategy and public communication messages, and presenting results to engineering, project management and sales teams
Statistical Consultant · Montana State University ·
Bozeman, MT
|
01/'17 - 08/'17
|
- Helped researchers identify patterns in antibody profiles by performing hierarchical cluster analysis, and applied a chi-squared test to compare these patterns to the antibody profiles
- Presented Monte Carlo simulation of African lion population sampling to ecology researcher using R shiny
- Summarized project status and provided feedback to consultants and clients
Improving a Predictive Model of Student Progress by Adding Learned Features from Unstructured Text Data |
|
Applied mixed effects logistic regression approach and investigating NLP methods to derive additional features from unstructured text responses.
Financial Health of United States Banks in 2016 |
|
Used k‐means cluster analysis in R to study patterns of financial health; found failed banks were all in one cluster.
Study and Analysis of Household Wi‐Fi Speed |
|
Designed balanced 3‐factor factorial experiment; collected data; used three‐way ANOVA and orthogonal contrast test in SAS.
Algorithms for Climate Data Sonification |
|
Built generalized additive model for temperature data; created R algorithm to see if sonification can enhance understanding.
Analysis of Website Traffic Data |
|
Used Google Analytics API for traffic data; applied ANOVA F‐Test and Tukey‐Kramer test to find optimal maintenance days.
Conditional Probability and Information Retrieval |
|
Explained how to use conditional probability to solve a particular problem in the field of information retrieval.