Huafeng (Hua) Zhang

A data-driven researcher who is passionate about delivering actionable insights

Coding: R, Python, SQL, GitHub
Statistics: Hypothesis Testing, Clustering & Regression Modeling, Natural Language Processing, Sampling, Experimental Design, Mediation Analyses
Business: Project Management & Planning, Communication and Reporting, Research Prioritization, Time Management, Influencing, DEI Awareness
Survey: Survey Design, Survey Implementation, Survey Analysis, Survey Testing
Montana State University
B.S. Mathematics - Statistics 2017
Minor in Economics (GPA: 3.76)
Major GPA 3.80, Overall GPA: 3.80
Additional Info
• Completed graduate level statistics courses on sampling, experimental design, probability theory
• Completed data science courses on DataCamp and Coursera
• Fluent in Chinese and English
the Refugee Center Online · Data Science Consultant
11/'17 - 08/'18
Portland, OR
Quant Research Analyst II · Google LLC · 
Seattle, WA
07/'18 - Present

  • Drove long-term strategies to improve user experience by effectively influencing product leads to act on foundational global user insights on a core product with over 4B users

  • Identified growth opportunities across several Google core products such as Ads, Search and YouTube by quantifying business impact of users interactions. Recommendations are helping leadership plan efforts across functions

  • Ensured equitable compensation by leading Google's Annual Pay Equity program in  2020. Insights from statistical analysis were reviewed by Google's top leadership. Rigorously calculated adjustments were directly applied to employee's annual compensation

  • Led mixed methods research on hiring,  performance reviews and internal mobility to help leadership design more inclusive and effective people processes

  • Enabled auditors, program managers and product managers to pull data insight, and to visualize findings using one click button solution  by  providing them advanced analytical tools
  • Empowered auditors and analysts to work with data independently by leading data trainings and statistical learning sessions
  • Received Superb rating (top 2% of all employees) 

Data Scientist · CityBldr · 
Seattle, WA
10/'17 - 06/'18

  •  Led projects on translating unstructured information into interpretable features for CityBldr's machine learning pipeline using NLP techniques such as Naive Bayes and Topic Modeling 
  • Developed comprehensive models of extracting business insights by combining data from various sources (the only one in the data science team to build and implement the project pipeline)
  • Created insights using multivariate data analysis to inform company strategy and public communication messages, and presenting results to engineering, project management and sales teams

Predictive Modeling on Refugees' Online Learning Behavior

Applied mixed effects logistic regression approach and NLP methods to derive understanding of refugees' online learning behavior to better support them getting citizenship tests passed in the states.

Financial Health of United States Banks in 2016

Used k means cluster analysis in R to study patterns of financial health; model is able to detect all failed banks during 2008 financial crisis in one cluster.

Analysis of Website Traffic Data

Used Google Analytics API for traffic data; applied ANOVA test and Tukey‐Kramer test to find optimal maintenance days for web developers.