Huafeng (Hua) Zhang

A critical thinker who is passionate about delivering actionable HUMAN insights using data-driven solutions

Coding: R, Python, SQL, GitHub
Statistics: Hypothesis Testing, Clustering & Regression Modeling, Natural Language Processing, Sampling, Experimental Design, Mediation Analyses
Montana State University
B.S. Mathematics - Statistics 2017
Minor in Economics (GPA: 3.76)
Major GPA 3.80, Overall GPA: 3.80
Pi Mu Epsilon (Math Honor Society)
Graduate with the Highest Honor
Additional Info
• Completed graduate level statistics courses on sampling, experimental design, probability theory
• Completed data science courses on DataCamp and Coursera
• Fluent in Chinese and English
the Refugee Center Online · Data Science Consultant
11/'17 - 08/'18
Portland, OR
Quant Research Analyst II · Google LLC · 
Seattle, WA
07/'18 - Present

  • Designed and developed workflows and dashboards to monitor unexpected internal user behavior such as fraudulent invoices/PO/Payment, data policy violation and etc
  • Created user metrics of several Google core systems such as Ads and YouTube to quantify business impact of users' risky events 
  • Built advanced analytical tools for auditors to enable them to do statistical sampling and visualize text data using one click button
  • Created and hosted data manipulation trainings and statistical learning sessions to empower auditors and people analysts to work with data independently, effectively and rigorously 
  •  Applied social science and statistics to support Google's diversity, equity, and inclusion goals

Data Scientist · CityBldr · 
Seattle, WA
10/'17 - 06/'18

  •  Lead project on translating unstructured information into interpretable features for CityBldr's machine learning pipeline using NLP techniques such as Naive Bayes and Topic Modeling 
  • Developed comprehensive model of each city’s unique development infrastructure by combining data from various sources (the only one in the data science team to build and implement the project pipeline)
  • Scraped and stored real estate/GIS data using web scraping, SQL and QGIS
  • Pulled insights using multivariate data analysis to inform company strategy and public communication messages, and presenting results to engineering, project management and sales teams

Statistical Consultant · Montana State University · 
Bozeman, MT
01/'17 - 08/'17

  • Helped researchers identify patterns in antibody profiles by performing hierarchical cluster analysis, and applied a chi-squared test to compare these patterns to the antibody profiles 
  • Presented Monte Carlo simulation of African lion population sampling to ecology researcher using R shiny
  • Summarized project status and provided feedback to consultants and clients

Improving a Predictive Model of Student Progress by Adding Learned Features from Unstructured Text Data

Applied mixed effects logistic regression approach and investigating NLP methods to derive additional features from unstructured text responses.

Financial Health of United States Banks in 2016

Used k‐means cluster analysis in R to study patterns of financial health; found failed banks were all in one cluster.

Study and Analysis of Household Wi‐Fi Speed

Designed balanced 3‐factor factorial experiment; collected data; used three‐way ANOVA and orthogonal contrast test in SAS.

Algorithms for Climate Data Sonification

Built generalized additive model for temperature data; created R algorithm to see if sonification can enhance understanding.

Analysis of Website Traffic Data

Used Google Analytics API for traffic data; applied ANOVA F‐Test and Tukey‐Kramer test to find optimal maintenance days.

Conditional Probability and Information Retrieval

Explained how to use conditional probability to solve a particular problem in the field of information retrieval.