Summary Profile

I am a curious, solution-oriented Data Scientist seeking to assist end-to-end decision making. After earning a degree in Computer Science at the University of Georgia, I used my programming skills at Griffin & Strong where I communicated with experts who specialize in Disparity Studies, automated their analysis, and tailored a unique data pipeline for four different governments. The final analysis now serves as a legal factual predicate. I currently work at Guidehouse where I consult with the Center of Medicare and the NIH supporting a variety of their data and automation needs.

Work Experience
GuidehouseData Scientist
Jan. 2021 - Current

  • Supports data analytics and compliance oversight for the Center for Medicare and Medicaid Innovation's Kidney Care Choices model

  • Supports statistical programming for the Surgeon General's Oral Report

  • Consults with clients from a variety of backgrounds and helps articulate their software needs
  • Gathers requirements and translates ETL needs of clients creating dashboards to drive decision making

Griffin & Strong P.C.Data Scientist
May 2020 - Jan. 2021

  • Leads efforts to take on time consuming and costly problems using automation and machine learning
  • Creates ETL pipelines for reproducible transformations of data
  • Standardizes operating practices for efficient and validated research
  • Mentors junior analyst in improving their analysis process

Griffin & Strong P.C.Data Analyst
Nov. 2018 - May 2020
  • Performs multifaceted analysis on collected data from governmental organizations
  • Translates research methodology into software and adapts models to each organization's constraints
  • Isolates data gaps and anomalies and cleans data for reproducible analysis
  • Communicates with clients to collect data and present findings 
Projects
Disparity Analysis

The completed disparity analysis of the City of Chattanooga, City of Frederick, Cuyahoga County, and Mecklenburg procurement data. The analysis is meant to serve as a factual predicate for proposed policy changes and inclusion programs. This includes everything from data collection to analysis. The findings were then presented to stakeholders within these organizations.

Data Profiler

A Python software allowing non-programmers to take advantage of the open-source "pandas-profiling" library which creates comprehensive HTML summaries of raw data files. This summary includes data warning, missing values, and correlations. The front-end was built using PyQt and can run on any operating system.

Equity Lib

A open-source Python library containing pre- and post-processing algorithms for wrangling common problem patterns in Excel files. The primary problem solved is resolving entities in joined data systems and traceability of rule based decisions.

Forecasting Avocado Prices

An analysis of historical avocado price data implemented using the open-source Prophet library to predict future prices

PASSNYC: Data Science for Good

An analysis of NYC public schools which provides a recommendation to PASSNYC for how they should distribute their services to improve the diversity of the specialized high schools

Tools
Programming: Python, C++, Java, JavaScript
Data Analysis: Pandas, Plotly, Dash, Altair, Seaborn, Superset, Prophet
Database/Big Data: SQL, NoSQL, Spark, Vaex
Machine Learning: Scikit-learn, Dedupe, Tensorflow
Workflow: Bash, Git, Jupyter, Org-mode, Unix, Emacs, Kedro
Cloud: AWS, GCP
Devops: Docker, Kubernetes, Terraform
Education
University of Georgia
Bachelor of Science Computer Science 2018
• Cumulative GPA: 3.55 (Cum Laude)