MediaHound · Data Engineer |
Implemented Apache Airflow for data ingest pipeline.
Updated codebase to a container-based architecture.
Upgraded codebase from Python 2 to Python 3.
Drove increased unit test coverage.
Created integration and end-to-end tests for several key projects.
Developed automated CI testing on Bamboo.
Automated deployments with Bamboo.
Built out pipelines for new data sources.
Refactored large portions of the codebase into a more module style.
Added new Python 3 features to existing codebase.
Edgewater Ranzal · Consultant - Big Data Practice |
Refactored internal library leading to a 8x speedup while adding features.
Engineered production ETL pipelines with CloverETL, Java and Python.
Automated deployments with Ansible.
Developed BI dashboards with Kibana.
Deployed applications with Ubuntu, Gunicorn and Nginx.
Integrated, cleaned and transformed large datasets with Python Pandas.
Delivered Data Discovery Apps with Oracle Information Discovery.
Built custom data pipelines using Python, MongoDB and S3.
Created dashboards using Flask.
Built developer tools with Python, reducing development time and error rate.
Developed data lakes with Hadoop, Pig, Hive and Sqoop.
Deployed applications with AWS Ec2.
Optimized SQL queries to improve ETL performance.
Installed and configured Oracle EBS on AWS.
ITW · Sr. Sourcing Analyst |
Built reporting application that integrated multiple external data sources.
Transformed, cleansed and analyzed large data sets with Python Pandas.
Integrated large datasets from multiple legacy systems.
Delivered key insights leading to cost savings of $700,000.
Automated data collection and reporting with Python.
Optimized SQL queries.
Developed a data mart and analytics dashboard with MySQL and Sinatra.
|
Resources Global Professionals · Analytics Consultant |
Developed MS Access databases.
Integrated and normalized large datasets with Excel & Python.
Built data pipelines and analytics workbooks with Python and Excel.
Delivered high quality analysis using SQL.