Over the 4 years, I wore many hats, but I'm most proud of establishing, growing and managing the data team that was responsible for building, maintaining and scaling the company’s data pipelines.
- Major challenges :
- seeking out new sources of data for finding and categorizing companies
- managing how we tracked 4000+ companies in their daily activities, scraping the latest news, seeking out their future plans to start clinical trials, attend conferences, hire new talent, release results, apply for funding, etc
- designing data models and pipeline processes for the acquisition of all the above collected data
- automating the processes to reduce the need for human intervention and review
- creating machine learning models to assist in the categorization process
- Technologies: Ruby, Ruby on Rails, React.js, Javascript, Mongodb
- Skills Developed :
- test-driven software development, with weekly code reviews
- extracting, transforming and loading (ETL) large datasets
- designing and automating the loading of new datasets
- defining metrics to help the team stay focused on projects aligned with the business
- experimenting with processes to help improve the team’s workflow
- growing a team
- managing team member’s personal growthT