At the start of COVID, our team at Boston Children's Hospital was competing with John Hopkins University to create THE reference map when talking about the spread. As you know, we were not successful in that endeavor, but we did manage to create a functional map that could display cases in the present, and show an animation of how the numbers grew with time. My responsibilities during this time included :
Collaborate with team of Curators to streamline data input processes.
Implement scripts for data cleaning, and verification.
Create ETL process connecting the database and the website.
Create webscrappers that could automatically retrieve data and add it to our database.
Create strategies for data management, including database choice, features to keep or drop, and AWS pricing.
All in all we created an amazing tool that I am very proud of, unfortunately John Hopkins got there before us. The project was eventually dropped, in order to focus on creating a database that could handle unpredicted vast quantities of data for any future pandemic.
You may see the remnants by following this link.
Tags : Backend, Python, Data Engineering, ETL, Sheets,