Update to this blog – Since it was such an interesting question, I decided to continue my search for city level weekly dengue case data to answer the original question. This GitHub link provides my research notes to date.
One of the first steps in the Data Science process is identifying what data you need to answer the question. In March 2017, I featured a series of blogs about characteristics of a data scientist. Today I want to add to that discussion by giving a case study of how clarifying questions is also a key part of the data science process.
It all began when I decided to participate in a crowdsourced Driven Data competition to predict local epidemics of dengue fever. I’m passionate about using machine learning and predictive analytics to solve some of the most challenging questions and thought this would be an excellent use of free time. I love learning new domains and data mining techniques to add improve my skills and help others at the same time.
Dengue fever is a mosquito-borne disease with 60,000 reported cases in Perú and Puerto Rico in 2016. (I…
View original post 596 more words