Hacking skills, Math & statistics knowledge, Substantive Expertise
Machine Learning, Data Science, Traditional Research
Raw data – processing – data set – statistical models / analysis – machine learning predictions – data driven products, reports visualization blogs
‘substantive expertise’
– knows which question to ask
– can interpret the data well
– understands structure of the data
– data scientist often work in team
Data science can solve problems you’d expect…
– netflix, social media, web apps, (okcupid, uber, etc)
bioin formatics, urban planning, a straphysics, public health, public health, sports
Tools to use
– Numpy
– multidimensional arrays + matrices
– mathematical functions
– Pandas
– handle data in a way suited to analysis
– similar to R
Both common among data scientists