Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Select an option

  • Save giserh/be4f1a7b2750f963cd36 to your computer and use it in GitHub Desktop.

Select an option

Save giserh/be4f1a7b2750f963cd36 to your computer and use it in GitHub Desktop.
Links from talk "what is Data Science?" (hackNY lecture 2016-06-15):
====
## context
- [google search trends: data science](https://www.google.com/trends/explore#q=%22data%20science%22%2C%20%22data%20scientist%22)
- [indeed job trends: data science](http://www.indeed.com/jobtrends?q=%22data+science%22%2C%22data+scientist%22&l=)
- [drew's data science venn diagram, sept 2010](http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram)
- [har har](https://twitter.com/chrishwiggins/status/610468719169568769)
## modern history (2009-2001)
- [modern published origin: hammerbacher chapter](http://www.amazon.com/Beautiful-Data-Stories-Elegant-Solutions/dp/0596157118)
- [contemporary discussion](http://flowingdata.com/2009/06/04/rise-of-the-data-scientist/#comment-30568)
- [elaboration, 2009](http://flowingdata.com/2009/06/04/rise-of-the-data-scientist/#comment-30568)
- [nathan's advisor, early advocate for "science of data"](http://www.journalism.columbia.edu/profile/428-mark-hansen/10)
- [cleveland 2001 "data science"](http://www.stat.purdue.edu/~wsc/papers/datascience.pdf)
- [cleveland publication list](http://www.stat.purdue.edu/~wsc/papers.html)
## ancient history (1962-2000)
- [tukey on visualization](http://stat-graphics.org/movies/prim9.html)
- [tukey quotes and biography](http://www-history.mcs.st-and.ac.uk/Biographies/Tukey.html)
- [john chambers on statistical computation including tukey influence](http://statweb.stanford.edu/~jmc4/papers/Neyman98.ps)
- [tufte's VDQI](http://www.edwardtufte.com/tufte/books_vdqi)
- [JWT's EDA](http://www.amazon.com/Exploratory-Data-Analysis-John-Tukey/dp/0201076160)
## NYT
- [TimesMachine: Thursday September 18, 1851 - NYTimes.com](http://timesmachine.nytimes.com/timesmachine/1851/09/18/issue.html "Visit this page")
- [The New York Times Introduces a Web Site - NYTimes.com](http://www.nytimes.com/1996/01/22/business/the-new-york-times-introduces-a-web-site.html "Visit this page")
- [The New York Times Company | The New York Times Company](http://www.nytco.com/ "Visit this page")
- [The leaked New York Times innovation report is one of the key documents of this media age » Nieman Journalism Lab](http://www.niemanlab.org/2014/05/the-leaked-new-york-times-innovation-report-is-one-of-the-key-documents-of-this-media-age/ "Visit this page")
- [How The New York Times Works](http://www.popularmechanics.com/technology/a14030/how-the-new-york-times-works/ "Visit this page")
- [Machine Learning For Journalism at The New York Times | Techjaw](http://techjaw.com/2015/02/01/machine-learning-for-journalism-at-the-new-york-times/?utm_content=buffer7adca&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer "Visit this page")
- [Free-fall: Adjusted for inflation, print newspaper advertising revenue in 2012 was lower than in 1950 - AEI | Carpe Diem Blog ](https://www.aei.org/publication/free-fall-adjusted-for-inflation-print-newspaper-advertising-revenue-in-2012-was-lower-than-in-1950/ "Visit this page")
- [Whats A Startup? First Principles. | Steve Blank](http://steveblank.com/2010/01/25/whats-a-startup-first-principles/ "Visit this page")
## data science including in industry
- [CW/JMH class spring 2015](http://modelingsocialdata.org)
- [FB paper](http://www.pnas.org/content/111/24/8788.full)
- [NFLX on A/B testing](http://techblog.netflix.com/2012/04/netflix-recommendations-beyond-5-stars.html)
- [GOOG on multi armed bandits](https://support.google.com/analytics/answer/2844870?hl=en)
- [site with visaliztion of topic models for articles](http://daeilkim.com)
- [example of an API](http://nsawatch.me/api)
## learn more
- [CW/JMH class data sciecne in R github repo](http://github.com/jhofman/msd2015)
- [CW class data science in python repo](http://github.com/ledeprogram/courses/tree/master/algorithms/)
- [example of playing with data hackNY alumus](http://abe.is/analyzing-citibike-usage/)
- [list of datasets from JMH](http://modelingsocialdata.org/homework/2015/03/27/final-project.html)
- [Big free stats book "elements of statistical learning theory"](http://statweb.stanford.edu/~tibs/ElemStatLearn/)
- [textbook on staitsitcs in R w/o calculus](http://pluto.huji.ac.il/~msby/StatThink/)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment