Introductory Resources for Data Science

Getting started in data science can be a daunting task. The area is broad, with little guidance on where one topic start and the next ends. Because of that, I've pulled together this list of resources that particularly helped me when I was starting out.

I've broken the links into the following four categories, each of which distinguishes between the different skills required by data scientists:

  1. Data Analytics - Involves feature engineering and statistical tests.
  2. Machine Learning - Involves model development and prediction.
  3. Data Engineering - Involves database management and design.
  4. Produce Analytics - Involves product design and user engagement.

Here is a list of texts that I found particularly interesting, which I hope is useful to others who are starting out.

For the Data Analyst

For the Machine Learning Coder

For the Data Engineer

  • The SQL School by Model Analytics is perfect to learn the basics.
  • SQLZoo is recommended to practice queries.
  • SQL Joins Visualizer is nice to see the difference between commands.
  • W3schools is my go-to reference to look up SQL keywords.
  • Bill Howe's Introduction to Data Science on Coursera has a nice few videos discussing the differences between database designs. The course also introduces - MapReduce and the history of different schemas. It's honestly a nice way to spend a weekend!

For the Product Analyst

  • Lean Analytics, by Eric Ries, is the most concise introduction to web analytics I've come across.
  • Customer Churn is discussed in this iPython Notebook, with a coded example.
  • Zero to One, by Peter Thiel, is a leisurely read that introduces the different factors to consider when building a product.
  • Hacker News is my way to catch up on tech news each evening! If you find yourself waiting around with nothing to do,  open it up and explore the latest from the world of tech.

Finally, Hilary Parker and Brian Coffey (both from Stitch-Fix) have their own excellent list of recommended data science books.

comments powered by Disqus