8 Of The Best Data Science Books for Learning The Science Of Data

Data science is no longer for the stereotypical “geek.” As the world continues to increase its online activity, data analytics are more important than ever. Data shapes the way we run businesses, manage political campaigns, and review scientific studies. Being able to look at a set of data and extrapolate meaning is a skill that is rapidly growing in demand, and one that data science books can help you with. 

On the surface, it can seem intimidating (especially for those of us who haven’t had to take a math class since high school). But, as is true with coding, you don’t need to excel at math to be able to work with data. That’s why this article on data science books has been divided into two parts: “Introduction to Data Science Skills” will give you a foundation, while “Applied Data Science” will give you a picture of the many ways in which data can be applied and how it affects the world around us. 

(Note: As with many disciplines in technology, there is a glaring lack of diversity both in who gets hired into the field and in who gets to write about it. There is one author of color on this list. There should be more. To take a peek into some of the data behind why diversity in tech is such a pressing issue, check out this article by Devskiller.) 

Data Science Books That Introduce Skills

Doing Data Science: Straight Talk from the Frontline by Cathy O’Neil and Rachel Schutt

Based on Columbia University’s Intro to Data Science class, Doing Data Science is a great starting point for anyone interested in the field. Data Scientists from companies such as Google and Microsoft share algorithms, models, and case studies they use to give the reader a more practical application. 

The Art of Data Science by Roger D. Peng and Elizabeth Matsui

Written by two authors with extensive experience in managing and conducting data analysis, The Art of Data Science distills their knowledge into a digestible form. This book sticks to the fundamentals, making it a bit basic for experienced data scientists, but perfect for beginners.

Data Science For Dummies by Lillian Pierson

For readers looking for a primer covering the expanse of data science, this is the book to start with. Rather than covering instructional information, Data Science for Dummies broaches which technologies, programming languages, and mathematical methods to focus on. 

Applied Data Science

The Signal And The Noise: Why So Many Predictions Fail—But Some Don’t by Nate Silver

Building on his groundbreaking experience in creating a system for predicting baseball performance, Nate Silver explores the world of prediction and the ways in which we can extract truth from data. Throughout the book, Silver examines forecasters of various types—including weather, poker, basketball, and the stock market—to evaluate the way they think and make predictions. He ultimately concludes that the most accurate forecasters have a superior command of probability, and by the end of this book, you’ll have the insight necessary to make stronger, more informed predictions. 

Naked Statistics by Charles Wheelan 

How can we catch schools that cheat on standardized tests? How does Netflix know which movies you’ll like? In Naked Statistics, Wheelan shows that with the right data and some statistical tools, we can have the answers to these questions and more. Though some of us may want to bolt at the mention of statistics, Wheelan breaks down technical details and instead, focuses on the underlying human intuition. 

Automate This: How Algorithms Came To Rule Our World by Christopher Steiner

Increasingly, high-level tasks such as legal interpretation and policy analysis are being handled by algorithms. What started as basic computer logic has now surpassed our idea of what a bot can do. In Automate This, Steiner tells the story of how algorithms took over and are beginning to touch all aspects of our lives. He poses the question: What would the world look like if bots took over the human work of interpreting data and automating human instinct? 

Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are by Seth Stephens-Davidowitz

With the amount of data the world amasses on a daily basis, we’re often left with astonishing untapped data from which we can learn about the human psyche. This data offers insight into our biggest fears and desires, the decisions we make, and the behaviors that drive us. Everybody Lies examines big data to share insight into economics, ethics, sports, sex, gender, and more. Questions investigated include: Does where you go to school affect how successful you are in life? Do parents secretly favor boy children over girls? Do violent films affect the crime rate?

Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy by Cathy O’Neill

Living in the age of algorithms, decisions about our lives are increasingly made by mathematical models rather than by humans. The hope is that everyone is judged based on the same rules without bias, whether it’s the cost of health insurance or if we qualify for a loan. But in Weapons, O’Neill argues that the opposite is true. She brings to light the ways in which data and algorithms go largely unregulated, and often reinforce discrimination. As she puts it, “welcome to the dark side of big data.”  


For a broader introduction to computer science, check out our list of books about computer science for beginners.

Sign up to receive Check Your Shelf, the Librarian's One-Stop Shop For News, Book Lists, And More.
By signing up you agree to our Terms of Service