51 Free Data Science Books
51 Free Data Science Books

A great collection of free data science books covering a wide range of topics from Data Science, Business Analytics, Data Mining and Big Data to Machine Learning, Algorithms and Data Science Tools.
Data Science Overviews
- An Introduction to Data Science (Jeffrey Stanton, 2013)
- School of Data Handbook (2015)
- Data Jujitsu: The Art of Turning Data into Product (DJ Patil, 2012)
- Art of Data Science (Roger D. Peng & Elizabeth Matsui, 2015)
Data Scientists Interviews
- The Data Science Handbook (Carl Shan, Henry Wang, William Chen, & Max Song, 2015)
- The Data Analytics Handbook (Brian Liou, Tristan Tao, & Declan Shener, 2015)
How To Build Data Science Teams
- Data Driven: Creating a Data Culture (Hilary Mason & DJ Patil, 2015)
- Building Data Science Teams (DJ Patil, 2011)
- Understanding the Chief Data O€fficer (Julie Steele, 2015)
Data Analysis
- The Elements of Data Analytic Style (Jeff Leek, 2015)
Distributed Computing Tools
- Hadoop: The Definitive Guide (Tom White, 2011)
- Data-Intensive Text Processing with MapReduce (Jimmy Lin & Chris Dyer, 2010)
Data Mining and Machine Learning
- Introduction to Machine Learning (Amnon Shashua, 2008)
- Machine Learning (Abdelhamid Mellouk & Abdennacer Chebira)
- Machine Learning – The Complete Guide (Wikipedia)
- Social Media Mining An Introduction (Reza Zafarani, Mohammad Ali Abbasi, & Huan Liu, 2014)
- Data Mining: Practical Machine Learning Tools and Techniques (Ian H. Witten & Eibe Frank, 2005)
- Mining of Massive Datasets (Jure Leskovec, Anand Rajaraman, & Jeff Ullman, 2014)
- A Programmer’s Guide to Data Mining (Ron Zacharski, 2015)
- Data Mining with Rattle and R (Graham Williams, 2011)
- Data Mining and Analysis: Fundamental Concepts and Algorithms (Mohammed J. Zaki & Wagner Meria Jr., 2014)
- Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Google+, GitHub, and More (Matthew A. Russell, 2014)
- Probabilistic Programming & Bayesian Methods for Hackers (Cam Davidson-Pilon, 2015)
- Data Mining Techniques For Marketing, Sales, and Customer Relationship Management (Michael J.A. Berry & Gordon S. Linoff, 2004)
- Inductive Logic Programming: Techniques and Applications (Nada Lavrac & Saso Dzeroski, 1994)
- Pattern Recognition and Machine Learning (Christopher M. Bishop, 2006)
- Machine Learning, Neural and Statistical Classification (D. Michie, D.J. Spiegelhalter, & C.C. Taylor, 1999)
- Information Theory, Inference, and Learning Algorithms (David J.C. MacKay, 2005)
- Data Mining and Business Analytics with R (Johannes Ledolter, 2013)
- Bayesian Reasoning and Machine Learning (David Barber, 2014)
- Gaussian Processes for Machine Learning (C. E. Rasmussen & C. K. I. Williams, 2006)
- Reinforcement Learning: An Introduction (Richard S. Sutton & Andrew G. Barto, 2012)
- Algorithms for Reinforcement Learning (Csaba Szepesvari, 2009)
- Big Data, Data Mining, and Machine Learning (Jared Dean, 2014)
- Modeling With Data (Ben Klemens, 2008)
- KB – Neural Data Mining with Python Sources (Roberto Bello, 2013)
- Deep Learning (Yoshua Bengio, Ian J. Goodfellow, & Aaron Courville, 2015)
- Neural Networks and Deep Learning (Michael Nielsen, 2015)
- Data Mining Algorithms In R (Wikibooks, 2014)
- Data Mining and Analysis: Fundamental Concepts and Algorithms (Mohammed J. Zaki & Wagner Meira Jr., 2014)
- Theory and Applications for Advanced Text Mining (Shigeaki Sakurai, 2012)
Statistics and Statistical Learning
- Think Stats: Exploratory Data Analysis in Python (Allen B. Downey, 2014)
- Think Bayes: Bayesian Statistics Made Simple (Allen B. Downey, 2012)
- The Elements of Statistical Learning: Data Mining, Inference, and Prediction (Trevor Hastie, Robert Tibshirani, & Jerome Friedman, 2008)
- An Introduction to Statistical Learning with Applications in R (Gareth James, Daniela Witten, Trevor Hastie, & Robert Tibshirani, 2013)
- A First Course in Design and Analysis of Experiments (Gary W. Oehlert, 2010)
Data Visualization
- D3 Tips and Tricks (Malcolm Maclean, 2015)
- Interactive Data Visualization for the Web (Scott Murray, 2013)
Big Data
- Disruptive Possibilities: How Big Data Changes Everything (Jeffrey Needham, 2013)
- Real-Time Big Data Analytics: Emerging Architecture (Mike Barlow, 2013)
- Big Data Now: 2012 Edition (O’Reilly Media, Inc., 2012)
51 Free Data Science Books的更多相关文章
- Awesome (and Free) Data Science Books[转]
Post Date: September 3, 2014By: Stephanie Miller Marty Rose, Data Scientist in the Acxiom Product an ...
- 【Repost】A Practical Intro to Data Science
Are you a interested in taking a course with us? Learn about our programs or contact us at hello@zip ...
- Competing in a data science contest without reading the data
Competing in a data science contest without reading the data Machine learning competitions have beco ...
- Comprehensive learning path – Data Science in Python深入学习路径-使用python数据中学习
http://blog.csdn.net/pipisorry/article/details/44245575 关于怎么学习python,并将python用于数据科学.数据分析.机器学习中的一篇非常好 ...
- R8:Learning paths for Data Science[continuous updating…]
Comprehensive learning path – Data Science in Python Journey from a Python noob to a Kaggler on Pyth ...
- 15 Most Read Data Science Articles in 2015. So far …
15 Most Read Data Science Articles in 2015. So far … We've compiled the latest set of "most rea ...
- 11 Facts about Data Science that you must know
11 Facts about Data Science that you must know Statistics, Machine Learning, Data Science, or Analyt ...
- 40 Questions to test your skill in Python for Data Science
Comes from: https://www.analyticsvidhya.com/blog/2017/05/questions-python-for-data-science/ Python i ...
- 【转】The most comprehensive Data Science learning plan for 2017
I joined Analytics Vidhya as an intern last summer. I had no clue what was in store for me. I had be ...
随机推荐
- 车牌识别算法库EasyPR的使用
主要参考以下两个博客: http://blog.csdn.net/junmuzi/article/details/49888123 http://blog.csdn.net/Lucas66666/ar ...
- mysql 数字类型的长度区别
mysql整型bigint.int.mediumint.smallint 和 tinyint的语法介绍,如下: 1.bigint 从 -2^63 (-9223372036854775808) 到 2^ ...
- Windows下 OpenSSL的安装与简单使用
1. openssl的最新版本 最新版本是 openssl1.1.1 官方地址 https://www.openssl.org/source/ TLS1.3的协议(RFC8446)在2018.8.12 ...
- C++的继承与多态
◆ 概念介绍 继承:为了代码的重用,保留基类的原本结构,并新增派生类的部分,同时可能覆盖(overide)基类的某些成员. 多态:一种将不同的特殊行为和单个泛化记号相关联的能力,分为静态多态和动态多态 ...
- 反爬虫之JS反编译:PyExecJS
PyExecJS是python运行JavaScript代码的包 安装: pip install PyExecJS (或者easy_install PyExecJS) 依赖: 需要有执行JS的程序, ...
- 一文总结之MyBatis
目录 MyBatis 目标 MyBatis演示 Configuration.xml 映射文件 初始化配置文件 Dao Spring与MyBatis集成 pom Spring配置文件 MyBatis配置 ...
- SPOJ_NSUBSTR
题目意思是给你一个字符串,f[x]是长度为x的子串中,出现个数最多的那个串的出现次数. 给出原串,依次输出f[1],f[2],……. 后缀自动机.对于某一个状态,right[]值的大小就是出现的次数, ...
- Python学习---基础篇
###打开文件并打印: #!/usr/bin/python3 f = open('F://myproject/test.txt', encoding='utf-8',mode='r') content ...
- C++模式学习------适配器模式
适配器模式: 适配器模式属于结构型的设计模式,是将一个类的接口转换成使用方希望的另外一个接口,这样使得原本由于接口不兼容而不能一起工作的那些类可以一起工作. 适配器模式有两种: 1.类的适配器:继承不 ...
- Life Forms POJ - 3294(不小于k个字符串中的最长子串)
题意: 求不小于字符串一半长度个字符串中的最长字串 解析: 论文题例11 将n个字符串连起来,中间用不相同的且没有出现在字符串中的字符隔开, 求后缀数组, 然后二分答案变为判定性问题, 然后判断每组的 ...