原文 就我个人所知有太多的软件工程师尝试转行到数据科学家而盲目地使用机器学习框架来处理数据,例如,TensorFlow或者Apache Spark,但是对于这些框架背后的统计理论没有完全的理解.所以提起 statistical learning,这是机器学习的理论框架,是从统计学和泛函分析(functional analysis)的领域中发展出来的. 推荐的三本书: Intro to Statistical Learning (Hastie, Tibshirani, Witten, James)…
Spark is a compelling multi-purpose platform for use cases that span investigative, as well as operational, analytics. Data science is a broad church. I am a data scientist — or so I’ve been told — but what I do is actually quite different from what…
Seven Python Tools All Data Scientists Should Know How to Use If you’re an aspiring data scientist, you’re inquisitive – always exploring, learning, and asking questions. Online tutorials and videos can help you prepare you for your first role, but t…
8 Productivity hacks for Data Scientists & Business Analysts Introduction I was catching up with one of my friends from a past organization. She had always been interested in data science, but was only able to break into it about 10 months ago. She h…
Software development skills for data scientists Data scientists often come from diverse backgrounds and frequently don't have much, if any, in the way of formal training in computer science or software development. That being said, most data scientis…
Classification============== #1. C4.5 Quinlan, J. R. 1993. C4.5: Programs for Machine Learning.Morgan Kaufmann Publishers Inc. Google Scholar Count in October 2006: 6907 #2. CART L. Breiman, J. Friedman, R. Olshen, and C. Stone. Classification andReg…
http://www.cocoachina.com/ios/20150902/13304.html 在iOS和OSX应用程序中存储和查询数据,Core Data是一个很好的选择.它不仅可以减少内存使用.提高性能,而且它可以使你避免写很多不必要的样板文件代码. 此外,Core Data API非常灵活,可以用在各种应用程序中,所有应用程序有不同的数据存数需求. 然而,这种灵活性意味着有时Core Data用起来可能稍微有点困难.即便你是一个Core Data专家,仍然会需要处理一些平常的任务,也会…
#!/usr/bin/env bash PACKAGE_NAME=com.your.package DB_NAME=data.db rm -rf ${DB_NAME} adb shell "run-as ${PACKAGE_NAME} chmod 666 /data/data/${PACKAGE_NAME}/databases/${DB_NAME}" adb pull /data/data/${PACKAGE_NAME}/databases/${DB_NAME} /tmp/ adb s…
http://www.analyticsvidhya.com/blog/2015/07/github-special-data-scientists-to-follow-best-tutorials/ 包含了各类大牛的深度学习的框架和开源代码.包括caffe tensorflow,torch theano,rcnn框架等等…
2017 will see a host of informed predictions, lower costs, and even business-centric gains, courtesy of the global adoption of Big Data and associated technologies. 2017 is already upon us, and Big Data seems to be growing in leaps and bounds. Be it…