属性与特征: attribute: e.g., 'Mileage' feature: an attribute plus its value, e.g., 'Mileage = 15000' Note that some regression algorithm can be used for classification as well,and vice versa. For example,Logistic Regression is commonly used for classifica…
the main steps: 1. look at the big picture 2. get the data 3. discover and visualize the data to gain insights 4. prepare the data for machine learning algorithms 5. select a model and train it 6. fine-tune your model 7. present your solution 8. laun…
本章介绍了机器学习的一些基本概念,已经应用场景.这部分知识在其它地方也经常看到,不再赘述. 这里只记录一些作者提到的,有趣的知识点. 回归(regression)名字的来源:这是由Francis Galton引入的一个统计学术语,当时他在研究这一现象:个子很高的人,其子女一般会比他们低.由于孩子是变低的,Francis Galton称之为:向平均值的回归(regression to the mean).从此他所使用的这种用于分析变量之间相关性的研究方法,被称作回归. 在机器学习中,一个属性(at…
去年在北京参加了一次由O'Reilly和Cloudera联合举办的大数据会议Strata Data Conference,并有幸获得了O'Reilly出版的Hands-On Machine Learning with Scikit-Learn and TensorFlow英文书,总体来说,这是一本不错的技术书,很多人也都在推荐这本书.这本书的作者通过具体的例子.很少的理论以及两款成熟的Python框架:Scikit-Learn和TensorFlow,帮助读者掌握构建智能系统所需要的概念和工具.这…
if you aggregate the predictions of a group of predictors,you will often get better predictions than with the best individual predictor. a group of predictors is called an ensemble:this technique is called Ensemble Learning,and an Ensemble Learning a…
SVM is capable of performing linear or nonlinear classification,regression,and even outlier detection. SVMs are particularly well suited for classification of complex but small- or medium-sized datasets. Linear SVM Classification: Soft Margin Classif…