#调整随机森林的参数(调整n_estimators随机森林中树的数量默认10个树,精度递增显著)

from sklearn import datasets
X, y = datasets.make_classification(n_samples=10000,n_features=20,n_informative=15,flip_y=.5, weights=[.2, .8]) import numpy as np
training = np.random.choice([True, False], p=[.8, .2],size=y.shape) from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import confusion_matrix n_estimator_params = range(1, 100,5)
confusion_matrixes = {}
for n_estimator in n_estimator_params:
rf = RandomForestClassifier(n_estimators=n_estimator,n_jobs=-1, verbose=True)
rf.fit(X[training], y[training])
print ("Accuracy:\t", (rf.predict(X[~training]) == y[~training]).mean()) '''
======================== RESTART: E:/python/pp138.py ========================
[Parallel(n_jobs=-1)]: Done 1 out of 1 | elapsed: 0.0s finished
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s finished
Accuracy: 0.590083456063
[Parallel(n_jobs=-1)]: Done 6 out of 6 | elapsed: 0.1s finished
[Parallel(n_jobs=2)]: Done 6 out of 6 | elapsed: 0.0s finished
Accuracy: 0.618065783014
[Parallel(n_jobs=-1)]: Done 11 out of 11 | elapsed: 0.3s finished
[Parallel(n_jobs=2)]: Done 11 out of 11 | elapsed: 0.0s finished
Accuracy: 0.682866961217
[Parallel(n_jobs=-1)]: Done 16 out of 16 | elapsed: 0.5s finished
[Parallel(n_jobs=2)]: Done 16 out of 16 | elapsed: 0.0s finished
Accuracy: 0.692194403535
[Parallel(n_jobs=-1)]: Done 21 out of 21 | elapsed: 0.6s finished
[Parallel(n_jobs=2)]: Done 21 out of 21 | elapsed: 0.0s finished
Accuracy: 0.702012763868
[Parallel(n_jobs=-1)]: Done 26 out of 26 | elapsed: 0.9s finished
[Parallel(n_jobs=2)]: Done 26 out of 26 | elapsed: 0.0s finished
Accuracy: 0.697594501718
[Parallel(n_jobs=-1)]: Done 31 out of 31 | elapsed: 1.0s finished
[Parallel(n_jobs=2)]: Done 31 out of 31 | elapsed: 0.0s finished
Accuracy: 0.710358370152
[Parallel(n_jobs=-1)]: Done 36 out of 36 | elapsed: 1.1s finished
[Parallel(n_jobs=2)]: Done 36 out of 36 | elapsed: 0.0s finished
Accuracy: 0.704958271969
[Parallel(n_jobs=-1)]: Done 41 out of 41 | elapsed: 1.3s finished
[Parallel(n_jobs=2)]: Done 41 out of 41 | elapsed: 0.0s finished
Accuracy: 0.707412862052
[Parallel(n_jobs=-1)]: Done 46 out of 46 | elapsed: 1.5s finished
[Parallel(n_jobs=2)]: Done 46 out of 46 | elapsed: 0.0s finished
Accuracy: 0.716740304369
[Parallel(n_jobs=-1)]: Done 46 tasks | elapsed: 1.6s
[Parallel(n_jobs=-1)]: Done 51 out of 51 | elapsed: 1.8s finished
[Parallel(n_jobs=2)]: Done 46 tasks | elapsed: 0.0s
[Parallel(n_jobs=2)]: Done 51 out of 51 | elapsed: 0.0s finished
Accuracy: 0.713303878252
[Parallel(n_jobs=-1)]: Done 46 tasks | elapsed: 1.5s
[Parallel(n_jobs=-1)]: Done 56 out of 56 | elapsed: 1.8s finished
[Parallel(n_jobs=2)]: Done 46 tasks | elapsed: 0.0s
[Parallel(n_jobs=2)]: Done 56 out of 56 | elapsed: 0.0s finished
Accuracy: 0.713303878252
[Parallel(n_jobs=-1)]: Done 46 tasks | elapsed: 1.5s
[Parallel(n_jobs=-1)]: Done 61 out of 61 | elapsed: 2.0s finished
[Parallel(n_jobs=2)]: Done 46 tasks | elapsed: 0.0s
[Parallel(n_jobs=2)]: Done 61 out of 61 | elapsed: 0.0s finished
Accuracy: 0.717231222386
[Parallel(n_jobs=-1)]: Done 46 tasks | elapsed: 1.5s
[Parallel(n_jobs=-1)]: Done 66 out of 66 | elapsed: 2.3s finished
[Parallel(n_jobs=2)]: Done 46 tasks | elapsed: 0.0s
[Parallel(n_jobs=2)]: Done 66 out of 66 | elapsed: 0.0s finished
Accuracy: 0.711340206186
[Parallel(n_jobs=-1)]: Done 46 tasks | elapsed: 1.6s
[Parallel(n_jobs=-1)]: Done 71 out of 71 | elapsed: 2.5s finished
[Parallel(n_jobs=2)]: Done 46 tasks | elapsed: 0.0s
[Parallel(n_jobs=2)]: Done 71 out of 71 | elapsed: 0.0s finished
Accuracy: 0.720667648503
[Parallel(n_jobs=-1)]: Done 46 tasks | elapsed: 1.5s
[Parallel(n_jobs=-1)]: Done 76 out of 76 | elapsed: 2.4s finished
[Parallel(n_jobs=2)]: Done 46 tasks | elapsed: 0.0s
[Parallel(n_jobs=2)]: Done 76 out of 76 | elapsed: 0.0s finished
Accuracy: 0.721649484536
[Parallel(n_jobs=-1)]: Done 46 tasks | elapsed: 1.7s
[Parallel(n_jobs=-1)]: Done 81 out of 81 | elapsed: 3.0s finished
[Parallel(n_jobs=2)]: Done 46 tasks | elapsed: 0.0s
[Parallel(n_jobs=2)]: Done 81 out of 81 | elapsed: 0.0s finished
Accuracy: 0.721649484536
[Parallel(n_jobs=-1)]: Done 46 tasks | elapsed: 1.5s
[Parallel(n_jobs=-1)]: Done 86 out of 86 | elapsed: 2.8s finished
[Parallel(n_jobs=2)]: Done 46 tasks | elapsed: 0.0s
[Parallel(n_jobs=2)]: Done 86 out of 86 | elapsed: 0.0s finished
Accuracy: 0.716740304369
[Parallel(n_jobs=-1)]: Done 46 tasks | elapsed: 1.5s
[Parallel(n_jobs=-1)]: Done 91 out of 91 | elapsed: 3.1s finished
[Parallel(n_jobs=2)]: Done 46 tasks | elapsed: 0.0s
[Parallel(n_jobs=2)]: Done 91 out of 91 | elapsed: 0.0s finished
Accuracy: 0.72410407462
[Parallel(n_jobs=-1)]: Done 46 tasks | elapsed: 1.4s
[Parallel(n_jobs=-1)]: Done 96 out of 96 | elapsed: 3.1s finished
[Parallel(n_jobs=2)]: Done 46 tasks | elapsed: 0.0s
[Parallel(n_jobs=2)]: Done 96 out of 96 | elapsed: 0.0s finished
Accuracy: 0.718213058419
'''

#调整随机森林的参数(调整n_estimators随机森林中树的数量默认10个树,精度递增显著,但并不是越多越好),加上verbose=True,显示进程使用信息的更多相关文章

  1. #调整随机森林的参数(调整max_features,结果未见明显差异)

    #调整随机森林的参数(调整max_features,结果未见明显差异) from sklearn import datasets X, y = datasets.make_classification ...

  2. Linux 查找指定名称的进程并显示进程详细信息

    实际应用中可能有这样的场景:给定一个进程名称特征串,查找所有匹配该进程名称的进程的详细信息. 解决的办法是: (1) 先用pgrep [str] 命令进行模糊匹配,找到匹配该特征串的进程ID: (2) ...

  3. Sysctl命令及linux内核参数调整

        一.Sysctl命令用来配置与显示在/proc/sys目录中的内核参数.如果想使参数长期保存,可以通过编辑/etc/sysctl.conf文件来实现.    命令格式:  sysctl [-n ...

  4. sklearn中随机森林的参数

    一:sklearn中决策树的参数: 1,criterion: ”gini” or “entropy”(default=”gini”)是计算属性的gini(基尼不纯度)还是entropy(信息增益),来 ...

  5. XGBoost中参数调整的完整指南(包含Python中的代码)

    (搬运)XGBoost中参数调整的完整指南(包含Python中的代码) AARSHAY JAIN, 2016年3月1日     介绍 如果事情不适合预测建模,请使用XGboost.XGBoost算法已 ...

  6. TensorFlow实现超参数调整

    TensorFlow实现超参数调整 正如你目前所看到的,神经网络的性能非常依赖超参数.因此,了解这些参数如何影响网络变得至关重要. 常见的超参数是学习率.正则化器.正则化系数.隐藏层的维数.初始权重值 ...

  7. Galera集群server.cnf参数调整--Innodb存储引擎内存相关参数(一)

    在innodb引擎中,内存的组成主要有三部分:缓冲池(buffer pool),重做日志缓存(redo log buffer),额外的内存池(additional memory pool).

  8. paip.提升性能----jvm参数调整.txt

    paip.提升性能----jvm参数调整.txt 作者Attilax  艾龙,  EMAIL:1466519819@qq.com 来源:attilax的专栏 地址:http://blog.csdn.n ...

  9. Storm集群参数调整

    Supervisor 参数调整 修改${STORM_HOME}conf/storm.yaml文件内容 supervisor变更参数 slots 配置: 若storm host仅仅执行superviso ...

随机推荐

  1. SpringBoot- springboot集成Redis出现报错:No qualifying bean of type 'org.springframework.data.redis.connection.RedisConnectionFactory'

    Springboot将accessToke写入Redisk 缓存,springboot集成Redis出现报错 No qualifying bean of type 'org.springframewo ...

  2. Codeforces 870E Points, Lines and Ready-made Titles:并查集【两个属性二选一】

    题目链接:http://codeforces.com/problemset/problem/870/E 题意: 给出平面坐标系上的n个点. 对于每个点,你可以画一条经过这个点的横线或竖线或什么都不画. ...

  3. 解决:xampp中Apache, MySql, Filezilla端口占用问题

    读 静觅 的python博客,学到requests库的用法,看到“”如果想请求JSON文件,可以利用 json() 方法解析 ”这里迷惑了: 我写了a.json文件放在本地,却不能被python调用, ...

  4. [Gym-101512C] 凸包+最远点对

    找最大的四边形或者三角形面积,先求凸包,然后枚举两个点,再通过旋转,找最大的另两个点 #include<bits/stdc++.h> #define fi first #define se ...

  5. release和retain还有多少人在用

    这年头,看到做IOS开发的专业公司,居然还在用release和retain这样的内存管理方式,是为了表示他很厉害吗? 我只能说他没有跟上时代的潮流,或者就是参加了某种老掉牙的IOS速成培训再入行的. ...

  6. SQL Compare

    http://pan.baidu.com/s/1jGIEEm6 来源:数据库同步工具: Red Gate Sql Compare

  7. crontab 总结

    crontab -e --------- linux定时任务提示-bash: crontab: command not found 执行 crontab 命令如果报 command not found ...

  8. Ajax与后台的交互

    Ajax Java 交互 jsp代码 <%@ page language="java" import="java.util.*" pageEncoding ...

  9. k-means算法的优缺点以及改进

    大家接触的第一个聚类方法,十有八九都是K-means聚类啦.该算法十分容易理解,也很容易实现.其实几乎所有的机器学习和数据挖掘算法都有其优点和缺点.那么K-means的缺点是什么呢? 总结为下: (1 ...

  10. hihocoder #1138 : Islands Travel

    题意,求1到n的最短路.不难想到单源最短路,难点在于数量级太大,因此如何建图是关键: 因为cost = min{|Xi-Xj|, |Yi-Yj|}:所以,点i的移动只有两种情况,. x距离最近的点,. ...