RankLib参数翻译

写在前面，metric2t指标详解：

NDCG（Normalized discounted cumulative gain）即DCG/IDCG
CG（cumulative gain）
DCG（Discounted Cumulative Gain）
MAP(Mean Average Precision)
MRR(Mean Reciprocal Rank)

Usage: java -jar RankLib.jar <Params>
Params:
[+] Training (+ tuning and evaluation)
-train <file> 训练数据
-ranker <type> 指定要使用的排名算法
0: MART (gradient boosted regression tree)
1: RankNet
2: RankBoost
3: AdaRank
4: Coordinate Ascent
6: LambdaMART
7: ListNet
8: Random Forests
[ -feature <file> ] 特征描述文件：列出要学习的特征，如果不指定，默认使用所有特征。
[ -metric2t <metric> ] Metric to optimize on the training data. Supported: MAP, NDCG@k, DCG@k, P@k, RR@k, ERR@k (default=ERR@10)
[ -gmax <label> ] Highest judged relevance label. It affects the calculation of ERR (default=4, i.e. 5-point scale {0,1,2,3,4})
[ -silent ] Do not print progress messages (which are printed by default)
[ -validate <file> ] 是否在验证数据集上调整模型 Specify if you want to tune your system on the validation data (default=unspecified). If specified, the final model will be the one that performs best on the validation data
[ -tvs <x \in [0..1]> ] 训练-验证数据集的分割比例 Set train-validation split to be (x)(1.0-x)
[ -save <model> ] 学习的模型保存到指定文件 Save the learned model to the specified file (default=not-save)
[ -test <file> ] 是否要在数据上测试训练的模型 Specify if you want to evaluate the trained model on this data (default=unspecified)
[ -tts <x \in [0..1]> ] 训练-测试数据集的分割比例 Set train-test split to be (x)(1.0-x). -tts will override -tvs
[ -metric2T <metric> ] 默认与metric2t一样 Metric to evaluate on the test data (default to the same as specified for -metric2t)
[ -norm <method>] 归一化特征向量，方法包括求和归一化，均值/标准差归一化，最大值/最小值归一化 Normalize feature vectors (default=no-normalization). Method can be:
sum: normalize each feature by the sum of all its values
zscore: normalize each feature by its mean/standard deviation
linear: normalize each feature by its min/max values
[ -kcv <k> ] 在训练数据集上执行交叉验证 Specify if you want to perform k-fold cross validation using ONLY the specified training data (default=NoCV)
-tvs can be used to further reserve a portion of the training data in each fold for validation
[ -kcvmd <dir> ] 交叉验证训练模型的目录 Directory for models trained via cross-validation (default=not-save)
[ -kcvmn <model> ] Name for model learned in each fold. It will be prefix-ed with the fold-number (default=empty)
[-] RankNet-specific parameters RankNet特定参数
[ -epoch <T> ] 训练迭代次数 The number of epochs to train (default=100)
[ -layer <layer> ] 隐含层个数 The number of hidden layers (default=1)
[ -node <node> ] 每层隐含节点个数 The number of hidden nodes per layer (default=10)
[ -lr <rate> ] 学习率 Learning rate (default=0.00005)
[-] RankBoost-specific parameters RankBoost特定参数
[ -round <T> ] 训练迭代次数 The number of rounds to train (default=300)
[ -tc <k> ] 搜索的阈值候选个数 Number of threshold candidates to search. -1 to use all feature values (default=10)
[-] AdaRank-specific parameters AdaRank特定参数
[ -round <T> ] 训练迭代次数 The number of rounds to train (default=500)
[ -noeq ] Train without enqueuing too-strong features (default=unspecified)
[ -tolerance <t> ] 连续两轮学习之间的误差 Tolerance between two consecutive rounds of learning (default=0.002)
[ -max <times> ] 一个特征可以被连续选择而不改变性能的最大次数 The maximum number of times can a feature be consecutively selected without changing performance (default=5)
[-] Coordinate Ascent-specific parameters
[ -r <k> ] The number of random restarts (default=5)
[ -i <iteration> ] The number of iterations to search in each dimension (default=25)
[ -tolerance <t> ] Performance tolerance between two solutions (default=0.001)
[ -reg <slack> ] Regularization parameter (default=no-regularization)
[-] {MART, LambdaMART}-specific parameters LanbdaMART特定参数
[ -tree <t> ] 树的个数 Number of trees (default=1000)
[ -leaf <l> ] 每个树的叶子个数 Number of leaves for each tree (default=10)
[ -shrinkage <factor> ] 学习率 Shrinkage, or learning rate (default=0.1)
[ -tc <k> ] 树分割时的候选特征个数 Number of threshold candidates for tree spliting. -1 to use all feature values (default=256)
[ -mls <n> ] 一个叶子最少的样本个数 Min leaf support -- minimum #samples each leaf has to contain (default=1)
[ -estop <e> ] Stop early when no improvement is observed on validaton data in e consecutive rounds (default=100)
[-] Random Forests-specific parameters 随机森林特定参数
[ -bag <r> ] Number of bags (default=300)
[ -srate <r> ] Sub-sampling rate (default=1.0)子集采样率
[ -frate <r> ] Feature sampling rate (default=0.3) 特征采样率
[ -rtype <type> ] Ranker to bag (default=0, i.e. MART)
[ -tree <t> ] Number of trees in each bag (default=1) 树个数
[ -leaf <l> ] Number of leaves for each tree (default=100) 每个树的叶节点个数
[ -shrinkage <factor> ] Shrinkage, or learning rate (default=0.1) 学习率
[ -tc <k> ] 树分割时使用的候选特征阈值个数 Number of threshold candidates for tree spliting. -1 to use all feature values (default=256)
[ -mls <n> ] Min leaf support -- minimum #samples each leaf has to contain (default=1)
[ -estop <e> ] Stop early when no improvement is observed on validaton data in e consecutive rounds (default=100)
[+] Testing previously saved models 测试已经保存的模型
-load <model> 加载模型 The model to load
-test <file> 测试数据Test data to evaluate the model (specify either this or -rank but not both)
-rank <file> 对制定文件中的样本排序，与-test不能同时指定 Rank the samples in the specified file (specify either this or -test but not both)
[ -metric2T <metric> ] Metric to evaluate on the test data (default=ERR@10)
[ -gmax <label> ] Highest judged relevance label. It affects the calculation of ERR (default=4, i.e. 5-point scale {0,1,2,3,4})
[ -score <file> ] Store ranker's score for each object being ranked (has to be used with -rank)
[ -idv ] 打印单个排名列表上的性能（必须与-test一起使用） Print model performance (in test metric) on individual ranked lists (has to be used with -test)
[ -norm ] 特征归一化 Normalize feature vectors (similar to -norm for training/tuning)

RankLib参数翻译的更多相关文章

JVM 参数翻译汉化解释
博客搬家,新地址:http://www.zicheng.net/article/38.htm Behavioral Options(行为参数) Option and Default Value Des ...
hbase-default.xml（Hbase 默认参数翻译）
hbase.tmp.dir ${java.io.tmpdir}/hbase-${user.name} 本地文件系统上的临时目录.将'/tmp'改为其他可以持久保存文件的位置,通常能够解决java. ...
以太坊客户端Geth命令用法-参数详解
Geth在以太坊智能合约开发中最常用的工具(必备开发工具),一个多用途的命令行工具. 熟悉Geth可以让我们有更好的效率,大家可收藏起来作为Geth命令用法手册. 本文主要是对geth help的翻译 ...
以太坊客户端Geth命令用法-参数详解【转载】
原文链接:http://www.cnblogs.com/tinyxiong/p/7918706.html Geth在以太坊智能合约开发中最常用的工具(必备开发工具),一个多用途的命令行工具.熟悉Get ...
Spark配置参数的三种方式
1.Spark 属性Spark应用程序的运行是通过外部参数来控制的,参数的设置正确与否,好与坏会直接影响应用程序的性能,也就影响我们整个集群的性能.参数控制有以下方式:(1)直接设置在SparkCon ...
nginx 常见参数以及重定向参数配置
nginx 各参数翻译,作用 $arg_PARAMETER #这个变量包含GET请求中,如果有变量PARAMETER时的值. $args #这个变量等于请求行中(GET请求)的参数,例如foo=123 ...
PHP的CURL方法curl_setopt()函数案例介绍(抓取网页,POST数据)
通过curl_setopt()函数可以方便快捷的抓取网页(采集很方便),curl_setopt 是php的一个扩展库使用条件:需要在php.ini 中配置开启.(PHP 4 >= 4.0.2) ...
nginx常用命令
ps -ef | grep nginx在进程列表里面找master进程,它的编号就是主进程号了. 步骤2:发送信号从容停止Nginx: kill -QUIT 主进程号快速停止Nginx: kill ...
Linux磁盘与文件系统概念理解
磁盘级别概念这里讲的主要是网上所谓的老式磁盘,它是由一个个盘片组成的,我们先从个盘片结构讲起.如图1所示,图中的一圈圈灰色同心圆为一条条磁道,从圆心向外画直线,可以将磁道划分为若干个弧段,每 ...

随机推荐

ASP.NET 页面间传递参数的方法
http://www.cnblogs.com/eoiioe/archive/2008/04/08/1142247.html
AngularJS select中ngOptions用法详解
AngularJS select中ngOptions用法详解一.用法 ngOption针对不同类型的数据源有不同的用法,主要体现在数组和对象上. 数组: label for value in a ...
浅谈android binder机制
binder机制是谷歌优化在android上更适合终端的IPC(多进程通信方式),满足系统对通信方式,传输性能和安全性的要求. 特性: 1. 用驱动程序来推进进程间的通信.2. 通过共享内存来提高性 ...
高德地图JavaScript开发
项目需求:标注一个或者两个点.显示信息窗体.自定义icon <!DOCTYPE html> <html lang="en"> <head> &l ...
解决xftp远程连接中文乱码
更改乱码服务器的属性,使用UTF-8编码
Oracle 数据库特殊查询总结
1. 查询本节点及本节点以下的所有节点: select * from table1 c start with c.p_id='0000000' connect by prior c.id=c.p_id ...
c# 获取本周、本月、本季度、本年的开始时间或结束时间
#region 获取本周.本月.本季度.本年的开始时间或结束时间 /// <summary> /// 获取结束时间 /// </summary> /// <param ...
Leetcode 详解（Substing without repeats character）
Given a string, find the length of the longest substring without repeating characters. Examples: Giv ...
iis里面浏览网页，提示找不到应用程序的解决办法
iis配置成功,数据库链接正确,代码无误,在iis里面,浏览某网页,提示找不到应用程序,这时一下子懵了. 处理办法:在浏览器中直接输入网址,例如:http://192.168.1.111,这时能够打开 ...
UIScrollView内容缩放
•有些时候,我们可能要对某些内容进行手势缩放,如下图所示 UIScrollView不仅能滚动显示大量内容,还能对其内容进行缩放处理也就是说,要完成缩放功能的话,只需要将需要缩放的内容添加到UIScr ...

RankLib参数翻译

RankLib参数翻译的更多相关文章

随机推荐

热门专题