libsvm 之 easy.py（流程化脚本）注释

鉴于该脚本的重要性，很有必要对该脚本做一个全面的注释，以便可以灵活的使用libsvm。

#!/usr/bin/env python

# 这种设置python路径的方法更为科学

import sys

import os

from subprocess import *

# 输入参数太少就会提示程序用法

if len(sys.argv) <= 1:

    print('Usage: {0} training_file [testing_file]'.format(sys.argv[0]))

    raise SystemExit

# svm, grid, and gnuplot executable files

is_win32 = (sys.platform == 'win32')

if not is_win32:

    # Linux系统下的程序路径配置

    svmscale_exe = "../svm-scale"

    svmtrain_exe = "../svm-train"

    svmpredict_exe = "../svm-predict"

    grid_py = "./grid.py"

    gnuplot_exe = "/usr/bin/gnuplot"   #需要修改次路径，gnuplot为可执行程序的路径，不是文件夹路径

else:

    # windows系统下的程序路径配置

    svmscale_exe = r"..\windows\svm-scale.exe"

    svmtrain_exe = r"..\windows\svm-train.exe"

    svmpredict_exe = r"..\windows\svm-predict.exe"

    gnuplot_exe = r"C:\gnuplot\bin\gnuplot.exe"

    grid_py = r".\grid.py"

assert os.path.exists(svmscale_exe),"svm-scale executable not found"

assert os.path.exists(svmtrain_exe),"svm-train executable not found"

assert os.path.exists(svmpredict_exe),"svm-predict executable not found"

assert os.path.exists(gnuplot_exe),"gnuplot executable not found"

assert os.path.exists(grid_py),"grid.py not found"

# 创建训练数据集相关的文件：".scale"，".model"，".range"三个文件

train_pathname = sys.argv[1]

assert os.path.exists(train_pathname),"training file not found"

file_name = os.path.split(train_pathname)[1]

scaled_file = file_name + ".scale"

model_file = file_name + ".model"

range_file = file_name + ".range"

# 创建测试数据集相关文件：".scale"，".predict"两个文件

if len(sys.argv) > 2:

    test_pathname = sys.argv[2]

    file_name = os.path.split(test_pathname)[1]

    assert os.path.exists(test_pathname),"testing file not found"

    scaled_test_file = file_name + ".scale"

    predict_test_file = file_name + ".predict"

# 流程化命令一：svm-scale缩放，训练集缩放，参数如下：

cmd = '{0} -s "{1}" "{2}" > "{3}"'.format(svmscale_exe, range_file, train_pathname, scaled_file)

print('Scaling training data...')

Popen(cmd, shell = True, stdout = PIPE).communicate()    

# 流程化命令二：参数选优，使用grid.py脚本，进行交叉验证，参数如下:

cmd = '{0} -svmtrain "{1}" -gnuplot "{2}" "{3}"'.format(grid_py, svmtrain_exe, gnuplot_exe, scaled_file)

print('Cross validation...')

f = Popen(cmd, shell = True, stdout = PIPE).stdout

line = ''

while True:

    last_line = line

    line = f.readline()

    if not line: break

c,g,rate = map(float,last_line.split())

# 输出最优参数c，g

print('Best c={0}, g={1} CV rate={2}'.format(c,g,rate))

# 流程化命令三：svm-train训练，参数设置如下

cmd = '{0} -c {1} -g {2} "{3}" "{4}"'.format(svmtrain_exe,c,g,scaled_file,model_file)

print('Training...')

Popen(cmd, shell = True, stdout = PIPE).communicate()

print('Output model: {0}'.format(model_file))

if len(sys.argv) > 2:

    # 流程化命令四：svm-scale缩放，测试数据缩放，参数设置如下：

    cmd = '{0} -r "{1}" "{2}" > "{3}"'.format(svmscale_exe, range_file, test_pathname, scaled_test_file)

    print('Scaling testing data...')

    Popen(cmd, shell = True, stdout = PIPE).communicate()    

    # 流程化命令五：svm-predict预测，参数设置如下：

    cmd = '{0} "{1}" "{2}" "{3}"'.format(svmpredict_exe, scaled_test_file, model_file, predict_test_file)

    print('Testing...')

    Popen(cmd, shell = True).communicate()    

    print('Output prediction: {0}'.format(predict_test_file))

libsvm 之 easy.py（流程化脚本）注释的更多相关文章

【转】Windows下使用libsvm中的grid.py和easy.py进行参数调优
libsvm中有进行参数调优的工具grid.py和easy.py可以使用,这些工具可以帮助我们选择更好的参数,减少自己参数选优带来的烦扰. 所需工具:libsvm.gnuplot 本机环境:Windo ...
libsvm easy.py ValueError: need more than 0 values to unpack windows下终极解决
现象是: python easy.py train test 输出: Scaling training data...WARNING: original #nonzeros 100389 new #n ...
Atitit usrqbg1834 html的逻辑化流程化规范标准化解决方案
Atitit usrqbg1834 html的逻辑化流程化规范标准化解决方案常用指令1 ..v-if.v-else指令2 v-for指令3 MVVM大比拼4 常用指令本来按照Vue文档说明,常用 ...
Selenium2学习-018-WebUI自动化实战实例-016-自动化脚本编写过程中的登录验证码问题
日常的 Web 网站开发的过程中,为提升登录安全或防止用户通过脚本进行黄牛操作(宇宙最贵铁皮天朝魔都的机动车牌照竞拍中),很多网站在登录的时候,添加了验证码验证,而且验证码的实现越来越复杂,对其进行脚 ...
easy.py使用中ValueError: could not convert string to float: svm_options错误问题解决
在使用easy.py中出现如下图所示问题解决方法: 1.找到cmd = '{0} -svmtrain "{1}" -gnuplot "{2}" "{ ...
Centos7 系统初试化脚本
系统初始化设置 # 设置主机名,永久修改,再次登陆生效 hostnamectl set-hostname xxxxx # 安装eprl源,常用命令 yum install -y wget && ...
Wifite.py 修正版脚本代码
Kali2.0系统自带的WiFite脚本代码中有几行错误,以下是修正后的代码: #!/usr/bin/python # -*- coding: utf-8 -*- """ ...
JsDoc脚本注释文档生成
使用jsDoc可使用特定注释,将注释的内容生成文档,可用于生成脚本库的API文档 jsdoc 文档: http://usejsdoc.org/
Linux启动流程和脚本服务-6
授课笔记:----------------------------------- linux系统启动流程:一.初始化阶段:1.grub引导界面2.识别硬件3.初始化驱动二.加载/etc/rc.d/r ...

随机推荐

EasyUI DataGrid 复选框
使用checkbox,用户可以选定/取消数据行.添加checkbox列,我们简单的添加列的checkbox属性,并且设置为true.代码像这样:<table id="tt"& ...
2016年7月2日星期六 --出埃及记 Exodus 14:29
2016年7月2日星期六 --出埃及记 Exodus 14:29 But the Israelites went through the sea on dry ground, with a wall ...
HDU4686 Arc of Dream 矩阵快速幂
Arc of Dream Time Limit: 2000/2000 MS (Java/Others) Memory Limit: 65535/65535 K (Java/Others)Tota ...
Codeforces 633B A Trivial Problem
B. A Trivial Problem time limit per test 2 seconds memory limit per test 256 megabytes input standar ...
JavaScript编写简单的抽奖程序
1.需求说明某公司年终抽奖,需要有如下功能 1)可以根据实际情况设置到场人数的最大值 2) 点击“开始”,大屏幕滚动,点击“停止”,获奖者的编号出现在大屏幕上 3)在界面里显示全部奖项获奖人编号 4 ...
【leetcode❤python】110. Balanced Binary Tree
#-*- coding: UTF-8 -*-#平衡二叉树# Definition for a binary tree node.# class TreeNode(object):# def _ ...
iOS深入学习（UITableView系列4：使用xib自定义cell）
可以通过继承UITableViewCell重新自定义cell,可以像下面一样通过代码来自定义cell,但是手写代码总是很浪费时间, ? 1 2 3 4 5 6 7 8 9 10 11 12 13 14 ...
C#控制台->>四则运算
用户需求: 要求编写一个0-10之间的整数进行四则运算,程序能接收输入的整数答案,并判断对错,程序结束时,统计出答对.答错的题目数量!并且0-10的整数是随机生成的,用户可以用键盘输入来选择四则运算中 ...
mac 配置jdk maven
1.从oracle下载jdk 链接:http://www.oracle.com/technetwork/java/javase/downloads/index.html 然后安装jdk 2.下载Mav ...
C# int.Parse()与int.TryParse()
int i = -1;bool b = int.TryParse(null, out i);执行完毕后,b等于false,i等于0,而不是等于-1,切记. int i = -1;bool b = in ...

libsvm 之 easy.py（流程化脚本）注释

libsvm 之 easy.py（流程化脚本）注释的更多相关文章

随机推荐

热门专题