数据集

给出了4个行业的语料，餐馆、酒店、电脑、电视，及其组合数据。

数据格式

任务

根据给定格式的命令，生成自然语言。

方法、模型、策略

作者给出了5种模型，2种训练（优化）策略、2种解码方式

* Model

- (knn) kNN generator:

    k-nearest neighbor example-based generator, based on MR similarty.

- (ngram) Class-based Ngram generator [Oh & Rudnicky, 2000]:

    Class-based language model generator by utterance class partitions.

- (hlstm) Heuristic Gated LSTM [Wen et al, 2015a]:

    An MR-conditioned LSTM generator with heuristic gates.

- (sclstm) Semantically Conditioned LSTM [Wen et al, 2015b]:

    An MR-conditioned LSTM generator with learned gates.

- (encdec) Attentive Encoder-Decoder LSTM [Wen et al, 2015c]:

    An encoder-decoder LSTM with slot-value level attention.

* Training Strategy

- (ml) Maximum Likehood Training, using token cross-entropy

- (dt) Discriminative Training (or Expected BLEU training) [Wen et al, 2016]

* Decoding Strategy

- (beam) Beam search

- (sample) Random sampling

快速开始

需要python2环境，依赖：

* Theano 0.8.2 and accompanying packages such as numpy, scipy ...

* NLTK 3.0.0

创建虚机，Python2

virtualenv env

source env/bin/activate

pip install theano==0.8.2

pip install nltk==3.0.0

训练：python main.py -config config/sclstm.cfg -mode train

测试：python main.py -config config/sclstm.cfg -mode test

配置文件和参数

从上面的训练和测试的命令可以看出，参数在config目录下的文件配置，看看config/sclstm.cfg文件的内容

[learn] // parameters for training

lr          = 0.1 : learning rate of SGD.

lr_decay    = 0.5  : learning rate decay.

lr_divide   = 3 : the maximum number of times when validation gets worse.

                  for early stopping.

beta        = 0.0000001  : regularisation parameter.

random_seed = 5 : random seed.

min_impr    = 1.003 : the relative minimal improvement allowed.

debug       = True : debug flag

llogp       = -100000000 : log prob in the last epoch

[train_mode]

mode        = all : training mode, currently only support 'all'

obj         = ml  : training objective, 'ml' or 'dt'

###################################

* Training Strategy

- (ml) Maximum Likehood Training, using token cross-entropy

- (dt) Discriminative Training (or Expected BLEU training) [Wen et al, 2016]

###################################

gamma       = 5.0  : hyperparameter for DT training

batch       = 1 : batch size

[generator] // structure for generator

type        = sclstm : the model type, [hlstm|sclstm|encdec]

hidden      = 80 : hidden layer size

[data] // data and model file

domain      = restaurant  作者给出4种领域：餐馆、酒店、电脑、电视

train       = data/original/restaurant/train.json

valid       = data/original/restaurant/valid.json

test        = data/original/restaurant/test.json

vocab       = resource/vocab  词典

percentage  = 100 : the percentage of train/valid considered

wvec        = vec/vectors-80.txt  : pretrained word vectors 预训练的词向量，有多个维度

model       = model/sclstm-rest.model  : the produced model path 生成的模型文件名称

[gen] // generation parameters, decode='beam' or 'sample'

topk        = 5  : the N-best list returned

overgen     = 20  : number of over-generation

beamwidth   = 10  : the beam width used to decode utterances

detectpairs = resource/detect.pair  :  the mapping file for calculating the slot error rate 见下文

verbose     = 1  : verbose level of the model, not supported yet

decode      = beam  : decoding strategy, 'beam' or 'sample'

Below are knn/ngram specific parameters:

* [ngram]

- ngram         : the N of ngram

- rho           : number of slots considered to partition the dataset

结果

我在自己机器试了一下



inform(name=fresca;phone='4154472668')

Penalty TSER    ASER    Gen

0.0672  0       0       the phone number for fresca is 4154472668

0.1272  0       0       fresca s phone number is 4154472668

0.1694  0       0       the phone number of fresca is 4154472668

0.1781  0       0       the phone number for the fresca is 4154472668

0.2153  0       0       the phone number to fresca is 4154472668

文件resource/detect.pair

{

   "general" : {

       "address"    : "SLOT_ADDRESS",

       "area"       : "SLOT_AREA",

       "count"      : "SLOT_COUNT",

       "food"       : "SLOT_FOOD",

       "goodformeal": "SLOT_GOODFORMEAL",

       "name"       : "SLOT_NAME",

       "near"       : "SLOT_NEAR",

       "phone"      : "SLOT_PHONE",

       "postcode"	 : "SLOT_POSTCODE",

       "price"	     : "SLOT_PRICE",

       "pricerange" : "SLOT_PRICERANGE",

       "battery"    : "SLOT_BATTERY",

       "batteryrating"  : "SLOT_BATTERYRATING",

       "design"     : "SLOT_DESIGN",

       "dimension"  : "SLOT_DIMENSION",

       "drive"      : "SLOT_DRIVE",

       "driverange" : "SLOT_DRIVERANGE",

       "family"     : "SLOT_FAMILY",

       "memory"     : "SLOT_MEMORY",

       "platform"   : "SLOT_PLATFORM",

       "utility"    : "SLOT_UTILITY",

       "warranty"   : "SLOT_WARRANTY",

       "weight"     : "SLOT_WEIGHT",

       "weightrange": "SLOT_WEIGHTRANGE",

       "hdmiport"   : "SLOT_HDMIPORT",

       "ecorating"  : "SLOT_ECORATING",

       "audio"      : "SLOT_AUDIO",

       "accessories": "SLOT_ACCESSORIES",

       "color"      : "SLOT_COLOR",

       "powerconsumption"  : "SLOT_POWERCONSUMPTION",

       "resolution" : "SLOT_RESOLUTION",

       "screensize" : "SLOT_SCREENSIZE",

       "screensizerange" : "SLOT_SCREENSIZERANGE"

   },

   "binary"  : {

       "kidsallowed":["child","kid","kids","children"],

       "dogsallowed":["dog","dogs","puppy"],

       "hasinternet":["internet","wifi"],

       "acceptscreditcards":["card","cards"],

       "isforbusinesscomputing":["business","nonbusiness","home","personal","general"],

       "hasusbport" :["usb"]

   }

}

总结

将结构化的数据，转为非结构化的文本。整个任务的核心就是这个吧

学习笔记（11）- 文本生成RNNLG的更多相关文章

Spring MVC 学习笔记11 —— 后端返回json格式数据
Spring MVC 学习笔记11 -- 后端返回json格式数据我们常常听说json数据,首先,什么是json数据,总结起来,有以下几点: 1. JSON的全称是"JavaScript ...
《C++ Primer Plus》学习笔记11
<C++ Primer Plus>学习笔记11 第17章输入.输出和文件 <<<<<<<<<<<<<< ...
Spring 源码学习笔记11——Spring事务
Spring 源码学习笔记11--Spring事务 Spring事务是基于Spring Aop的扩展 AOP的知识参见<Spring 源码学习笔记10--Spring AOP> 图片参考了 ...
Ext.Net学习笔记11：Ext.Net GridPanel的用法
Ext.Net学习笔记11:Ext.Net GridPanel的用法 GridPanel是用来显示数据的表格,与ASP.NET中的GridView类似. GridPanel用法直接看代码: < ...
SQL反模式学习笔记11 限定列的有效值
目标:限定列的有效值,将一列的有效字段值约束在一个固定的集合中.类似于数据字典. 反模式:在列定义上指定可选值 1. 对某一列定义一个检查约束项,这个约束不允许往列中插入或者更新任何会导致约束失败的值 ...
golang学习笔记11 golang要用jetbrain的golang这个IDE工具开发才好
golang学习笔记11 golang要用jetbrain的golang这个IDE工具开发才好 jetbrain家的全套ide都很好用,一定要dark背景风格才装B 从File-->s ...
ArcGIS案例学习笔记2_2_等高线生成DEM和三维景观动画
ArcGIS案例学习笔记2_2_等高线生成DEM和三维景观动画计划时间:第二天下午教程:Pdf/405 数据:ch9/ex3 方法: 1. 创建DEM SA工具箱/插值分析/地形转栅格 2. 生成 ...
Python3+Selenium3+webdriver学习笔记11（cookie处理）
#!/usr/bin/env python# -*- coding:utf-8 -*-'''Selenium3+webdriver学习笔记11(cookie处理)'''from selenium im ...
并发编程学习笔记(11)----FutureTask的使用及实现
1. Future的使用 Future模式解决的问题是.在实际的运用场景中,可能某一个任务执行起来非常耗时,如果我们线程一直等着该任务执行完成再去执行其他的代码,就会损耗很大的性能,而Future接口 ...
SpringMVC:学习笔记(11)——依赖注入与@Autowired
SpringMVC:学习笔记(11)——依赖注入与@Autowired 使用@Autowired 从Spring2.5开始,它引入了一种全新的依赖注入方式,即通过@Autowired注解.这个注解允许 ...

随机推荐

mount命令实际操作样例
本篇文章主要介绍了如何在Linux(CentOS 7)命令行模式安装VMware Tools,具有一定的参考价值,感兴趣的小伙伴们可以参考一下. 本例中为在Linux(以CentOS 7为例)安装VM ...
HBuilder笔记
官网: https://uniapp.dcloud.io/quickstart HBuilderX - 高效极客技巧 https://ask.dcloud.net.cn/article/13191 插 ...
使用jps查看JVM进程信息
VM进程状态工具 - 列出目标系统上已检测的HotSpot Java虚拟机进程信息.可直接在装有java运行环境的Windows 或者 Linux机器上使用命令行执行jps命令.一个典型的应用场景,例 ...
PTA的Python练习题（十七）
第4章-19 矩阵运算 a=eval(input()) s=[] count=0 for i in range(a): b=input() s.append([int(i) for i in b.sp ...
第八届极客大挑战 Web-故道白云&Clound的错误
web-故道白云题目: 解题思路: 0x01 首先看到题目说html里有秘密,就看了下源代码如图, 重点在红圈那里,表示输入的变量是id,当然上一行的method=“get”同时说明是get方式获取 ...
修正SQLSERVER package连接
drop table #temp_mt; --WITH XMLNAMESPACES ('www.microsoft.com/SqlServer/Dts' AS DTS) SELECT ROW_NUMB ...
PyQt5打印机
1.打印机操作(打印默认文本里面的内容)from PyQt5 import QtGui,QtWidgets,QtPrintSupportfrom PyQt5.QtWidgets import *imp ...
Spring Boot Shiro 使用教程
Apache Shiro 已经大名鼎鼎,搞 Java 的没有不知道的,这类似于 .Net 中的身份验证 form 认证.跟 .net core 中的认证授权策略基本是一样的.当然都不知道也没有关系,因 ...
MariaDB-Galera部署
Galera Cluster:集成了Galera插件的MySQL集群,是一种新型的,数据不共享的,高度冗余的高可用方案,目前Galera Cluster有两个版本,分别是Percona Xtradb ...
误删/boot下文件或目录的修复方式！
步骤:进入硬盘的急救模式,进入磁盘,挂载光盘到/media上,rpm安装内核到media目录下,从装grub程序到/dev/sda,然后将grub文件从定向到/boot下,然后重启. 第一步:进入bi ...

学习笔记（11）- 文本生成RNNLG

数据集