数据集

给出了4个行业的语料，餐馆、酒店、电脑、电视，及其组合数据。

数据格式

任务

根据给定格式的命令，生成自然语言。

方法、模型、策略

作者给出了5种模型，2种训练（优化）策略、2种解码方式

* Model

- (knn) kNN generator:

    k-nearest neighbor example-based generator, based on MR similarty.

- (ngram) Class-based Ngram generator [Oh & Rudnicky, 2000]:

    Class-based language model generator by utterance class partitions.

- (hlstm) Heuristic Gated LSTM [Wen et al, 2015a]:

    An MR-conditioned LSTM generator with heuristic gates.

- (sclstm) Semantically Conditioned LSTM [Wen et al, 2015b]:

    An MR-conditioned LSTM generator with learned gates.

- (encdec) Attentive Encoder-Decoder LSTM [Wen et al, 2015c]:

    An encoder-decoder LSTM with slot-value level attention.

* Training Strategy

- (ml) Maximum Likehood Training, using token cross-entropy

- (dt) Discriminative Training (or Expected BLEU training) [Wen et al, 2016]

* Decoding Strategy

- (beam) Beam search

- (sample) Random sampling

快速开始

需要python2环境，依赖：

* Theano 0.8.2 and accompanying packages such as numpy, scipy ...

* NLTK 3.0.0

创建虚机，Python2

virtualenv env

source env/bin/activate

pip install theano==0.8.2

pip install nltk==3.0.0

训练：python main.py -config config/sclstm.cfg -mode train

测试：python main.py -config config/sclstm.cfg -mode test

配置文件和参数

从上面的训练和测试的命令可以看出，参数在config目录下的文件配置，看看config/sclstm.cfg文件的内容

[learn] // parameters for training

lr          = 0.1 : learning rate of SGD.

lr_decay    = 0.5  : learning rate decay.

lr_divide   = 3 : the maximum number of times when validation gets worse.

                  for early stopping.

beta        = 0.0000001  : regularisation parameter.

random_seed = 5 : random seed.

min_impr    = 1.003 : the relative minimal improvement allowed.

debug       = True : debug flag

llogp       = -100000000 : log prob in the last epoch

[train_mode]

mode        = all : training mode, currently only support 'all'

obj         = ml  : training objective, 'ml' or 'dt'

###################################

* Training Strategy

- (ml) Maximum Likehood Training, using token cross-entropy

- (dt) Discriminative Training (or Expected BLEU training) [Wen et al, 2016]

###################################

gamma       = 5.0  : hyperparameter for DT training

batch       = 1 : batch size

[generator] // structure for generator

type        = sclstm : the model type, [hlstm|sclstm|encdec]

hidden      = 80 : hidden layer size

[data] // data and model file

domain      = restaurant  作者给出4种领域：餐馆、酒店、电脑、电视

train       = data/original/restaurant/train.json

valid       = data/original/restaurant/valid.json

test        = data/original/restaurant/test.json

vocab       = resource/vocab  词典

percentage  = 100 : the percentage of train/valid considered

wvec        = vec/vectors-80.txt  : pretrained word vectors 预训练的词向量，有多个维度

model       = model/sclstm-rest.model  : the produced model path 生成的模型文件名称

[gen] // generation parameters, decode='beam' or 'sample'

topk        = 5  : the N-best list returned

overgen     = 20  : number of over-generation

beamwidth   = 10  : the beam width used to decode utterances

detectpairs = resource/detect.pair  :  the mapping file for calculating the slot error rate 见下文

verbose     = 1  : verbose level of the model, not supported yet

decode      = beam  : decoding strategy, 'beam' or 'sample'

Below are knn/ngram specific parameters:

* [ngram]

- ngram         : the N of ngram

- rho           : number of slots considered to partition the dataset

结果

我在自己机器试了一下



inform(name=fresca;phone='4154472668')

Penalty TSER    ASER    Gen

0.0672  0       0       the phone number for fresca is 4154472668

0.1272  0       0       fresca s phone number is 4154472668

0.1694  0       0       the phone number of fresca is 4154472668

0.1781  0       0       the phone number for the fresca is 4154472668

0.2153  0       0       the phone number to fresca is 4154472668

文件resource/detect.pair

{

   "general" : {

       "address"    : "SLOT_ADDRESS",

       "area"       : "SLOT_AREA",

       "count"      : "SLOT_COUNT",

       "food"       : "SLOT_FOOD",

       "goodformeal": "SLOT_GOODFORMEAL",

       "name"       : "SLOT_NAME",

       "near"       : "SLOT_NEAR",

       "phone"      : "SLOT_PHONE",

       "postcode"	 : "SLOT_POSTCODE",

       "price"	     : "SLOT_PRICE",

       "pricerange" : "SLOT_PRICERANGE",

       "battery"    : "SLOT_BATTERY",

       "batteryrating"  : "SLOT_BATTERYRATING",

       "design"     : "SLOT_DESIGN",

       "dimension"  : "SLOT_DIMENSION",

       "drive"      : "SLOT_DRIVE",

       "driverange" : "SLOT_DRIVERANGE",

       "family"     : "SLOT_FAMILY",

       "memory"     : "SLOT_MEMORY",

       "platform"   : "SLOT_PLATFORM",

       "utility"    : "SLOT_UTILITY",

       "warranty"   : "SLOT_WARRANTY",

       "weight"     : "SLOT_WEIGHT",

       "weightrange": "SLOT_WEIGHTRANGE",

       "hdmiport"   : "SLOT_HDMIPORT",

       "ecorating"  : "SLOT_ECORATING",

       "audio"      : "SLOT_AUDIO",

       "accessories": "SLOT_ACCESSORIES",

       "color"      : "SLOT_COLOR",

       "powerconsumption"  : "SLOT_POWERCONSUMPTION",

       "resolution" : "SLOT_RESOLUTION",

       "screensize" : "SLOT_SCREENSIZE",

       "screensizerange" : "SLOT_SCREENSIZERANGE"

   },

   "binary"  : {

       "kidsallowed":["child","kid","kids","children"],

       "dogsallowed":["dog","dogs","puppy"],

       "hasinternet":["internet","wifi"],

       "acceptscreditcards":["card","cards"],

       "isforbusinesscomputing":["business","nonbusiness","home","personal","general"],

       "hasusbport" :["usb"]

   }

}

总结

将结构化的数据，转为非结构化的文本。整个任务的核心就是这个吧

学习笔记（11）- 文本生成RNNLG的更多相关文章

Spring MVC 学习笔记11 —— 后端返回json格式数据
Spring MVC 学习笔记11 -- 后端返回json格式数据我们常常听说json数据,首先,什么是json数据,总结起来,有以下几点: 1. JSON的全称是"JavaScript ...
《C++ Primer Plus》学习笔记11
<C++ Primer Plus>学习笔记11 第17章输入.输出和文件 <<<<<<<<<<<<<< ...
Spring 源码学习笔记11——Spring事务
Spring 源码学习笔记11--Spring事务 Spring事务是基于Spring Aop的扩展 AOP的知识参见<Spring 源码学习笔记10--Spring AOP> 图片参考了 ...
Ext.Net学习笔记11：Ext.Net GridPanel的用法
Ext.Net学习笔记11:Ext.Net GridPanel的用法 GridPanel是用来显示数据的表格,与ASP.NET中的GridView类似. GridPanel用法直接看代码: < ...
SQL反模式学习笔记11 限定列的有效值
目标:限定列的有效值,将一列的有效字段值约束在一个固定的集合中.类似于数据字典. 反模式:在列定义上指定可选值 1. 对某一列定义一个检查约束项,这个约束不允许往列中插入或者更新任何会导致约束失败的值 ...
golang学习笔记11 golang要用jetbrain的golang这个IDE工具开发才好
golang学习笔记11 golang要用jetbrain的golang这个IDE工具开发才好 jetbrain家的全套ide都很好用,一定要dark背景风格才装B 从File-->s ...
ArcGIS案例学习笔记2_2_等高线生成DEM和三维景观动画
ArcGIS案例学习笔记2_2_等高线生成DEM和三维景观动画计划时间:第二天下午教程:Pdf/405 数据:ch9/ex3 方法: 1. 创建DEM SA工具箱/插值分析/地形转栅格 2. 生成 ...
Python3+Selenium3+webdriver学习笔记11（cookie处理）
#!/usr/bin/env python# -*- coding:utf-8 -*-'''Selenium3+webdriver学习笔记11(cookie处理)'''from selenium im ...
并发编程学习笔记(11)----FutureTask的使用及实现
1. Future的使用 Future模式解决的问题是.在实际的运用场景中,可能某一个任务执行起来非常耗时,如果我们线程一直等着该任务执行完成再去执行其他的代码,就会损耗很大的性能,而Future接口 ...
SpringMVC:学习笔记(11)——依赖注入与@Autowired
SpringMVC:学习笔记(11)——依赖注入与@Autowired 使用@Autowired 从Spring2.5开始,它引入了一种全新的依赖注入方式,即通过@Autowired注解.这个注解允许 ...

随机推荐

POCO的理解
POCO的名称有多种,pure old clr object. plain ordinary clr object等 POCO的概念是指那些没有从任何类继承,也没有实现任何接口,更没有被其它框架侵入的 ...
webRTC中回声消除(AEC)模块编译时aec_rdft.c文件报错：
webRTC中回声消除(AEC)模块编译时aec_rdft.c文件报错. 原因是: 局部变量ip跟全局变量冲突的问题,可以将局部变量重新命名一下,就可以通过编译了. aec_rdft.c修改以后文件代 ...
Django框架之Filters（过滤器）、母版的使用
在Django的模板语言中,通过使用过滤器来改变变量的显示. 过滤器的语法: {{ value|filter_name:参数 }} 使用管道符"|"来应用过滤器. 注意事项: ...
推荐算法之---FM算法；
一,FM算法: 1,逻辑回归上面进行了交叉特征.算法复杂度优化从O(n^3)->O(k*n^2)->O(k*n). 2,本质:每个特征都有一个k维的向量,代表的是每个特征都有k个不可告人的 ...
4、maven——构建生命周期
什么是生命周期? 构建生命周期是一组阶段的序列(sequence of phase),每个阶段定义了目标被执行的顺序,这里的阶段就是生命周期的一部分. 一个典型的Maven生命周期由一些几个阶段的序列 ...
Solr 8.2.0最新RCE漏洞复现
漏洞描述国外安全研究员s00py公开了一个Apache Solr的Velocity模板注入漏洞.该漏洞可以攻击最新版本的Solr. 漏洞编号无影响范围包括但不限于8.2.0(20191031最 ...
【HTML】输入密码访问
<script> (function(){ if('{{ page.password }}'){ if (prompt('请输入文章密码') !== '{{ page.password } ...
iOS 开发之提取图片的主色调用于更换应用主题颜色
从刷爆 IT 圈的一个事件说起: 新闻:某互联网公司产品经理提出一个需求--要求APP开发人员做到软件根据用户的手机壳改变软件的主题颜色. What Fuck!还有这操作,PM,你过来,保证不打屎你. ...
获取 input type=file 上次文件的路径
可以通过 $('这个元素').val();得到全路径:
微信+QQ跳转
加到对应页面的</body> 上面,或者<head> </head>之间 <script type="text/javascript"&g ...

学习笔记（11）- 文本生成RNNLG

数据集