PyTorch学习笔记之n-gram模型实现

 import torch

 import torch.nn as nn

 from torch.autograd import Variable

 import torch.nn.functional as F

 import torch.optim as optim

 CONTEXT_SIZE = 2            # the same as window_size

 EMBEDDING_DIM = 10

 test_sentence = "When forty winters shall besiege thy brow,And dig deep trenches in thy beauty's field,Thy youth's proud livery so gazed on now,Will be a totter'd weed of small worth held:Then being asked, where all thy beauty lies,Where all the treasure of thy lusty days;To say, within thine own deep sunken eyes,Were an all-eating shame, and thriftless praise.How much more praise deserv'd thy beauty's use,If thou couldst answer 'This fair child of mineShall sum my count, and make my old excuse,'Proving his beauty by succession thine!This were to be new made when thou art old,And see thy blood warm when thou feel'st it cold.".split()

 vocb = set(test_sentence)       # remove repeated words

 word2id = {word: i for i, word in enumerate(vocb)}

 id2word = {word2id[word]: word for word in word2id}

 # define model

 class NgramModel(nn.Module):

     def __init__(self, vocb_size, context_size, n_dim):

         # super(NgramModel, self)._init_()

         super().__init__()

         self.n_word = vocb_size

         self.embedding = nn.Embedding(self.n_word, n_dim)

         self.linear1 = nn.Linear(context_size*n_dim, 128)

         self.linear2 = nn.Linear(128, self.n_word)

     def forward(self, x):

         # the first step: transmit words and achieve word embedding. eg. transmit two words, and then achieve (2, 100)

         emb = self.embedding(x)

         # the second step: word wmbedding unfold to (1,200)

         emb = emb.view(1, -1)

         # the third step: transmit to linear model, and then use relu, at last, transmit to linear model again

         out = self.linear1(emb)

         out = F.relu(out)

         out = self.linear2(out)

         # the output dim of last step is the number of words, wo can view as a classification problem

         # if we want to predict the max probability of the words, finally we need use log softmax

         log_prob = F.log_softmax(out)

         return log_prob

 ngrammodel = NgramModel(len(word2id), CONTEXT_SIZE, 100)

 criterion = nn.NLLLoss()

 optimizer = optim.SGD(ngrammodel.parameters(), lr=1e-3)

 trigram = [((test_sentence[i], test_sentence[i+1]), test_sentence[i+2])

             for i in range(len(test_sentence)-2)]

 for epoch in range(100):

     print('epoch: {}'.format(epoch+1))

     print('*'*10)

     running_loss = 0

     for data in trigram:

         # we use 'word' to represent the two words forward the predict word, we use 'label' to represent the predict word

         word, label = data                          # attention

         word = Variable(torch.LongTensor([word2id[e] for e in word]))

         label = Variable(torch.LongTensor([word2id[label]]))

         # forward

         out = ngrammodel(word)

         loss = criterion(out, label)

         running_loss += loss.data[0]

         # backward

         optimizer.zero_grad()

         loss.backward()

         optimizer.step()

     print('loss: {:.6f}'.format(running_loss/len(word2id)))

 # predict

 word, label = trigram[3]

 word = Variable(torch.LongTensor([word2id[i] for i in word]))

 out = ngrammodel(word)

 _, predict_label = torch.max(out, 1)

 predict_word = id2word[predict_label.data[0][0]]

 print('real word is {}, predict word is {}'.format(label, predict_word))

PyTorch学习笔记之n-gram模型实现的更多相关文章

ArcGIS案例学习笔记-批量裁剪地理模型
ArcGIS案例学习笔记-批量裁剪地理模型联系方式:谢老师,135-4855-4328,xiexiaokui#qq.com 功能:空间数据的批量裁剪优点:1.批量裁剪:任意多个目标数据,去裁剪任意 ...
Java学习笔记之---单例模型
Java学习笔记之---单例模型单例模型分为:饿汉式,懒汉式 (一)要点 1.某个类只能有一个实例 2.必须自行创建实例 3.必须自行向整个系统提供这个实例 (二)实现 1.只提供私有的构造方法 2 ...
WebGL three.js学习笔记加载外部模型以及Tween.js动画
WebGL three.js学习笔记加载外部模型以及Tween.js动画本文的程序实现了加载外部stl格式的模型,以及学习了如何把加载的模型变为一个粒子系统,并使用Tween.js对该粒子系统进行 ...
ARMV8 datasheet学习笔记5：异常模型
1.前言 2.异常类型描述见 ARMV8 datasheet学习笔记4:AArch64系统级体系结构之编程模型(1)-EL/ET/ST 一文 3. 异常处理路由对比 AArch32.AArch64架 ...
Javascript MVC 学习笔记（一）模型和数据
写在前面近期在看<MVC的Javascript富应用开发>一书.本来是抱着一口气读完的想法去看的.结果才看了一点就傻眼了:太多不懂的地方了. 仅仅好看一点查一点,一点一点往下看吧,进度虽 ...
PowerDesigner 15学习笔记：十大模型及五大分类
个人认为PowerDesigner 最大的特点和优势就是1)提供了一整套的解决方案,面向了不同的人员提供不同的模型工具,比如有针对企业架构师的模型,有针对需求分析师的模型,有针对系统分析师和软件架构师 ...
[PyTorch 学习笔记] 3.1 模型创建步骤与 nn.Module
本章代码:https://github.com/zhangxiann/PyTorch_Practice/blob/master/lesson3/module_containers.py 这篇文章来看下 ...
[PyTorch 学习笔记] 7.1 模型保存与加载
本章代码: https://github.com/zhangxiann/PyTorch_Practice/blob/master/lesson7/model_save.py https://githu ...
PyTorch学习笔记之CBOW模型实践
import torch from torch import nn, optim from torch.autograd import Variable import torch.nn.functio ...

随机推荐

命令行执行Qt程序
原文网址 //helloworld.cpp #include <QApplication> #include <QPushButton> int main(int argc,c ...
POJ 3057 网络流 Evacuation
题意: 有一个n×m的房间,四周每个格子要么是墙要么是门.中间部分是墙或者人. 现在所有人要从房间逃出去,每个人的速度为1,也就是每个单位时间只能向上下左右四个方向走一格. 多个人可以站在同一个格子上 ...
Python虚拟机之if控制流（一）
Python虚拟机中的if控制流在所有的编程语言中,if控制流是最简单也是最常用的控制流语句.下面,我们来分析一下在Python虚拟机中对if控制流的实现 # cat demo.py a = 1 i ...
luogu1262 间谍网络
贿赂所有能贿赂的,如果还有人不被访问则显然是NO. 否则,必定为YES.强联通分量缩成一个DAG,若某点的入度为零,则答案要算上它的. #include <iostream> #inclu ...
Java学习笔记1---JVM、JRE、JDK
jdk包含jre,jre包含jvm. 用java语言进行开发时,必须先装jdk: 只运行java程序,不进行开发时,可以只装jre. JVM 即Java Virtual machine,Java虚拟机 ...
ThreeJs 3D 全景项目开发总结
本文来自网易云社区作者:唐钊项目背景那是在一个毫无征兆的下午,我还沉浸在 vue 的世界中,突然编辑跑过来说N的新官网想做一些3D全景的东西,一开始其实我的内心是拒绝的,一是没怎么实质性做过 W ...
python学习-- 数据库迁移 python manage.py makemigrations 和 python manage.py migrate
python manage.py makemigrations 和 python manage.py migrate
[git 学习篇] 关联github和本地创库
所以,github和本地创库是通过ssh传送,所以要将公钥拷贝到远程创库上(比如我ssh 远程服务器时,先将本地的公钥,拷贝到远程服务器的某个文件上(http://www.cnblogs.com/li ...
是男人就过 8 题--Pony.AI A AStringGame
链接:https://www.nowcoder.com/acm/contest/92/A来源:牛客网 AStringGame 时间限制:C/C++ 2秒,其他语言4秒空间限制:C/C++ 26214 ...
《程序员思维修炼》读书笔记——week4
<程序员思维修炼>读书笔记——week4 PB16061441 陈昶金这周读的是Andy Hunt的著作<程序员思维修炼>,这本书对于我这种刚刚入门的新手很友好,大多是讲一些 ...

PyTorch学习笔记之n-gram模型实现

PyTorch学习笔记之n-gram模型实现的更多相关文章

随机推荐

热门专题