keras—多层感知器MLP—IMDb情感分析

 import urllib.request

 import os

 import tarfile

 from keras.datasets import imdb

 from keras.preprocessing import sequence

 from keras.preprocessing.text import Tokenizer

 import re

 def rm_tags(text):

     re_tag=re.compile(r'<[^>]+>')

     return re_tag.sub('',text)

 def read_files(filetype):

     path="C:/Users/admin/.keras/aclImdb/"

     file_list=[]

     positive_path=path+filetype+"/pos/"

     for f in os.listdir(positive_path):

         file_list+=[positive_path+f]

     negative_path=path+filetype+"/pos/"

     for f in os.listdir(negative_path):

         file_list+=[negative_path+f]

     print('read',filetype,'files:',len(file_list))

     all_labels=([]*+[]*)

     all_texts=[]

     for fi in file_list:

         with open(fi,encoding='utf8') as file_input:

             all_texts+=[rm_tags(" ".join(file_input.readlines()))]

     return all_labels,all_texts

 y_train,train_text=read_files("train")

 y_test,test_text=read_files("test")

 print(train_text[])

 print(y_train[])

 token=Tokenizer(num_words=)

 token.fit_on_texts(train_text)

 print(token.document_count)

 print(token.word_index)

 x_train_seq=token.texts_to_sequences(train_text)

 x_test_seq=token.texts_to_sequences(test_text)

 print(train_text[])

 print(x_train_seq[])

 x_train=sequence.pad_sequences(x_train_seq,maxlen=)

 x_test=sequence.pad_sequences(x_test_seq,maxlen=)

 print('before pad_sequences lenfth=',len(x_train_seq[]))

 print(x_train_seq[])

 print('after pad_sequences lenfth=',len(x_train[]))

 print(x_train[])

 from keras.models import Sequential

 from keras.layers import Dense,Dropout,Flatten,Activation

 from keras.layers.embeddings import Embedding

 model=Sequential()

 model.add(Embedding(output_dim=,

                  input_dim=,

                  input_length=))

 model.add(Dropout(0.2))

 #model.add(SimpleRNN(units=))

 model.add(Flatten())

 model.add(Dense(units=,

                 activation='relu'))

 model.add(Dropout(0.35))

 model.add(Dense(units=,

                 activation='sigmoid'))

 print(model.summary())

 model.compile(loss='binary_crossentropy',

               optimizer='adam',

               metrics=['accuracy'])

 train_history=model.fit(x=x_train,y=y_train,batch_size=,

                         epochs=,verbose=,

                         validation_split=0.2)

 scores=model.evaluate(x_test,y_test,verbose=)

 print('accuracy',scores[])

 predict=model.predict_classes(x_test)

 print("prediction[:10]",predict[:])

 predict_classes=predict.reshape(-)

 print(predict_classes[:])

 SentimentDict = {: '正面的', : '负面的'}

 def display_test_Sentiment(i):

     print(test_text[i])

     print('label真实值:', SentimentDict[y_test[i]],

           '预测结果:', SentimentDict[predict_classes[i]])

 display_test_Sentiment()

 input_text='''

 I saw this film with my -year-old a couple weeks ago. While there's plenty about which to gripe, here's one of

 my biggest problems: I can't stand this constant CGI-heavy everything-must-be-a-sequel-or- a- remake era of film

 making. It's making movie makers lazy.

 '''

 input_seq=token.texts_to_sequences([input_text])

 len(input_seq[])

 print(input_seq[])

 pad_input_seq=sequence.pad_sequences(input_seq,maxlen=)

 len(pad_input_seq[])

 print(pad_input_seq[])

 predict_result=model.predict_classes(pad_input_seq)

 print(predict_result)

 print(predict_result[][])

 print(SentimentDict[predict_result[][]])

 def predict_review(input_text):

     input_seq=token.texts_to_sequences([input_text])

     pad_input_seq=sequence.pad_sequences(input_seq,maxlen=)

     predict_result=model.predict_classes(pad_input_seq)

     print(SentimentDict[predict_result[][]])

 predict_review('''

 They poured on the whole "LeFou is gay" thing a bit thick for my taste. It was the only thing that added levity to the movie (despite how much fun it should have been already), but it seemed a bit cheap. I'm not going to apologize for wanting more for my LGBTQ characters than to be just the comic relief.

 ''')

验证的准确率为0问题待解决

keras—多层感知器MLP—IMDb情感分析的更多相关文章

keras—多层感知器MLP—MNIST手写数字识别
一.手写数字识别现在就来说说如何使用神经网络实现手写数字识别. 在这里我使用mind manager工具绘制了要实现手写数字识别需要的模块以及模块的功能: 其中隐含层节点数量(即神经细胞数量)计算 ...
4.2tensorflow多层感知器MLP识别手写数字最易懂实例代码
自己开发了一个股票智能分析软件,功能很强大,需要的点击下面的链接获取: https://www.cnblogs.com/bclshuai/p/11380657.html 1.1 多层感知器MLP(m ...
"多层感知器"--MLP神经网络算法
提到人工智能(Artificial Intelligence,AI),大家都不会陌生,在现今行业领起风潮,各行各业无不趋之若鹜,作为技术使用者,到底什么是AI,我们要有自己的理解. 目前,在人工智能中 ...
TFboy养成记多层感知器 MLP
内容总结与莫烦的视频. 这里多层感知器代码写的是一个简单的三层神经网络,输入层,隐藏层,输出层.代码的目的是你和一个二次曲线.同时,为了保证数据的自然,添加了mean为0,steddv为0.05的噪声 ...
MLPclassifier，MLP 多层感知器的的缩写（Multi-layer Perceptron）
先看代码(sklearn的示例代码): from sklearn.neural_network import MLPClassifier X = [[0., 0.], [1., 1.]] y = [0 ...
神经网络与机器学习笔记—多层感知器（MLP）
多层感知器(MLP) Rosenblatt感知器和LMS算法,都是单层的并且是单个神经元构造的神经网络,他们的局限性是只能解决线性可分问题,例如Rosenblatt感知器一直没办法处理简单异或问题.然 ...
tensorflow学习笔记——自编码器及多层感知器
1,自编码器简介传统机器学习任务很大程度上依赖于好的特征工程,比如对数值型,日期时间型,种类型等特征的提取.特征工程往往是非常耗时耗力的,在图像,语音和视频中提取到有效的特征就更难了,工程师必须在这 ...
使用TensorFlow v2.0构建多层感知器
使用TensorFlow v2.0构建一个两层隐藏层完全连接的神经网络(多层感知器). 这个例子使用低级方法来更好地理解构建神经网络和训练过程背后的所有机制. 神经网络概述 MNIST 数据集概述此 ...
Spark Multilayer perceptron classifier (MLPC)多层感知器分类器
多层感知器分类器(MLPC)是基于前馈人工神经网络(ANN)的分类器. MLPC由多个节点层组成. 每个层完全连接到网络中的下一层. 输入层中的节点表示输入数据. 所有其他节点,通过输入与节点的权重w ...

随机推荐

python接口自动化20-requests获取响应时间(elapsed)与超时（timeout）
前言 requests发请求时,接口的响应时间,也是我们需要关注的一个点,如果响应时间太长,也是不合理的. 如果服务端没及时响应,也不能一直等着,可以设置一个timeout超时的时间关于reques ...
[转]Java 运算符的优先级
Java 运算符的优先级(从高到低) 优先级描述运算符 1 括号 ().[] 2 正负号 +.- 3 自增自减,非 ++.--.! 4 乘除,取余 *./.% 5 加减 +.- 6 移位运算 &l ...
springMVC 是单例还是的多例的？
曾经面试的时候有面试官问我spring的controller是单例还是多例,结果我傻逼的回答当然是多例,要不然controller类中的非静态变量如何保证是线程安全的,这样想起似乎是对的,但是不知道( ...
javascript的冻结对象之freeze(),isFrozen()方法
最严格的对象保护措施就是冻结对象了.冻结过后的对象,即不可以扩展,原有对象也不可以删除,因为[Writable]=false,所以对象的属性不可修改. 示例一: var person={name:&q ...
【Python编程：从入门到实践】chapter10 文件和异常
chapter10 文件和异常 10.1 从文件中读取数据 10.1.1 读取整个文件 with open("pi.txt") as file_object: contents = ...
web service初探
概述:Web service是一个平台独立.低耦合的.自包含的.基于可编程的web的应用程序,可使用开放的XML(标准通用标记语言下的一个子集)标准来描述.发布.发现.协调和配置这些应用程序,用于开发 ...
并发基础（四） java中线程的状态
一.程的五种状态线程的生命周期可以大致分为5种,但这种说法是比较旧的一种说法,有点过时了,或者更确切的来说,这是操作系统的说法,而不是java的说法.但对下面所说的六种状态的理解有所帮助,所以也 ...
内建函数（builtins）和functools
内建函数 Build-in Function,启动python解释器,输入dir(__builtins__), 可以看到很多python解释器启动后默认加载的属性和函数,这些函数称之为内建函数, 这些 ...
初试mysql5.7.2新特性：多源复制（MySQL 5.7 multi-source replication）
多源复制和多主复制的区别: 多主复制示意图: 多源复制示意图: 在my.cnf中添加crash safe特性参数:master_info_repository=TABLE;relay_log_info ...
java8函数式编程（转载）
1. 概述 1.1 函数式编程简介我们最常用的面向对象编程(Java)属于命令式编程(Imperative Programming)这种编程范式.常见的编程范式还有逻辑式编程(Logic Progr ...

keras—多层感知器MLP—IMDb情感分析

keras—多层感知器MLP—IMDb情感分析的更多相关文章

随机推荐

热门专题