论文《A Generative Entity-Mention Model for Linking Entities with Knowledge Base》
A Generative Entity-Mention Model for Linking Entities with Knowledge Base
一.主要方法
提出了一种生成概率模型,叫做entity-mention model.
Explanation:
In our model, each name mention to be linked is modeled as a sample generated through a three-step generative story, and the entity knowledge is encoded in the distribution of entities in document P(e), the distribution of possible names of a specific entity P(s|e), and the distribution of possible contexts of a specific entity P(c|e). To find the referent entity of a name mention, our method combines the evidences from all the three distributions P(e), P(s|e) and P(c|e).
The P(e), P(s|e) and P(c|e) are respectively called the entity popularity model, the entity name model and the entity context model
二.相关介绍
建模
Given a set of name mentions M = {m1, m2, …, mk} contained in documents and a knowledge base KB containing a set of entities E = {e1, e2, …, en}, an entity linking system is a function s : M ® E which links these name mentions to their referent entities in KB.
Popularity Knowledge
实体的流行度知识告诉我们一个实体出现在文档中的可能性
Name Knowledge
名称知识告诉我们实体的可能名称,以及名称引用特定实体的可能性。
Context Knowledge
上下文知识告诉我们一个实体出现在特定上下文中的可能性。
三.The Generative Entity-Mention Model for Entity Linking
Explanation

- 首先,该模型根据P(e)中实体的分布情况,从给定知识库中选择提及名称的引用实体e。
- 其次,该模型根据被引用实体P(s|e)的可能名称的分布情况输出所述名称的名称s。
- 最后,模型根据被引用实体P(c|e)可能的上下文分布输出所提到的名称的上下文c。
model
The probability of a name mention m (its context is c and its name is s) referring to a specific entity e can be expressed as the following formula (here assume that s and c are independent):

Give a name mention m, to perform entity linking, we need to find the entity e which maximizes the probability P(e|m).

Candidate Selection
building a name-to-entity dictionary using the redirect links, disambiguation pages, anchor texts of Wikipedia, then the candidate entities of a name mention are selected by finding its name’s corresponding entry in the dictionary
四.Model Estimation
Entity Popularity Model
----》
where Count(e) is the count of the name mentions whose referent entity is e, and the |M| is the total name mention size.
Entity Name Model
比如,我们希望 P(Michael Jordan|Michael Jeffrey Jordan) 高,,P(MJ|Michael Jeffrey Jordan) 也高。 P(Michael I. Jordan|Michael Jeffrey Jordan) 应该是0.
因此,名称模型可以通过首先从数据集中收集所有(实体、名称)对来估计。

缺点:它不能正确地处理一个不可见的实体或一个不可见的名称。
Eg: “MJ”在Wikipedia指的并不是Michael Jeffrey Jordan, 这个the name model 将不能识别 “MJ” 就是Michael Jeffrey Jordan.
↓
1) It is retained (translated into itself);
2) It is translated into its acronym;
3) It is omitted(translated into the word NULL);
4) It is translated into another word (misspelling or alias).


wheree is a normalization factor, f is the full name of entity e, lf is the length of f, ls is the length of the name s, si the i th word of s, fj is the j th word of f and t(si|fj) is the lexical translation probability which indicates the probability of a word fj in the full name will be written as si in the output name.
Entity Context Model
例如:
C1: __wins NBA MVP.
C2: __is a researcher in machine learning
P(C1|Michael Jeffrey Jordan)应该很高,因为NBA球员迈克尔杰弗里乔丹经常出现在C1和P(C2|Michael Jeffrey Jordan)应该是非常低的,因为他很少出现在C2.
a context c containing n terms t1,t2…tn (term: a word; a named entity; a Wikipedia concept) ,the entity context model estimates the probability P(c|e) as


where Pg(t) is a general language model which is estimated using the whole Wikipedia data, and the optimal value of λ is set to 0.2

where Counte(t) is the frequency of occurrences of a term t in the contexts of the name mentions whose referent entity is e
The NIL Entity Problem
假设:“如果一个名字被提到是指一个特定的实体,那么这个名字被提到的概率是由特定实体的模型产生的,应该显著高于由一般语言模型产生的概率
1. add a pseudo entity, the NIL entity, into the knowledge base
2. the probability of a name mention is generated by the NIL entity is higher than all other entities in Knowledge base, we link the name mention to the NIL entity.

五.Experiments

论文《A Generative Entity-Mention Model for Linking Entities with Knowledge Base》的更多相关文章
- Entity Framework Model First下改变数据库脚本的生成方式
在Entity Framework Model First下, 一个非常常见的需求是改变数据库脚本的生成方式.这个应用场景是指,当用户在Designer上单击鼠标右键,然后选择Generate Dat ...
- Entity Framework的核心 – EDM(Entity Data Model) 一
http://blog.csdn.net/wangyongxia921/article/details/42061695 一.EnityFramework EnityFramework的全程是ADO. ...
- EF,ADO.NET Entity Data Model简要的笔记
1. 新建一个项目,添加一个ADO.NET Entity Data Model的文件,此文件会生成所有的数据对象模型,如果是用vs2012生的话,在.Designer.cs里会出现“// Defaul ...
- Create Entity Data Model
http://www.entityframeworktutorial.net/EntityFramework5/create-dbcontext-in-entity-framework5.aspx 官 ...
- 论文分享|《Universal Language Model Fine-tuning for Text Classificatio》
https://www.sohu.com/a/233269391_395209 本周我们要分享的论文是<Universal Language Model Fine-tuning for Text ...
- Entity Framework Tutorial Basics(5):Create Entity Data Model
Create Entity Data Model: Here, we are going to create an Entity Data Model (EDM) for SchoolDB datab ...
- ASP.NET-MVC中Entity和Model之间的关系
Entity 与 Model之间的关系图 ViewModel类是MVC中与浏览器交互的,Entity是后台与数据库交互的,这两者可以在MVC中的model类中转换 MVC基础框架 来自为知笔记(Wiz ...
- How to: Use the Entity Framework Model First in XAF 如何:在 XAF 中使用EF ModelFirst
This topic demonstrates how to use the Model First entity model and a DbContext entity container in ...
- 创建实体数据模型【Create Entity Data Model】(EF基础系列5)
现在我要来为上面一节末尾给出的数据库(SchoolDB)创建实体数据模型: SchoolDB数据库的脚本我已经写好了,如下: USE master GO IF EXISTS(SELECT * FROM ...
随机推荐
- Spring学习记录5——数据库事务基础知识
何为数据库事务 “一荣共荣,一损共损”这句话很能体现事务的思想,很多复杂的事务要分步进行,但它们组成了一个整体,要么整体生效,要么整体失效.这种思想反映到数据库上,就是多条SQL语句,要么全部成功,要 ...
- 实战_Spring_Cloud
目录 前言 开发环境 源码地址 创建工程 服务注册中心(Eureka) Eureka Server Eureka Client 注册中心高可用 小结 负载均衡(Ribbon) RestTemplate ...
- 关于neo4j初入门(4)
关于admin管理员 数据库备份和恢复 Neo4j数据库备份 步骤1 -点击“Neo4j Community”,如下图所示 Windows“开始”按钮>> "All Progra ...
- 吸取教训:一段网上找的代码突然爆了,项目出现大BUG
本人是做游戏服务器开发的,碰到一个需求,给符某些要求的玩家的发送道具奖励,奖励的数量根据离线的天数计算. 这个需求实现起来很简单,只需要在玩家上线的时候计算上次离线时间和当前时间间隔的天数,然后根据策 ...
- 01.flask pycharm开启debug模式
代码照旧
- Jenkins自动执行python脚本输出测试报告
前言 在用python做自动化测试时,我们写好代码,然后需要执行才能得到测试报告,这时我们可以通过 Jenkins 来进一步完成自动化工作. 借助Jenkins,我们可以结合 Git/SVN 自动拉取 ...
- 痞子衡嵌入式:语音处理工具pzh-speech诞生记(2)- 界面构建(wxFormBuilder3.8.0)
大家好,我是痞子衡,是正经搞技术的痞子.今天痞子衡给大家介绍的是语音处理工具pzh-py-speech诞生之界面构建. 之前痞子衡设计过一个串口调试助手pzh-py-com,也专门写过一篇关于其界面构 ...
- Python通过win32 com接口实现offic自动化
最近几天通过Python做一些自动生成office报表的东东,比如解析.xml文件,导出.html/WORD/PPT等格式,html不足一提,只需要简单的html静态网页知识即可,这儿要说的是怎么生成 ...
- 安全性与收尾工作 运用过滤器进行授权 精通ASP-NET-MVC-5-弗瑞曼
- 个人第四次作业--Alpha项目测试
这个作业属于哪个课程 https://edu.cnblogs.com/campus/xnsy/GeographicInformationScience 这个作业要求在哪里 https://www.cn ...