http://haixun.olidu.com/probase.html

A Data Driven Semantic Network for Text Understanding

Probase is a data driven semantic network that consists of millions of fine-grained concepts and their relationships. One of the goal of Probase is to enable generalization in natural language processing. One important application we have built using Probase is short text analysis (a.k.a. deep query understanding). Using the knowledge in Probase, we perform segmentation, build dependency tree, and annotate terms in a short text. This enables us to understand the intent of keyword based queries.

Below is a comprehensive list of Probase related publications. More (and a little outdated) info can be found here.

Talks

Inferencing in Information Extraction: Techniques and Applications, ICDE 2015 Tutorial
Knowledge Base for Text Understanding: Haixun Wang, Dec 2014.
Learning Knowledge Bases for Text and Multimedia, Lexing Xie and Haixun Wang, Tutorial at ACM Multimedia, Nov 2014.
Probase: A Review, Haixun Wang, Feb 2014.
Short Text Understanding (invited talk), by Haixun Wang, in AKBC (Automated Knowledge Base Construction), 2013, San Francisco, USA.
Understanding Short Texts (keynote), by Haixun Wang, in APWeb, 2013, Sydney, Australia.

Under Submission

An Inference Approach to Basic Level of Categorization, by Zhongyuan Wang and Haixun Wang, Under Submission, 2015.
On the Transitivity of isA Relations in Data-Driven Semantic Networks, by Jiaqing Liang, Haixun Wang, Yanghua Xiao, Under Submission, 2015
Fine-grained Semantic Typing of FrameNet, by Seung-won Hwang, Haixun Wang, Under Submission, 2015
Probase+: A Comprehensive Conceptual Taxonomy, by Jiaqing Liang, Yanghua Xiao, and Haixun Wang, Under Submission, 2015.

2015

Learning Term Embeddings for Hypernymy Identification, by Yu Zheng, Haixun Wang, Xuemin Lin, and Min Wang, IJCAI 2015.
Query Understanding through Knowledge-Based Conceptualization, by Zhongyuan Wang and Haixun Wang, IJCAI 2015
On Conceptual Labeling of a Bag of Words, by Xiangyan Sun, Haixun Wang, Yanghua Xiao, IJCAI 2015
Open Domain Short Text Conceptualization: A Generative + Descriptive Modeling Approach, by Yangqiu Song, Shusen Wang, Haixun Wang, IJCAI 2015

Short Text Understanding Through Lexical-Semantic Analysis (Best Paper Award), by Wen Hua, Zhongyuan Wang, Haixun Wang, and Xiaofang Zhou, ICDE 2015.
Automatic Taxonomy Construction from Keywords via Scalable Bayesian Rose Trees, by Xueqing Liu, Yangqiu Song, Shixia Liu, and Haixun Wang, TKDE, 2015.

2014

Transfer Understanding from Head Queries to Tail Queries, by Yangqiu Song, Haixun Wang, Weizhu Chen, Shusen Wang, in CIKM, 2014, Shanghai, China.
Concept-based Short Text Classification and Ranking, by Zhongyuan Wang, Fang Wang, Wen Ji-Rong, Zhoujun Li, in CIKM, 2014, Shanghai, China.
Overcoming Semantic Drift in Information Extraction, by Zhixu Li, Hongsong Li, Haixun Wang, Yi Yang, Xiangliang Zhang, and Xiaofang Zhou, in EDBT, 2014, Athens, Greece.
Data Driven Metaphor Recognition and Explanation, by Hongsong Li, Kenny Zhu, and Haixun Wang, in TACL, 2014.
Head, Modifier, and Constraint Detection in Short Texts, by Zhongyuan Wang, Haixun Wang, and Zhirui Hu, in ICDE, 2014, Chicago, USA.
Semantic Multidimensional Scaling for Open-Domain Sentiment Analysis, by Erik Cambria, Yangqiu Song, Haixun Wang, and Newton Howard, in IEEE Intelligent Systems, 2014.

2013

Computing term similarity by large probabilistic isA knowledge, by Pei-Pei Li, Haixun Wang, Kenny Zhu, Zhongyuan Wang, and Xindong Wu, in CIKM, 2013, San Francisco, USA.
Assessing sparse information extraction using semantic contexts, by Pei-Pei Li, Haixun Wang, Hongsong Li, and Xindong Wu, in CIKM, 2013, San Francisco, USA.
Attribute extraction and scoring: A probabilistic approach, by Taesung Lee, Zhongyuan Wang, Haixun Wang, and Seung-won Hwang, in ICDE, 2013, Brisbane, Australia.
Automatic extraction of top-k lists from the web, by Zhixian Zhang, Kenny Zhu, Haixun Wang, and Hongsong Li, in ICDE, 2013, Brisbane, Australia.
Shallow Information Extraction for the knowledge Web (Tutorial), by Denilson Barbosa, Haixun Wang, and Cong Yu, in ICDE, 2013, Brisbane, Australia.
Context-Dependent Conceptualization, by Dongwoo Kim, Haixun Wang, and Alice H. Oh, in IJCAI, 2013, Beijing, China.
Identifying Users' Topical Tasks in Web Search, by Wen Hua, Yangqiu Song, Haixun Wang, and Xiaofang Zhou, in WSDM, 2013, Rome, Italy.
Semantic multi-dimensional scaling for open-domain sentiment analysis, by Eric Cambria, Yangqiu Song, Haixun Wang, and N Howard, in IEEE Intelligent Systems, 2013.

2012

A System for Extracting Top-K Lists from the Web (demo), by Zhixian Zhang, Kenny Zhu, and Haixun Wang, in SIGKDD, 2012, Beijing, China.
Automatic Taxonomy Construction from Keywords, by Xueqing Liu, Yangqiu Song, Shixia Liu, and Haixun Wang, in SIGKDD, 2012, Beijing, China.
Probase: A Probabilistic Taxonomy for Text Understanding, by Wentao Wu, Hongsong Li, Haixun Wang, and Kenny Zhu, in ACM International Conference on Management of Data (SIGMOD), 2012, Arizona, USA.
Optimizing Index for Taxonomy Keyword Search, by Bolin Ding, Haixun Wang, Ruomin Jin, Jiawei Han, and Zhongyuan Wang, in ACM International Conference on Management of Data (SIGMOD), 2012, Arizona, USA.

2011

Web Scale Taxonomy Cleansing, by Taesung Lee, Zhongyuan Wang, Haixun Wang, and Seung-won Hwang, in 37th International Conference on Very Large Data Bases (VLDB), 2011
Isanette: A common and common sense knowledge base for opinion mining, by Eric Cambria, Yangqiu Song, Haixun Wang, and A Hussain, in ICDM, 2011, Vancouver, Canada.
Short Text Conceptualization using a Probabilistic Knowledgebase, by Yangqiu Song, Haixun Wang, Zhongyuan Wang, and Hongsong Li, in The 26th International Joint Conference on Artificial Intelligence (IJCAI), 2011, Spain.

ProBase的更多相关文章

[python爬虫] Selenium定向爬取海量精美图片及搜索引擎杂谈
我自认为这是自己写过博客中一篇比较优秀的文章,同时也是在深夜凌晨2点满怀着激情和愉悦之心完成的.首先通过这篇文章,你能学到以下几点: 1.可以了解Python简单爬取图片的一些思路和方法 ...
追本溯源解析“大数据生态环境”发展现状(CSDN)
程学旗先生是中科院计算所副总工.研究员.博士生导师.网络科学与技术重点实验室主任.本次程学旗带来了中国大数据生态系统的基础问题方面的内容分享.大数据的发展越来越快,但是对于大数据的认知大都还停留在最初 ...
知识图谱顶刊综述 - (2021年4月) A Survey on Knowledge Graphs: Representation, Acquisition, and Applications
知识图谱综述(2021.4) 论文地址:A Survey on Knowledge Graphs: Representation, Acquisition, and Applications 目录知 ...

随机推荐

iOS图片设置圆角性能优化
问题圆角虽好,但如果使用不当,它就是你的帧数杀手,特别当它出现在滚动列表的时候.下面来看圆角如何毁掉你的流畅度的. 实测 layer.cornerRadius 我创建了一个简单地UITableVie ...
【原】配置MySQL服务器端的字符集
[简述] 通过直接配置my.cnf方式修改mysql的字符集,这种方式并不复杂,但是,在linux端配置时,特别容易出错,因此,记录之,以待后用. [配置步骤描述]Step 1:关闭当前的MySQL服 ...
解决 PermGen space Tomcat内存设置(转)
在使用Java程序从数据库中查询大量的数据或是应用服务器(如tomcat.jboss,weblogic)加载jar包时会出现java.lang.OutOfMemoryError异常.这主要是由于应用服 ...
HDU 4423 Simple Function（数学题，2012长春D题）
Simple Function Time Limit: 2000/1000 MS (Java/Others) Memory Limit: 32768/32768 K (Java/Others)T ...
swddude -- A SWD programmer for ARM Cortex microcontrollers.
Introducing swddude I love the ARM Cortex-M series of microcontrollers. The sheer computational po ...
USBDM RS08/HCS08/HCS12/Coldfire V1,2,3,4/DSC/Kinetis Debugger and Programmer -- BDM Construction and Firmware
Construction. Build the hardware using the information provided in the PCB download. The following a ...
IAR EWARM __iar_program_start, __iar_data_init3, __iar_copy_init3, __iar_zero_init3
#include <stdint.h> // The type of a pointer into the init table. typedef void const * table_p ...
MyEclipse使用总结——设置MyEclipse使用的Tomcat服务器
一.设置使用的Tomcat服务器如果不想使用MyEclipse自带的tomcat服务器版本,那么可以在MyEclipse中设置我们自己安装好的tomcat服务器设置步骤如下: Window→Pre ...
.Net Discovery 系列之七--深入理解.Net垃圾收集机制(拾贝篇)
关于.Net垃圾收集器(Garbage Collection),Aicken已经在“.Net Discovery 系列”文章中有2篇的涉及,这一篇文章是对上2篇文章的补充,关于“.Net Discov ...
解决ubuntu上在androidstudio中启动emulator闪退的问题（1）
作者彭东林 pengdonglin137@163.com 平台 Ubuntu14.04 64 androidstudio 2.3.3 现象在创建好模拟器后,点击启动时,模拟器界面刚出来就闪退了解 ...

ProBase