ProBase
http://haixun.olidu.com/probase.html

A Data Driven Semantic Network for Text Understanding
Probase is a data driven semantic network that consists of millions of fine-grained concepts and their relationships. One of the goal of Probase is to enable generalization in natural language processing. One important application we have built using Probase is short text analysis (a.k.a. deep query understanding). Using the knowledge in Probase, we perform segmentation, build dependency tree, and annotate terms in a short text. This enables us to understand the intent of keyword based queries.
Below is a comprehensive list of Probase related publications. More (and a little outdated) info can be found here.
Talks
- Inferencing in Information Extraction: Techniques and Applications, ICDE 2015 Tutorial
- Knowledge Base for Text Understanding: Haixun Wang, Dec 2014.
- Learning Knowledge Bases for Text and Multimedia, Lexing Xie and Haixun Wang, Tutorial at ACM Multimedia, Nov 2014.
- Probase: A Review, Haixun Wang, Feb 2014.
- Short Text Understanding (invited talk), by Haixun Wang, in AKBC (Automated Knowledge Base Construction), 2013, San Francisco, USA.
- Understanding Short Texts (keynote), by Haixun Wang, in APWeb, 2013, Sydney, Australia.
Under Submission
- An Inference Approach to Basic Level of Categorization, by Zhongyuan Wang and Haixun Wang, Under Submission, 2015.
- On the Transitivity of isA Relations in Data-Driven Semantic Networks, by Jiaqing Liang, Haixun Wang, Yanghua Xiao, Under Submission, 2015
- Fine-grained Semantic Typing of FrameNet, by Seung-won Hwang, Haixun Wang, Under Submission, 2015
- Probase+: A Comprehensive Conceptual Taxonomy, by Jiaqing Liang, Yanghua Xiao, and Haixun Wang, Under Submission, 2015.
2015
- Learning Term Embeddings for Hypernymy Identification, by Yu Zheng, Haixun Wang, Xuemin Lin, and Min Wang, IJCAI 2015.
- Query Understanding through Knowledge-Based Conceptualization, by Zhongyuan Wang and Haixun Wang, IJCAI 2015
- On Conceptual Labeling of a Bag of Words, by Xiangyan Sun, Haixun Wang, Yanghua Xiao, IJCAI 2015
- Open Domain Short Text Conceptualization: A Generative + Descriptive Modeling Approach, by Yangqiu Song, Shusen Wang, Haixun Wang, IJCAI 2015
- Short Text Understanding Through Lexical-Semantic Analysis (Best Paper Award), by Wen Hua, Zhongyuan Wang, Haixun Wang, and Xiaofang Zhou, ICDE 2015.
- Automatic Taxonomy Construction from Keywords via Scalable Bayesian Rose Trees, by Xueqing Liu, Yangqiu Song, Shixia Liu, and Haixun Wang, TKDE, 2015.
2014
- Transfer Understanding from Head Queries to Tail Queries, by Yangqiu Song, Haixun Wang, Weizhu Chen, Shusen Wang, in CIKM, 2014, Shanghai, China.
- Concept-based Short Text Classification and Ranking, by Zhongyuan Wang, Fang Wang, Wen Ji-Rong, Zhoujun Li, in CIKM, 2014, Shanghai, China.
- Overcoming Semantic Drift in Information Extraction, by Zhixu Li, Hongsong Li, Haixun Wang, Yi Yang, Xiangliang Zhang, and Xiaofang Zhou, in EDBT, 2014, Athens, Greece.
- Data Driven Metaphor Recognition and Explanation, by Hongsong Li, Kenny Zhu, and Haixun Wang, in TACL, 2014.
- Head, Modifier, and Constraint Detection in Short Texts, by Zhongyuan Wang, Haixun Wang, and Zhirui Hu, in ICDE, 2014, Chicago, USA.
- Semantic Multidimensional Scaling for Open-Domain Sentiment Analysis, by Erik Cambria, Yangqiu Song, Haixun Wang, and Newton Howard, in IEEE Intelligent Systems, 2014.
2013
- Computing term similarity by large probabilistic isA knowledge, by Pei-Pei Li, Haixun Wang, Kenny Zhu, Zhongyuan Wang, and Xindong Wu, in CIKM, 2013, San Francisco, USA.
- Assessing sparse information extraction using semantic contexts, by Pei-Pei Li, Haixun Wang, Hongsong Li, and Xindong Wu, in CIKM, 2013, San Francisco, USA.
- Attribute extraction and scoring: A probabilistic approach, by Taesung Lee, Zhongyuan Wang, Haixun Wang, and Seung-won Hwang, in ICDE, 2013, Brisbane, Australia.
- Automatic extraction of top-k lists from the web, by Zhixian Zhang, Kenny Zhu, Haixun Wang, and Hongsong Li, in ICDE, 2013, Brisbane, Australia.
- Shallow Information Extraction for the knowledge Web (Tutorial), by Denilson Barbosa, Haixun Wang, and Cong Yu, in ICDE, 2013, Brisbane, Australia.
- Context-Dependent Conceptualization, by Dongwoo Kim, Haixun Wang, and Alice H. Oh, in IJCAI, 2013, Beijing, China.
- Identifying Users' Topical Tasks in Web Search, by Wen Hua, Yangqiu Song, Haixun Wang, and Xiaofang Zhou, in WSDM, 2013, Rome, Italy.
- Semantic multi-dimensional scaling for open-domain sentiment analysis, by Eric Cambria, Yangqiu Song, Haixun Wang, and N Howard, in IEEE Intelligent Systems, 2013.
2012
- A System for Extracting Top-K Lists from the Web (demo), by Zhixian Zhang, Kenny Zhu, and Haixun Wang, in SIGKDD, 2012, Beijing, China.
- Automatic Taxonomy Construction from Keywords, by Xueqing Liu, Yangqiu Song, Shixia Liu, and Haixun Wang, in SIGKDD, 2012, Beijing, China.
- Probase: A Probabilistic Taxonomy for Text Understanding, by Wentao Wu, Hongsong Li, Haixun Wang, and Kenny Zhu, in ACM International Conference on Management of Data (SIGMOD), 2012, Arizona, USA.
- Optimizing Index for Taxonomy Keyword Search, by Bolin Ding, Haixun Wang, Ruomin Jin, Jiawei Han, and Zhongyuan Wang, in ACM International Conference on Management of Data (SIGMOD), 2012, Arizona, USA.
2011
- Web Scale Taxonomy Cleansing, by Taesung Lee, Zhongyuan Wang, Haixun Wang, and Seung-won Hwang, in 37th International Conference on Very Large Data Bases (VLDB), 2011
- Isanette: A common and common sense knowledge base for opinion mining, by Eric Cambria, Yangqiu Song, Haixun Wang, and A Hussain, in ICDM, 2011, Vancouver, Canada.
- Short Text Conceptualization using a Probabilistic Knowledgebase, by Yangqiu Song, Haixun Wang, Zhongyuan Wang, and Hongsong Li, in The 26th International Joint Conference on Artificial Intelligence (IJCAI), 2011, Spain.
ProBase的更多相关文章
- [python爬虫] Selenium定向爬取海量精美图片及搜索引擎杂谈
我自认为这是自己写过博客中一篇比较优秀的文章,同时也是在深夜凌晨2点满怀着激情和愉悦之心完成的.首先通过这篇文章,你能学到以下几点: 1.可以了解Python简单爬取图片的一些思路和方法 ...
- 追本溯源 解析“大数据生态环境”发展现状(CSDN)
程学旗先生是中科院计算所副总工.研究员.博士生导师.网络科学与技术重点实验室主任.本次程学旗带来了中国大数据生态系统的基础问题方面的内容分享.大数据的发展越来越快,但是对于大数据的认知大都还停留在最初 ...
- 知识图谱顶刊综述 - (2021年4月) A Survey on Knowledge Graphs: Representation, Acquisition, and Applications
知识图谱综述(2021.4) 论文地址:A Survey on Knowledge Graphs: Representation, Acquisition, and Applications 目录 知 ...
随机推荐
- Javascript 严格模式use strict详解
1.概述 除了正常运行模式,ECMAscript 5添加了第二种运行模式:"严格模式"(strict mode).顾名思义,这种模式使得Javascript在更严格的条件下运行. ...
- ROS知识(23)——行为树Behavio Tree原理
机器人的复杂行为的控制结构CA(Contrl Architecture)通常使用有限状态机来实现,例如ROS提供的smach.行为树是另外一种实现机器人控制的方法,ROS下代表的开源库有pi_tree ...
- jquery json 格式教程
介绍 我们知道AJAX技术能够使得每一次请求更加迅捷,对于每一次请求返回的不是整个页面,也仅仅是所需要返回的数据.通常AJAX通过返回XML格式的数据,然后再通过客户端复杂的JavaScript脚本解 ...
- STM32F4 Timer External Clock TI2 Both Edges Demo
#define CLK_FREQ ( 10000 ) #define CORE_FREQ ( 168000000 ) static void TIM_GPIO_Config( void ) { GPI ...
- One-wire Demo on the STM32F4 Discovery Board
One-wire Demo on the STM32F4 Discovery Board Some of the devs at work were struggling to get their s ...
- rcp(插件开发)插件B需要引用插件A中的jar包-如何处理依赖关系
如果插件B需要引用插件A中的jar 通常需要以下几步: 1.插件B要依赖插件A 2.在插件B的build path中添加插件A的jar包 3.插件A的runtime导出插件B中使用jar的packag ...
- 大数据以及Hadoop相关概念介绍
一.大数据的基本概念 1.1.什么是大数据 大数据指的就是要处理的数据是TB级别以上的数据.大数据是以TB级别起步的.在计算机当中,存放到硬盘上面的文件都会占用一定的存储空间,例如: 文件占用的存储空 ...
- TrinityCore 魔兽世界私服11159 完整配置
为什么要研究TrinityCore ? (1)它是一个完整成熟的可运行调试的网游服务器框架. (2)它是一个跨平台的标准C++编写的项目,在Windows.Linux.MacOSX上都可编译运行. ( ...
- nil coalescing operator
nil coalescing operator ?? 就是 optional和 三元运算符?:的简写形式. 比如一个optional String类型的变量 var a:String? // prin ...
- Npm安装以及express框架的使用
一.安装node.js 下载node.js,并将其放置合适的位置 二.修改环境变量 添加Node执行路径添加到系统的环境变量PATH中,如图:在PATH变量的值的最后添加“C:\Program Fil ...