Understand User's Intent from Speech and Text

http://research.microsoft.com/en-us/projects/IntentUnderstanding/

Understanding what users like to do/need to get is critical in human computer interaction. When natural user interface like speech or natural language is used in human-computer interaction, such as in a spoken dialogue system or with an internet search engine, language understanding becomes an important issue. Intent understanding is about indentifying the action a user wants a computer to take or the information she/he would like to obtain, conveyed in a spoken utterance or a text query.

In this project, we develop robust data-driven technologies applicable todifferent domains, make them morepractical by leveraging large amount of unlabeled data via unsupervised/semi-supervised machine learning;by innovating machine learning algorithms that work better with less data or mismatched data; and by augmenting statistical models with domain knowledge obtainedin a semi-supervised fashion.Research activities fall into the following areas:

Data-Driven Approaches to Spoken Language/Query Understanding
Unsupervised/Semi-Supervised Learning
Automatic/Semi-automatic Acquisition of Domain Knowledge
Authoring Tools for Spoken Language Understanding
Application of Intent Undrestanding Technology

We have contributed to Microsoft products from the following teams:

Microsoft Live Search/Commerce Search
Microsoft adCenter
Microsoft Speech Component Group
Tellme

People

Li Deng

Y. C. Ju

Dong Yu

Geoffrey Zweig

Publications

Larry Heck, Dilek Hakkani-Tur, Madhu Chinthakunta, Gokhan Tur, Rukmini Iyer, Partha Parthasarathy, Lisa Stifelman, Elizabeth Shriberg, and Ashley Fidler, Multimodal Conversational Search and Browse, IEEE Workshop on Speech, Language and Audio in Multimedia, August 2013
Malcolm Slaney, Pay Attention, Please: Attention at the Telluride Neuromorphic Cognition Workshop, in IEEE SLTC Newsletter, IEEE, November 2012
Jingjing Liu, Xiao Li, Alex Acero, and Ye-Yi Wang, Lexicon Modeling for Query Understanding, in ICASSP, IEEE, May 2011
Xiao Li, Understanding the Semantic Structure of Noun Phrase Queries, in ACL, Association for Computational Linguistics, July 2010
Xiao Li, Ye-Yi Wang, Dou Shen, and Alex Acero, Learning with Click Graph for Query Intent Classification, in ACM Transaction on Information Systems, vol. 28, no. 3, Association for Computing Machinery, Inc., June 2010
Mehdi Hafezi Manshadi and Xiao Li, Semantic Tagging of Web Search Queries, in ACL, Association for Computational Linguistics, August 2009
Xiao Li, On the Use of Virtual Evidence in Conditional Random Fields, in EMNLP, August 2009
Xiao Li, Ye-Yi Wang, and Alex Acero, Extracting Structured Information from User Queries with Semi-Supervised Conditional Random Fields, in SIGIR, July 2009
Young-In Song, Ye-Yi Wang, Yun-Cheng Ju, Mike Seltzer, Ivan Tashev, and Alex Acero, Voice Search of Structured Media Data, in International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electornic Engineers, Inc., Taipei, Taiwan, April 2009
Ye-Yi Wang, Xiao Li, and Alex Acero, Inductive and Example-Based Learning for Text Classification, in Interspeech, International Speech Communication Association, Brisbane, Australia, September 2008

Understand User's Intent from Speech and Text的更多相关文章

利用Google Speech API实现Speech To Text
很久很久以前, 网上流传着一个免费的,识别率暴高的,稳定的 Speech To Text API, 那就是Google Speech API. 但是最近再使用的时候,总是返回500 Error. 后来 ...
Csharp: speech to text, text to speech in win
using System; using System.Collections.Generic; using System.ComponentModel; using System.Data; usin ...
Speech to Text for iOS
找了一下 speech to text 可以用的 SDK for iOS 以下幾種方案: NDEV Mobile (有免費方案,不過似乎不支援離線,客戶清單中有 wallmart,支援不少語言) iS ...
一次神奇的Azure speech to text rest api之旅
错误Max retries exceeded with url: requests.exceptions.ConnectionError: HTTPSConnectionPool(host='%20e ...
Sample example for Speech to Text in iOS
There are several libraries for this kind of conversion - I host two of those on GitHub: libsprec (t ...
各大厂的语音识别Speech To Text API使用体验
最近发现有声读物能极大促进我的睡眠,但每个前面都有一段开场语,想把它剪掉,但是有多个开场语,所以就要用到语音识别判断一下再剪. 前两年在本地搭建过识别的环境,奈何识别准确率不行,只能找找API了,后面 ...
CMUSphinx Learn - Basic concepts of speech
Basic concepts of speech Speech is a complex phenomenon. People rarely understand how is it produced ...
Intent属性详解二 Action、Category
先看效果图: 1.Action:该activity可以执行的动作该标识用来说明这个activity可以执行哪些动作,所以当隐式intent传递过来action时,如果跟这里<intent-fi ...
Android开发之Intent略解
Intent是一种运行时绑定(run-time binding)机制,它能在程序运行过程中连接两个不同的组件.通过Intent,你的程序可以向Android表达某种请求或者意愿,Android会根据意 ...

随机推荐

【转】用java实例学习MVC模式
.1 MVC模式 MVC是三个单词的缩写,这三个单词分别为:模型(Model).视图(View)和控制(Controller).MVC模式的目的就是实现Web系统的职能分工.下面以J2EE开发进行介绍 ...
AMQ学习笔记 - 16. 确认机制的测试
概述对Acknowledge机制进行测试. 此处的测试是针对Consumer的确认设计的:对于Producer的确认是透明的,无法提供测试. 测试实例设计demo,测试三种确认机制. 测试机制测 ...
对象-关系Metadata映射模式
MetaData Mapping元数据映射在MetaData中保存object-relation映射的详细信息. 以表格形式定义映射,并可由通用代码来处理映射. 运行机制 MetaData中的信息如 ...
hibernate结合使用gson转换json中一点看法
转发请注明出处:http://www.cnblogs.com/shizhongtao/p/3680216.html 在前后台的交互中,经常需要把bean对象与xml或者json,这里就把自己以前遇到的 ...
超越luabind的luaBridge
此编是引用他人的文章,这里记录下,主要为以后自己查找方便,原文地址:http://www.cppblog.com/sunicdavy/archive/2013/12/07/204648.html 最近 ...
[Guava源码分析]Ordering：排序
我的技术博客经常被流氓网站恶意爬取转载.请移步原文:http://www.cnblogs.com/hamhog/p/3876466.html,享受整齐的排版.有效的链接.正确的代码缩进.更好的阅读体验 ...
Linux下OpenCV的环境搭建
OpenCV is the most popular and advanced code library for Computer Vision related applications today, ...
java.lang.ThreadLocal源码分析
ThreadLocal类提供线程本地变量,为变量在每个线程创建一个副本,每个线程可以访问自己内部的副本变量. 比如,有这样一个需求,需要为每个线程创建一个独一无二的标识,这个标识在第一次调用Threa ...
【风马一族_xml】xml的基本讲解笔记
xml是如何保存数据的在xml语言中,它允许用户自定义标签.每个标签用于描述一段数据; 一个标签可以分为开始标签和结束标签,在开始标签和结束标签之间又可以嵌套其它标签,利用标签间的嵌套其它标签,利用 ...
【风马一族_Android】强制activity的横屏与纵屏
<?xml version="1.0" encoding="utf-8"?> <manifest xmlns:android="ht ...

Understand User's Intent from Speech and Text

Understand User's Intent from Speech and Text的更多相关文章

随机推荐

热门专题