在UWP应用中加入Cortana语音指令集

本文介绍小娜语音指令集的使用场景，如何将UWP应用接入小娜的语音指令集，使用户直接通过小娜启动应用并使用应用中一些轻量级的功能。文中以必应词典作为实例讲解必应词典UWP版本是如何接入小娜语音功能的。

小娜算得上是Windows 10一大卖点，跟邻居家的Google Now和Siri比起来叫好声也更多。除了经典开始菜单的“回归”，UWP通用应用架构，营销人员口中三句话不离的，自然还有微软小娜。其实微软小娜是具有第三方应用整合能力的，而且隐隐可见微软让小娜平台化的意图，所以小娜的入口自然也就成了开发者的兵家必整之地了。

使用情景

目前小娜开放给第三方应用的接口主要是语音指令集（Voice Command Definitions）。

现来看看VCD到底能做些什么。VCD的使用场景概括说来有两种：

第一种是利用第三方应用本身的数据能力，在用户输入语音指令或文字指令后，在小娜的界面内显示由第三方应用提供的一些数据，完成一些轻量级功能。比如提供一些文字、信息。

第二种是将用户输入的语音指令或文字中的信息，作为第三方应用的启动参数，在应用打开后直接跳转到相应的功能页面，缩短导航的路径长度。比如对小娜说“在大众点评中查找附近吃烤鱼的饭馆”，小娜将会打开大众点评，直接跳转到能吃烤鱼的附近的餐厅。这里小娜为用户省去了打开应用，打开查找页，搜索附近吃烤鱼的餐厅这几步。

在新版本的必应词典中，主要用到的是第一种情景。我们先来看一看整个体验长啥样：

用户对着小娜说：“必应词典，告诉我cute是什么意思？” 小娜理解以后跟必应词典沟通，取回cute的意思，并显示出来

开发实现

要实现这些，有两个关键部分，第一个是定义语法。

小娜VCD的语法文件是一个xml格式的文件。先来看看官方的文档以及官文的实例代码：

https://msdn.microsoft.com/en-us/library/windows/apps/dn706593.aspx

https://github.com/Microsoft/Windows-universal-samples/tree/master/Samples/CortanaVoiceCommand

我在这里重点介绍必应词典的VCD实现。下面是必应词典VCD的语法文件：

<?xml version="1.0" encoding="utf-8" ?>

<Example> 翻译一下 friend </Example>

<Example> 翻译一下 friend </Example>

<ListenFor RequireAppName="ExplicitlySpecified"> {builtin:AppName}[告诉我]{query}的意思 </ListenFor>

<ListenFor RequireAppName="ExplicitlySpecified"> {builtin:AppName}[告诉我]{query}[是]什么意思 </ListenFor>

<ListenFor RequireAppName="ExplicitlySpecified"> {builtin:AppName}{query}[用][英语][英文]怎么说 </ListenFor>

<ListenFor RequireAppName="ExplicitlySpecified"> {builtin:AppName}[英语][英文]{query}怎么说 </ListenFor>

<ListenFor RequireAppName="ExplicitlySpecified"> {builtin:AppName}{query}用[汉语][中文]怎么说 </ListenFor>

<ListenFor RequireAppName="ExplicitlySpecified"> {builtin:AppName}什么是{query}</ListenFor>

<Feedback>正在查询{query}的释义...</Feedback>

</Command>

<Example> 翻译一下 friend</Example>

<ListenFor RequireAppName="ExplicitlySpecified"> {builtin:AppName}翻译[一下][单词]{query}</ListenFor>

<Feedback>正在翻译{query}...</Feedback>

</Command>

<Subject> Words </Subject>

</PhraseTopic>

</CommandSet>

</VoiceCommands>

VCD中的语法是区分语言的，每个Cortana语言，都是一个CommandSet。对于中文来说，是zh-CN或zh-hans-CN。每一个CommandSet要求一个AppName。理论上这个AppName的名称是可以自定义的，未必非要与应用一模一样。比如我们的应用全名叫“必应词典Win10版”，如果用户需要说：“必应词典Win10版告诉我cute是什么意思？”估计用户会崩溃。不过取应用名的时候还是要稍微讲究一点，一来是用户用着方便，二来如果名字起得太常见可能会跟其它应用产生歧义，也可能有因为破坏了小娜自身的一些功能而被用户卸载的风险。

在ListenFor语句中，[]表示可选字，{}表示特殊字。一句ListenFor中，不能全部由可选字组成，否则就像正则表达式中的.或*一样，无法匹配了。{builtin:AppName} 是应用名字出现的位置，应用的名字可以出现在一句话的开头，也可以在其它位置。

例如：<ListenFor RequireAppName="ExplicitlySpecified"> {builtin:AppName}{query}[用][英语][英文]怎么说 </ListenFor>

对于这句语法，“必应词典xxx怎么说”，“必应词典xxx用英语怎么说”，“必应词典xxx用英文怎么说”，“必应词典xxx英语怎么说”，等等都是可识别的。

正如msdn所述，PhraseTopic可表示任意词，subject和scenario用来辅助语言识别模型更准确的识别语音输入。枚举类型可以通过msdn查到。

在应用的App.xaml.cs文件中，需要把写好的文件在应用启动时装载进Cortana。

protected async override void OnLaunched(LaunchActivatedEventArgs e)

{

…

InstallVoiceCommand();

…

}

private async Task InstallVoiceCommand()

{

try

{

//user can stop VCD in settings

if (AppSettings.GetInstance().CortanaVCDEnableStatus == false)

return;

// Install the main VCD. Since there's no simple way to test that the VCD has been imported, or that it's your most recent

// version, it's not unreasonable to do this upon app load.

StorageFile vcdStorageFile = await Package.Current.InstalledLocation.GetFileAsync(@"DictVoiceCommands.xml");

await Windows.ApplicationModel.VoiceCommands.VoiceCommandDefinitionManager.InstallCommandDefinitionsFromStorageFileAsync(vcdStorageFile);

}

catch (Exception ex)

{

System.Diagnostics.Debug.WriteLine("Installing Voice Commands Failed: " + ex.ToString());

}

第二个重要的部分是语音应用服务(app service)

仿照msdn的sample，必应词典也在解决方案中建立了一个BingDictUWP.VoiceCommands工程。需要注意的是，这个工程的output type必须是Windows Runtime Component。否则backgroundtask将不工作。如下图：

对于backgroundtask这个项目，大家仍然可以从github上下载刚才分享的链接里的项目。大体框架可以直接用那个sample，自己在相应位置做一些修改。

以下是必应词典用来处理小娜发回的语音指令的代码

namespace BingDictUWP.AppExtensions

{

/// <summary>

/// The VoiceCommandService implements the entrypoint for all headless voice commands

/// invoked via Cortana. The individual commands supported are described in the

/// AdventureworksCommands.xml VCD file in the AdventureWorks project. The service

/// entrypoint is defined in the Package Manifest (See section uap:Extension in

/// AdventureWorks:Package.appxmanifest)

/// </summary>

public sealed class DictVoiceCommandService : IBackgroundTask

{

...

/// <summary>

/// Background task entrypoint. Voice Commands using the <VoiceCommandService Target="...">

/// tag will invoke this when they are recognized by Cortana, passing along details of the

/// invocation.

///

/// Background tasks must respond to activation by Cortana within 0.5 seconds, and must

/// report progress to Cortana every 5 seconds (unless Cortana is waiting for user

/// input). There is no execution time limit on the background task managed by Cortana,

/// but developers should use plmdebug (https://msdn.microsoft.com/en-us/library/windows/hardware/jj680085%28v=vs.85%29.aspx)

/// on the Cortana app package in order to prevent Cortana timing out the task during

/// debugging.

///

/// Cortana dismisses its UI if it loses focus. This will cause it to terminate the background

/// task, even if the background task is being debugged. Use of Remote Debugging is recommended

/// in order to debug background task behaviors. In order to debug background tasks, open the

/// project properties for the app package (not the background task project), and enable

/// Debug -> "Do not launch, but debug my code when it starts". Alternatively, add a long

/// initial progress screen, and attach to the background task process while it executes.

/// </summary>

/// <param name="taskInstance">Connection to the hosting background service process.</param>

public async void Run(IBackgroundTaskInstance taskInstance)

{

mServiceDeferral = taskInstance.GetDeferral();

// Register to receive an event if Cortana dismisses the background task. This will

// occur if the task takes too long to respond, or if Cortana's UI is dismissed.

// Any pending operations should be cancelled or waited on to clean up where possible.

taskInstance.Canceled += OnTaskCanceled;

var triggerDetails = taskInstance.TriggerDetails as AppServiceTriggerDetails;

// Load localized resources for strings sent to Cortana to be displayed to the user.

mCortanaResourceMap = ResourceManager.Current.MainResourceMap.GetSubtree("Resources");

// Select the system language, which is what Cortana should be running as.

mCortanaContext = ResourceContext.GetForViewIndependentUse();

var lang = Windows.Media.SpeechRecognition.SpeechRecognizer.SystemSpeechLanguage.LanguageTag;

mCortanaContext.Languages = new string[] { Windows.Media.SpeechRecognition.SpeechRecognizer.SystemSpeechLanguage.LanguageTag };

// Get the currently used system date format

mDateFormatInfo = CultureInfo.CurrentCulture.DateTimeFormat;

// This should match the uap:AppService and VoiceCommandService references from the

// package manifest and VCD files, respectively. Make sure we've been launched by

// a Cortana Voice Command.

if ((triggerDetails != null) && (triggerDetails.Name == "DictVoiceCommandService"))

{

try

{

mVoiceServiceConnection = VoiceCommandServiceConnection.FromAppServiceTriggerDetails(triggerDetails);

mVoiceServiceConnection.VoiceCommandCompleted += OnVoiceCommandCompleted;

VoiceCommand voiceCommand = await mVoiceServiceConnection.GetVoiceCommandAsync();

//var properties = voiceCommand.SpeechRecognitionResult.SemanticInterpretation.Properties.Values.First()[0];

// Depending on the operation (defined in AdventureWorks:AdventureWorksCommands.xml)

// perform the appropriate command.

switch (voiceCommand.CommandName)

{

case "searchWord":

case "translate":

var keyword = voiceCommand.Properties["query"][0];

await SendCompletionMessageForKeyword(keyword);

break;

}

catch (Exception ex)

{

System.Diagnostics.Debug.WriteLine("Handling Voice Command failed " + ex.ToString());

}

…}

大家需要在Package.appxmanifest里声名App Service并正确填写该service的entry point，如下图：

写在最后

关于Cortana语音指令集，目前还存在一些可以改进的地方，比如语音指令集的语法全靠手写，并没有自然语音理解的能力。如果开发者在使用中还有其它痛点，也欢迎给我们留言一起讨论。说不定这些痛点，下个版本就能解决了呢 :)

在UWP应用中加入Cortana语音指令集的更多相关文章

Win10/UWP开发—使用Cortana语音指令与App的前台交互
Win10开发中最具有系统特色的功能点绝对少不了集成Cortana语音指令,其实Cortana语音指令在以前的wp8/8.1时就已经存在了,发展到了Win10,Cortana最明显的进步就是开始支持调 ...
Win10/UWP开发—使用Cortana语音指令启动前台App
这两天进群(53078485)找大咖的童鞋比较多,只是大咖比较忙,目前Demo还没有要到,这里先给大家转载一篇Aran大咖的博客学习下,以下是原文: Win10开发中最具有系统特色的功能点绝对少不了集 ...
Win10/UWP开发—使用Cortana语音与App后台Service交互
上篇文章中我们介绍了使用Cortana调用前台App,不熟悉的移步到:Win10/UWP开发—使用Cortana语音指令与App的前台交互,这篇我们讲讲如何使用Cortana调用App的后台任务,相比 ...
WP8.1下 Cortana语音命令 VCD文件设计
Windows Phone8.1下的Cortana,可以通过语音的方式,打开.设置应用,进行页面跳转.执行任务. 我们先要创建VCD(VoiceCommand.xml)文件 <?xml vers ...
Windows 10 IoT Serials 4 - 如何在树莓派上使用Cortana语音助手
从Windows 10 IoT Core 14986版本开始,微软已经加入Cortana语音助手功能.之前,我们只能使用本地语音识别,需要编写应用程序,下载到设备中才能实现.从现在开始,微软已经从系统 ...
Viewbox在UWP开发中的应用
Windows 8.1 与Windows Phone 8.1的UAP应用,终于在Windows 10上统一到了UWP之下.原来3个不同的project也变为一个.没有了2套xaml页面,我们需要用同一 ...
WP8微信5.3开始内测支持Cortana语音两微破冰了?
WP版微信v5.3内测版昨发布了,进行了一些小幅升级,最意外的是原生支持WP8.1版Cortana语音命令操作.要知道微软的聊天机器人“小冰”在微信上存在不到4天,就被微信全面封杀退出,现在微信又内测 ...
UWP开发中的方向传感器
在UWP开发中,我们能使用的到方向有三种: OrientationSensor下的四元数:Compass罗盘的HeadingMagneticNorth:以及SimpleOrientationSenso ...
13、在 uwp应用中，给图片添加高斯模糊滤镜效果（一）
如果在应用中,如果想要给app 添加模糊滤镜,可能第一想到的是第三方类库,比如 Win2d.lumia Imaging SDK .WriteableBitmapEx,不可否认,这些类库功能强大,效果也 ...

随机推荐

【python】GTK 例子
# encoding:utf-8 # box_multi_button_data.py import pygtk, gtk def destroy(widget, data=None): gtk.ma ...
Arch Linux 简易打包指南
本文时代久远,请参阅更可靠的:Arch User Repository (简体中文) - 分享和维护软件包这两天给 Kreogist µ 打 Arch Linux 包,照着 wiki 跟着搞,同时在 ...
hibernate的update() 更新延迟或者无法更新，导致同个service调用存储过程执行方法不精确
hibernate的update()方法无法更新,不报错原因是hibernate的update方法操作的是缓存,可以flush下先. 设置缓存为false理论上也可. 在一个serivce方法里,执 ...
AD帐户操作C#示例代码（二）——检查密码将过期的用户
本文接着和大家分享AD帐户操作,这次开发一个简单的检查密码将过期用户的小工具. 首先,新建一个用户实体类,属性是我们要取的用户信息. public class UserInfo { /// <s ...
讲解版的导航高亮（新手福利）原生JS
1.先写样式: 导航的排版样式: 导航对应高亮样式: .d6000f{ background:red; } .d6000f a{ color:#fff; } 我这个地方导航高亮样式为背景红色,字体颜色 ...
mybatis报invalue types（）错误
错误信息: Cause: org.apache.ibatis.reflection.ReflectionException: Error instantiating class cn.qd.mybat ...
tmpfs介绍
tmpfs 前几天发现服务器的内存(ram)和swap使用率非常低,于是就想这么多的资源不用岂不浪费了?google了一下,认识了tmpfs,总的来说tmpfs是一种虚拟内存文件系统正如这个定义它最大 ...
Java回调函数
维基百科上的定义:在计算机程序设计中,回调函数,或简称回调,是指通过函数参数传递到其它代码的,某一块可执行代码的引用.这一设计允许了底层代码调用在高层定义的子程序. 所谓回调,就是客户程序C调用服务程 ...
Java学习笔记 05 数据包装类
一.包装类综述 >>java.lang包中的Integer类.Long类和Short类,分别将基本数据类型int.long和short封装成一个类.这些类都是Number的子类. Int ...
MATLAB cvx 工具包使用
一个例子 m = ; n = ; p = ; A = randn(m,n); b = randn(m,); C = randn(p,n); d = randn(p,); e = rand; cvx_b ...

在UWP应用中加入Cortana语音指令集

在UWP应用中加入Cortana语音指令集的更多相关文章

随机推荐

热门专题