- Learning Microsoft Cognitive Services
- Leif Larsen
- 229字
- 2021-08-13 15:40:14
Speech
Adding one of the Speech APIs allows your application to hear and speak to your users. The APIs can filter noise and identify speakers. Based on the recognized intent, they can drive further actions in your application.
The speech domain contains three APIs that are outlined in the following sections.
Bing Speech
Adding the Bing Speech API to your application allows you to convert speech to text and vice versa. You can convert spoken audio to text either by utilizing a microphone or other sources in real time or by converting audio from files. The API also offers speech intent recognition, which is trained by the Language Understanding Intelligent Service (LUIS) to understand the intent.
Speaker recognition
The speaker recognition API gives your application the ability to know who is talking. By using this API, you can verify that the person that is speaking is who they claim to be. You can also determine who an unknown speaker is based on a group of selected speakers.
Translator speech API
The translator speech API is a cloud-based automatic translation service for spoken audio. Using this API, you can add end-to-end translation across web apps, mobile apps, and desktop applications. Depending on your use cases, it can provide you with partial translations, full translations, and transcripts of the translations cover all speech-related APIs in Chapter 5, Speak with Your Application.
- 電腦軟硬件維修大全(實例精華版)
- 電腦組裝與維修從入門到精通(第2版)
- Intel FPGA/CPLD設計(高級篇)
- 硬件產品經理手冊:手把手構建智能硬件產品
- Mastering Manga Studio 5
- Learning Stencyl 3.x Game Development Beginner's Guide
- 分布式微服務架構:原理與實戰
- SiFive 經典RISC-V FE310微控制器原理與實踐
- VMware Workstation:No Experience Necessary
- 單片機原理及應用
- 微服務實戰
- 創客電子:Arduino和Raspberry Pi智能制作項目精選
- ARM接口編程
- 微型計算機原理及應用教程(第2版)
- Spring Cloud微服務架構開發