机器人:语音识别哪些可利用的技术可利用、机器人、语音识别、技术

2023-09-06 16:37:01 作者:骑着蜗牛环游世界

我是新来的android的语音识别的区域。

I am new to the area of "voice recognition" in android.

我在我的应用程序的要求有语音识别。因此,我做我的功课。我找到1. Android SDK有支持这一点,它使用了谷歌语音识别所以从我了解的天气是我们通过一个意图调用识别器或者我们使用类SpeechRecogniser,实际的识别在谷歌的云服务器上完成。我尝试使用这两种方法,并在这两个案例的匹配率是很低的示例应用程序\\(首先是我找吧?我没有得到正确的比赛的大部分单词/句子我试过的)。

I have a requirement in my app to have "speech recognition". So i am doing my homework. I found that 1. android SDK has support for this and it used the "google voice recognition" So from what i understand weather we invoke the recogniser by an intent or we use the class SpeechRecogniser , the actual recognition is done at the google cloud server. I tried sample apps using both methods and the matching rate in both case is very low\ ( First of all is my finding right ? i didn't get right match for most of the words/ sentence i tried ).

会不会有这两种方法在输出任何区别,即通过发射意图/或使用SpeechRecogniser类)

Will there be any difference in output for these two methods ie launching by intent / or using the SpeechRecogniser class )

取决于这个谷歌的技术,其中的声音发出声音字节,而在云端服务器识别的所有应用程序。我看见Shazam的采用了不同的技术,但他们有自己的数据库。是否有使用的任何其他技术

Is all the apps depending on this google technology where voice is send as sound bytes and recognized at cloud server . I saw Shazam uses a different technology but they have their own database. Are there any such other technologies used

我看到了许多为Android的Siri。这些应用程序是如何工作的任何笔记吗?

I saw many "siri for android" . Any notes on how these applications actually work ?

非常感谢您的时间和帮助。

Thanks a lot for your time and help.

推荐答案

1)使用时,你会得到相同的结果无论是 RecognizerIntent SpeechRecognizer 。主要的区别是在用户交互。在 RecognizerIntent 强制用户去通过一个标准的语音识别过程。随着 SpeechRecognizer 你能控制应用程序如何收集演讲,当它处理它。 RecognizerIntent 的优点在于,它是容易编程和为用户所熟悉。随着 SpeechRecognizer 可以实现先进的东西,喜欢听在后台讲话。您还可以得到更好的错误报告。

1) you will get identical results when using either the RecognizerIntent or SpeechRecognizer. The main difference is in the User interaction. The RecognizerIntent forces the user to go through a standard speech recognition procedure. With the SpeechRecognizer you get to control how the app collects speech and when it processes it. The advantage of RecognizerIntent is that it is easy to program and familiar to users. With SpeechRecognizer you can implement advanced things like listening for speech in the background. You also get better error reporting.

此外,有些话是很容易的识别理解像苹果但也有一些坚硬如孜然由于各种原因。你将不得不聪明​​匹配是谷歌返回到实现一些可靠的。

Also, some words are easy for the recognizer to understand like "apple" but some are hard like "cumin" for various reasons. You will have to be clever with matching what google returns to implement something reliable.

2)我不知道你自己的数据库是什么意思。您的应用程序都会有各种各样的资料库您正在尝试来匹配用户说的话

2) I'm not sure what you mean by their own database. Your app will have a "database" of sorts which you are trying to match against what the user says

3)可能自然语言处理,用户建模,技术的组合,以模仿人的对话。或者,他们只是一大束手codeD规则,以使他们看起来聪明。我的猜测是大量的工作,力图使一些可信的。

3) Probably a mix of natural language processing, user modeling, techniques to emulate a human dialogue. Or they are just a big bunch of hand coded rules to make them look smart. My guess it is a lot of work to try to make something believable.

检查出我的一些样本code在这里: https://github.com/gmilette/Say-the-Magic-Word-

Check out some of my sample code here: https://github.com/gmilette/Say-the-Magic-Word-

 
精彩推荐
图片推荐