A Golden Wii U Opportunity

Mazty

Banned

7,730

1951 posts since 29/08/09

Recent Badges:

Pata 100 wall post comments made on gamrConnect.
1st Birthday Has been a VGChartz member for over 1 year.
'Ello Princess! Awarded for signing up.
The Nuts & Bolts Add a total of 25 games to your collection.
Everything's Falling Into Place Add a total of 100 games to your collection.
One Piece at a Time Add your first game to your collection.

Mazty on 07 January 2013

timmah said:

Mazty said:

timmah said:

You must have missed what I said about picking out keywords from the sentence. This greatly simplifies the overall process and would allow something like what I'm describing to work. For the example we're using, no matter how you ask to upgrade your suit, the words 'upgrade' and 'suit' should always be in that sentence (hell, even if only the word 'suit' was in the sentence, it could say 'What would you like to do with your suit' and give you options), so the system knows which menu to take you into based on those keywords alone. From there it would be as simple as the computer asking you which upgrade you would like to apply, and either inputting the answer via touchscreen or voice. Because this is in-game and therefore very specific to that game, and because you're interacting with a virtual assistant that only performs specific functions within the game, the number of commands does not have to be even close to what Siri is able to process. It's been a while since I've done any programming, but when I was in college (back in '02) one of my classmates made a proof of concept in VB similar to this and was able to process queries, then present a response from natural sentence structure by picking out pre-programmed keywords without having to parse the entire sentence using either keyboard or off-the-shelf voice recognition software as the input, and without using significant processing power... so I know it could be done. It probably won't, but it is possible.

Though something like this could (unfairly) get compared to Siri, there could still be some cool applications.

EDIT: Though if this were implemented via the cloud like Siri, this discussion would become a moot point.

But you still have to record the entire sentence to determine where the key words are. You can't just "go deaf" when it's not a word that is recognised. I don't understand how you couldn't parse the entire sentence as the entire sentence still needs to be analysed to find the key words.

The only issue with the cloud is latency, although if Nintendo managed the cloud, that would seem more realistic then a dev or publisher.

If you convert the spoken sentence to text, the system does not have to understand the entire sentence. It can analyze the words as text and process only the ones in a keywords database, resulting in a more simple command to execute, such as turning the phrase "Let's go ahead and upgrade my suit" into "upgrade suit". It is much simpler to extract keywords than to process and understand an entire sentence using natural speech and grammar, thus requiring significantly less complex code, making for significantly less processing power required.

I know it doesn't have to understand the sentence, but it still has to record the entire setence and convert it. PC software that does just that takes a dual core a@ 2.2Ghz if we go by retail software.

I understand exactly what you are saying, but I think you aren't realising that it still needs to be able to be write down correctly everything that is written.

timmah

Currently Offline

3,182

720 posts since 04/03/08

Recent Badges:

8 Years Has been a VGChartz member for over 8 years.
10 Years Has been a VGChartz member for over 10 years.
Trust Me, It'll Have Legs 100 replies made to user's most popular thread.
11 Years Has been a VGChartz member for over 11 years.
Site Veteran Has been a VGChartz member for over 5 years.
13 Years Has been a VGChartz member for over 13 years.

timmah on 07 January 2013

Mazty said:

timmah said:

Mazty said:

timmah said:

You must have missed what I said about picking out keywords from the sentence. This greatly simplifies the overall process and would allow something like what I'm describing to work. For the example we're using, no matter how you ask to upgrade your suit, the words 'upgrade' and 'suit' should always be in that sentence (hell, even if only the word 'suit' was in the sentence, it could say 'What would you like to do with your suit' and give you options), so the system knows which menu to take you into based on those keywords alone. From there it would be as simple as the computer asking you which upgrade you would like to apply, and either inputting the answer via touchscreen or voice. Because this is in-game and therefore very specific to that game, and because you're interacting with a virtual assistant that only performs specific functions within the game, the number of commands does not have to be even close to what Siri is able to process. It's been a while since I've done any programming, but when I was in college (back in '02) one of my classmates made a proof of concept in VB similar to this and was able to process queries, then present a response from natural sentence structure by picking out pre-programmed keywords without having to parse the entire sentence using either keyboard or off-the-shelf voice recognition software as the input, and without using significant processing power... so I know it could be done. It probably won't, but it is possible.

Though something like this could (unfairly) get compared to Siri, there could still be some cool applications.

EDIT: Though if this were implemented via the cloud like Siri, this discussion would become a moot point.

But you still have to record the entire sentence to determine where the key words are. You can't just "go deaf" when it's not a word that is recognised. I don't understand how you couldn't parse the entire sentence as the entire sentence still needs to be analysed to find the key words.

The only issue with the cloud is latency, although if Nintendo managed the cloud, that would seem more realistic then a dev or publisher.

If you convert the spoken sentence to text, the system does not have to understand the entire sentence. It can analyze the words as text and process only the ones in a keywords database, resulting in a more simple command to execute, such as turning the phrase "Let's go ahead and upgrade my suit" into "upgrade suit". It is much simpler to extract keywords than to process and understand an entire sentence using natural speech and grammar, thus requiring significantly less complex code, making for significantly less processing power required.

I know it doesn't have to understand the sentence, but it still has to record the entire setence and convert it. PC software that does just that takes a dual core a@ 2.2Ghz if we go by retail software.

I understand exactly what you are saying, but I think you aren't realising that it still needs to be able to be write down correctly everything that is written.

The part of 'writing down what is said' is a very small part of the power equation, and has been possible at some level since the late 80's. Dragon takes a bit more power because it has to take natural speech and know where to insert punctuation, make paragraph breaks, etc. meaning it needs to understand complex language structure and create properly formatted sentences. That being said, it still doesn't take a very large chunk out of overall processor time, AND, the first versions of the software came out in the 90's and worked on those terrible processors. In addition, there were a lot of applications that did speech to text in the 90's with very, very limited processing power. You're flat wrong in thinking translating speech to simple text takes much power at all. It just doesn't.

EDIT: By the way, smartphones actually do the voice recognition on the device (using a low powered ARM processor), then upload the translated text to the search service (such as Siri or Google Voice Actions) for processing of the command. You can prove this on an android phone by enabling airplane mode, then using the speech to text feature in the compose email window. It processes the speech to text without any access to the web, meaning on the local device.

Existing User Log In

New User Registration

Forums - Nintendo - A Golden Wii U Opportunity

Recent Badges:

Recent Badges: