Changes

USpeak (view source)

Revision as of 06:30, 28 March 2009

5 bytes removed , 06:30, 28 March 2009

→‎Long Term Vision:

Line 62: Line 62: −

====Long Term Vision:====

+

=====Long Term Vision:=====

This project aims at introducing speech as an alternative to typing as a system-wide mode of input.

Line 69: Line 69:

# '''''Port an existing speech engine to the less powerful computers like XO.''''' ( This has been a part of the work that I have been doing so far. I chose Julius as the Speech engine as it is lighter, wriiten in C and is more suitable to dictation based activities. I have been able to compile Julius on the XO and am continuing to optimize it to make it work faster.)

−

# '''''Writing a system service that will take speech as an input and generate corresponding keystrokes and then proceed as if the input was given through the keyboard.''''' (This method was suggested by Benjamin M. Schwartz as an simpler approach as compared to writing a speech library in Python which would use DBUS to connect the engine to the activities in which case changes have to be made to the existing activities to use the library.)

−

# '''''Starting with recognition of alphabets of a language rather than full-blown speech recognition.''''' This will give an achievable target for the summer of code. As the alphabet set is limited to a small number for most languages, this target will be feasible considering both computational power requirements and attainable efficiency.

−

# '''''Demonstrating its use by applying it to activities like listen and spell which can benefit immediately from this feature.''''' (see the benefits section below.)

−

# '''''Create acoustic models where the corpus is recorded by children and where the dictionary maps to the vocabulary of children to improve recognition.''''' (I have been working on creating acoustic models for Indian English and Hindi. This part needs active community participation to bring in support for more languages. The Qt application can come in handy for anyone who is interested in contributing.)

−

# '''''Use the model in activities like Speak and implement a dictation activity.'''''

−

# '''''Introduce a command mode.''''' This would be based on the system service mentioned in step 2 but would differ in interpretation of speech. It will handle speech as commands instead of stream of characters.

−

====Proposal for GSoC 09====

Mavu

52

edits

Changes

USpeak (view source)

Revision as of 06:30, 28 March 2009

Navigation menu

Search