<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.sugarlabs.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Chiragjain1989</id>
	<title>Sugar Labs - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.sugarlabs.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Chiragjain1989"/>
	<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/go/Special:Contributions/Chiragjain1989"/>
	<updated>2026-05-13T05:55:02Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.0</generator>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Talk:Summer_of_Code/2010/speech-recognition&amp;diff=51108</id>
		<title>Talk:Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Talk:Summer_of_Code/2010/speech-recognition&amp;diff=51108"/>
		<updated>2010-04-11T14:40:30Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;It would be great to have speech reco integrated into Sugar and this proposal covers much of the back-end. However, there is a gap between the ability to recognize commands and having Sugar execute upon those commands. I am particularly worried about the Sugar views, which don&#039;t yet have support for keyboard short cuts in part because they are using a non-standard graphics framework. I hope that the project doesn&#039;t get rendered obsolete because of this current deficiency in Sugar itself. --[[User:Walter|Walter]] 12:05, 9 April 2010 (UTC)&lt;br /&gt;
&lt;br /&gt;
In reference to the keyboard shortcuts for sugar views, I am a little confused. As far as I know, there are keyboard shortcuts F1-4, to switch between home and other views. &lt;br /&gt;
But yes there are no keyboard shortcuts for launching an activity. There is a terminal command though &amp;quot;sugar-launch &amp;lt;activity_name&amp;gt;&amp;quot; but that too is inefficient because you have to type in the full bundle id. http://bugs.sugarlabs.org/ticket/897&lt;br /&gt;
So if I want to launch my ListenAndSpell activity I have to type at the terminal: &amp;quot;sugar-launch in.seeta.ListenAndSpell&amp;quot;&lt;br /&gt;
And if I want the command &amp;quot;sugar-launch ListenAndSpell&amp;quot; to work, I can simply search the string &amp;quot;ListenAndSpell&amp;quot; in the bundle ids of all the installed activites. -- Chirag Jain&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=51033</id>
		<title>Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=51033"/>
		<updated>2010-04-09T05:13:40Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Gnome Voice Control to Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control is a Gnome Desktop Voice Control system which allows to control the entire system by speaking commands. &lt;br /&gt;
&lt;br /&gt;
:     The system consists in an application that will be monitoring the audio input(microphone) and when a significant audio signal has been detected, the software catches, processes and recognizes the signal and then executes the desired action over the Gnome Desktop.&lt;br /&gt;
&lt;br /&gt;
:     For more details please visit: http://live.gnome.org/GnomeVoiceControl&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control uses Pocket Sphinx. The idea is to sugarize it to implement &amp;quot;Sugar Voice Conrol&amp;quot; &lt;br /&gt;
&lt;br /&gt;
:     A block view of the above implementation plan is as shown below:&lt;br /&gt;
&lt;br /&gt;
[[Image:Svc.jpg|center|Block view of Sugar Voice Control]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Tasks Division:&lt;br /&gt;
:     As I have already mentioned, a lot of features can be implemented around Speech Recognition. I have sub-divided my proposal into  following parts:&lt;br /&gt;
&lt;br /&gt;
:    a) My first priority this summer is to enable &amp;quot;Sugar Voice Control&amp;quot;. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1. Testing Pocket Sphinx on Sugar&lt;br /&gt;
:    2. Studying more about Gnome Voice Control.&lt;br /&gt;
:    3. Sugarizing the Gnome Voice Control.&lt;br /&gt;
:    4. A command line interface that will start speech recognition in the background and will start taking &amp;quot;Speech Commands&amp;quot;.&lt;br /&gt;
 &lt;br /&gt;
:    b) After the successful implementation of Sugar Voice control, we can then look into providing speech recognized text to unmodified sugar activities. Thus activities like Write can be made to get the required inputs either from Keyboard or through microphone. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1.  Providing a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. &lt;br /&gt;
&lt;br /&gt;
:    2.  A key board shortcut like Alt+S for starting speech recognition&lt;br /&gt;
&lt;br /&gt;
:    3. Speech recognition control panel for controlling the various parameters.&lt;br /&gt;
&lt;br /&gt;
:    c) The last part can be creating an API for providing easy Speech Recognition access to activity developers. &lt;br /&gt;
&lt;br /&gt;
:    My aim is to atleast achieve part a) this summer and if time permits I would also like to implement part b). Part c) can be taken care off later.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Detailed time line:&lt;br /&gt;
:    Present to May 24 (before actual working for GsoC starts): I will be studying more about Gnome Voice Control and Pocket Sphinx. Upto this time I will be sure and confident about how Sugar Voice Control has to be derived from Gnome Voice Control. Also we require to test the compatibility of Pocket Sphinx on Sugar.&lt;br /&gt;
:   &#039;&#039;&#039;Weekdays&lt;br /&gt;
:   During this time, I am involved in studies too. I am having classes from morning 9:30 AM to Evening 4:30 PM. Thus from Present to end May I will be working around 2 hours per day between 8 PM to 11 PM (IST). &lt;br /&gt;
:   &#039;&#039;&#039;Weekends&lt;br /&gt;
:   I have weekends off, so I can spare around 4 hours per day on weekends. During weekends I can communicate with my mentor any time suitable for him/her. &lt;br /&gt;
    &lt;br /&gt;
:    From May end I will be getting my summer break which will continue till August end. Thus I will be completely free of any other distraction and thus can spare all my energies on development. During this period I can spare around 4-5 hours per day. Again I can communicate with the mentor any time as I have the habit of working late night too.    &lt;br /&gt;
&lt;br /&gt;
:    May 24 to June 13: Sugarizing the Gnome Voice Control to obtain &amp;quot;Sugar Voice Control&amp;quot;. Implementation of a Command line interface, which will run the speech recognition in the background and will take the simple speech commands like open an activity, go to home or desktop, close activity etc.&lt;br /&gt;
&lt;br /&gt;
:    June 14 - June 25: Test the implemented framework of Sugar Voice Control on limited resource devices like the XO-1.0. Take the community feedback on the current implementation. Add more &amp;quot;Control Commands&amp;quot; to the framework after discussions. &lt;br /&gt;
&lt;br /&gt;
:    Thus upto end June we should be completed with the implementation of part a) as mentioned above. &lt;br /&gt;
&lt;br /&gt;
:    June 26- July 11: Implementation of Sugar Voice Control button in the GUI. This button will be implemented in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. Implementation of Sugar Voice Control Panel as mentioned in the GUI considerations part. &lt;br /&gt;
&lt;br /&gt;
:    Thus before mid term evaluations we should be done with the part a) and part b) as mentioned above.&lt;br /&gt;
&lt;br /&gt;
:    July 12-July 16: Submitting mid term evaluations.&lt;br /&gt;
&lt;br /&gt;
:    July 17 - July 30: Creating different Language models and datasets so that &amp;quot;Sugar Voice Control&amp;quot; can support different types of Languages.       &lt;br /&gt;
&lt;br /&gt;
:    Aug 1 - Aug 8: Testing the different language models on XOs. Specifically I would like to create a language model for recognizing Hindi control commands. Then I would like to test the implementation in a Primary school situated in my locality.&lt;br /&gt;
&lt;br /&gt;
:   Aug 9- Aug 16: Documenting the entire work and specially how to create language models. I have gone through some tutorials on how to create them, but most of them are very complicated. I would like to create a simple documentation, so that anyone can create simple language models of their favourite languages. In this way Sugar Voice Control will be extensible for multilingual users. &lt;br /&gt;
  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     &#039;&#039;&#039;My answer:&lt;br /&gt;
:      If Sugar Voice Control gets successfully implemented, then it will greatly increase the usability of Sugar. This is because now sugar can be controlled by physically challenged children too and thus Sugar will have a reach to a greater section of users.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      As I already mentioned, I would like to implement Hindi language models too that will help me testing the framework in my locality. We have some primary schools where students know Hindi very well although they have poor English speaking skills. So testing with Hindi Language and seeing how this affects the children will be a great idea and I am more than happy to set up the Sugar plot.  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     Nopes :-)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2010 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50928</id>
		<title>Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50928"/>
		<updated>2010-04-07T14:42:20Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Gnome Voice Control to Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control is a Gnome Desktop Voice Control system which allows to control the entire system by speaking commands. &lt;br /&gt;
&lt;br /&gt;
:     The system consists in an application that will be monitoring the audio input(microphone) and when a significant audio signal has been detected, the software catches, processes and recognizes the signal and then executes the desired action over the Gnome Desktop.&lt;br /&gt;
&lt;br /&gt;
:     For more details please visit: http://live.gnome.org/GnomeVoiceControl&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control uses Pocket Sphinx. The idea is to sugarize it to implement &amp;quot;Sugar Voice Conrol&amp;quot; &lt;br /&gt;
&lt;br /&gt;
:     A block view of the above implementation plan is as shown below:&lt;br /&gt;
&lt;br /&gt;
[[Image:Svc.jpg|center|Block view of Sugar Voice Control]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Tasks Division:&lt;br /&gt;
:     As I already mentioned, a lot of features can be implemented around Speech Recognition. I have sub-divided my proposal into  following parts:&lt;br /&gt;
&lt;br /&gt;
:    a) My first priority this summer is to enable &amp;quot;Sugar Voice Control&amp;quot;. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1. Testing Pocket Sphinx on Sugar&lt;br /&gt;
:    2. Studying more about Gnome Voice Control.&lt;br /&gt;
:    3. Sugarizing the Gnome Voice Control.&lt;br /&gt;
:    4. A command line interface that will start speech recognition in the background and will start taking &amp;quot;Speech Commands&amp;quot;.&lt;br /&gt;
 &lt;br /&gt;
:    b) After the successful implementation of Sugar Voice control, we can then look into providing speech recognized text to unmodified sugar activities. Thus activities like Write can be made to get the required inputs either from Keyboard or through microphone. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1.  Providing a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. &lt;br /&gt;
&lt;br /&gt;
:    2.  A key board shortcut like Alt+S for starting speech recognition&lt;br /&gt;
&lt;br /&gt;
:    3. Speech recognition control panel for controlling the various parameters.&lt;br /&gt;
&lt;br /&gt;
:    c) The last part can be creating an API for providing easy Speech Recognition access to activity developers. &lt;br /&gt;
&lt;br /&gt;
:    My aim is to atleast achieve part a) this summer and if time permits I would also like to implement part b). Part c) can be taken care off later.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Detailed time line:&lt;br /&gt;
:    Present to May 24 (before actual working for GsoC starts): I will be studying more about Gnome Voice Control and Pocket Sphinx. Upto this time I will be sure and confident about how Sugar Voice Control has to be derived from Gnome Voice Control. Also we require to test the compatibility of Pocket Sphinx on Sugar.&lt;br /&gt;
:   &#039;&#039;&#039;Weekdays&lt;br /&gt;
:   During this time, I am involved in studies too. I am having classes from morning 9:30 AM to Evening 4:30 PM. Thus from Present to end May I will be working around 2 hours per day between 8 PM to 11 PM (IST). &lt;br /&gt;
:   &#039;&#039;&#039;Weekends&lt;br /&gt;
:   I have weekends off, so I can spare around 4 hours per day on weekends. During weekends I can communicate with my mentor any time suitable for him/her. &lt;br /&gt;
    &lt;br /&gt;
:    From May end I will be getting my summer break which will continue till August end. Thus I will be completely free of any other distraction and thus can spare all my energies on development. During this period I can spare around 4-5 hours per day. Again I can communicate with the mentor any time as I have the habit of working late night too.    &lt;br /&gt;
&lt;br /&gt;
:    May 24 to June 13: Sugarizing the Gnome Voice Control to obtain &amp;quot;Sugar Voice Control&amp;quot;. Implementation of a Command line interface, which will run the speech recognition in the background and will take the simple speech commands like open an activity, go to home or desktop, close activity etc.&lt;br /&gt;
&lt;br /&gt;
:    June 14 - June 25: Test the implemented framework of Sugar Voice Control on limited resource devices like the XO-1.0. Take the community feedback on the current implementation. Add more &amp;quot;Control Commands&amp;quot; to the framework after discussions. &lt;br /&gt;
&lt;br /&gt;
:    Thus upto end June we should be completed with the implementation of part a) as mentioned above. &lt;br /&gt;
&lt;br /&gt;
:    June 26- July 11: Implementation of Sugar Voice Control button in the GUI. This button will be implemented in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. Implementation of Sugar Voice Control Panel as mentioned in the GUI considerations part. &lt;br /&gt;
&lt;br /&gt;
:    Thus before mid term evaluations we should be done with the part a) and part b) as mentioned above.&lt;br /&gt;
&lt;br /&gt;
:    July 12-July 16: Submitting mid term evaluations.&lt;br /&gt;
&lt;br /&gt;
:    July 17 - July 30: Creating different Language models and datasets so that &amp;quot;Sugar Voice Control&amp;quot; can support different types of Languages.       &lt;br /&gt;
&lt;br /&gt;
:    Aug 1 - Aug 8: Testing the different language models on XOs. Specifically I would like to create a language model for recognizing Hindi control commands. Then I would like to test the implementation in a Primary school situated in my locality.&lt;br /&gt;
&lt;br /&gt;
:   Aug 9- Aug 16: Documenting the entire work and specially how to create language models. I have gone through some tutorials on how to create them, but most of them are very complicated. I would like to create a simple documentation, so that anyone can create simple language models of their favourite languages. In this way Sugar Voice Control will be extensible for multilingual users. &lt;br /&gt;
  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     &#039;&#039;&#039;My answer:&lt;br /&gt;
:      If Sugar Voice Control gets successfully implemented, then it will greatly increase the usability of Sugar. This is because now sugar can be controlled by physically challenged children too and thus Sugar will have a reach to a greater section of users.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      As I already mentioned, I would like to implement Hindi language models too that will help me testing the framework in my locality. We have some primary schools where students know Hindi very well although they have poor English speaking skills. So testing with Hindi Language and seeing how this affects the children will be a great idea and I am more than happy to set up the Sugar plot.  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     Nopes :-)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2010 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50927</id>
		<title>Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50927"/>
		<updated>2010-04-07T14:01:48Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Gnome Voice Control to Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control is a Gnome Desktop Voice Control system which allows to control the entire system by speaking commands. &lt;br /&gt;
&lt;br /&gt;
:     The system consists in an application that will be monitoring the audio input(microphone) and when a significant audio signal has been detected, the software catches, processes and recognizes the signal and then executes the desired action over the Gnome Desktop.&lt;br /&gt;
&lt;br /&gt;
:     For more details please visit: http://live.gnome.org/GnomeVoiceControl&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control uses Pocket Sphinx. The idea is to sugarize it to implement &amp;quot;Sugar Voice Conrol&amp;quot; &lt;br /&gt;
&lt;br /&gt;
:     A block view of the above implementation plan is as shown below:&lt;br /&gt;
&lt;br /&gt;
[[Image:Svc.jpg|center|Block view of Sugar Voice Control]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Tasks Division:&lt;br /&gt;
:     As I already mentioned, a lot of features can be implemented around Speech Recognition. I have sub-divided my proposal into  following parts:&lt;br /&gt;
&lt;br /&gt;
:    a) My first priority this summer is to enable &amp;quot;Sugar Voice Control&amp;quot;. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1. Testing Pocket Sphinx on Sugar&lt;br /&gt;
:    2. Studying more about Gnome Voice Control.&lt;br /&gt;
:    3. Sugarizing the Gnome Voice Control.&lt;br /&gt;
:    4. A command line interface that will start speech recognition in the background and will start taking &amp;quot;Speech Commands&amp;quot;.&lt;br /&gt;
 &lt;br /&gt;
:    b) After the successful implementation of Sugar Voice control, we can then look into providing speech recognized text to unmodified sugar activities. Thus activities like Write can be made to get the required inputs either from Keyboard or through microphone. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1.  Providing a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. &lt;br /&gt;
&lt;br /&gt;
:    2.  A key board shortcut like Alt+S for starting speech recognition&lt;br /&gt;
&lt;br /&gt;
:    3. Speech recognition control panel for controlling the various parameters.&lt;br /&gt;
&lt;br /&gt;
:    c) The last part can be creating an API for providing easy Speech Recognition access to activity developers. &lt;br /&gt;
&lt;br /&gt;
:    My aim is to atleast achieve part a) this summer and if time permits I would also like to implement part b). Part c) can be taken care off later.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Detailed time line:&lt;br /&gt;
:    Present to May 24 (before actual working for GsoC starts): I will be studying more about Gnome Voice Control and Pocket Sphinx. Upto this time I will be sure and confident about how Sugar Voice Control has to be derived from Gnome Voice Control. Also we require to test the compatibility of Pocket Sphinx on Sugar.&lt;br /&gt;
&lt;br /&gt;
:    May 24 to June 13: Sugarizing the Gnome Voice Control to obtain &amp;quot;Sugar Voice Control&amp;quot;. Implementation of a Command line interface, which will run the speech recognition in the background and will take the simple speech commands like open an activity, go to home or desktop, close activity etc.&lt;br /&gt;
&lt;br /&gt;
:    June 14 - June 25: Test the implemented framework of Sugar Voice Control on limited resource devices like the XO-1.0. Take the community feedback on the current implementation. Add more &amp;quot;Control Commands&amp;quot; to the framework after discussions. &lt;br /&gt;
&lt;br /&gt;
:    Thus upto end June we should be completed with the implementation of part a) as mentioned above. &lt;br /&gt;
&lt;br /&gt;
:    June 26- July 11: Implementation of Sugar Voice Control button in the GUI. This button will be implemented in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. Implementation of Sugar Voice Control Panel as mentioned in the GUI considerations part. &lt;br /&gt;
&lt;br /&gt;
:    Thus before mid term evaluations we should be done with the part a) and part b) as mentioned above.&lt;br /&gt;
&lt;br /&gt;
:    July 12-July 16: Submitting mid term evaluations.&lt;br /&gt;
&lt;br /&gt;
:    July 17 - July 30: Creating different Language models and datasets so that &amp;quot;Sugar Voice Control&amp;quot; can support different types of Languages.       &lt;br /&gt;
&lt;br /&gt;
:    Aug 1 - Aug 8: Testing the different language models on XOs. Specifically I would like to create a language model for recognizing Hindi control commands. Then I would like to test the implementation in a Primary school situated in my locality.&lt;br /&gt;
&lt;br /&gt;
:   Aug 9- Aug 16: Documenting the entire work and specially how to create language models. I have gone through some tutorials on how to create them, but most of them are very complicated. I would like to create a simple documentation, so that anyone can create simple language models of their favourite languages. In this way Sugar Voice Control will be extensible for multilingual users. &lt;br /&gt;
  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     &#039;&#039;&#039;My answer:&lt;br /&gt;
:      If Sugar Voice Control gets successfully implemented, then it will greatly increase the usability of Sugar. This is because now sugar can be controlled by physically challenged children too and thus Sugar will have a reach to a greater section of users.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      As I already mentioned, I would like to implement Hindi language models too that will help me testing the framework in my locality. We have some primary schools where students know Hindi very well although they have poor English speaking skills. So testing with Hindi Language and seeing how this affects the children will be a great idea and I am more than happy to set up the Sugar plot.  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     Nopes :-)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2010 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50916</id>
		<title>Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50916"/>
		<updated>2010-04-06T17:59:32Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Gnome Voice Control to Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control is a Gnome Desktop Voice Control system which allows to control the entire system by speaking commands. &lt;br /&gt;
&lt;br /&gt;
:     The system consists in an application that will be monitoring the audio input(microphone) and when a significant audio signal has been detected, the software catches, processes and recognizes the signal and then executes the desired action over the Gnome Desktop.&lt;br /&gt;
&lt;br /&gt;
:     For more details please visit: http://live.gnome.org/GnomeVoiceControl&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control uses Pocket Sphinx. The idea is to sugarize it to implement &amp;quot;Sugar Voice Conrol&amp;quot; &lt;br /&gt;
&lt;br /&gt;
:     A block view of the above implementation plan is as shown below:&lt;br /&gt;
&lt;br /&gt;
[[Image:Svc.jpg|center|Block view of Sugar Voice Control]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Tasks Division:&lt;br /&gt;
:     As I already mentioned, a lot of features can be implemented around Speech Recognition. I have sub-divided my proposal into  following parts:&lt;br /&gt;
&lt;br /&gt;
:    a) My first priority this summer is to enable &amp;quot;Sugar Voice Control&amp;quot;. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1. Testing Pocket Sphinx on Sugar&lt;br /&gt;
:    2. Studying more about Gnome Voice Control.&lt;br /&gt;
:    3. Sugarizing the Gnome Voice Control.&lt;br /&gt;
:    4. A command line interface that will start speech recognition in the background and will start taking &amp;quot;Speech Commands&amp;quot;.&lt;br /&gt;
 &lt;br /&gt;
:    b) After the successful implementation of Sugar Voice control, we can then look into providing speech recognized text to unmodified sugar activities. Thus activities like Write can be made to get the required inputs either from Keyboard or through microphone. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1.  Providing a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. &lt;br /&gt;
&lt;br /&gt;
:    2.  A key board shortcut like Alt+S for starting speech recognition&lt;br /&gt;
&lt;br /&gt;
:    3. Speech recognition control panel for controlling the various parameters.&lt;br /&gt;
&lt;br /&gt;
:    c) The last part can be creating an API for providing easy Speech Recognition access to activity developers. &lt;br /&gt;
&lt;br /&gt;
:    My aim is to atleast achieve part a) this summer and if time permits I would also like to implement part b). Part c) can be taken care off later.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Detailed time line:&lt;br /&gt;
:    Present to May 24 (before actual working for GsoC starts): I will be studying more about Gnome Voice Control and Pocket Sphinx. Upto this time I will be sure and confident about how Sugar Voice Control has to be derived from Gnome Voice Control. Also we require to test the compatibility of Pocket Sphinx on Sugar.&lt;br /&gt;
&lt;br /&gt;
:    May 24 to June 13: Sugarizing the Gnome Voice Control to obtain &amp;quot;Sugar Voice Control&amp;quot;. Implementation of a Command line interface, which will run the speech recognition in the background and will take the simple speech commands like open an activity, go to home or desktop, close activity etc.&lt;br /&gt;
&lt;br /&gt;
:    June 14 - June 25: Test the implemented framework of Sugar Voice Control on limited resource devices like the XO-1.0. Take the community feedback on the current implementation. Add more &amp;quot;Control Commands&amp;quot; to the framework after discussions. &lt;br /&gt;
&lt;br /&gt;
:    Thus upto end June we should be completed with the implementation of part a) as mentioned above. &lt;br /&gt;
&lt;br /&gt;
:    June 26- July 11: Implementation of Sugar Voice Control button in the GUI. This button will be implemented in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. Implementation of Sugar Voice Control Panel as mentioned in the GUI considerations part. &lt;br /&gt;
&lt;br /&gt;
:    Thus before mid term evaluations we should be done with the part a) and part b) as mentioned above.&lt;br /&gt;
&lt;br /&gt;
:    July 12-July 16: Submitting mid term evaluations.&lt;br /&gt;
&lt;br /&gt;
:    July 17 - July 30: Creating different Language models and datasets so that &amp;quot;Sugar Voice Control&amp;quot; can support different types of Languages.       &lt;br /&gt;
&lt;br /&gt;
:    Aug 1 - Aug 8: Testing the different language models on XOs. Specifically I would like to create a language model for recognizing Hindi control commands. Then I would like to test the implementation in a Primary school situated in my locality.&lt;br /&gt;
&lt;br /&gt;
:   Aug 9- Aug 16: Documenting the entire work and specially how to create language models. I have gone through some tutorials on how to create them, but most of them are very complicated. I would like to create a simple documentation, so that anyone can create simple language models of their favourite languages. In this way Sugar Voice Control will be extensible for multilingual users. &lt;br /&gt;
  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2010 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=File:Svc.jpg&amp;diff=50915</id>
		<title>File:Svc.jpg</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=File:Svc.jpg&amp;diff=50915"/>
		<updated>2010-04-06T17:56:04Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50898</id>
		<title>Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50898"/>
		<updated>2010-04-06T15:27:56Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Gnome Voice Control to Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control is a Gnome Desktop Voice Control system which allows to control the entire system by speaking commands. &lt;br /&gt;
&lt;br /&gt;
:     The system consists in an application that will be monitoring the audio input(microphone) and when a significant audio signal has been detected, the software catches, processes and recognizes the signal and then executes the desired action over the Gnome Desktop.&lt;br /&gt;
&lt;br /&gt;
:     For more details please visit: http://live.gnome.org/GnomeVoiceControl&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control uses Pocket Sphinx. The idea is to sugarize it to implement &amp;quot;Sugar Voice Conrol&amp;quot; &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Tasks Division:&lt;br /&gt;
:     As I already mentioned, a lot of features can be implemented around Speech Recognition. I have sub-divided my proposal into  following parts:&lt;br /&gt;
&lt;br /&gt;
:    a) My first priority this summer is to enable &amp;quot;Sugar Voice Control&amp;quot;. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1. Testing Pocket Sphinx on Sugar&lt;br /&gt;
:    2. Studying more about Gnome Voice Control.&lt;br /&gt;
:    3. Sugarizing the Gnome Voice Control.&lt;br /&gt;
:    4. A command line interface that will start speech recognition in the background and will start taking &amp;quot;Speech Commands&amp;quot;.&lt;br /&gt;
 &lt;br /&gt;
:    b) After the successful implementation of Sugar Voice control, we can then look into providing speech recognized text to unmodified sugar activities. Thus activities like Write can be made to get the required inputs either from Keyboard or through microphone. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1.  Providing a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. &lt;br /&gt;
&lt;br /&gt;
:    2.  A key board shortcut like Alt+S for starting speech recognition&lt;br /&gt;
&lt;br /&gt;
:    3. Speech recognition control panel for controlling the various parameters.&lt;br /&gt;
&lt;br /&gt;
:    c) The last part can be creating an API for providing easy Speech Recognition access to activity developers. &lt;br /&gt;
&lt;br /&gt;
:    My aim is to atleast achieve part a) this summer and if time permits I would also like to implement part b). Part c) can be taken care off later.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Detailed time line:&lt;br /&gt;
:    Present to May 24 (before actual working for GsoC starts): I will be studying more about Gnome Voice Control and Pocket Sphinx. Upto this time I will be sure and confident about how Sugar Voice Control has to be derived from Gnome Voice Control. Also we require to test the compatibility of Pocket Sphinx on Sugar.&lt;br /&gt;
&lt;br /&gt;
:    May 24 to June 13: Sugarizing the Gnome Voice Control to obtain &amp;quot;Sugar Voice Control&amp;quot;. Implementation of a Command line interface, which will run the speech recognition in the background and will take the simple speech commands like open an activity, go to home or desktop, close activity etc.&lt;br /&gt;
&lt;br /&gt;
:    June 14 - June 25: Test the implemented framework of Sugar Voice Control on limited resource devices like the XO-1.0. Take the community feedback on the current implementation. Add more &amp;quot;Control Commands&amp;quot; to the framework after discussions. &lt;br /&gt;
&lt;br /&gt;
:    Thus upto end June we should be completed with the implementation of part a) as mentioned above. &lt;br /&gt;
&lt;br /&gt;
:    June 26- July 11: Implementation of Sugar Voice Control button in the GUI. This button will be implemented in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. Implementation of Sugar Voice Control Panel as mentioned in the GUI considerations part. &lt;br /&gt;
&lt;br /&gt;
:    Thus before mid term evaluations we should be done with the part a) and part b) as mentioned above.&lt;br /&gt;
&lt;br /&gt;
:    July 12-July 16: Submitting mid term evaluations.&lt;br /&gt;
&lt;br /&gt;
:    July 17 - July 30: Creating different Language models and datasets so that &amp;quot;Sugar Voice Control&amp;quot; can support different types of Languages.       &lt;br /&gt;
&lt;br /&gt;
:    Aug 1 - Aug 8: Testing the different language models on XOs. Specifically I would like to create a language model for recognizing Hindi control commands. Then I would like to test the implementation in a Primary school situated in my locality.&lt;br /&gt;
&lt;br /&gt;
:   Aug 9- Aug 16: Documenting the entire work and specially how to create language models. I have gone through some tutorials on how to create them, but most of them are very complicated. I would like to create a simple documentation, so that anyone can create simple language models of their favourite languages. In this way Sugar Voice Control will be extensible for multilingual users. &lt;br /&gt;
  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2010 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50861</id>
		<title>Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50861"/>
		<updated>2010-04-05T10:44:20Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Gnome Voice Control to Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control is a Gnome Desktop Voice Control system which allows to control the entire system by speaking commands. &lt;br /&gt;
&lt;br /&gt;
:     The system consists in an application that will be monitoring the audio input(microphone) and when a significant audio signal has been detected, the software catches, processes and recognizes the signal and then executes the desired action over the Gnome Desktop.&lt;br /&gt;
&lt;br /&gt;
:     For more details please visit: http://live.gnome.org/GnomeVoiceControl&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control uses Pocket Sphinx. The idea is to sugarize it to implement &amp;quot;Sugar Voice Conrol&amp;quot; &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Tasks Division:&lt;br /&gt;
:     As I already mentioned, a lot of features can be implemented around Speech Recognition. I have sub-divided my proposal into  following parts:&lt;br /&gt;
&lt;br /&gt;
:    a) My first priority this summer is to enable &amp;quot;Sugar Voice Control&amp;quot;. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1. Testing Pocket Sphinx on Sugar&lt;br /&gt;
:    2. Studying more about Gnome Voice Control.&lt;br /&gt;
:    3. Sugarizing the Gnome Voice Control.&lt;br /&gt;
:    4. A command line interface that will start speech recognition in the background and will start taking &amp;quot;Speech Commands&amp;quot;.&lt;br /&gt;
 &lt;br /&gt;
:    b) After the successful implementation of Sugar Voice control, we can then look into providing speech recognized text to unmodified sugar activities. Thus activities like Write can be made to get the required inputs either from Keyboard or through microphone. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1.  Providing a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. &lt;br /&gt;
&lt;br /&gt;
:    2.  A key board shortcut like Alt+S for starting speech recognition&lt;br /&gt;
&lt;br /&gt;
:    3. Speech recognition control panel for controlling the various parameters.&lt;br /&gt;
&lt;br /&gt;
:    c) The last part can be creating an API for providing easy Speech Recognition access to activity developers. &lt;br /&gt;
&lt;br /&gt;
:    My aim is to atleast achieve part a) this summer and if time permits I would also like to implement part b). Part c) can be taken care off later.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Detailed time line:&lt;br /&gt;
:    Present to May 24 (before actual working for GsoC starts): I will be studying more about Gnome Voice Control and Pocket Sphinx. Upto this time I will be sure and confident about how Sugar Voice Control has to be derived from Gnome Voice Control. Also we require to test the compatibility of Pocket Sphinx on Sugar.&lt;br /&gt;
&lt;br /&gt;
:    May 24 to June 13: Sugarizing the Gnome Voice Control to obtain &amp;quot;Sugar Voice Control&amp;quot;. Implementation of a Command line interface, which will run the speech recognition in the background and will take the simple speech commands like open an activity, go to home or desktop, close activity etc.&lt;br /&gt;
&lt;br /&gt;
:    June 14 - June 25: Test the implemented framework of Sugar Voice Control on limited resource devices like the XO-1.0. Take the community feedback on the current implementation. Add more &amp;quot;Control Commands&amp;quot; to the framework after discussions. &lt;br /&gt;
&lt;br /&gt;
:    Thus upto end June we should be completed with the implementation of part a) as mentioned above. &lt;br /&gt;
&lt;br /&gt;
:    June 26- July 11: Implementation of Sugar Voice Control button in the GUI. This button will be implemented in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. Implementation of Sugar Voice Control Panel as mentioned in the GUI considerations part. &lt;br /&gt;
&lt;br /&gt;
:    Thus before mid term evaluations we should be done with the part a) and part b) as mentioned above.&lt;br /&gt;
&lt;br /&gt;
:    July 12-July 16: Submitting mid term evaluations.&lt;br /&gt;
&lt;br /&gt;
:    July 17 - July 30: Creating different Language models and datasets so that &amp;quot;Sugar Voice Control&amp;quot; can support different types of Languages.       &lt;br /&gt;
&lt;br /&gt;
:    Aug 1 - Aug 8: Testing the different language models on XOs. Specifically I would like to create a language model for recognizing Hindi control commands. Then I would like to test the implementation in a Primary school situated in my locality.&lt;br /&gt;
&lt;br /&gt;
:   Aug 9- Aug 16: Documenting the entire work and specially how to create language models. I have gone through some tutorials on how to create them, but most of them are very complicated. I would like to create a simple documentation, so that anyone can create simple language models of their favourite languages. In this way Sugar Voice Control will be extensible for multilingual users. &lt;br /&gt;
  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2010 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50860</id>
		<title>Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50860"/>
		<updated>2010-04-05T10:43:46Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Gnome Voice Control to Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control is a Gnome Desktop Voice Control system which allows to control the entire system by speaking commands. &lt;br /&gt;
&lt;br /&gt;
:     The system consists in an application that will be monitoring the audio input(microphone) and when a significant audio signal has been detected, the software catches, processes and recognizes the signal and then executes the desired action over the Gnome Desktop.&lt;br /&gt;
&lt;br /&gt;
:     For more details please visit: http://live.gnome.org/GnomeVoiceControl&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control uses Pocket Sphinx. The idea is to sugarize it to implement &amp;quot;Sugar Voice Conrol&amp;quot; &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Tasks Division:&lt;br /&gt;
:     As I already mentioned, a lot of features can be implemented around Speech Recognition. I have sub-divided my proposal into  following parts:&lt;br /&gt;
&lt;br /&gt;
:    #a) My first priority this summer is to enable &amp;quot;Sugar Voice Control&amp;quot;. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1. Testing Pocket Sphinx on Sugar&lt;br /&gt;
:    2. Studying more about Gnome Voice Control.&lt;br /&gt;
:    3. Sugarizing the Gnome Voice Control.&lt;br /&gt;
:    4. A command line interface that will start speech recognition in the background and will start taking &amp;quot;Speech Commands&amp;quot;.&lt;br /&gt;
 &lt;br /&gt;
:    #b) After the successful implementation of Sugar Voice control, we can then look into providing speech recognized text to unmodified sugar activities. Thus activities like Write can be made to get the required inputs either from Keyboard or through microphone. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1.  Providing a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. &lt;br /&gt;
&lt;br /&gt;
:    2.  A key board shortcut like Alt+S for starting speech recognition&lt;br /&gt;
&lt;br /&gt;
:    3. Speech recognition control panel for controlling the various parameters.&lt;br /&gt;
&lt;br /&gt;
:    #c) The last part can be creating an API for providing easy Speech Recognition access to activity developers. &lt;br /&gt;
&lt;br /&gt;
:    My aim is to atleast achieve part a) this summer and if time permits I would also like to implement part b). Part c) can be taken care off later.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Detailed time line:&lt;br /&gt;
:    Present to May 24 (before actual working for GsoC starts): I will be studying more about Gnome Voice Control and Pocket Sphinx. Upto this time I will be sure and confident about how Sugar Voice Control has to be derived from Gnome Voice Control. Also we require to test the compatibility of Pocket Sphinx on Sugar.&lt;br /&gt;
&lt;br /&gt;
:    May 24 to June 13: Sugarizing the Gnome Voice Control to obtain &amp;quot;Sugar Voice Control&amp;quot;. Implementation of a Command line interface, which will run the speech recognition in the background and will take the simple speech commands like open an activity, go to home or desktop, close activity etc.&lt;br /&gt;
&lt;br /&gt;
:    June 14 - June 25: Test the implemented framework of Sugar Voice Control on limited resource devices like the XO-1.0. Take the community feedback on the current implementation. Add more &amp;quot;Control Commands&amp;quot; to the framework after discussions. &lt;br /&gt;
&lt;br /&gt;
:    Thus upto end June we should be completed with the implementation of part a) as mentioned above. &lt;br /&gt;
&lt;br /&gt;
:    June 26- July 11: Implementation of Sugar Voice Control button in the GUI. This button will be implemented in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. Implementation of Sugar Voice Control Panel as mentioned in the GUI considerations part. &lt;br /&gt;
&lt;br /&gt;
:    Thus before mid term evaluations we should be done with the part a) and part b) as mentioned above.&lt;br /&gt;
&lt;br /&gt;
:    July 12-July 16: Submitting mid term evaluations.&lt;br /&gt;
&lt;br /&gt;
:    July 17 - July 30: Creating different Language models and datasets so that &amp;quot;Sugar Voice Control&amp;quot; can support different types of Languages.       &lt;br /&gt;
&lt;br /&gt;
:    Aug 1 - Aug 8: Testing the different language models on XOs. Specifically I would like to create a language model for recognizing Hindi control commands. Then I would like to test the implementation in a Primary school situated in my locality.&lt;br /&gt;
&lt;br /&gt;
:   Aug 9- Aug 16: Documenting the entire work and specially how to create language models. I have gone through some tutorials on how to create them, but most of them are very complicated. I would like to create a simple documentation, so that anyone can create simple language models of their favourite languages. In this way Sugar Voice Control will be extensible for multilingual users. &lt;br /&gt;
  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2010 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50859</id>
		<title>Speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50859"/>
		<updated>2010-04-05T10:30:50Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Gnome Voice Control to Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control is a Gnome Desktop Voice Control system which allows to control the entire system by speaking commands. &lt;br /&gt;
&lt;br /&gt;
:     The system consists in an application that will be monitoring the audio input(microphone) and when a significant audio signal has been detected, the software catches, processes and recognizes the signal and then executes the desired action over the Gnome Desktop.&lt;br /&gt;
&lt;br /&gt;
:     For more details please visit: http://live.gnome.org/GnomeVoiceControl&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control uses Pocket Sphinx. The idea is to sugarize it to implement &amp;quot;Sugar Voice Conrol&amp;quot; &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Tasks Division:&lt;br /&gt;
:     As I already mentioned, a lot of features can be implemented around Speech Recognition. I have sub-divided my proposal into  following parts:&lt;br /&gt;
&lt;br /&gt;
:    a) My first priority this summer is to enable &amp;quot;Sugar Voice Control&amp;quot;. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1. Testing Pocket Sphinx on Sugar&lt;br /&gt;
:    2. Studying more about Gnome Voice Control.&lt;br /&gt;
:    3. Sugarizing the Gnome Voice Control.&lt;br /&gt;
:    4. A command line interface that will start speech recognition in the background and will start taking &amp;quot;Speech Commands&amp;quot;.&lt;br /&gt;
 &lt;br /&gt;
:    b) After the successful implementation of Sugar Voice control, we can then look into providing speech recognized text to unmodified sugar activities. Thus activities like Write can be made to get the required inputs either from Keyboard or through microphone. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1.  Providing a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. &lt;br /&gt;
&lt;br /&gt;
:    2.  A key board shortcut like Alt+S for starting speech recognition&lt;br /&gt;
&lt;br /&gt;
:    3. Speech recognition control panel for controlling the various parameters.&lt;br /&gt;
&lt;br /&gt;
:    c) The last part can be creating an API for providing easy Speech Recognition access to activity developers. &lt;br /&gt;
&lt;br /&gt;
:    My aim is to atleast achieve part a) this summer and if time permits I would also like to implement part b). Part c) can be taken care off later.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Detailed time line:&lt;br /&gt;
:    Present to May 24 (before actual working for GsoC starts): I will be studying more about Gnome Voice Control and Pocket Sphinx. Upto this time I will be sure and confident about how Sugar Voice Control has to be derived from Gnome Voice Control. Also we require to test the compatibility of Pocket Sphinx on Sugar.&lt;br /&gt;
&lt;br /&gt;
:    May 24 to June 13: Sugarizing the Gnome Voice Control to obtain &amp;quot;Sugar Voice Control&amp;quot;. Implementation of a Command line interface, which will run the speech recognition in the background and will take the simple speech commands like open an activity, go to home or desktop, close activity etc.&lt;br /&gt;
&lt;br /&gt;
:    June 14 - June 25: Test the implemented framework of Sugar Voice Control on limited resource devices like the XO-1.0. Take the community feedback on the current implementation. Add more &amp;quot;Control Commands&amp;quot; to the framework after discussions. &lt;br /&gt;
&lt;br /&gt;
:    Thus upto end June we should be completed with the implementation of part a) as mentioned above. &lt;br /&gt;
&lt;br /&gt;
:    June 26- July 11: Implementation of Sugar Voice Control button in the GUI. This button will be implemented in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. Implementation of Sugar Voice Control Panel as mentioned in the GUI considerations part. &lt;br /&gt;
&lt;br /&gt;
:    Thus before mid term evaluations we should be done with the part a) and part b) as mentioned above.&lt;br /&gt;
&lt;br /&gt;
:    July 12-July 16: Submitting mid term evaluations.&lt;br /&gt;
&lt;br /&gt;
:    July 17 - July 30: Creating different Language models and datasets so that &amp;quot;Sugar Voice Control&amp;quot; can support different types of Languages.       &lt;br /&gt;
&lt;br /&gt;
:    Aug 1 - Aug 8: Testing the different language models on XOs. Specifically I would like to create a language model for recognizing Hindi control commands. Then I would like to test the implementation in a Primary school situated in my locality.&lt;br /&gt;
&lt;br /&gt;
:   Aug 9- Aug 16: Documenting the entire work and specially how to create language models. I have gone through some tutorials on how to create them, but most of them are very complicated. I would like to create a simple documentation, so that anyone can create simple language models of their favourite languages. In this way Sugar Voice Control will be extensible for multilingual users. &lt;br /&gt;
  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2010 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50858</id>
		<title>Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50858"/>
		<updated>2010-04-05T10:30:34Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Gnome Voice Control to Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control is a Gnome Desktop Voice Control system which allows to control the entire system by speaking commands. &lt;br /&gt;
&lt;br /&gt;
:     The system consists in an application that will be monitoring the audio input(microphone) and when a significant audio signal has been detected, the software catches, processes and recognizes the signal and then executes the desired action over the Gnome Desktop.&lt;br /&gt;
&lt;br /&gt;
:     For more details please visit: http://live.gnome.org/GnomeVoiceControl&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control uses Pocket Sphinx. The idea is to sugarize it to implement &amp;quot;Sugar Voice Conrol&amp;quot; &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Tasks Division:&lt;br /&gt;
:     As I already mentioned, a lot of features can be implemented around Speech Recognition. I have sub-divided my proposal into  following parts:&lt;br /&gt;
&lt;br /&gt;
:    a) My first priority this summer is to enable &amp;quot;Sugar Voice Control&amp;quot;. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1. Testing Pocket Sphinx on Sugar&lt;br /&gt;
:    2. Studying more about Gnome Voice Control.&lt;br /&gt;
:    3. Sugarizing the Gnome Voice Control.&lt;br /&gt;
:    4. A command line interface that will start speech recognition in the background and will start taking &amp;quot;Speech Commands&amp;quot;.&lt;br /&gt;
 &lt;br /&gt;
:    b) After the successful implementation of Sugar Voice control, we can then look into providing speech recognized text to unmodified sugar activities. Thus activities like Write can be made to get the required inputs either from Keyboard or through microphone. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1.  Providing a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. &lt;br /&gt;
&lt;br /&gt;
:    2.  A key board shortcut like Alt+S for starting speech recognition&lt;br /&gt;
&lt;br /&gt;
:    3. Speech recognition control panel for controlling the various parameters.&lt;br /&gt;
&lt;br /&gt;
:    c) The last part can be creating an API for providing easy Speech Recognition access to activity developers. &lt;br /&gt;
&lt;br /&gt;
:    My aim is to atleast achieve part a) this summer and if time permits I would also like to implement part b). Part c) can be taken care off later.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Detailed time line:&lt;br /&gt;
:    Present to May 24 (before actual working for GsoC starts): I will be studying more about Gnome Voice Control and Pocket Sphinx. Upto this time I will be sure and confident about how Sugar Voice Control has to be derived from Gnome Voice Control. Also we require to test the compatibility of Pocket Sphinx on Sugar.&lt;br /&gt;
&lt;br /&gt;
:    May 24 to June 13: Sugarizing the Gnome Voice Control to obtain &amp;quot;Sugar Voice Control&amp;quot;. Implementation of a Command line interface, which will run the speech recognition in the background and will take the simple speech commands like open an activity, go to home or desktop, close activity etc.&lt;br /&gt;
&lt;br /&gt;
:    June 14 - June 25: Test the implemented framework of Sugar Voice Control on limited resource devices like the XO-1.0. Take the community feedback on the current implementation. Add more &amp;quot;Control Commands&amp;quot; to the framework after discussions. &lt;br /&gt;
&lt;br /&gt;
:    Thus upto end June we should be completed with the implementation of part a) as mentioned above. &lt;br /&gt;
&lt;br /&gt;
:    June 26- July 11: Implementation of Sugar Voice Control button in the GUI. This button will be implemented in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. Implementation of Sugar Voice Control Panel as mentioned in the GUI considerations part. &lt;br /&gt;
&lt;br /&gt;
:    Thus before mid term evaluations we should be done with the part a) and part b) as mentioned above.&lt;br /&gt;
&lt;br /&gt;
:    July 12-July 16: Submitting mid term evaluations.&lt;br /&gt;
&lt;br /&gt;
:    July 17 - July 30: Creating different Language models and datasets so that &amp;quot;Sugar Voice Control&amp;quot; can support different types of Languages.       &lt;br /&gt;
&lt;br /&gt;
:    Aug 1 - Aug 8: Testing the different language models on XOs. Specifically I would like to create a language model for recognizing Hindi control commands. Then I would like to test the implementation in a Primary school situated in my locality.&lt;br /&gt;
&lt;br /&gt;
:   Aug 9- Aug 16: Documenting the entire work and specially how to create language models. I have gone through some tutorials on how to create them, but most of them are very complicated. I would like to create a simple documentation, so that anyone can create simple language models of their favourite languages. In this way Sugar Voice Control will be extensible for multilingual users. &lt;br /&gt;
  &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2010 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50856</id>
		<title>Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50856"/>
		<updated>2010-04-05T09:47:49Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Gnome Voice Control to Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control is a Gnome Desktop Voice Control system which allows to control the entire system by speaking commands. &lt;br /&gt;
&lt;br /&gt;
:     The system consists in an application that will be monitoring the audio input(microphone) and when a significant audio signal has been detected, the software catches, processes and recognizes the signal and then executes the desired action over the Gnome Desktop.&lt;br /&gt;
&lt;br /&gt;
:     For more details please visit: http://live.gnome.org/GnomeVoiceControl&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control uses Pocket Sphinx. The idea is to sugarize it to implement &amp;quot;Sugar Voice Conrol&amp;quot; &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Tasks Division:&lt;br /&gt;
:     As I already mentioned, a lot of features can be implemented around Speech Recognition. I have sub-divided my proposal into  following parts:&lt;br /&gt;
&lt;br /&gt;
:    a) My first priority this summer is to enable &amp;quot;Sugar Voice Control&amp;quot;. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1. Testing Pocket Sphinx on Sugar&lt;br /&gt;
:    2. Studying more about Gnome Voice Control.&lt;br /&gt;
:    3. Sugarizing the Gnome Voice Control.&lt;br /&gt;
:    4. A command line interface that will start speech recognition in the background and will start taking &amp;quot;Speech Commands&amp;quot;.&lt;br /&gt;
 &lt;br /&gt;
:    b) After the successful implementation of Sugar Voice control, we can then look into providing speech recognized text to unmodified sugar activities. Thus activities like Write can be made to get the required inputs either from Keyboard or through microphone. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1.  Providing a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. &lt;br /&gt;
&lt;br /&gt;
:    2.  A key board shortcut like Alt+S for starting speech recognition&lt;br /&gt;
&lt;br /&gt;
:    3. Speech recognition control panel for controlling the various parameters.&lt;br /&gt;
&lt;br /&gt;
:    c) The last part can be creating an API for providing easy Speech Recognition access to activity developers. &lt;br /&gt;
&lt;br /&gt;
:    My aim is to atleast achieve part a) this summer and if time permits I would also like to implement part b). Part c) can be taken care off later.&lt;br /&gt;
&lt;br /&gt;
:    I will be presenting a more detailed time line.  &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2010 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50825</id>
		<title>Speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50825"/>
		<updated>2010-04-04T08:12:43Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Gnome Voice Control to Sugar Voice Control&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control is a Gnome Desktop Voice Control system which allows to control the entire system by speaking commands. &lt;br /&gt;
&lt;br /&gt;
:     The system consists in an application that will be monitoring the audio input(microphone) and when a significant audio signal has been detected, the software catches, processes and recognizes the signal and then executes the desired action over the Gnome Desktop.&lt;br /&gt;
&lt;br /&gt;
:     For more details please visit: http://live.gnome.org/GnomeVoiceControl&lt;br /&gt;
&lt;br /&gt;
:     Gnome Voice Control uses Pocket Sphinx. The idea is to sugarize it to implement &amp;quot;Sugar Voice Conrol&amp;quot; &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Tasks Division:&lt;br /&gt;
:     As I already mentioned, a lot of features can be implemented around Speech Recognition. I have sub-divided my proposal into  following parts:&lt;br /&gt;
&lt;br /&gt;
:    a) My first priority this summer is to enable &amp;quot;Sugar Voice Control&amp;quot;. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1. Testing Pocket Sphinx on Sugar&lt;br /&gt;
:    2. Studying more about Gnome Voice Control.&lt;br /&gt;
:    3. Sugarizing the Gnome Voice Control.&lt;br /&gt;
:    4. A command line interface that will start speech recognition in the background and will start taking &amp;quot;Speech Commands&amp;quot;.&lt;br /&gt;
 &lt;br /&gt;
:    b) After the successful implementation of Sugar Voice control, we can then look into providing speech recognized text to unmodified sugar activities. Thus activities like Write can be made to get the required inputs either from Keyboard or through microphone. This includes:&lt;br /&gt;
&lt;br /&gt;
:    1.  Providing a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. &lt;br /&gt;
&lt;br /&gt;
:    2.  A key board shortcut like Alt+S for starting speech recognition&lt;br /&gt;
&lt;br /&gt;
:    3. Speech recognition control panel for controlling the various parameters.&lt;br /&gt;
&lt;br /&gt;
:    c) The last part can be creating an API for providing easy Speech Recognition access to activity developers. &lt;br /&gt;
&lt;br /&gt;
:    My aim is to atleast achieve part a) this summer and if time permits I would also like to implement part b). Part c) can be taken care off later.&lt;br /&gt;
&lt;br /&gt;
:    I will be presenting a more detailed time line.  &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50775</id>
		<title>Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50775"/>
		<updated>2010-04-03T09:17:07Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot;. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB whereas the requirements of Sphinx 3 is more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing English, Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     [TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50774</id>
		<title>Summer of Code/2010/speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/2010/speech-recognition&amp;diff=50774"/>
		<updated>2010-04-03T09:00:21Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: Created page with &amp;#039;{{TOCright}} ====About you====  Q.1:     &amp;#039;&amp;#039;&amp;#039;What is your name?  A:       Chirag Jain  ----  Q.2:   &amp;#039;&amp;#039;&amp;#039;What is your email address?  A:       chiragjain1989{AT}gmail{DOT}com  ---- …&amp;#039;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     [TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50773</id>
		<title>Speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50773"/>
		<updated>2010-04-03T08:58:51Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     [TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50772</id>
		<title>Speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50772"/>
		<updated>2010-04-03T08:45:15Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
&lt;br /&gt;
:      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
&lt;br /&gt;
A:      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
&lt;br /&gt;
A:       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
:       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
:       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
A:       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
:       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
&lt;br /&gt;
:        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
:       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
&lt;br /&gt;
:       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
:       1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
:       2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
:       3.      It should not consume a lot of memory during run time.&lt;br /&gt;
:       4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
:       One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
:       http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
:       The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
:       1.      Sphinx 3 &lt;br /&gt;
:       2.      Pocket Sphinx&lt;br /&gt;
:       3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
:        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
:       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
:      We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.&lt;br /&gt;
   &lt;br /&gt;
:      A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
:     [TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
&lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
&lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:      [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-)&lt;br /&gt;
 &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:       Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50771</id>
		<title>Speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50771"/>
		<updated>2010-04-03T08:39:02Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and : Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
:       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
:       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
:       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
:       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
:       Please visit the details at: &lt;br /&gt;
:       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
:       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
:       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
:       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
:       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
:       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
3.      It should not consume a lot of memory during run time.&lt;br /&gt;
4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
1.      Sphinx 3 &lt;br /&gt;
2.      Pocket Sphinx&lt;br /&gt;
3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
       We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.   &lt;br /&gt;
A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
[TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-) &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:     Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50770</id>
		<title>Speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50770"/>
		<updated>2010-04-03T08:36:38Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
       My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
       1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
       Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
       2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
       Please visit the details at: &lt;br /&gt;
       http://launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
       3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
3.      It should not consume a lot of memory during run time.&lt;br /&gt;
4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
1.      Sphinx 3 &lt;br /&gt;
2.      Pocket Sphinx&lt;br /&gt;
3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
       We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.   &lt;br /&gt;
A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
[TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-) &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:     Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50769</id>
		<title>Speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50769"/>
		<updated>2010-04-03T08:35:17Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:        I will be honored by working with any mentor you will provide. :-)   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       Yes, I have been actively involved in open source projects from last one year. As a Software Engineer, Products and Services at SEETA, New Delhi, India http://seeta.in, I am mangaing the design and development of speech related projects. Please visit my profile at http://seeta.in/j/team.html &lt;br /&gt;
&lt;br /&gt;
:      My Major contributions are:&lt;br /&gt;
&lt;br /&gt;
:      1) Lead Developer of Listen and Spell activity&lt;br /&gt;
&lt;br /&gt;
       Listen and Spell is an activity that helps children learn and revise English words through fun and engaging  methdologies. Learn more about the activity by visiting the following links:&lt;br /&gt;
       &lt;br /&gt;
       http://activities.sugarlabs.org/en-US/sugar/addon/4234&lt;br /&gt;
       http://seeta.in/wiki/index.php?title=Listen_Spell&lt;br /&gt;
       http://seeta.in/j/products/listen-spell.html&lt;br /&gt;
 &lt;br /&gt;
:      Currently I am involved in collaboration implementation in Listen and Spell. &lt;br /&gt;
&lt;br /&gt;
:      2) Sugar packaging for Lucid&lt;br /&gt;
&lt;br /&gt;
       Recently I have taken some packaging work for Sugar artwork, sugar presence service and ubuntu sugar remix.&lt;br /&gt;
       Please visit the details at: &lt;br /&gt;
       launchpad.net/~sugarteam/+archive/ppa&lt;br /&gt;
&lt;br /&gt;
:      3) ShruthLaikh &lt;br /&gt;
&lt;br /&gt;
       This is also one major project that I have undertaken at SEETA. It is nothing but a more advanced version of Listen and Spell including features like automatic database adaptation, user profile system and automatic feedback generation. For more details please visit: &lt;br /&gt;
       http://seeta.in/j/products/shruthlaikh.html&lt;br /&gt;
       http://seeta.in/wiki/index.php?title=Shruthlaikh&lt;br /&gt;
&lt;br /&gt;
       One more thing that I would like to mention is that recently I got my first International paper submission accepted at PyCon Asia Pacific Conference going to be held in June 2010 at Singapore. http://pycon.sit.rp.sg/conference-1 &lt;br /&gt;
       This publication was entirely based on how Python and its batteries helped in the development of ShruthLaikh project. &lt;br /&gt;
      &lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Recognition&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:    Sugar has got all the potential to become an excellent educational platform. One particular problem that I feel with current version of sugar is the lack of features that can help even physically challenged users to interact with the system easily. This limits us to reach this section of chidren. But when we have technology, then why to restrict ourselves? &lt;br /&gt;
      My project for this summer, aims at integrating Speech recognition into sugar that will open whole new set of opportunities both for Activity developers and end users (especially for physically challlenged.)&lt;br /&gt;
&lt;br /&gt;
Q.10:  &#039;&#039;&#039;What is Speech Recognition?&lt;br /&gt;
      Although the title is self explanatory, but still I would like to mention that Speech Recognition is a process of converting spoken words into text. You just speak, and the system automatically converts the audio into editable text which can be used for countless purposes that I will show you in the remaining part of this proposal.&lt;br /&gt;
&lt;br /&gt;
Q.11:  &#039;&#039;&#039;How Speech Recognition can help Sugar become better?&lt;br /&gt;
       As I mentioned previously, speech recognition can help physcially challenged children to interact with a system running sugar. Imagine a child who is not able to operate keypad and touchpad can now open the activities by just speaking &amp;quot;Open Write Activity&amp;quot; or &amp;quot;Open turtle art&amp;quot; etc. They can even type into the write activity and others by simply speaking the appropriate commands. This is more of less like the Microsoft Speech Recognition system, where you can control the entire Windows by just speaking commands.&lt;br /&gt;
&lt;br /&gt;
       Correct Pronunciation is the first lesson given in any educational system. With the help of Speech recognition, we can develop activities to conduct automatic oral testing. We can create language models, for particular set of words and if a child is speaking them correctly then they should be properly recognized or not. &lt;br /&gt;
&lt;br /&gt;
       Implementation of Speech Recognition will provide activity developers an opportunity to create more interactive activities, where users can interact by just speaking words.  &lt;br /&gt;
         &lt;br /&gt;
Q.12:  &#039;&#039;&#039;Who are you making it for and why do they need it?&lt;br /&gt;
&lt;br /&gt;
       With speech recognition system, we will be fulfilling the needs of two types of audience: one is the end users who are not technical and others are activity developers.&lt;br /&gt;
&lt;br /&gt;
       &#039;&#039;&#039;End users (Non technical)&lt;br /&gt;
       &lt;br /&gt;
       For end users Speech recognition can act as a medium for controlling the sugar. Now imagine a child who is physically challenged and thus is not able to interact with systems can now open the activities (like write activity) by just saying “Open write activity”. Then he/she can simply interact with the activity with speech recognition running in the background by just saying simple commands. For example, he/she can start typing by saying “Start typing” and then speaking the words that they want to write into the document. Thus sugar will become accessible for physically challenged users which will be a boon to them.&lt;br /&gt;
        Activities can be developed around speech recognition that can help children to improve their pronunciation by incorporating oral testing. Oral testing is a method to provide feedback to the users on their pronunciation by recognizing their speech. Thus a child speaking a word “Apple” correctly should be recognized otherwise not. This is only one example and we can create numerous activities around speech recognition. This will make possible to develop more interactive activities for children that can help make sugar a useful educational tool.&lt;br /&gt;
&lt;br /&gt;
       &#039;&#039;&#039;Activity Developers (Technical)&lt;br /&gt;
       Activity developers would be primarily interested in the API’s provided for speech recognition. We will provide simple and easy to use interfaces for the developers that will have all the control over speech recognition. The developers of already existing activities can also integrate speech recognition to make them more useful. Consider for example the write activity, we can modify it to take the inputs for typing from Speech recognition system instead of the keyboard. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.13:  &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
       For a speech recognition system, we require a Speech recognition engine that can be integrated into sugar over which we can develop the entire framework. The major requirements of such an engine are:&lt;br /&gt;
&lt;br /&gt;
1.      It should be capable of running on Linux which is the core of sugar.&lt;br /&gt;
2.      It should be open source so that we can modify it accordingly as per our needs and requirements.&lt;br /&gt;
3.      It should not consume a lot of memory during run time.&lt;br /&gt;
4.      It should be an efficient speech recognizer.&lt;br /&gt;
&lt;br /&gt;
One such Speech recognition engine that nearly fulfills all of these requirements is Sphinx. Sphinx is an open source speech recognition engine, developed at CMU is one of the top class speech recognizer. It has been developed primarily for Linux and comes under different versions. &lt;br /&gt;
&lt;br /&gt;
http://www.speech.cs.cmu.edu/&lt;br /&gt;
&lt;br /&gt;
The currently available versions are:&lt;br /&gt;
&lt;br /&gt;
1.      Sphinx 3 &lt;br /&gt;
2.      Pocket Sphinx&lt;br /&gt;
3.      Sphinx 4 &lt;br /&gt;
&lt;br /&gt;
        Sphinx 4 is the latest version which has been developed entirely in JAVA. Sphinx 3 and pocket sphinx are older versions but still are the famous ones. Using Sphinx 4 for integration in sugar does not seem feasible because it has been written in JAVA. So we are left with two options of either using Sphinx 3 or Pocket Sphinx. Now the decision between these two can only be made by experimenting them with sugar. This will also depend on the devices currently being aimed by sugar and thus the main focus will be on OLPC XO laptops. The XOs have 256 MB of RAM and the run time requirement of Pocket Sphinx is around 20 MB. At this time I am not sure about the requirements of Sphinx 3 but this should be more than 30 MB. Pocket Sphinx is light weight and is designed primarily for embedded devices like PDA. Sphinx 3 on the other hand is developed to run on desktops and consumes considerable amount of memory. So at least Pocket Sphinx can be implemented in sugar and the feasibility of Sphinx 3 will be tested soon.&lt;br /&gt;
  &lt;br /&gt;
:      &#039;&#039;&#039;Language Support&lt;br /&gt;
&lt;br /&gt;
       Sphinx engines require training data sets and language models for recognizing speech. Thus we can set them to recognize many languages. At present they have been tested for recognizing Chinese, Spanish, Dutch, German, Hindi, Italic, Icelandic and Russian successfully. Thus we can target a wide range of users belonging to different parts of world speaking different languages. I have collected all this data after discussion with a Sphinx developer on IRC and I am testing the Sphinx 3 and Pocket sphinx too.&lt;br /&gt;
 &lt;br /&gt;
:      &#039;&#039;&#039;GUI considerations&lt;br /&gt;
&lt;br /&gt;
       We can provide a Speech recognition button in the sugar frame (for example on Top Right hand side) which when clicked will automatically start recognizing speech in the background. Clicking the same button again will stop the recognition process. On hovering over the Speech recognition button, a sugar palette will be exposed which will display the speech recognition parameters that can be modified by the user. Sugar Controls like Sliders, Palette Buttons, and Combo boxes will be used within the palette to achieve the desired effect.   &lt;br /&gt;
A keyboard shortcut like &amp;lt;Alt+S&amp;gt; can also be provided for starting speech recognition. The corresponding hooks for the key shortcut must be made in the Sugar UI source code.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
[TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I have been working as a Software Engineer at SEETA http://seeta.in from last 10 months. SEETA is working in collaboration with Sugar Labs and we have undertaken a lot of projects and completed them successfully too. During this time I have gained a lot of knowledge about Sugar platform and how development works.    &lt;br /&gt;
:     I have also been working with MILLEE http://millee.org project under the guidance of Prof. Matthew Kam, Carnegie mellon university CMU. One project that I would specifically like to mention is Voice Activity Detection on Java based cell phones. I have completed this project during Dec 2009 to Feb 2010, and during this time I have gained decent knowledge of how speech processing works internally. Due to the documentation of the project kept private, I would not be able to share it here. This project aimed at detecting human speech in the mobile recored WAV format audio files.   &lt;br /&gt;
:     I will be getting almost 2.5 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:      Ever since I have got my first Computer system and high speed broadband connection, I have always found my best teacher as Google. :-) &lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC. &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:     Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50629</id>
		<title>Speech-recognition</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-recognition&amp;diff=50629"/>
		<updated>2010-04-02T09:55:25Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: Created page with &amp;#039;Chirag Jain&amp;#039;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Chirag Jain&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/Interviews&amp;diff=27459</id>
		<title>Summer of Code/Interviews</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/Interviews&amp;diff=27459"/>
		<updated>2009-04-15T05:09:14Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;We would like to interview the candidates for Google Summer of Code under Sugar Labs. Please sign up for a time below. If you have been told the name of your provisional prospective mentor, and your mentor has put his/her preferred times below, please try to find a time which works for your mentor. (Note that mentor assignments are still in flux; if your provisional prospective mentor changes, we will try to accommodate your chosen time or reschedule at a convenient time for you.)&lt;br /&gt;
&lt;br /&gt;
Interviews will be conducted over IRC; even hours on #sugar-interview0 and half-hours on #sugar-interview1 . Please treat these like real rooms, ie, do not lurk unless you are a mentor. If you have some quick pre-interview questions, we will be in #sugar too.&lt;br /&gt;
&lt;br /&gt;
If you have specifically been told to sign up to interview, please sign up here. Even if your provisional prospective mentor has not indicated availability, put your available times, and your mentor will renegotiate with you later if necessary. We have invited the 9-10 proposals which are currently ranked highest, for the 4-6 slots we will probably get. However, we &#039;&#039;do&#039;&#039; want to interview you even if you have not specifically been invited; although your proposal is a long shot in that case, you may impress us, and in any case we&#039;d like to talk to you about participating in Sugar outside of GSoC.&lt;br /&gt;
&lt;br /&gt;
Like many other organizations participating in GSoC, we have received more good applications than we will have funding from Google for. Almost all of the applications we&#039;ve received are things that we&#039;d really love to have you working on, even if we don&#039;t have space for you in GSoC. If you are interested in finishing your project and would like a mentor assigned even though we can&#039;t give you a slot, you can also contact us on the sugar-devel or gsoc mailing list and we&#039;ll work something out.&lt;br /&gt;
&lt;br /&gt;
== Sunday Apr 1 ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Guido van Rossum, Linus Torvalds, Bill Gates ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=1&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=14&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 1400 UTC]: [[User:example|example]] 18:11, 13 April 2009 (UTC) (Spanish interview preferred) (just an example - I used &amp;lt;nowiki&amp;gt;~~~~&amp;lt;/nowiki&amp;gt; to add a signature, and added any special requests after.)&lt;br /&gt;
&lt;br /&gt;
== Wed Apr 15 ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Jameson Quinn, Luis G. Lira, [[User:Aa|aa]], [[User:Tomeu|Tomeu]], [[User:Bobbyp|Bobby]] ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=15&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=14&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 1400 UTC]: Vamsi Krishna Davuluri (iwikiwi)&lt;br /&gt;
&lt;br /&gt;
* 1430 UTC: Eduardo Silva (edsiper)&lt;br /&gt;
&lt;br /&gt;
* 1500 UTC: Lucian Branescu (lucian1900)&lt;br /&gt;
&lt;br /&gt;
* 1530 UTC: Deepank Gupta (deepank)&lt;br /&gt;
&lt;br /&gt;
== Thurs Apr 16 (Wed 15 in the western hemisphere) ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Jameson Quinn, [[User:Aa|aa]], [[User:BryanWB|BryanWB]], [[User:Nrp|Nirav Patel]], [[User:Bobbyp|Bobby]] ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=16&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=0&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 0000 UTC]:&lt;br /&gt;
* 0030 UTC:&lt;br /&gt;
&lt;br /&gt;
* 0100 UTC:Felipe López Toledo (subzero)  ?Can we move this to 0100 UTC? I usually just wake up at this time. [[User:BryanWB|BryanWB]]- sure (subzero).&lt;br /&gt;
&lt;br /&gt;
* 0130 UTC:&lt;br /&gt;
&lt;br /&gt;
== Thurs Apr 16 ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Jameson Quinn, Sayamindu Dasgupta, Assim Deodia, [[User:Tomeu|Tomeu]], ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=16&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=18&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 1800 UTC]: Satya Komaragiri (mavu)&lt;br /&gt;
&lt;br /&gt;
* 1830 UTC: [http://wiki.sugarlabs.org/go/speech-synthesis chirag jain (chirag)]&lt;br /&gt;
&lt;br /&gt;
* 1900 UTC: [http://wiki.sugarlabs.org/go/Marbles Puneet_Girdhar]&lt;br /&gt;
&lt;br /&gt;
* 1930 UTC: Kartik Rustagi&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/Interviews&amp;diff=27419</id>
		<title>Summer of Code/Interviews</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/Interviews&amp;diff=27419"/>
		<updated>2009-04-14T16:43:51Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;We would like to interview the candidates for Google Summer of Code under Sugar Labs. Please sign up for a time below. If you have been told the name of your provisional prospective mentor, and your mentor has put his/her preferred times below, please try to find a time which works for your mentor. (Note that mentor assignments are still in flux; if your provisional prospective mentor changes, we will try to accommodate your chosen time or reschedule at a convenient time for you.)&lt;br /&gt;
&lt;br /&gt;
Interviews will be conducted over IRC; even hours on #sugar-interview0 and half-hours on #sugar-interview1 . Please treat these like real rooms, ie, do not lurk unless you are a mentor. If you have some quick pre-interview questions, we will be in #sugar too.&lt;br /&gt;
&lt;br /&gt;
If you have specifically been told to sign up to interview, please sign up here. Even if your provisional prospective mentor has not indicated availability, put your available times, and your mentor will renegotiate with you later if necessary. We have invited the 9-10 proposals which are currently ranked highest, for the 4-6 slots we will probably get. However, we &#039;&#039;do&#039;&#039; want to interview you even if you have not specifically been invited; although your proposal is a long shot in that case, you may impress us, and in any case we&#039;d like to talk to you about participating in Sugar outside of GSoC.&lt;br /&gt;
&lt;br /&gt;
Like many other organizations participating in GSoC, we have received more good applications than we will have funding from Google for. Almost all of the applications we&#039;ve received are things that we&#039;d really love to have you working on, even if we don&#039;t have space for you in GSoC. If you are interested in finishing your project and would like a mentor assigned even though we can&#039;t give you a slot, you can also contact us on the sugar-devel or gsoc mailing list and we&#039;ll work something out.&lt;br /&gt;
&lt;br /&gt;
== Sunday Apr 1 ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Guido van Rossum, Linus Torvalds, Bill Gates ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=1&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=14&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 1400 UTC]: [[User:example|example]] 18:11, 13 April 2009 (UTC) (Spanish interview preferred) (just an example - I used &amp;lt;nowiki&amp;gt;~~~~&amp;lt;/nowiki&amp;gt; to add a signature, and added any special requests after.)&lt;br /&gt;
&lt;br /&gt;
== Wed Apr 15 ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Jameson Quinn, Luis G. Lira, [[User:Aa|aa]], [[User:Tomeu|Tomeu]], ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=15&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=14&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 1400 UTC]: Vamsi Krishna Davuluri (iwikiwi)&lt;br /&gt;
&lt;br /&gt;
* 1430 UTC: Eduardo Silva (edsiper)&lt;br /&gt;
&lt;br /&gt;
* 1500 UTC: Lucian Branescu (lucian1900)&lt;br /&gt;
&lt;br /&gt;
* 1530 UTC: Deepank Gupta (deepank)&lt;br /&gt;
&lt;br /&gt;
== Thurs Apr 16 (Wed 15 in the western hemisphere) ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Jameson Quinn, [[User:Aa|aa]], [[User:BryanWB|BryanWB]], [[User:Nrp|Nirav Patel]] ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=16&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=0&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 0000 UTC]:Felipe López Toledo (subzero)  ?Can we move this to 0100 UTC? I usually just wake up at this time. [[User:BryanWB|BryanWB]]&lt;br /&gt;
* 0030 UTC:&lt;br /&gt;
&lt;br /&gt;
* 0100 UTC:&lt;br /&gt;
&lt;br /&gt;
* 0130 UTC:&lt;br /&gt;
&lt;br /&gt;
== Thurs Apr 16 ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Jameson Quinn, Sayamindu Dasgupta, Assim Deodia, [[User:Tomeu|Tomeu]], ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=16&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=18&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 1800 UTC]: Satya Komaragiri (mavu)&lt;br /&gt;
&lt;br /&gt;
* 1830 UTC: [[http://wiki.sugarlabs.org/go/speech-synthesis chirag jain (chirag)]]&lt;br /&gt;
&lt;br /&gt;
* 1900 UTC: [http://wiki.sugarlabs.org/go/Marbles Puneet_Girdhar]&lt;br /&gt;
&lt;br /&gt;
* 1930 UTC:&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/Interviews&amp;diff=27418</id>
		<title>Summer of Code/Interviews</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Summer_of_Code/Interviews&amp;diff=27418"/>
		<updated>2009-04-14T16:42:12Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;We would like to interview the candidates for Google Summer of Code under Sugar Labs. Please sign up for a time below. If you have been told the name of your provisional prospective mentor, and your mentor has put his/her preferred times below, please try to find a time which works for your mentor. (Note that mentor assignments are still in flux; if your provisional prospective mentor changes, we will try to accommodate your chosen time or reschedule at a convenient time for you.)&lt;br /&gt;
&lt;br /&gt;
Interviews will be conducted over IRC; even hours on #sugar-interview0 and half-hours on #sugar-interview1 . Please treat these like real rooms, ie, do not lurk unless you are a mentor. If you have some quick pre-interview questions, we will be in #sugar too.&lt;br /&gt;
&lt;br /&gt;
If you have specifically been told to sign up to interview, please sign up here. Even if your provisional prospective mentor has not indicated availability, put your available times, and your mentor will renegotiate with you later if necessary. We have invited the 9-10 proposals which are currently ranked highest, for the 4-6 slots we will probably get. However, we &#039;&#039;do&#039;&#039; want to interview you even if you have not specifically been invited; although your proposal is a long shot in that case, you may impress us, and in any case we&#039;d like to talk to you about participating in Sugar outside of GSoC.&lt;br /&gt;
&lt;br /&gt;
Like many other organizations participating in GSoC, we have received more good applications than we will have funding from Google for. Almost all of the applications we&#039;ve received are things that we&#039;d really love to have you working on, even if we don&#039;t have space for you in GSoC. If you are interested in finishing your project and would like a mentor assigned even though we can&#039;t give you a slot, you can also contact us on the sugar-devel or gsoc mailing list and we&#039;ll work something out.&lt;br /&gt;
&lt;br /&gt;
== Sunday Apr 1 ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Guido van Rossum, Linus Torvalds, Bill Gates ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=1&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=14&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 1400 UTC]: [[User:example|example]] 18:11, 13 April 2009 (UTC) (Spanish interview preferred) (just an example - I used &amp;lt;nowiki&amp;gt;~~~~&amp;lt;/nowiki&amp;gt; to add a signature, and added any special requests after.)&lt;br /&gt;
&lt;br /&gt;
== Wed Apr 15 ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Jameson Quinn, Luis G. Lira, [[User:Aa|aa]], [[User:Tomeu|Tomeu]], ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=15&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=14&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 1400 UTC]: Vamsi Krishna Davuluri (iwikiwi)&lt;br /&gt;
&lt;br /&gt;
* 1430 UTC: Eduardo Silva (edsiper)&lt;br /&gt;
&lt;br /&gt;
* 1500 UTC: Lucian Branescu (lucian1900)&lt;br /&gt;
&lt;br /&gt;
* 1530 UTC: Deepank Gupta (deepank)&lt;br /&gt;
&lt;br /&gt;
== Thurs Apr 16 (Wed 15 in the western hemisphere) ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Jameson Quinn, [[User:Aa|aa]], [[User:BryanWB|BryanWB]], [[User:Nrp|Nirav Patel]] ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=16&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=0&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 0000 UTC]:Felipe López Toledo (subzero)  ?Can we move this to 0100 UTC? I usually just wake up at this time. [[User:BryanWB|BryanWB]]&lt;br /&gt;
* 0030 UTC:&lt;br /&gt;
&lt;br /&gt;
* 0100 UTC:&lt;br /&gt;
&lt;br /&gt;
* 0130 UTC:&lt;br /&gt;
&lt;br /&gt;
== Thurs Apr 16 ==&lt;br /&gt;
&lt;br /&gt;
Mentors attending: Jameson Quinn, Sayamindu Dasgupta, Assim Deodia, [[User:Tomeu|Tomeu]], ...&lt;br /&gt;
&lt;br /&gt;
* [http://www.timeanddate.com/counters/customcounter.html?day=16&amp;amp;month=04&amp;amp;year=2009&amp;amp;hour=18&amp;amp;min=0&amp;amp;sec=0&amp;amp;p0=0 1800 UTC]: Satya Komaragiri (mavu)&lt;br /&gt;
&lt;br /&gt;
* 1830 UTC: [[http://wiki.sugarlabs.org/go/speech-synthesis chiragjain1989 (chirag)]]&lt;br /&gt;
&lt;br /&gt;
* 1900 UTC: [http://wiki.sugarlabs.org/go/Marbles Puneet_Girdhar]&lt;br /&gt;
&lt;br /&gt;
* 1930 UTC:&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27239</id>
		<title>Talk:Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27239"/>
		<updated>2009-04-12T05:23:40Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This would be an important addition to Sugar&#039;s capabilities. &lt;br /&gt;
&lt;br /&gt;
What does it take to make a GStreamer plugin? &lt;br /&gt;
&lt;br /&gt;
Can you explain better how the keyboard speaker would work?&lt;br /&gt;
&lt;br /&gt;
[[User:Homunq|Homunq]] 00:39, 31 March 2009 (UTC)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
My answers:&lt;br /&gt;
&lt;br /&gt;
Well first of all I want to apologize that I am answering this question so late. The main reason is my unawareness of this talk page. (Don&#039;t take it as an excuse I know I am my own culprit)&lt;br /&gt;
&lt;br /&gt;
Keyboard speaker:&lt;br /&gt;
&lt;br /&gt;
My idea is to use the keyboard speaker in two different ways.&lt;br /&gt;
&lt;br /&gt;
1. speaking characters:&lt;br /&gt;
&lt;br /&gt;
In this option, the speaker will simply speak the keyboard characters typed by the user. It will speak all the alphabets a-z, digits 0-9, special characters like *(asterisk), &amp;amp;(ampersand), #(hash) etc, other keys like tab, alt, control, shift etc.&lt;br /&gt;
&lt;br /&gt;
  &#039;&#039;&#039;ADVANTAGE&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
The child using this facility can easily learn and memorize the alphabets. The symbols are in front of him and he presses any of the key or symbol the facility is telling him how to pronounce it. This will also create a interest in the child and this playful activity will become a learning tool. Not only alphabets, the child can learn the names of the special characters easily.&lt;br /&gt;
&lt;br /&gt;
2. speaking words:&lt;br /&gt;
&lt;br /&gt;
In this option, the facility will speak the words typed by the user. &#039;&#039;&#039;The words can be typed anywhere in any window or in any activity like write activity.&#039;&#039;&#039;&lt;br /&gt;
To achieve this I will be hooking the keyboard or tapping the keystrokes. I will store the characters typed by the user until space is pressed. As the user presses the space, the entire word will be sent to TTS for speaking. &lt;br /&gt;
&#039;&#039;&#039;The main advantage is that this facility will be system side and will run in background without interfering with any other activity.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
    &#039;&#039;&#039;ADVANTAGE&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This facility will help the child to type or learn the correct spelling of the word. It is natural for a human mind to memorize the sound of a word more easily than its exact spelling. So if he types the word incorrect then as the speaker will speak the wrongly typed word and it will not match with the sound he heard, he can easily correct the spelling. In this manner the speech can be incorporated in the existing sugar write activity.&lt;br /&gt;
&lt;br /&gt;
Now these two options can be given in the GUI under the keyboard speaker option.&lt;br /&gt;
&lt;br /&gt;
The first option is very easy implement.&lt;br /&gt;
&lt;br /&gt;
For the second option I have written a sample keyboard_speaker.py only for demonstration purpose. It will be great if you test this script and give your comments.&lt;br /&gt;
&lt;br /&gt;
The zip folder containig keyboard_speaker.py can be downloaded from&lt;br /&gt;
&lt;br /&gt;
http://code.google.com/p/speech-synthesis/downloads/list&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=27238</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=27238"/>
		<updated>2009-04-12T05:21:37Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:      I will be honored by working with any mentor you will provide.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:      1)   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:      2)   I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:      3)   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:      Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:      Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
&lt;br /&gt;
:      Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       &#039;&#039;&#039;Speech Synthesis&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:   My project aims at creating a framework that will generate speech in core sugar. I want to implement speech as a basic functionality in sugar.&lt;br /&gt;
:      Let me become more clear. I am using some case scenarios to elaborate my proposal. Imagine any window containing some text is open in sugar. Now what I will do is to provide speech to the text in that window. User will have the freedom to listen the text he has selected. This framework will speak the complete text that the user has selected.&lt;br /&gt;
&lt;br /&gt;
:      Now how user can do this is very simple. &lt;br /&gt;
&lt;br /&gt;
:      First he have to select the text he wants to listen then he can simply press a keyboard shortcut (key combination like Alt+s or something else) or a button provided in the sugar like the home button. From now onwards I will call this button as speech button. This button can be made as a permanent which is showed up on moving the mouse pointer to the top left corner of the screen.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide a configuration management tool with a simple GUI. Simple because our target users are small children of age group 3-15.&lt;br /&gt;
&lt;br /&gt;
:      Now what you can do with this tool is that you can configure the speech. Like you can increase or decrease the volume, can change the language, can change the accent, pitch, male or female voice, rate of speech etc.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide karaoke style coloring or captioning to the words being spoken. Like for example write activity is open up in sugar with some text. Then the user selects the text and presses the speech button. &lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;A separate window containing the selected text will open up. The captioning will be achieved in that separate window while the framework is reading the text.&#039;&#039;&#039; &lt;br /&gt;
 &lt;br /&gt;
:     One more thing which I am aims at is a keyboard speaker. In the configuration tool, the user will be provided with an option of turning on or off the keyboard speaker. So if the key board speaker is turned on, then as the user presses any keyboard key, the framework will speak it. Like if user presses Tab key then the framework will speak &#039;Tab&#039;, on pressing caps lock it will speak &#039;caps lock&#039; and so on.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;More details of keyboard speaker&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
     Keyboard speaker:&lt;br /&gt;
&lt;br /&gt;
My idea is to use the keyboard speaker in two different ways.&lt;br /&gt;
&lt;br /&gt;
1. speaking characters:&lt;br /&gt;
&lt;br /&gt;
In this option, the speaker will simply speak the keyboard characters typed by the user. It will speak all the alphabets a-z, digits 0-9, special characters like *(asterisk), &amp;amp;(ampersand), #(hash) etc, other keys like tab, alt, control, shift etc.&lt;br /&gt;
&lt;br /&gt;
 ADVANTAGE&lt;br /&gt;
&lt;br /&gt;
The child using this facility can easily learn and memorize the alphabets. The symbols are in front of him and he presses any of the key or symbol the facility is telling him how to pronounce it. This will also create a interest in the child and this playful activity will become a learning tool. Not only alphabets, the child can learn the names of the special characters easily.&lt;br /&gt;
&lt;br /&gt;
2. speaking words:&lt;br /&gt;
&lt;br /&gt;
In this option, the facility will speak the words typed by the user. The words can be typed anywhere in any window or in any activity like write activity.To achieve this I will be hooking the keyboard or tapping the keystrokes. I will store the characters typed by the user until space is pressed. As the user presses the space, the entire word will be sent to TTS for speaking. The main advantage is that this facility will be system side and will run in background without interfering with any other activity.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
   ADVANTAGE&lt;br /&gt;
&lt;br /&gt;
This facility will help the child to type or learn the correct spelling of the word. It is natural for a human mind to memorize the sound of a word more easily than its exact spelling. So if he types the word incorrect then as the speaker will speak the wrongly typed word and it will not match with the sound he heard, he can easily correct the spelling. In this manner the speech can be incorporated in the existing sugar write activity.&lt;br /&gt;
&lt;br /&gt;
Now these two options can be given in the GUI under the keyboard speaker option&lt;br /&gt;
&lt;br /&gt;
I have implemented a sample keyboard_speaker.py which is system wide. It can be easily tested on sugar. The zip folder can be downloaded from below mentioned link: &lt;br /&gt;
&lt;br /&gt;
http://code.google.com/p/speech-synthesis/downloads/list&lt;br /&gt;
&lt;br /&gt;
:     Now after describing so many features of this framework, I think that why not to make it a useful thing for the blind users too. So here is an idea.&lt;br /&gt;
&lt;br /&gt;
:     ICON READER. Now what is icon reader is simple. As the blind user is browsing the XO, he can keep track of the current position of the mouse pointer. Suppose at present the mouse pointer is at Home button. Then as the user presses a pre-defined keyboard key the framework will speak &#039;Home&#039;. Similarly if pointer is at desktop the framework will speak &#039;desktop&#039;.&lt;br /&gt;
&lt;br /&gt;
:     If sugar labs likes this idea then they can provide a simple change in the XO hardware by creating that predefined keyboard key feel-able for the blind user. Like for example there are two keys j and f having a slight projection for feel. This functionality can be a boon for blind users.   &lt;br /&gt;
&lt;br /&gt;
:     Here again I am pointing the main features of my proposal:&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing speech to text opened in any window in any activity in sugar.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing a configuration Panel with GUI from which speech configuration can be changed.  &lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Karoake style coloring of the text being spoken.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Key board speaker.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Icon reader.  &lt;br /&gt;
&lt;br /&gt;
:     Who are you making it for? &lt;br /&gt;
&lt;br /&gt;
:     According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this framework for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, XO will now become a boon for blind children too.&lt;br /&gt;
&lt;br /&gt;
:    Why do they need it?&lt;br /&gt;
&lt;br /&gt;
:    The main of sugar is to spread the fruit of literacy. And as I have already mentioned that students can learn very fast if speech is also included with the text or words they read. So including this framework in sugar will make it more efficient.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, now the blind students or children can also use the XO which will be like a boon for them. Blinds also want to study...&lt;br /&gt;
&lt;br /&gt;
:    What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:    I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are: &lt;br /&gt;
&lt;br /&gt;
:    Some rough ideas of implementation:&lt;br /&gt;
&lt;br /&gt;
:    Below I am showing the basic structure of my framework i.e., the speech synthesizing framework.&lt;br /&gt;
&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                         -----------------------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   On the bottom level is the speech engine (espeak) producing the speech.&lt;br /&gt;
&lt;br /&gt;
*   There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
 &lt;br /&gt;
*   For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   Now to get the user selected text my idea is to use clipboard module which takes care of copy paste. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   A small code snippet which I have prepared for demonstration purpose is shown below. You can try it. But first please download the espeak.py code from the following link:&lt;br /&gt;
&lt;br /&gt;
    http://git.sugarlabs.org/projects/listen-spell/repos/mainline/blobs/master/espeak.py&lt;br /&gt;
&lt;br /&gt;
:    First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&lt;br /&gt;
&lt;br /&gt;
                         import gtk&lt;br /&gt;
                         from espeak import espeak&lt;br /&gt;
                         obj=espeak()&lt;br /&gt;
                         clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                         text=clip.wait_for_text()&lt;br /&gt;
                         if text==None:&lt;br /&gt;
                             obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                         else:&lt;br /&gt;
                             obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   You can select the text anywhere in the sugar. Although I have created this code using espeak directly but in future I will be using gstreamer plugin.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:    &#039;&#039;&#039;April 21-March 22&lt;br /&gt;
:     During this period I will remain in constant touch with my mentor and sugar community. I will remain active on IRC and mailing list to discuss the design details and further improvements that can be incorporated in this project.&lt;br /&gt;
&lt;br /&gt;
:     I will also study a relatively new things called STARDICT and ORCA. These can be very useful for sugar. &lt;br /&gt;
&lt;br /&gt;
:     Up to this time I will become absolutely clear on my further approach. But now I am providing a rough plan.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;May 24 - June 5&lt;br /&gt;
*     Will work on implementing the command line interface of the framework.&lt;br /&gt;
*     complete the basic architecture in which user can select and listen the text from command line interface.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 6- June 15     &lt;br /&gt;
*     Discuss the GUI design for configuration manager on IRC.&lt;br /&gt;
*     Design the finalized GUI of the configuration manager.&lt;br /&gt;
*     Will release the snapshots of the GUI on wiki page.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 16 - June 25&lt;br /&gt;
*     Implement the configuration manger.&lt;br /&gt;
*     linking the various options I already described in the GUI.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 18 - June 25&lt;br /&gt;
*     Implementation of the keyboard speaker.&lt;br /&gt;
*     Releasing the basic configuration manager. &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 26- July 7&lt;br /&gt;
*     implementation of the icon reader.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;July 8-July12&lt;br /&gt;
*     finalize an alpha release of the framwork.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 13&lt;br /&gt;
*    Mid term evaluation. Will release the alpha version.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 14 - July 23&lt;br /&gt;
*    Will test it on XO.&lt;br /&gt;
*    Ask for bugs and further improvements.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 24 - August 3&lt;br /&gt;
*    Will port the framework on windows.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 4 - August 13 &lt;br /&gt;
*    Ask for feedback&lt;br /&gt;
*    Preparation for the beta version release.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 14 - August 22&lt;br /&gt;
*    Start the documentation work&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 23 onwards&lt;br /&gt;
*    Continue Working on beta release.&lt;br /&gt;
*    I will continue working on this to make it available in official sugar distros. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:     &lt;br /&gt;
:    I already have a lot of coding experience in open source events like AI challenge whose simulator code I prepared just in a time span of 15 days.&lt;br /&gt;
:    Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
 &lt;br /&gt;
:    In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:    Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:    I have solved a lot of complex problems at SPOJ programming contest site.&lt;br /&gt;
:    http://www.spoj.pl&lt;br /&gt;
:    http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
&lt;br /&gt;
:    Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
:    I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi. A lot of students from my college have been associated with OLPC for development work. Like:&lt;br /&gt;
 &lt;br /&gt;
*    Food Force which is still in its developing phase. Recently we have achieved collaboration in Food Force. Mr. Deepank and Mr. Mohit Taneja (both are my seniors) are involved with this OLPC project from last one year. &lt;br /&gt;
*    Listen and Spell. This project was started at GSoC 2008 by Mr. Assim Deodia (Senior) and has recently achieved progress to remove speech dispatcher dependencies from it.&lt;br /&gt;
*    Speech dispatcher. This project was completed by my senior Mr. Hemant Goyal at GSoC 2008.&lt;br /&gt;
&lt;br /&gt;
:    So by giving these examples what I am trying to convey is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can. So I can get a lot of guidance and ready help in any case of technical designs or other sort of work or if I got stuck anywhere. My chances of stucking at any point are very low.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Philip Wagner &amp;lt;Philip5147@aol.com&amp;gt;&lt;br /&gt;
:    &amp;quot;I am Philip Wagner.  I am a member of the Education Team for Sugar Labs. I was a teacher for seventeen years in Africa and in the  United States.&lt;br /&gt;
:   The speech synthesizer is very important to help children learn to speak, read and write languages.&lt;br /&gt;
Some research states that a child needs to hear words pronounced between one hundred and one thousand times before the child knows the word.  This is a monumental task for a teacher to be repeating words enough times for the children to learn.  The speech synthesizer is a tool which helps the teacher in doing some of that repetition. The more words a child  knows the more learning can take place inside the child&#039;s brain.  We think with words.  If we do not have the words necessary to do the thinking then things don&#039;t progress as well.&lt;br /&gt;
For writing, the child  listens to a story that the speech synthesizer very patiently repeats as many times as the child needs it and then the child writes the story in his or her own words..&lt;br /&gt;
The hearing, seeing, and writing of the words helps the student for reading.&lt;br /&gt;
There are many more words that are used in books and on the internet than we use to speak with.&lt;br /&gt;
In English we use about ninety thousand words for speaking. An estimate of the total words in English is more than five-hundred-thousand.  One dictionary has five-hundred-thousand entries.&lt;br /&gt;
The more a child learns the more the child will learn. We cannot depend on teachers to teach enough words to children. I am very much encouraged that someone is working toward preparing the speech synthesizer in Sugar.&amp;quot; &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;According to Assim Deodia (Mentor of Sugar Labs for GSoC 2009)&lt;br /&gt;
:     &#039;&#039;&#039;assim.deodia@gmail.com&lt;br /&gt;
:     This proposal is of great potential since speech synthesis is a long desired component of sugar. Activities like speak and listen-spell are already using speech synthesis and it would be very useful to have more speech enabled activities. Various survey showed that voice + text based learnnig is much more efficient then only text based learning. If, as promised, captioning is also achieved, this would enhance the learning multifold.&lt;br /&gt;
 &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:     Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     I think that most of the things I have already cleared. I don&#039;t know wether you are going to take it or not but one thing I can assure is that I have the determination to achieve whatever I have mentioned. One last point I would like to emphasize is that speech synthesis in sugar is indispensable. So please consider this project always in the future sugar developments.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
for my Comment on this:&lt;br /&gt;
&lt;br /&gt;
http://wiki.sugarlabs.org/go/Summer_of_Code/Application_review_notes#Competing_proposals&lt;br /&gt;
&lt;br /&gt;
Please visit:  &lt;br /&gt;
&lt;br /&gt;
http://code.google.com/p/speech-synthesis/downloads/list&lt;br /&gt;
&lt;br /&gt;
and also my talk page&lt;br /&gt;
&lt;br /&gt;
Regards&lt;br /&gt;
&lt;br /&gt;
Chirag Jain&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=27201</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=27201"/>
		<updated>2009-04-11T19:07:21Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:      I will be honored by working with any mentor you will provide.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:      1)   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:      2)   I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:      3)   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:      Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:      Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
&lt;br /&gt;
:      Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       &#039;&#039;&#039;Speech Synthesis&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:   My project aims at creating a framework that will generate speech in core sugar. I want to implement speech as a basic functionality in sugar.&lt;br /&gt;
:      Let me become more clear. I am using some case scenarios to elaborate my proposal. Imagine any window containing some text is open in sugar. Now what I will do is to provide speech to the text in that window. User will have the freedom to listen the text he has selected. This framework will speak the complete text that the user has selected.&lt;br /&gt;
&lt;br /&gt;
:      Now how user can do this is very simple. &lt;br /&gt;
&lt;br /&gt;
:      First he have to select the text he wants to listen then he can simply press a keyboard shortcut (key combination like Alt+s or something else) or a button provided in the sugar like the home button. From now onwards I will call this button as speech button. This button can be made as a permanent which is showed up on moving the mouse pointer to the top left corner of the screen.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide a configuration management tool with a simple GUI. Simple because our target users are small children of age group 3-15.&lt;br /&gt;
&lt;br /&gt;
:      Now what you can do with this tool is that you can configure the speech. Like you can increase or decrease the volume, can change the language, can change the accent, pitch, male or female voice, rate of speech etc.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide karaoke style coloring or captioning to the words being spoken. Like for example write activity is open up in sugar with some text. Then the user selects the text and presses the speech button. &lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;A separate window containing the selected text will open up. The captioning will be achieved in that separate window while the framework is reading the text.&#039;&#039;&#039; &lt;br /&gt;
 &lt;br /&gt;
:     One more thing which I am aims at is a keyboard speaker. In the configuration tool, the user will be provided with an option of turning on or off the keyboard speaker. So if the key board speaker is turned on, then as the user presses any keyboard key, the framework will speak it. Like if user presses Tab key then the framework will speak &#039;Tab&#039;, on pressing caps lock it will speak &#039;caps lock&#039; and so on.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;More details of keyboard speaker&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
     Keyboard speaker:&lt;br /&gt;
&lt;br /&gt;
My idea is to use the keyboard speaker in two different ways.&lt;br /&gt;
&lt;br /&gt;
1. speaking characters:&lt;br /&gt;
&lt;br /&gt;
In this option, the speaker will simply speak the keyboard characters typed by the user. It will speak all the alphabets a-z, digits 0-9, special characters like *(asterisk), &amp;amp;(ampersand), #(hash) etc, other keys like tab, alt, control, shift etc.&lt;br /&gt;
&lt;br /&gt;
 ADVANTAGE&lt;br /&gt;
&lt;br /&gt;
The child using this facility can easily learn and memorize the alphabets. The symbols are in front of him and he presses any of the key or symbol the facility is telling him how to pronounce it. This will also create a interest in the child and this playful activity will become a learning tool. Not only alphabets, the child can learn the names of the special characters easily.&lt;br /&gt;
&lt;br /&gt;
2. speaking words:&lt;br /&gt;
&lt;br /&gt;
In this option, the facility will speak the words typed by the user. The words can be typed anywhere in any window or in any activity like write activity.To achieve this I will be hooking the keyboard or tapping the keystrokes. I will store the characters typed by the user until space is pressed. As the user presses the space, the entire word will be sent to TTS for speaking. The main advantage is that this facility will be system side and will run in background without interfering with any other activity.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
   ADVANTAGE&lt;br /&gt;
&lt;br /&gt;
This facility will help the child to type or learn the correct spelling of the word. It is natural for a human mind to memorize the sound of a word more easily than its exact spelling. So if he types the word incorrect then as the speaker will speak the wrongly typed word and it will not match with the sound he heard, he can easily correct the spelling. In this manner the speech can be incorporated in the existing sugar write activity.&lt;br /&gt;
&lt;br /&gt;
Now these two options can be given in the GUI under the keyboard speaker option&lt;br /&gt;
&lt;br /&gt;
I have implemented a sample keyboard_speaker.py which is system wide. It can be easily tested on sugar. The file can be downloaded from below mentioned link: &lt;br /&gt;
&lt;br /&gt;
http://code.google.com/p/speech-synthesis/downloads/list&lt;br /&gt;
&lt;br /&gt;
:     Now after describing so many features of this framework, I think that why not to make it a useful thing for the blind users too. So here is an idea.&lt;br /&gt;
&lt;br /&gt;
:     ICON READER. Now what is icon reader is simple. As the blind user is browsing the XO, he can keep track of the current position of the mouse pointer. Suppose at present the mouse pointer is at Home button. Then as the user presses a pre-defined keyboard key the framework will speak &#039;Home&#039;. Similarly if pointer is at desktop the framework will speak &#039;desktop&#039;.&lt;br /&gt;
&lt;br /&gt;
:     If sugar labs likes this idea then they can provide a simple change in the XO hardware by creating that predefined keyboard key feel-able for the blind user. Like for example there are two keys j and f having a slight projection for feel. This functionality can be a boon for blind users.   &lt;br /&gt;
&lt;br /&gt;
:     Here again I am pointing the main features of my proposal:&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing speech to text opened in any window in any activity in sugar.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing a configuration Panel with GUI from which speech configuration can be changed.  &lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Karoake style coloring of the text being spoken.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Key board speaker.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Icon reader.  &lt;br /&gt;
&lt;br /&gt;
:     Who are you making it for? &lt;br /&gt;
&lt;br /&gt;
:     According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this framework for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, XO will now become a boon for blind children too.&lt;br /&gt;
&lt;br /&gt;
:    Why do they need it?&lt;br /&gt;
&lt;br /&gt;
:    The main of sugar is to spread the fruit of literacy. And as I have already mentioned that students can learn very fast if speech is also included with the text or words they read. So including this framework in sugar will make it more efficient.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, now the blind students or children can also use the XO which will be like a boon for them. Blinds also want to study...&lt;br /&gt;
&lt;br /&gt;
:    What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:    I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are: &lt;br /&gt;
&lt;br /&gt;
:    Some rough ideas of implementation:&lt;br /&gt;
&lt;br /&gt;
:    Below I am showing the basic structure of my framework i.e., the speech synthesizing framework.&lt;br /&gt;
&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                         -----------------------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   On the bottom level is the speech engine (espeak) producing the speech.&lt;br /&gt;
&lt;br /&gt;
*   There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
 &lt;br /&gt;
*   For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   Now to get the user selected text my idea is to use clipboard module which takes care of copy paste. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   A small code snippet which I have prepared for demonstration purpose is shown below. You can try it. But first please download the espeak.py code from the following link:&lt;br /&gt;
&lt;br /&gt;
    http://git.sugarlabs.org/projects/listen-spell/repos/mainline/blobs/master/espeak.py&lt;br /&gt;
&lt;br /&gt;
:    First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&lt;br /&gt;
&lt;br /&gt;
                         import gtk&lt;br /&gt;
                         from espeak import espeak&lt;br /&gt;
                         obj=espeak()&lt;br /&gt;
                         clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                         text=clip.wait_for_text()&lt;br /&gt;
                         if text==None:&lt;br /&gt;
                             obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                         else:&lt;br /&gt;
                             obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   You can select the text anywhere in the sugar. Although I have created this code using espeak directly but in future I will be using gstreamer plugin.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:    &#039;&#039;&#039;April 21-March 22&lt;br /&gt;
:     During this period I will remain in constant touch with my mentor and sugar community. I will remain active on IRC and mailing list to discuss the design details and further improvements that can be incorporated in this project.&lt;br /&gt;
&lt;br /&gt;
:     I will also study a relatively new things called STARDICT and ORCA. These can be very useful for sugar. &lt;br /&gt;
&lt;br /&gt;
:     Up to this time I will become absolutely clear on my further approach. But now I am providing a rough plan.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;May 24 - June 5&lt;br /&gt;
*     Will work on implementing the command line interface of the framework.&lt;br /&gt;
*     complete the basic architecture in which user can select and listen the text from command line interface.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 6- June 15     &lt;br /&gt;
*     Discuss the GUI design for configuration manager on IRC.&lt;br /&gt;
*     Design the finalized GUI of the configuration manager.&lt;br /&gt;
*     Will release the snapshots of the GUI on wiki page.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 16 - June 25&lt;br /&gt;
*     Implement the configuration manger.&lt;br /&gt;
*     linking the various options I already described in the GUI.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 18 - June 25&lt;br /&gt;
*     Implementation of the keyboard speaker.&lt;br /&gt;
*     Releasing the basic configuration manager. &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 26- July 7&lt;br /&gt;
*     implementation of the icon reader.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;July 8-July12&lt;br /&gt;
*     finalize an alpha release of the framwork.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 13&lt;br /&gt;
*    Mid term evaluation. Will release the alpha version.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 14 - July 23&lt;br /&gt;
*    Will test it on XO.&lt;br /&gt;
*    Ask for bugs and further improvements.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 24 - August 3&lt;br /&gt;
*    Will port the framework on windows.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 4 - August 13 &lt;br /&gt;
*    Ask for feedback&lt;br /&gt;
*    Preparation for the beta version release.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 14 - August 22&lt;br /&gt;
*    Start the documentation work&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 23 onwards&lt;br /&gt;
*    Continue Working on beta release.&lt;br /&gt;
*    I will continue working on this to make it available in official sugar distros. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:     &lt;br /&gt;
:    I already have a lot of coding experience in open source events like AI challenge whose simulator code I prepared just in a time span of 15 days.&lt;br /&gt;
:    Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
 &lt;br /&gt;
:    In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:    Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:    I have solved a lot of complex problems at SPOJ programming contest site.&lt;br /&gt;
:    http://www.spoj.pl&lt;br /&gt;
:    http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
&lt;br /&gt;
:    Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
:    I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi. A lot of students from my college have been associated with OLPC for development work. Like:&lt;br /&gt;
 &lt;br /&gt;
*    Food Force which is still in its developing phase. Recently we have achieved collaboration in Food Force. Mr. Deepank and Mr. Mohit Taneja (both are my seniors) are involved with this OLPC project from last one year. &lt;br /&gt;
*    Listen and Spell. This project was started at GSoC 2008 by Mr. Assim Deodia (Senior) and has recently achieved progress to remove speech dispatcher dependencies from it.&lt;br /&gt;
*    Speech dispatcher. This project was completed by my senior Mr. Hemant Goyal at GSoC 2008.&lt;br /&gt;
&lt;br /&gt;
:    So by giving these examples what I am trying to convey is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can. So I can get a lot of guidance and ready help in any case of technical designs or other sort of work or if I got stuck anywhere. My chances of stucking at any point are very low.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Philip Wagner &amp;lt;Philip5147@aol.com&amp;gt;&lt;br /&gt;
:    &amp;quot;I am Philip Wagner.  I am a member of the Education Team for Sugar Labs. I was a teacher for seventeen years in Africa and in the  United States.&lt;br /&gt;
:   The speech synthesizer is very important to help children learn to speak, read and write languages.&lt;br /&gt;
Some research states that a child needs to hear words pronounced between one hundred and one thousand times before the child knows the word.  This is a monumental task for a teacher to be repeating words enough times for the children to learn.  The speech synthesizer is a tool which helps the teacher in doing some of that repetition. The more words a child  knows the more learning can take place inside the child&#039;s brain.  We think with words.  If we do not have the words necessary to do the thinking then things don&#039;t progress as well.&lt;br /&gt;
For writing, the child  listens to a story that the speech synthesizer very patiently repeats as many times as the child needs it and then the child writes the story in his or her own words..&lt;br /&gt;
The hearing, seeing, and writing of the words helps the student for reading.&lt;br /&gt;
There are many more words that are used in books and on the internet than we use to speak with.&lt;br /&gt;
In English we use about ninety thousand words for speaking. An estimate of the total words in English is more than five-hundred-thousand.  One dictionary has five-hundred-thousand entries.&lt;br /&gt;
The more a child learns the more the child will learn. We cannot depend on teachers to teach enough words to children. I am very much encouraged that someone is working toward preparing the speech synthesizer in Sugar.&amp;quot; &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;According to Assim Deodia (Mentor of Sugar Labs for GSoC 2009)&lt;br /&gt;
:     &#039;&#039;&#039;assim.deodia@gmail.com&lt;br /&gt;
:     This proposal is of great potential since speech synthesis is a long desired component of sugar. Activities like speak and listen-spell are already using speech synthesis and it would be very useful to have more speech enabled activities. Various survey showed that voice + text based learnnig is much more efficient then only text based learning. If, as promised, captioning is also achieved, this would enhance the learning multifold.&lt;br /&gt;
 &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:     Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     I think that most of the things I have already cleared. I don&#039;t know wether you are going to take it or not but one thing I can assure is that I have the determination to achieve whatever I have mentioned. One last point I would like to emphasize is that speech synthesis in sugar is indispensable. So please consider this project always in the future sugar developments.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
for my Comment on this:&lt;br /&gt;
&lt;br /&gt;
http://wiki.sugarlabs.org/go/Summer_of_Code/Application_review_notes#Competing_proposals&lt;br /&gt;
&lt;br /&gt;
Please visit:  &lt;br /&gt;
&lt;br /&gt;
http://code.google.com/p/speech-synthesis/downloads/list&lt;br /&gt;
&lt;br /&gt;
and also my talk page&lt;br /&gt;
&lt;br /&gt;
Regards&lt;br /&gt;
&lt;br /&gt;
Chirag Jain&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27200</id>
		<title>Talk:Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27200"/>
		<updated>2009-04-11T18:57:49Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This would be an important addition to Sugar&#039;s capabilities. &lt;br /&gt;
&lt;br /&gt;
What does it take to make a GStreamer plugin? &lt;br /&gt;
&lt;br /&gt;
Can you explain better how the keyboard speaker would work?&lt;br /&gt;
&lt;br /&gt;
[[User:Homunq|Homunq]] 00:39, 31 March 2009 (UTC)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
My answers:&lt;br /&gt;
&lt;br /&gt;
Well first of all I want to apologize that I am answering this question so late. The main reason is my unawareness of this talk page. (Don&#039;t take it as an excuse I know I am my own culprit)&lt;br /&gt;
&lt;br /&gt;
Keyboard speaker:&lt;br /&gt;
&lt;br /&gt;
My idea is to use the keyboard speaker in two different ways.&lt;br /&gt;
&lt;br /&gt;
1. speaking characters:&lt;br /&gt;
&lt;br /&gt;
In this option, the speaker will simply speak the keyboard characters typed by the user. It will speak all the alphabets a-z, digits 0-9, special characters like *(asterisk), &amp;amp;(ampersand), #(hash) etc, other keys like tab, alt, control, shift etc.&lt;br /&gt;
&lt;br /&gt;
  &#039;&#039;&#039;ADVANTAGE&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
The child using this facility can easily learn and memorize the alphabets. The symbols are in front of him and he presses any of the key or symbol the facility is telling him how to pronounce it. This will also create a interest in the child and this playful activity will become a learning tool. Not only alphabets, the child can learn the names of the special characters easily.&lt;br /&gt;
&lt;br /&gt;
2. speaking words:&lt;br /&gt;
&lt;br /&gt;
In this option, the facility will speak the words typed by the user. &#039;&#039;&#039;The words can be typed anywhere in any window or in any activity like write activity.&#039;&#039;&#039;&lt;br /&gt;
To achieve this I will be hooking the keyboard or tapping the keystrokes. I will store the characters typed by the user until space is pressed. As the user presses the space, the entire word will be sent to TTS for speaking. &lt;br /&gt;
&#039;&#039;&#039;The main advantage is that this facility will be system side and will run in background without interfering with any other activity.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
    &#039;&#039;&#039;ADVANTAGE&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This facility will help the child to type or learn the correct spelling of the word. It is natural for a human mind to memorize the sound of a word more easily than its exact spelling. So if he types the word incorrect then as the speaker will speak the wrongly typed word and it will not match with the sound he heard, he can easily correct the spelling. In this manner the speech can be incorporated in the existing sugar write activity.&lt;br /&gt;
&lt;br /&gt;
Now these two options can be given in the GUI under the keyboard speaker option.&lt;br /&gt;
&lt;br /&gt;
The first option is very easy implement.&lt;br /&gt;
&lt;br /&gt;
For the second option I have written a sample keyboard_speaker.py only for demonstration purpose. This uses the espeak.py&lt;br /&gt;
It will be great if you test this script and give your comments.&lt;br /&gt;
&lt;br /&gt;
espeak.py &lt;br /&gt;
&lt;br /&gt;
http://git.sugarlabs.org/projects/listen-spell/repos/mainline/blobs/master/espeak.py&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
keyboard_speaker.py&lt;br /&gt;
&lt;br /&gt;
http://code.google.com/p/speech-synthesis/downloads/list&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27199</id>
		<title>Talk:Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27199"/>
		<updated>2009-04-11T18:55:44Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This would be an important addition to Sugar&#039;s capabilities. &lt;br /&gt;
&lt;br /&gt;
What does it take to make a GStreamer plugin? &lt;br /&gt;
&lt;br /&gt;
Can you explain better how the keyboard speaker would work?&lt;br /&gt;
&lt;br /&gt;
[[User:Homunq|Homunq]] 00:39, 31 March 2009 (UTC)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
My answers:&lt;br /&gt;
&lt;br /&gt;
Well first of all I want to apologize that I am answering this question so late. The main reason is my unawareness of this talk page. (Don&#039;t take it as an excuse I know I am my own culprit)&lt;br /&gt;
&lt;br /&gt;
Keyboard speaker:&lt;br /&gt;
&lt;br /&gt;
My idea is to use the keyboard speaker in two different ways.&lt;br /&gt;
&lt;br /&gt;
1. speaking characters:&lt;br /&gt;
&lt;br /&gt;
In this option, the speaker will simply speak the keyboard characters typed by the user. It will speak all the alphabets a-z, digits 0-9, special characters like *(asterisk), &amp;amp;(ampersand), #(hash) etc, other keys like tab, alt, control, shift etc.&lt;br /&gt;
&lt;br /&gt;
  &#039;&#039;&#039;ADVANTAGE&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
The child using this facility can easily learn and memorize the alphabets. The symbols are in front of him and he presses any of the key or symbol the facility is telling him how to pronounce it. This will also create a interest in the child and this playful activity will become a learning tool. Not only alphabets, the child can learn the names of the special characters easily.&lt;br /&gt;
&lt;br /&gt;
2. speaking words:&lt;br /&gt;
&lt;br /&gt;
In this option, the facility will speak the words typed by the user. &#039;&#039;&#039;The words can be typed anywhere in any window or in any activity like write activity.&#039;&#039;&#039;&lt;br /&gt;
To achieve this I will be hooking the keyboard or tapping the keystrokes. I will store the characters typed by the user until space is pressed. As the user presses the space, the entire word will be sent to TTS for speaking. &lt;br /&gt;
&#039;&#039;&#039;The main advantage is that this facility will be system side and will run in background without interfering with any other activity.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
    &#039;&#039;&#039;ADVANTAGE&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This facility will help the child to type or learn the correct spelling of the word. It is natural for a human mind to memorize the sound of a word more easily than its exact spelling. So if he types the word incorrect then as the speaker will speak the wrongly typed word and it will not match with the sound he heard, he can easily correct the spelling. In this manner the speech can be incorporated in the existing sugar write activity.&lt;br /&gt;
&lt;br /&gt;
Now these two options can be given in the GUI under the keyboard speaker option.&lt;br /&gt;
&lt;br /&gt;
The first option is very easy implement.&lt;br /&gt;
&lt;br /&gt;
For the second option I have written a sample keyboard_speaker.py only for demonstration purpose. This uses the espeak.py&lt;br /&gt;
It will be great if you test this script and give your comments.&lt;br /&gt;
&lt;br /&gt;
espeak.py &lt;br /&gt;
&lt;br /&gt;
http://git.sugarlabs.org/projects/listen-spell/repos/mainline/blobs/master/espeak.py&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
keyboard_speaker.py&lt;br /&gt;
&lt;br /&gt;
http://code.google.com/p/speech-synthesis&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=27136</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=27136"/>
		<updated>2009-04-10T18:17:09Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:      I will be honored by working with any mentor you will provide.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:      1)   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:      2)   I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:      3)   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:      Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:      Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
&lt;br /&gt;
:      Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       &#039;&#039;&#039;Speech Synthesis&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:   My project aims at creating a framework that will generate speech in core sugar. I want to implement speech as a basic functionality in sugar.&lt;br /&gt;
:      Let me become more clear. I am using some case scenarios to elaborate my proposal. Imagine any window containing some text is open in sugar. Now what I will do is to provide speech to the text in that window. User will have the freedom to listen the text he has selected. This framework will speak the complete text that the user has selected.&lt;br /&gt;
&lt;br /&gt;
:      Now how user can do this is very simple. &lt;br /&gt;
&lt;br /&gt;
:      First he have to select the text he wants to listen then he can simply press a keyboard shortcut (key combination like Alt+s or something else) or a button provided in the sugar like the home button. From now onwards I will call this button as speech button. This button can be made as a permanent which is showed up on moving the mouse pointer to the top left corner of the screen.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide a configuration management tool with a simple GUI. Simple because our target users are small children of age group 3-15.&lt;br /&gt;
&lt;br /&gt;
:      Now what you can do with this tool is that you can configure the speech. Like you can increase or decrease the volume, can change the language, can change the accent, pitch, male or female voice, rate of speech etc.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide karaoke style coloring or captioning to the words being spoken. Like for example write activity is open up in sugar with some text. Then the user selects the text and presses the speech button. &lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;A separate window containing the selected text will open up. The captioning will be achieved in that separate window while the framework is reading the text.&#039;&#039;&#039; &lt;br /&gt;
 &lt;br /&gt;
:     One more thing which I am aims at is a keyboard speaker. In the configuration tool, the user will be provided with an option of turning on or off the keyboard speaker. So if the key board speaker is turned on, then as the user presses any keyboard key, the framework will speak it. Like if user presses Tab key then the framework will speak &#039;Tab&#039;, on pressing caps lock it will speak &#039;caps lock&#039; and so on.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;More details of keyboard speaker&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
     Keyboard speaker:&lt;br /&gt;
&lt;br /&gt;
My idea is to use the keyboard speaker in two different ways.&lt;br /&gt;
&lt;br /&gt;
1. speaking characters:&lt;br /&gt;
&lt;br /&gt;
In this option, the speaker will simply speak the keyboard characters typed by the user. It will speak all the alphabets a-z, digits 0-9, special characters like *(asterisk), &amp;amp;(ampersand), #(hash) etc, other keys like tab, alt, control, shift etc.&lt;br /&gt;
&lt;br /&gt;
 ADVANTAGE&lt;br /&gt;
&lt;br /&gt;
The child using this facility can easily learn and memorize the alphabets. The symbols are in front of him and he presses any of the key or symbol the facility is telling him how to pronounce it. This will also create a interest in the child and this playful activity will become a learning tool. Not only alphabets, the child can learn the names of the special characters easily.&lt;br /&gt;
&lt;br /&gt;
2. speaking words:&lt;br /&gt;
&lt;br /&gt;
In this option, the facility will speak the words typed by the user. The words can be typed anywhere in any window or in any activity like write activity.To achieve this I will be hooking the keyboard or tapping the keystrokes. I will store the characters typed by the user until space is pressed. As the user presses the space, the entire word will be sent to TTS for speaking. The main advantage is that this facility will be system side and will run in background without interfering with any other activity.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
   ADVANTAGE&lt;br /&gt;
&lt;br /&gt;
This facility will help the child to type or learn the correct spelling of the word. It is natural for a human mind to memorize the sound of a word more easily than its exact spelling. So if he types the word incorrect then as the speaker will speak the wrongly typed word and it will not match with the sound he heard, he can easily correct the spelling. In this manner the speech can be incorporated in the existing sugar write activity.&lt;br /&gt;
&lt;br /&gt;
Now these two options can be given in the GUI under the keyboard speaker option&lt;br /&gt;
&lt;br /&gt;
:     Now after describing so many features of this framework, I think that why not to make it a useful thing for the blind users too. So here is an idea.&lt;br /&gt;
&lt;br /&gt;
:     ICON READER. Now what is icon reader is simple. As the blind user is browsing the XO, he can keep track of the current position of the mouse pointer. Suppose at present the mouse pointer is at Home button. Then as the user presses a pre-defined keyboard key the framework will speak &#039;Home&#039;. Similarly if pointer is at desktop the framework will speak &#039;desktop&#039;.&lt;br /&gt;
&lt;br /&gt;
:     If sugar labs likes this idea then they can provide a simple change in the XO hardware by creating that predefined keyboard key feel-able for the blind user. Like for example there are two keys j and f having a slight projection for feel. This functionality can be a boon for blind users.   &lt;br /&gt;
&lt;br /&gt;
:     Here again I am pointing the main features of my proposal:&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing speech to text opened in any window in any activity in sugar.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing a configuration Panel with GUI from which speech configuration can be changed.  &lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Karoake style coloring of the text being spoken.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Key board speaker.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Icon reader.  &lt;br /&gt;
&lt;br /&gt;
:     Who are you making it for? &lt;br /&gt;
&lt;br /&gt;
:     According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this framework for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, XO will now become a boon for blind children too.&lt;br /&gt;
&lt;br /&gt;
:    Why do they need it?&lt;br /&gt;
&lt;br /&gt;
:    The main of sugar is to spread the fruit of literacy. And as I have already mentioned that students can learn very fast if speech is also included with the text or words they read. So including this framework in sugar will make it more efficient.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, now the blind students or children can also use the XO which will be like a boon for them. Blinds also want to study...&lt;br /&gt;
&lt;br /&gt;
:    What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:    I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are: &lt;br /&gt;
&lt;br /&gt;
:    Some rough ideas of implementation:&lt;br /&gt;
&lt;br /&gt;
:    Below I am showing the basic structure of my framework i.e., the speech synthesizing framework.&lt;br /&gt;
&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                         -----------------------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   On the bottom level is the speech engine (espeak) producing the speech.&lt;br /&gt;
&lt;br /&gt;
*   There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
 &lt;br /&gt;
*   For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   Now to get the user selected text my idea is to use clipboard module which takes care of copy paste. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   A small code snippet which I have prepared for demonstration purpose is shown below. You can try it. But first please download the espeak.py code from the following link:&lt;br /&gt;
&lt;br /&gt;
    http://git.sugarlabs.org/projects/listen-spell/repos/mainline/blobs/master/espeak.py&lt;br /&gt;
&lt;br /&gt;
:    First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&lt;br /&gt;
&lt;br /&gt;
                         import gtk&lt;br /&gt;
                         from espeak import espeak&lt;br /&gt;
                         obj=espeak()&lt;br /&gt;
                         clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                         text=clip.wait_for_text()&lt;br /&gt;
                         if text==None:&lt;br /&gt;
                             obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                         else:&lt;br /&gt;
                             obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   You can select the text anywhere in the sugar. Although I have created this code using espeak directly but in future I will be using gstreamer plugin.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:    &#039;&#039;&#039;April 21-March 22&lt;br /&gt;
:     During this period I will remain in constant touch with my mentor and sugar community. I will remain active on IRC and mailing list to discuss the design details and further improvements that can be incorporated in this project.&lt;br /&gt;
&lt;br /&gt;
:     I will also study a relatively new things called STARDICT and ORCA. These can be very useful for sugar. &lt;br /&gt;
&lt;br /&gt;
:     Up to this time I will become absolutely clear on my further approach. But now I am providing a rough plan.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;May 24 - June 5&lt;br /&gt;
*     Will work on implementing the command line interface of the framework.&lt;br /&gt;
*     complete the basic architecture in which user can select and listen the text from command line interface.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 6- June 15     &lt;br /&gt;
*     Discuss the GUI design for configuration manager on IRC.&lt;br /&gt;
*     Design the finalized GUI of the configuration manager.&lt;br /&gt;
*     Will release the snapshots of the GUI on wiki page.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 16 - June 25&lt;br /&gt;
*     Implement the configuration manger.&lt;br /&gt;
*     linking the various options I already described in the GUI.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 18 - June 25&lt;br /&gt;
*     Implementation of the keyboard speaker.&lt;br /&gt;
*     Releasing the basic configuration manager. &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 26- July 7&lt;br /&gt;
*     implementation of the icon reader.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;July 8-July12&lt;br /&gt;
*     finalize an alpha release of the framwork.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 13&lt;br /&gt;
*    Mid term evaluation. Will release the alpha version.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 14 - July 23&lt;br /&gt;
*    Will test it on XO.&lt;br /&gt;
*    Ask for bugs and further improvements.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 24 - August 3&lt;br /&gt;
*    Will port the framework on windows.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 4 - August 13 &lt;br /&gt;
*    Ask for feedback&lt;br /&gt;
*    Preparation for the beta version release.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 14 - August 22&lt;br /&gt;
*    Start the documentation work&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 23 onwards&lt;br /&gt;
*    Continue Working on beta release.&lt;br /&gt;
*    I will continue working on this to make it available in official sugar distros. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:     &lt;br /&gt;
:    I already have a lot of coding experience in open source events like AI challenge whose simulator code I prepared just in a time span of 15 days.&lt;br /&gt;
:    Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
 &lt;br /&gt;
:    In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:    Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:    I have solved a lot of complex problems at SPOJ programming contest site.&lt;br /&gt;
:    http://www.spoj.pl&lt;br /&gt;
:    http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
&lt;br /&gt;
:    Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
:    I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi. A lot of students from my college have been associated with OLPC for development work. Like:&lt;br /&gt;
 &lt;br /&gt;
*    Food Force which is still in its developing phase. Recently we have achieved collaboration in Food Force. Mr. Deepank and Mr. Mohit Taneja (both are my seniors) are involved with this OLPC project from last one year. &lt;br /&gt;
*    Listen and Spell. This project was started at GSoC 2008 by Mr. Assim Deodia (Senior) and has recently achieved progress to remove speech dispatcher dependencies from it.&lt;br /&gt;
*    Speech dispatcher. This project was completed by my senior Mr. Hemant Goyal at GSoC 2008.&lt;br /&gt;
&lt;br /&gt;
:    So by giving these examples what I am trying to convey is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can. So I can get a lot of guidance and ready help in any case of technical designs or other sort of work or if I got stuck anywhere. My chances of stucking at any point are very low.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Philip Wagner &amp;lt;Philip5147@aol.com&amp;gt;&lt;br /&gt;
:    &amp;quot;I am Philip Wagner.  I am a member of the Education Team for Sugar Labs. I was a teacher for seventeen years in Africa and in the  United States.&lt;br /&gt;
:   The speech synthesizer is very important to help children learn to speak, read and write languages.&lt;br /&gt;
Some research states that a child needs to hear words pronounced between one hundred and one thousand times before the child knows the word.  This is a monumental task for a teacher to be repeating words enough times for the children to learn.  The speech synthesizer is a tool which helps the teacher in doing some of that repetition. The more words a child  knows the more learning can take place inside the child&#039;s brain.  We think with words.  If we do not have the words necessary to do the thinking then things don&#039;t progress as well.&lt;br /&gt;
For writing, the child  listens to a story that the speech synthesizer very patiently repeats as many times as the child needs it and then the child writes the story in his or her own words..&lt;br /&gt;
The hearing, seeing, and writing of the words helps the student for reading.&lt;br /&gt;
There are many more words that are used in books and on the internet than we use to speak with.&lt;br /&gt;
In English we use about ninety thousand words for speaking. An estimate of the total words in English is more than five-hundred-thousand.  One dictionary has five-hundred-thousand entries.&lt;br /&gt;
The more a child learns the more the child will learn. We cannot depend on teachers to teach enough words to children. I am very much encouraged that someone is working toward preparing the speech synthesizer in Sugar.&amp;quot; &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;According to Assim Deodia (Mentor of Sugar Labs for GSoC 2009)&lt;br /&gt;
:     &#039;&#039;&#039;assim.deodia@gmail.com&lt;br /&gt;
:     This proposal is of great potential since speech synthesis is a long desired component of sugar. Activities like speak and listen-spell are already using speech synthesis and it would be very useful to have more speech enabled activities. Various survey showed that voice + text based learnnig is much more efficient then only text based learning. If, as promised, captioning is also achieved, this would enhance the learning multifold.&lt;br /&gt;
 &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:     Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     I think that most of the things I have already cleared. I don&#039;t know wether you are going to take it or not but one thing I can assure is that I have the determination to achieve whatever I have mentioned. One last point I would like to emphasize is that speech synthesis in sugar is indispensable. So please consider this project always in the future sugar developments.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27135</id>
		<title>Talk:Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27135"/>
		<updated>2009-04-10T18:14:27Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This would be an important addition to Sugar&#039;s capabilities. &lt;br /&gt;
&lt;br /&gt;
What does it take to make a GStreamer plugin? &lt;br /&gt;
&lt;br /&gt;
Can you explain better how the keyboard speaker would work?&lt;br /&gt;
&lt;br /&gt;
[[User:Homunq|Homunq]] 00:39, 31 March 2009 (UTC)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
My answers:&lt;br /&gt;
&lt;br /&gt;
Well first of all I want to apologize that I am answering this question so late. The main reason is my unawareness of this talk page. (Don&#039;t take it as an excuse I know I am my own culprit)&lt;br /&gt;
&lt;br /&gt;
Keyboard speaker:&lt;br /&gt;
&lt;br /&gt;
My idea is to use the keyboard speaker in two different ways.&lt;br /&gt;
&lt;br /&gt;
1. speaking characters:&lt;br /&gt;
&lt;br /&gt;
In this option, the speaker will simply speak the keyboard characters typed by the user. It will speak all the alphabets a-z, digits 0-9, special characters like *(asterisk), &amp;amp;(ampersand), #(hash) etc, other keys like tab, alt, control, shift etc.&lt;br /&gt;
&lt;br /&gt;
  &#039;&#039;&#039;ADVANTAGE&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
The child using this facility can easily learn and memorize the alphabets. The symbols are in front of him and he presses any of the key or symbol the facility is telling him how to pronounce it. This will also create a interest in the child and this playful activity will become a learning tool. Not only alphabets, the child can learn the names of the special characters easily.&lt;br /&gt;
&lt;br /&gt;
2. speaking words:&lt;br /&gt;
&lt;br /&gt;
In this option, the facility will speak the words typed by the user. &#039;&#039;&#039;The words can be typed anywhere in any window or in any activity like write activity.&#039;&#039;&#039;&lt;br /&gt;
To achieve this I will be hooking the keyboard or tapping the keystrokes. I will store the characters typed by the user until space is pressed. As the user presses the space, the entire word will be sent to TTS for speaking. &lt;br /&gt;
&#039;&#039;&#039;The main advantage is that this facility will be system side and will run in background without interfering with any other activity.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
    &#039;&#039;&#039;ADVANTAGE&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This facility will help the child to type or learn the correct spelling of the word. It is natural for a human mind to memorize the sound of a word more easily than its exact spelling. So if he types the word incorrect then as the speaker will speak the wrongly typed word and it will not match with the sound he heard, he can easily correct the spelling. In this manner the speech can be incorporated in the existing sugar write activity.&lt;br /&gt;
&lt;br /&gt;
Now these two options can be given in the GUI under the keyboard speaker option.&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27134</id>
		<title>Talk:Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27134"/>
		<updated>2009-04-10T18:12:03Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This would be an important addition to Sugar&#039;s capabilities. &lt;br /&gt;
&lt;br /&gt;
What does it take to make a GStreamer plugin? &lt;br /&gt;
&lt;br /&gt;
Can you explain better how the keyboard speaker would work?&lt;br /&gt;
&lt;br /&gt;
[[User:Homunq|Homunq]] 00:39, 31 March 2009 (UTC)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
My answers:&lt;br /&gt;
&lt;br /&gt;
Well first of all I want to apologize that I am answering this question so late. The main reason is my unawareness of this talk page. (Don&#039;t take it as an excuse I know I am my own culprit)&lt;br /&gt;
&lt;br /&gt;
Keyboard speaker:&lt;br /&gt;
&lt;br /&gt;
My idea is to use the keyboard speaker in two different ways.&lt;br /&gt;
&lt;br /&gt;
1. speaking characters:&lt;br /&gt;
&lt;br /&gt;
In this option, the speaker will simply speak the keyboard characters typed by the user. It will speak all the alphabets a-z, digits 0-9, special characters like *(asterisk), &amp;amp;(ampersand), #(hash) etc, other keys like tab, alt, control, shift etc.&lt;br /&gt;
&lt;br /&gt;
  &#039;&#039;&#039;ADVANTAGE&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
The child using this facility can easily learn and memorize the alphabets. The symbols are in front of him and he presses any of the key or symbol the facility is telling him how to pronounce it. This will also create a interest in the child and this playful activity will become a learning tool. Not only alphabets, the child can learn the names of the special characters easily.&lt;br /&gt;
&lt;br /&gt;
2. speaking words:&lt;br /&gt;
&lt;br /&gt;
In this option, the facility will speak the words typed by the user. &#039;&#039;&#039;The words can be typed anywhere in any window or in any activity like write activity.&#039;&#039;&#039;&lt;br /&gt;
To achieve this I will be hooking the keyboard or tapping the keystrokes. I will store the characters typed by the user until space is pressed. As the user presses the space, the entire word will be sent to TTS for speaking. &lt;br /&gt;
&#039;&#039;&#039;The main advantage is that this facility will be system side and will run in background without interfering with any other activity.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
    &#039;&#039;&#039;ADVANTAGE&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This facility will help the child to type or learn the correct spelling of the word. It is natural for a human mind to memorize the sound of a word more easily than its exact spelling. So if he types the word incorrect then as the speaker will speak the wrongly typed word and it will not match with the sound he heard, he can easily correct the spelling. In this manner the speech can be incorporated in the existing sugar write activity.&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27133</id>
		<title>Talk:Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Talk:Speech-synthesis&amp;diff=27133"/>
		<updated>2009-04-10T18:08:51Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This would be an important addition to Sugar&#039;s capabilities. &lt;br /&gt;
&lt;br /&gt;
What does it take to make a GStreamer plugin? &lt;br /&gt;
&lt;br /&gt;
Can you explain better how the keyboard speaker would work?&lt;br /&gt;
&lt;br /&gt;
[[User:Homunq|Homunq]] 00:39, 31 March 2009 (UTC)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
My answers:&lt;br /&gt;
&lt;br /&gt;
Well first of all I want to apologize that I am answering this question so late. The main reason is my unawareness of this talk page. (Don&#039;t take it as an excuse I know I am my own culprit)&lt;br /&gt;
&lt;br /&gt;
Keyboard speaker:&lt;br /&gt;
&lt;br /&gt;
My idea is to use the keyboard speaker in two different ways.&lt;br /&gt;
&lt;br /&gt;
1. speaking characters:&lt;br /&gt;
&lt;br /&gt;
In this option, the speaker will simply speak the keyboard characters typed by the user. It will speak all the alphabets a-z, digits 0-9, special characters like *(asterisk), &amp;amp;(ampersand), #(hash) etc, other keys like tab, alt, control, shift etc.&lt;br /&gt;
&lt;br /&gt;
===============&lt;br /&gt;
  ADVANTAGE&lt;br /&gt;
===============&lt;br /&gt;
&lt;br /&gt;
The child using this facility can easily learn and memorize the alphabets. The symbols are in front of him and he presses any of the key or symbol the facility is telling him how to pronounce it. This will also create a interest in the child and this playful activity will become a learning tool. Not only alphabets, the child can learn the names of the special characters easily.&lt;br /&gt;
&lt;br /&gt;
2. speaking words:&lt;br /&gt;
&lt;br /&gt;
In this option, the facility will speak the words typed by the user. &#039;&#039;&#039;The words can be typed anywhere in any window or in any activity like write activity.&#039;&#039;&#039;&lt;br /&gt;
To achieve this I will be hooking the keyboard or tapping the keystrokes. I will store the characters typed by the user until space is pressed. As the user presses the space, the entire word will be sent to TTS for speaking. &lt;br /&gt;
&#039;&#039;&#039;The main advantage is that this facility will be system side and will run in background without interfering with any other activity.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==============&lt;br /&gt;
    ADVANTAGE&lt;br /&gt;
==============&lt;br /&gt;
&lt;br /&gt;
This facility will help the child to type or learn the correct spelling of the word. It is natural for a human mind to memorize the sound of a word more easily than its exact spelling. So if he types the word incorrect then as the speaker will speak the wrongly typed word and it will not match with the sound he heard, he can easily correct the spelling. In this manner the speech can be incorporated in the existing sugar write activity.&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=User:Chiragjain1989&amp;diff=25537</id>
		<title>User:Chiragjain1989</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=User:Chiragjain1989&amp;diff=25537"/>
		<updated>2009-04-05T07:27:50Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;   ==SPEECH SYNTHESIS IN SUGAR CORE==&lt;br /&gt;
   ===INTRODUCTION===&lt;br /&gt;
&lt;br /&gt;
---&lt;br /&gt;
&lt;br /&gt;
:   The main of sugar is to spread the fruit of literacy across the world. They have come up with a low cost children specific laptop. Sugar is the GUI of the laptop. Sugar Labs is participating in this year Google Summer of Code program. The efforts made by sugar in achieving their goal has always impressed me a lot.&lt;br /&gt;
&lt;br /&gt;
:   This year, one of the project ideas of sugar is speech synthesis which is the enhancement over the last year OLPC GSoC speech dispatcher project. This project was handled by my college senior Mr. Hemant. I appreciate his efforts as the project was really a tough one. &lt;br /&gt;
&lt;br /&gt;
:   But due to changes in the sugar architecture, the speech dispatcher has become a system daemon. So the need is to develop a new API which can easily collaborate with the existing sugar framework. One good option is gstreamer plugin.&lt;br /&gt;
&lt;br /&gt;
:   This year for GSoC, I have proposed for speech synthesis in core sugar. My basic idea is to use the gstreamer plugin and provide speech capability to all the existing sugar activities. My aim is to give speech whatever and whereever any text is selected. A button or a keyboard shortcut will be assigned for it.&lt;br /&gt;
&lt;br /&gt;
   ====current status====&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;March 23- April 3&lt;br /&gt;
&lt;br /&gt;
:    Right now I have already applied for the GSoC. My proposal can be seen at http://wiki.sugarlabs.org/go/speech-synthesis&lt;br /&gt;
&lt;br /&gt;
:    For any suggestions or errors please do mail me at chiragjain1989@gmail.com&lt;br /&gt;
&lt;br /&gt;
:    Right now I am preparing some code related to this framework that I can show up in my application proposal.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039; March 30&lt;br /&gt;
:    The code I have prepared is the base of my framework. By using this code any selected text can be spoken by the framework. The code is as shown below. First download the code espeak.py from the following link:&lt;br /&gt;
&lt;br /&gt;
   http://git.sugarlabs.org/projects/listen-spell/repos/mainline/blobs/master/espeak.py&lt;br /&gt;
&lt;br /&gt;
:  after that u can try this code....&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
                        import gtk&lt;br /&gt;
                        from espeak import espeak&lt;br /&gt;
                        obj=espeak()&lt;br /&gt;
                        clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                        text=clip.wait_for_text()&lt;br /&gt;
                        if text==None:&lt;br /&gt;
                            obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                        else:&lt;br /&gt;
                            obj.speak(text)&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=25536</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=25536"/>
		<updated>2009-04-05T05:47:36Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:     What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:      I will be honored by working with any mentor you will provide.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:      1)   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:      2)   I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:      3)   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:      Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:      Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
&lt;br /&gt;
:      Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       &#039;&#039;&#039;Speech Synthesis&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:   My project aims at creating a framework that will generate speech in core sugar. I want to implement speech as a basic functionality in sugar.&lt;br /&gt;
:      Let me become more clear. I am using some case scenarios to elaborate my proposal. Imagine any window containing some text is open in sugar. Now what I will do is to provide speech to the text in that window. User will have the freedom to listen the text he has selected. This framework will speak the complete text that the user has selected.&lt;br /&gt;
&lt;br /&gt;
:      Now how user can do this is very simple. &lt;br /&gt;
&lt;br /&gt;
:      First he have to select the text he wants to listen then he can simply press a keyboard shortcut (key combination like Alt+s or something else) or a button provided in the sugar like the home button. From now onwards I will call this button as speech button. This button can be made as a permanent which is showed up on moving the mouse pointer to the top left corner of the screen.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide a configuration management tool with a simple GUI. Simple because our target users are small children of age group 3-15.&lt;br /&gt;
&lt;br /&gt;
:      Now what you can do with this tool is that you can configure the speech. Like you can increase or decrease the volume, can change the language, can change the accent, pitch, male or female voice, rate of speech etc.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide karaoke style coloring or captioning to the words being spoken. Like for example write activity is open up in sugar with some text. Then the user selects the text and presses the speech button. &lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;A separate window containing the selected text will open up. The captioning will be achieved in that separate window while the framework is reading the text.&#039;&#039;&#039; &lt;br /&gt;
 &lt;br /&gt;
:     One more thing which I am aims at is a keyboard speaker. In the configuration tool, the user will be provided with an option of turning on or off the keyboard speaker. So if the key board speaker is turned on, then as the user presses any keyboard key, the framework will speak it. Like if user presses Tab key then the framework will speak &#039;Tab&#039;, on pressing caps lock it will speak &#039;caps lock&#039; and so on.&lt;br /&gt;
&lt;br /&gt;
:     Now after describing so many features of this framework, I think that why not to make it a useful thing for the blind users too. So here is an idea.&lt;br /&gt;
&lt;br /&gt;
:     ICON READER. Now what is icon reader is simple. As the blind user is browsing the XO, he can keep track of the current position of the mouse pointer. Suppose at present the mouse pointer is at Home button. Then as the user presses a pre-defined keyboard key the framework will speak &#039;Home&#039;. Similarly if pointer is at desktop the framework will speak &#039;desktop&#039;.&lt;br /&gt;
&lt;br /&gt;
:     If sugar labs likes this idea then they can provide a simple change in the XO hardware by creating that predefined keyboard key feel-able for the blind user. Like for example there are two keys j and f having a slight projection for feel. This functionality can be a boon for blind users.   &lt;br /&gt;
&lt;br /&gt;
:     Here again I am pointing the main features of my proposal:&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing speech to text opened in any window in any activity in sugar.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing a configuration Panel with GUI from which speech configuration can be changed.  &lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Karoake style coloring of the text being spoken.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Key board speaker.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Icon reader.  &lt;br /&gt;
&lt;br /&gt;
:     Who are you making it for? &lt;br /&gt;
&lt;br /&gt;
:     According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this framework for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, XO will now become a boon for blind children too.&lt;br /&gt;
&lt;br /&gt;
:    Why do they need it?&lt;br /&gt;
&lt;br /&gt;
:    The main of sugar is to spread the fruit of literacy. And as I have already mentioned that students can learn very fast if speech is also included with the text or words they read. So including this framework in sugar will make it more efficient.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, now the blind students or children can also use the XO which will be like a boon for them. Blinds also want to study...&lt;br /&gt;
&lt;br /&gt;
:    What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:    I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are: &lt;br /&gt;
&lt;br /&gt;
:    Some rough ideas of implementation:&lt;br /&gt;
&lt;br /&gt;
:    Below I am showing the basic structure of my framework i.e., the speech synthesizing framework.&lt;br /&gt;
&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                         -----------------------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   On the bottom level is the speech engine (espeak) producing the speech.&lt;br /&gt;
&lt;br /&gt;
*   There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
 &lt;br /&gt;
*   For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   Now to get the user selected text my idea is to use clipboard module which takes care of copy paste. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   A small code snippet which I have prepared for demonstration purpose is shown below. You can try it. But first please download the espeak.py code from the following link:&lt;br /&gt;
&lt;br /&gt;
    http://git.sugarlabs.org/projects/listen-spell/repos/mainline/blobs/master/espeak.py&lt;br /&gt;
&lt;br /&gt;
:    First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&lt;br /&gt;
&lt;br /&gt;
                         import gtk&lt;br /&gt;
                         from espeak import espeak&lt;br /&gt;
                         obj=espeak()&lt;br /&gt;
                         clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                         text=clip.wait_for_text()&lt;br /&gt;
                         if text==None:&lt;br /&gt;
                             obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                         else:&lt;br /&gt;
                             obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   You can select the text anywhere in the sugar. Although I have created this code using espeak directly but in future I will be using gstreamer plugin.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:    &#039;&#039;&#039;April 21-March 22&lt;br /&gt;
:     During this period I will remain in constant touch with my mentor and sugar community. I will remain active on IRC and mailing list to discuss the design details and further improvements that can be incorporated in this project.&lt;br /&gt;
&lt;br /&gt;
:     I will also study a relatively new things called STARDICT and ORCA. These can be very useful for sugar. &lt;br /&gt;
&lt;br /&gt;
:     Up to this time I will become absolutely clear on my further approach. But now I am providing a rough plan.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;May 24 - June 5&lt;br /&gt;
*     Will work on implementing the command line interface of the framework.&lt;br /&gt;
*     complete the basic architecture in which user can select and listen the text from command line interface.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 6- June 15     &lt;br /&gt;
*     Discuss the GUI design for configuration manager on IRC.&lt;br /&gt;
*     Design the finalized GUI of the configuration manager.&lt;br /&gt;
*     Will release the snapshots of the GUI on wiki page.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 16 - June 25&lt;br /&gt;
*     Implement the configuration manger.&lt;br /&gt;
*     linking the various options I already described in the GUI.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 18 - June 25&lt;br /&gt;
*     Implementation of the keyboard speaker.&lt;br /&gt;
*     Releasing the basic configuration manager. &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 26- July 7&lt;br /&gt;
*     implementation of the icon reader.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;July 8-July12&lt;br /&gt;
*     finalize an alpha release of the framwork.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 13&lt;br /&gt;
*    Mid term evaluation. Will release the alpha version.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 14 - July 23&lt;br /&gt;
*    Will test it on XO.&lt;br /&gt;
*    Ask for bugs and further improvements.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 24 - August 3&lt;br /&gt;
*    Will port the framework on windows.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 4 - August 13 &lt;br /&gt;
*    Ask for feedback&lt;br /&gt;
*    Preparation for the beta version release.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 14 - August 22&lt;br /&gt;
*    Start the documentation work&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 23 onwards&lt;br /&gt;
*    Continue Working on beta release.&lt;br /&gt;
*    I will continue working on this to make it available in official sugar distros. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:     &lt;br /&gt;
:    I already have a lot of coding experience in open source events like AI challenge whose simulator code I prepared just in a time span of 15 days.&lt;br /&gt;
:    Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
 &lt;br /&gt;
:    In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:    Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:    I have solved a lot of complex problems at SPOJ programming contest site.&lt;br /&gt;
:    http://www.spoj.pl&lt;br /&gt;
:    http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
&lt;br /&gt;
:    Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
:    I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi. A lot of students from my college have been associated with OLPC for development work. Like:&lt;br /&gt;
 &lt;br /&gt;
*    Food Force which is still in its developing phase. Recently we have achieved collaboration in Food Force. Mr. Deepank and Mr. Mohit Taneja (both are my seniors) are involved with this OLPC project from last one year. &lt;br /&gt;
*    Listen and Spell. This project was started at GSoC 2008 by Mr. Assim Deodia (Senior) and has recently achieved progress to remove speech dispatcher dependencies from it.&lt;br /&gt;
*    Speech dispatcher. This project was completed by my senior Mr. Hemant Goyal at GSoC 2008.&lt;br /&gt;
&lt;br /&gt;
:    So by giving these examples what I am trying to convey is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can. So I can get a lot of guidance and ready help in any case of technical designs or other sort of work or if I got stuck anywhere. My chances of stucking at any point are very low.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Philip Wagner &amp;lt;Philip5147@aol.com&amp;gt;&lt;br /&gt;
:    &amp;quot;I am Philip Wagner.  I am a member of the Education Team for Sugar Labs. I was a teacher for seventeen years in Africa and in the  United States.&lt;br /&gt;
:   The speech synthesizer is very important to help children learn to speak, read and write languages.&lt;br /&gt;
Some research states that a child needs to hear words pronounced between one hundred and one thousand times before the child knows the word.  This is a monumental task for a teacher to be repeating words enough times for the children to learn.  The speech synthesizer is a tool which helps the teacher in doing some of that repetition. The more words a child  knows the more learning can take place inside the child&#039;s brain.  We think with words.  If we do not have the words necessary to do the thinking then things don&#039;t progress as well.&lt;br /&gt;
For writing, the child  listens to a story that the speech synthesizer very patiently repeats as many times as the child needs it and then the child writes the story in his or her own words..&lt;br /&gt;
The hearing, seeing, and writing of the words helps the student for reading.&lt;br /&gt;
There are many more words that are used in books and on the internet than we use to speak with.&lt;br /&gt;
In English we use about ninety thousand words for speaking. An estimate of the total words in English is more than five-hundred-thousand.  One dictionary has five-hundred-thousand entries.&lt;br /&gt;
The more a child learns the more the child will learn. We cannot depend on teachers to teach enough words to children. I am very much encouraged that someone is working toward preparing the speech synthesizer in Sugar.&amp;quot; &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;According to Assim Deodia (Mentor of Sugar Labs for GSoC 2009)&lt;br /&gt;
:     &#039;&#039;&#039;assim.deodia@gmail.com&lt;br /&gt;
:     This proposal is of great potential since speech synthesis is a long desired component of sugar. Activities like speak and listen-spell are already using speech synthesis and it would be very useful to have more speech enabled activities. Various survey showed that voice + text based learnnig is much more efficient then only text based learning. If, as promised, captioning is also achieved, this would enhance the learning multifold.&lt;br /&gt;
 &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   We want to make sure that you can set up a [[Development Team#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:     Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     I think that most of the things I have already cleared. I don&#039;t know wether you are going to take it or not but one thing I can assure is that I have the determination to achieve whatever I have mentioned. One last point I would like to emphasize is that speech synthesis in sugar is indispensable. So please consider this project always in the future sugar developments.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Category:2009_GSoC_applications&amp;diff=25421</id>
		<title>Category:2009 GSoC applications</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Category:2009_GSoC_applications&amp;diff=25421"/>
		<updated>2009-04-04T09:21:36Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Below is the list of student applications. Please copy Summer of Code/Student application template and name your page with your project title, not using your name and not making it a subpage (no / in title). Please apply also in [http://socghop.appspot.com melange, google&#039;s web app]; if you do not apply there &#039;&#039;&#039;before&#039;&#039;&#039; April 3, we will not be able to accept your applications.&lt;br /&gt;
&lt;br /&gt;
[[Category: GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Category:2009_GSoC_applications&amp;diff=25420</id>
		<title>Category:2009 GSoC applications</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Category:2009_GSoC_applications&amp;diff=25420"/>
		<updated>2009-04-04T09:19:18Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Below is the list of student applications. Please copy Summer of Code/Student application template and name your page with your project title, not using your name and not making it a subpage (no / in title). Please apply also in [http://socghop.appspot.com melange, google&#039;s web app]; if you do not apply there &#039;&#039;&#039;before&#039;&#039;&#039; April 3, we will not be able to accept your applications.&lt;br /&gt;
&lt;br /&gt;
[[Category: GSoC]]&lt;br /&gt;
[[http://wiki.sugarlabs.org/go/Marbles]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Category:2009_GSoC_applications&amp;diff=25419</id>
		<title>Category:2009 GSoC applications</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Category:2009_GSoC_applications&amp;diff=25419"/>
		<updated>2009-04-04T09:17:05Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Below is the list of student applications. Please copy Summer of Code/Student application template and name your page with your project title, not using your name and not making it a subpage (no / in title). Please apply also in [http://socghop.appspot.com melange, google&#039;s web app]; if you do not apply there &#039;&#039;&#039;before&#039;&#039;&#039; April 3, we will not be able to accept your applications.&lt;br /&gt;
&lt;br /&gt;
[[Category: GSoC]]&lt;br /&gt;
[[Marbles]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Category:2009_GSoC_applications&amp;diff=25418</id>
		<title>Category:2009 GSoC applications</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Category:2009_GSoC_applications&amp;diff=25418"/>
		<updated>2009-04-04T09:14:53Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Below is the list of student applications. Please copy Summer of Code/Student application template and name your page with your project title, not using your name and not making it a subpage (no / in title). Please apply also in [http://socghop.appspot.com melange, google&#039;s web app]; if you do not apply there &#039;&#039;&#039;before&#039;&#039;&#039; April 3, we will not be able to accept your applications.&lt;br /&gt;
&lt;br /&gt;
[[Marbles]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=25017</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=25017"/>
		<updated>2009-04-03T12:42:26Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:   &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:      collaborating with any mentor wouldn&#039;t be a big deal.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:      1)   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:      2)   I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:      3)   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:      Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:      Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
&lt;br /&gt;
:      Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Synthesis&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:   My project aims at creating a framework that will generate speech in core sugar. I want to implement speech as a basic functionality in sugar.&lt;br /&gt;
:      Let me become more clear. I am using some case scenarios to elaborate my proposal. Imagine any window containing some text is open in sugar. Now what I will do is to provide speech to the text in that window. User will have the freedom to listen the text he has selected. This framework will speak the complete text that the user has selected.&lt;br /&gt;
&lt;br /&gt;
:      Now how user can do this is very simple. &lt;br /&gt;
&lt;br /&gt;
:      First he have to select the text he wants to listen then he can simply press a keyboard shortcut (key combination like Alt+s or something else) or a button provided in the sugar like the home button. From now onwards I will call this button as speech button. This button can be made as a permanent which is showed up on moving the mouse pointer to the top left corner of the screen.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide a configuration management tool with a simple GUI. Simple because our target users are small children of age group 3-15.&lt;br /&gt;
&lt;br /&gt;
:      Now what you can do with this tool is that you can configure the speech. Like you can increase or decrease the volume, can change the language, can change the accent, pitch, male or female voice, rate of speech etc.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide karaoke style coloring or captioning to the words being spoken. Like for example write activity is open up in sugar with some text. Then the user selects the text and presses the speech button. &#039;&#039;&#039;Now a separate window containing the selected text will open up. The captioning will be achieved in that separate window while the framework is reading the text.&#039;&#039;&#039; &lt;br /&gt;
 &lt;br /&gt;
:     One more thing which I am aims at is a keyboard speaker. In the configuration tool, the user will be provided with an option of turning on or off the keyboard speaker. So if the key board speaker is turned on, then as the user presses any keyboard key, the framework will speak it. Like if user presses Tab key then the framework will speak &#039;Tab&#039;, on pressing caps lock it will speak &#039;caps lock&#039; and so on.&lt;br /&gt;
&lt;br /&gt;
:     Now after describing so many features of this framework, I think that why not to make it a useful thing for the blind users too. So here is an idea.&lt;br /&gt;
&lt;br /&gt;
:     ICON READER. Now what is icon reader is simple. As the blind user is browsing the XO, he can keep track of the current position of the mouse pointer. Suppose at present the mouse pointer is at Home button. Then as the user presses a pre-defined keyboard key the framework will speak &#039;Home&#039;. Similarly if pointer is at desktop the framework will speak &#039;desktop&#039;.&lt;br /&gt;
&lt;br /&gt;
:     If sugar labs likes this idea then they can provide a simple change in the XO hardware by creating that predefined keyboard key feel-able for the blind user. Like for example there are two keys j and f having a slight projection for feel. This functionality can be a boon for blind users.   &lt;br /&gt;
&lt;br /&gt;
:     Here again I am pointing the main features of my proposal:&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing speech to text opened in any window in any activity in sugar.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing a configuration Panel with GUI from which speech configuration can be changed.  &lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Karoake style coloring of the text being spoken.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Key board speaker.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Icon reader.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Who are you making it for? &lt;br /&gt;
&lt;br /&gt;
:     According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this framework for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, XO will now become a boon for blind children too.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Why do they need it?&lt;br /&gt;
&lt;br /&gt;
:    The main of sugar is to spread the fruit of literacy. And as I have already mentioned that students can learn very fast if speech is also included with the text or words they read. So including this framework in sugar will make it more efficient.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, now the blind students or children can also use the XO which will be like a boon for them. Blinds also want to study...&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:    I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are: &lt;br /&gt;
&lt;br /&gt;
:    Some rough ideas of implementation:&lt;br /&gt;
&lt;br /&gt;
:    Below I am showing the basic structure of my framework i.e., the speech synthesizing framework.&lt;br /&gt;
&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                 ---------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;On the bottom level is the speech engine (espeak) producing the speech.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
 &lt;br /&gt;
*   &#039;&#039;&#039;For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now to get the user selected text my idea is to use clipboard module which takes care of copy paste. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;A small code snippet which I have prepared for demonstration purpose is shown below. You can try it. But first please download the espeak.py code from the following link:&lt;br /&gt;
&lt;br /&gt;
    http://git.sugarlabs.org/projects/listen-spell/repos/mainline/blobs/master/espeak.py&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
                         import gtk&lt;br /&gt;
                         from espeak import espeak&lt;br /&gt;
                         obj=espeak()&lt;br /&gt;
                         clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                         text=clip.wait_for_text()&lt;br /&gt;
                         if text==None:&lt;br /&gt;
                             obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                         else:&lt;br /&gt;
                             obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;You can select the text anywhere in the sugar. Although I have created this code using espeak directly but in future I will be using gstreamer plugin.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:    &#039;&#039;&#039;April 21-March 22&lt;br /&gt;
:     During this period I will remain in constant touch with my mentor and sugar community. I will remain active on IRC and mailing list to discuss the design details and further improvements that can be incorporated in this project.&lt;br /&gt;
&lt;br /&gt;
:     I will also study a relatively new things called STARDICT and ORCA. These can be very useful for sugar. &lt;br /&gt;
&lt;br /&gt;
:     Up to this time I will become absolutely clear on my further approach. But now I am providing a rough plan.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;May 24 - June 5&lt;br /&gt;
*     Will work on implementing the command line interface of the framework.&lt;br /&gt;
*     complete the basic architecture in which user can select and listen the text from command line interface.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 6- June 15     &lt;br /&gt;
*     Discuss the GUI design for configuration manager on IRC.&lt;br /&gt;
*     Design the finalized GUI of the configuration manager.&lt;br /&gt;
*     Will release the snapshots of the GUI on wiki page.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 16 - June 25&lt;br /&gt;
*     Implement the configuration manger.&lt;br /&gt;
*     linking the various options I already described in the GUI.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 18 - June 25&lt;br /&gt;
*     Implementation of the keyboard speaker.&lt;br /&gt;
*     Releasing the basic configuration manager. &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 26- July 7&lt;br /&gt;
*     implementation of the icon reader.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;July 8-July12&lt;br /&gt;
*     finalize an alpha release of the framwork.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 13&lt;br /&gt;
*    Mid term evaluation. Will release the alpha version.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 14 - July 23&lt;br /&gt;
*    Will test it on XO.&lt;br /&gt;
*    Ask for bugs and further improvements.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 24 - August 3&lt;br /&gt;
*    Will port the framework on windows.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 4 - August 13 &lt;br /&gt;
*    Ask for feedback&lt;br /&gt;
*    Preparation for the beta version release.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 14 - August 22&lt;br /&gt;
*    Start the documentation work&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 23 onwards&lt;br /&gt;
*    Continue Working on beta release.&lt;br /&gt;
*    I will continue working on this to make it available in official sugar distros. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi.&lt;br /&gt;
:    I already have a bit of coding experience in open source events like AI challenge whose simulator code I prepared just in a time span of 15 days.&lt;br /&gt;
:    Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
 &lt;br /&gt;
:    In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:    Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
:    A lot of students from my college have been associated with OLPC for development work. Like:&lt;br /&gt;
 &lt;br /&gt;
*    &#039;&#039;&#039;Food Force which is still in its developing phase. Recently we have achieved collaboration in Food Force. Mr. Deepank and Mr. Mohit Taneja (both are my seniors) are involved with this OLPC project from last one year. &lt;br /&gt;
*    &#039;&#039;&#039;Listen and Spell. This project was started at GSoC 2008 by Mr. Assim Deodia (Senior) and has recently achieved progress to remove speech dispatcher dependencies from it.&lt;br /&gt;
*    &#039;&#039;&#039;Speech dispatcher. This project was completed by my senior Mr. Hemant Goyal at GSoC 2008.&lt;br /&gt;
&lt;br /&gt;
:    So by giving these examples what I am trying to convey is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can. So I can get a lot of guidance and ready help in any case of technical designs or other sort of work. My chances of stucking at any point are very low.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Philip Wagner &amp;lt;Philip5147@aol.com&amp;gt;&lt;br /&gt;
:    &amp;quot;I am Philip Wagner.  I am a member of the Education Team for Sugar Labs. I was a teacher for seventeen years in Africa and in the  United States.&lt;br /&gt;
:   The speech synthesizer is very important to help children learn to speak, read and write languages.&lt;br /&gt;
Some research states that a child needs to hear words pronounced between one hundred and one thousand times before the child knows the word.  This is a monumental task for a teacher to be repeating words enough times for the children to learn.  The speech synthesizer is a tool which helps the teacher in doing some of that repetition. The more words a child  knows the more learning can take place inside the child&#039;s brain.  We think with words.  If we do not have the words necessary to do the thinking then things don&#039;t progress as well.&lt;br /&gt;
For writing, the child  listens to a story that the speech synthesizer very patiently repeats as many times as the child needs it and then the child writes the story in his or her own words..&lt;br /&gt;
The hearing, seeing, and writing of the words helps the student for reading.&lt;br /&gt;
There are many more words that are used in books and on the internet than we use to speak with.&lt;br /&gt;
In English we use about ninety thousand words for speaking. An estimate of the total words in English is more than five-hundred-thousand.  One dictionary has five-hundred-thousand entries.&lt;br /&gt;
The more a child learns the more the child will learn. We cannot depend on teachers to teach enough words to children. I am very much encouraged that someone is working toward preparing the speech synthesizer in Sugar.&amp;quot; &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;According to Assim Deodia (Mentor of Sugar Labs for GSoC 2009)&lt;br /&gt;
:     &#039;&#039;&#039;assim.deodia@gmail.com&lt;br /&gt;
:     This proposal is of great potential since speech synthesis is a long desired component of sugar. Activities like speak and listen-spell are already using speech synthesis and it would be very useful to have more speech enabled activities. Various survey showed that voice + text based learnnig is much more efficient then only text based learning. If, as promised, captioning is also achieved, this would enhance the learning multifold.&lt;br /&gt;
 &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[DevelopmentTeam#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:     Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:     I think that most of the things I have already cleared. I don&#039;t know wether you are going to take it or not but one thing I can assure is that I have the determination to achieve whatever I have mentioned. One last point I would like to emphasize is that speech synthesis in sugar is indispensable. So please consider this project always in the future sugar developments.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24857</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24857"/>
		<updated>2009-04-02T15:22:59Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:   &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:       Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:       chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:       chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:       Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:       I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:      collaborating with any mentor wouldn&#039;t be a big deal.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:       I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:      1)   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:      2)   I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:      3)   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:      Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:      Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
&lt;br /&gt;
:      Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:       Speech Synthesis&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making? &lt;br /&gt;
   &lt;br /&gt;
A:   My project aims at creating a framework that will generate speech in core sugar. I want to implement speech as a basic functionality in sugar.&lt;br /&gt;
:      Let me become more clear. I am using some case scenarios to elaborate my proposal. Imagine any window containing some text is open in sugar. Now what I will do is to provide speech to the text in that window. User will have the freedom to listen the text he has selected. This framework will speak the complete text that the user has selected.&lt;br /&gt;
&lt;br /&gt;
:      Now how user can do this is very simple. &lt;br /&gt;
&lt;br /&gt;
:      First he have to select the text he wants to listen then he can simply press a keyboard shortcut (key combination like Alt+s or something else) or a button provided in the sugar like the home button. From now onwards I will call this button as speech button. This button can be made as a permanent which is showed up on moving the mouse pointer to the top left corner of the screen.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide a configuration management tool with a simple GUI. Simple because our target users are small children of age group 3-15.&lt;br /&gt;
&lt;br /&gt;
:      Now what you can do with this tool is that you can configure the speech. Like you can increase or decrease the volume, can change the language, can change the accent, pitch, male or female voice, rate of speech etc.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide karaoke style coloring or captioning to the words being spoken. Like for example write activity is open up in sugar with some text. Then the user selects the text and presses the speech button. Now a separate window containing the selected text will open up. The captioning will be achieved in that separate window while the framework is reading the text. &lt;br /&gt;
 &lt;br /&gt;
:     One more thing which I am aims at is a keyboard speaker. In the configuration tool, the user will be provided with an option of turning on or off the keyboard speaker. So if the key board speaker is turned on, then as the user presses any keyboard key, the framework will speak it. Like if user presses Tab key then the framework will speak &#039;Tab&#039;, on pressing caps lock it will speak &#039;caps lock&#039; and so on.&lt;br /&gt;
&lt;br /&gt;
:     Now after describing so many features of this framework, I think that why not to make it a useful thing for the blind users too. So here is an idea.&lt;br /&gt;
&lt;br /&gt;
:     ICON READER. Now what is icon reader is simple. As the blind user is browsing the XO, he can keep track of the current position of the mouse pointer. Suppose at present the mouse pointer is at Home button. Then as the user presses a pre-defined keyboard key the framework will speak &#039;Home&#039;. Similarly if pointer is at desktop the framework will speak &#039;desktop&#039;.&lt;br /&gt;
&lt;br /&gt;
:     If sugar labs likes this idea then they can provide a simple change in the XO hardware by creating that predefined keyboard key feel-able for the blind user. Like for example there are two keys j and f having a slight projection for feel. This functionality can be a boon for blind users.   &lt;br /&gt;
&lt;br /&gt;
:     Here again I am pointing the main features of my proposal:&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing speech to text opened in any window in any activity in sugar.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing a configuration Panel with GUI from which speech configuration can be changed.  &lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Karoake style coloring of the text being spoken.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Key board speaker.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Icon reader.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Who are you making it for? &lt;br /&gt;
&lt;br /&gt;
:     According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this framework for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, XO will now become a boon for blind children too.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Why do they need it?&lt;br /&gt;
&lt;br /&gt;
:    The main of sugar is to spread the fruit of literacy. And as I have already mentioned that students can learn very fast if speech is also included with the text or words they read. So including this framework in sugar will make it more efficient.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, now the blind students or children can also use the XO which will be like a boon for them. Blinds also want to study...&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:    I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are: &lt;br /&gt;
&lt;br /&gt;
:    Some rough ideas of implementation:&lt;br /&gt;
&lt;br /&gt;
:    Below I am showing the basic structure of my framework i.e., the speech synthesizing framework.&lt;br /&gt;
&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                 ---------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;On the bottom level is the speech engine (espeak) producing the speech.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
 &lt;br /&gt;
*   &#039;&#039;&#039;For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now to get the user selected text my idea is to use clipboard module which takes care of copy paste. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;A small code snippet which I have prepared for demonstration purpose is shown below. You can try it. But first please download the espeak.py code from the following link:&lt;br /&gt;
&lt;br /&gt;
    http://git.sugarlabs.org/projects/listen-spell/repos/mainline/blobs/master/espeak.py&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
                         import gtk&lt;br /&gt;
                         from espeak import espeak&lt;br /&gt;
                         obj=espeak()&lt;br /&gt;
                         clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                         text=clip.wait_for_text()&lt;br /&gt;
                         if text==None:&lt;br /&gt;
                             obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                         else:&lt;br /&gt;
                             obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;You can select the text anywhere in the sugar. Although I have created this code using espeak directly but in future I will be using gstreamer plugin.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:    &#039;&#039;&#039;April 21-March 22&lt;br /&gt;
:     During this period I will remain in constant touch with my mentor and sugar community. I will remain active on IRC and mailing list to discuss the design details and further improvements that can be incorporated in this project.&lt;br /&gt;
&lt;br /&gt;
:     I will also study a relatively new things called STARDICT and ORCA. These can be very useful for sugar. &lt;br /&gt;
&lt;br /&gt;
:     Up to this time I will become absolutely clear on my further approach. But now I am providing a rough plan.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;May 24 - June 5&lt;br /&gt;
*     Will work on implementing the command line interface of the framework.&lt;br /&gt;
*     complete the basic architecture in which user can select and listen the text from command line interface.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 6- June 15     &lt;br /&gt;
*     Discuss the GUI design for configuration manager on IRC.&lt;br /&gt;
*     Design the finalized GUI of the configuration manager.&lt;br /&gt;
*     Will release the snapshots of the GUI on wiki page.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 16 - June 25&lt;br /&gt;
*     Implement the configuration manger.&lt;br /&gt;
*     linking the various options I already described in the GUI.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 18 - June 25&lt;br /&gt;
*     Implementation of the keyboard speaker.&lt;br /&gt;
*     Releasing the basic configuration manager. &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 26- July 7&lt;br /&gt;
*     implementation of the icon reader.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;July 8-July12&lt;br /&gt;
*     finalize an alpha release of the framwork.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 13&lt;br /&gt;
*    Mid term evaluation. Will release the alpha version.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 14 - July 23&lt;br /&gt;
*    Will test it on XO.&lt;br /&gt;
*    Ask for bugs and further improvements.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 24 - August 3&lt;br /&gt;
*    Will port the framework on windows.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 4 - August 13 &lt;br /&gt;
*    Ask for feedback&lt;br /&gt;
*    Preparation for the beta version release.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 14 - August 22&lt;br /&gt;
*    Start the documentation work&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 23 onwards&lt;br /&gt;
*    Continue Working on beta release.&lt;br /&gt;
*    I will continue working on this to make it available in official sugar distros. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:    I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi.&lt;br /&gt;
:    I already have a bit of coding experience in open source events like AI challenge whose simulator code I prepared just in a time span of 15 days.&lt;br /&gt;
:    Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
 &lt;br /&gt;
:    In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:    Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
:    A lot of students from my college have been associated with OLPC for development work. Like:&lt;br /&gt;
 &lt;br /&gt;
*    &#039;&#039;&#039;Food Force which is still in its developing phase. Recently we have achieved collaboration in Food Force. Mr. Deepank and Mr. Mohit Taneja (both are my seniors) are involved with this OLPC project from last one year. &lt;br /&gt;
*    &#039;&#039;&#039;Listen and Spell. This project was started at GSoC 2008 by Mr. Assim Deodia (Senior) and has recently achieved progress to remove speech dispatcher dependencies from it.&lt;br /&gt;
*    &#039;&#039;&#039;Speech dispatcher. This project was completed by my senior Mr. Hemant Goyal at GSoC 2008.&lt;br /&gt;
&lt;br /&gt;
:    So by giving these examples what I am trying to convey is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can. So I can get a lot of guidance and ready help in any case of technical designs or other sort of work. My chances of stucking at any point are very low.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:     According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Philip Wagner &amp;lt;Philip5147@aol.com&amp;gt;&lt;br /&gt;
:    &amp;quot;I am Philip Wagner.  I am a member of the Education Team for Sugar Labs. I was a teacher for seventeen years in Africa and in the  United States.&lt;br /&gt;
:   The speech synthesizer is very important to help children learn to speak, read and write languages.&lt;br /&gt;
Some research states that a child needs to hear words pronounced between one hundred and one thousand times before the child knows the word.  This is a monumental task for a teacher to be repeating words enough times for the children to learn.  The speech synthesizer is a tool which helps the teacher in doing some of that repetition. The more words a child  knows the more learning can take place inside the child&#039;s brain.  We think with words.  If we do not have the words necessary to do the thinking then things don&#039;t progress as well.&lt;br /&gt;
For writing, the child  listens to a story that the speech synthesizer very patiently repeats as many times as the child needs it and then the child writes the story in his or her own words..&lt;br /&gt;
The hearing, seeing, and writing of the words helps the student for reading.&lt;br /&gt;
There are many more words that are used in books and on the internet than we use to speak with.&lt;br /&gt;
In English we use about ninety thousand words for speaking. An estimate of the total words in English is more than five-hundred-thousand.  One dictionary has five-hundred-thousand entries.&lt;br /&gt;
The more a child learns the more the child will learn. We cannot depend on teachers to teach enough words to children. I am very much encouraged that someone is working toward preparing the speech synthesizer in Sugar.&amp;quot; &lt;br /&gt;
 &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[DevelopmentTeam#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
A:     My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:     Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:     When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:    I think that most of the things I have already mentioned. One last point I would like to emphasize is that speech synthesis in sugar is indispensable. So if for any reasons you think that I should not be given a chance then please consider this project always in the future sugar developments.   &lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24856</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24856"/>
		<updated>2009-04-02T14:48:42Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:   &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:     Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:     chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:     chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:     chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:     Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:     I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:      collaborating with any mentor wouldn&#039;t be a big deal.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:     I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:      I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:      Then I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:      I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:      Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:      Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
:      Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:     Speech Synthesis&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making?    &lt;br /&gt;
A:     My project aims at creating a framework that will generate speech in core sugar. I want to implement speech as a basic functionality in sugar.&lt;br /&gt;
:      Let me become more clear. I am using some case scenarios to elaborate my proposal. Imagine any window containing some text is open in sugar. Now what I will do is to provide speech to the text in that window. User will have the freedom to listen the text he has selected. This framework will speak the complete text that the user has selected.&lt;br /&gt;
&lt;br /&gt;
:      Now how user can do this is very simple. &lt;br /&gt;
&lt;br /&gt;
:      First he have to select the text he wants to listen then he can simply press a keyboard shortcut (key combination like Alt+s or something else) or a button provided in the sugar like the home button. From now onwards I will call this button as speech button. This button can be made as a permanent which is showed up on moving the mouse pointer to the top left corner of the screen.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide a configuration management tool with a simple GUI. Simple because our target users are small children of age group 3-15.&lt;br /&gt;
&lt;br /&gt;
:      Now what you can do with this tool is that you can configure the speech. Like you can increase or decrease the volume, can change the language, can change the accent, pitch, male or female voice, rate of speech etc.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide karaoke style coloring or captioning to the words being spoken. Like for example write activity is open up in sugar with some text. Then the user selects the text and presses the speech button. Now a separate window containing the selected text will open up. The captioning will be achieved in that separate window while the framework is reading the text. &lt;br /&gt;
 &lt;br /&gt;
:     One more thing which I am aims at is a keyboard speaker. In the configuration tool, the user will be provided with an option of turning on or off the keyboard speaker. So if the key board speaker is turned on, then as the user presses any keyboard key, the framework will speak it. Like if user presses Tab key then the framework will speak &#039;Tab&#039;, on pressing caps lock it will speak &#039;caps lock&#039; and so on.&lt;br /&gt;
&lt;br /&gt;
:     Now after describing so many features of this framework, I think that why not to make it a useful thing for the blind users too. So here is an idea.&lt;br /&gt;
&lt;br /&gt;
:     ICON READER. Now what is icon reader is simple. As the blind user is browsing the XO, he can keep track of the current position of the mouse pointer. Suppose at present the mouse pointer is at Home button. Then as the user presses a pre-defined keyboard key the framework will speak &#039;Home&#039;. Similarly if pointer is at desktop the framework will speak &#039;desktop&#039;.&lt;br /&gt;
&lt;br /&gt;
:     If sugar labs likes this idea then they can provide a simple change in the XO hardware by creating that predefined keyboard key feel-able for the blind user. Like for example there are two keys j and f having a slight projection for feel. This functionality can be a boon for blind users.   &lt;br /&gt;
&lt;br /&gt;
:     Here again I am pointing the main features of my proposal:&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing speech to text opened in any window in any activity in sugar.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing a configuration Panel with GUI from which speech configuration can be changed.  &lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Karoake style coloring of the text being spoken.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Key board speaker.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Icon reader.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Who are you making it for? &lt;br /&gt;
&lt;br /&gt;
:     According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this framework for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, XO will now become a boon for blind children too.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Why do they need it?&lt;br /&gt;
&lt;br /&gt;
:    The main of sugar is to spread the fruit of literacy. And as I have already mentioned that students can learn very fast if speech is also included with the text or words they read. So including this framework in sugar will make it more efficient.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, now the blind students or children can also use the XO which will be like a boon for them. Blinds also want to study...&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:    I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are: &lt;br /&gt;
&lt;br /&gt;
:    Some rough ideas of implementation:&lt;br /&gt;
&lt;br /&gt;
:    Below I am showing the basic structure of my framework i.e., the speech synthesizing framework.&lt;br /&gt;
&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                 ---------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;On the bottom level is the speech engine (espeak) producing the speech.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
 &lt;br /&gt;
*   &#039;&#039;&#039;For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now to get the user selected text my idea is to use clipboard module which takes care of copy paste. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;A small code snippet which I have prepared for demonstration purpose is shown below. You can try it. But first please download the espeak.py code from the following link:&lt;br /&gt;
&lt;br /&gt;
    http://git.sugarlabs.org/projects/listen-spell/repos/mainline/blobs/master/espeak.py&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
                         import gtk&lt;br /&gt;
                         from espeak import espeak&lt;br /&gt;
                         obj=espeak()&lt;br /&gt;
                         clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                         text=clip.wait_for_text()&lt;br /&gt;
                         if text==None:&lt;br /&gt;
                             obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                         else:&lt;br /&gt;
                             obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;You can select the text anywhere in the sugar. Although I have created this code using espeak directly but in future I will be using gstreamer plugin.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:    &#039;&#039;&#039;April 21-March 22&lt;br /&gt;
:     During this period I will remain in constant touch with my mentor and sugar community. I will remain active on IRC and mailing list to discuss the design details and further improvements that can be incorporated in this project.&lt;br /&gt;
&lt;br /&gt;
:     I will also study a relatively new things called STARDICT and ORCA. These can be very useful for sugar. &lt;br /&gt;
&lt;br /&gt;
:     Up to this time I will become absolutely clear on my further approach. But now I am providing a rough plan.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;May 24 - June 5&lt;br /&gt;
*     Will work on implementing the command line interface of the framework.&lt;br /&gt;
*     complete the basic architecture in which user can select and listen the text from command line interface.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 6- June 15     &lt;br /&gt;
*     Discuss the GUI design for configuration manager on IRC.&lt;br /&gt;
*     Design the finalized GUI of the configuration manager.&lt;br /&gt;
*     Will release the snapshots of the GUI on wiki page.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 16 - June 25&lt;br /&gt;
*     Implement the configuration manger.&lt;br /&gt;
*     linking the various options I already described in the GUI.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 18 - June 25&lt;br /&gt;
*     Implementation of the keyboard speaker.&lt;br /&gt;
*     Releasing the basic configuration manager. &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 26- July 7&lt;br /&gt;
*     implementation of the icon reader.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;July 8-July12&lt;br /&gt;
*     finalize an alpha release of the framwork.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 13&lt;br /&gt;
*    Mid term evaluation. Will release the alpha version.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 14 - July 23&lt;br /&gt;
*    Will test it on XO.&lt;br /&gt;
*    Ask for bugs and further improvements.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 24 - August 3&lt;br /&gt;
*    Will port the framework on windows.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 4 - August 13 &lt;br /&gt;
*    Ask for feedback&lt;br /&gt;
*    Preparation for the beta version release.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 14 - August 22&lt;br /&gt;
*    Start the documentation work&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 23 onwards&lt;br /&gt;
*    Continue Working on beta release.&lt;br /&gt;
*    I will continue working on this to make it available in official sugar distros. &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:  I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi.&lt;br /&gt;
:   I already have a bit of coding experience in real events like AI challenge whose simulator code I prepared just in a time span of 15 days.&lt;br /&gt;
:   Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
 &lt;br /&gt;
In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:   Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
:   A lot of students from my college have been associated with OLPC for development work. Like:&lt;br /&gt;
 &lt;br /&gt;
*   &#039;&#039;&#039;Food Force which is still in its developing phase. Recently we have achieved collaboration in Food Force. Mr. Deepank and Mr. Mohit Taneja (both are my seniors) are involved with this OLPC project from last one year. &lt;br /&gt;
*   &#039;&#039;&#039;Listen and Spell. This project was started at GSoC 2008 by Mr. Assim Deodia (Senior) and has recently achieved progress to remove speech dispatcher dependencies from it.&lt;br /&gt;
*   &#039;&#039;&#039;Speech dispatcher. This project was completed by my senior Mr. Hemant Goyal at GSoC 2008.&lt;br /&gt;
:   So by giving these examples what I am trying to convey is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can. So I can get a lot of guidance and ready help in any case of technical designs or other sort of work. My chances of stucking at any point are very low.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:   According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Philip Wagner &amp;lt;Philip5147@aol.com&amp;gt;&lt;br /&gt;
:    &amp;quot;I am Philip Wagner.  I am a member of the Education Team for Sugar Labs. I was a teacher for seventeen years in Africa and in the  United States.&lt;br /&gt;
The speech synthesizer is very important to help children learn to speak, read and write languages.&lt;br /&gt;
Some research states that a child needs to hear words pronounced between one hundred and one thousand times before the child knows the word.  This is a monumental task for a teacher to be repeating words enough times for the children to learn.  The speech synthesizer is a tool which helps the teacher in doing some of that repetition. The more words a child  knows the more learning can take place inside the child&#039;s brain.  We think with words.  If we do not have the words necessary to do the thinking then things don&#039;t progress as well.&lt;br /&gt;
For writing, the child  listens to a story that the speech synthesizer very patiently repeats as many times as the child needs it and then the child writes the story in his or her own words..&lt;br /&gt;
The hearing, seeing, and writing of the words helps the student for reading.&lt;br /&gt;
There are many more words that are used in books and on the internet than we use to speak with.&lt;br /&gt;
In English we use about ninety thousand words for speaking. An estimate of the total words in English is more than five-hundred-thousand.  One dictionary has five-hundred-thousand entries.&lt;br /&gt;
The more a child learns the more the child will learn. We cannot depend on teachers to teach enough words to children. I am very much encouraged that someone is working toward preparing the speech synthesizer in Sugar.&amp;quot; &lt;br /&gt;
 &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[DevelopmentTeam#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
A:   My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:   Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:   When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
:   Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:    I think that most of the things I have already mentioned. One last point I would like to emphasize is that speech synthesis in sugar is indispensable. So if for any reasons you think that I should not be given a chance then please consider this project always in the future sugar developments.   &lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24854</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24854"/>
		<updated>2009-04-02T14:47:49Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:   &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:     Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:     chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:     chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:     chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:     Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:     I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:      collaborating with any mentor wouldn&#039;t be a big deal.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:     I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:      I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:      Then I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:      Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:      I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:      Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:      Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
:      Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:     Speech Synthesis&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What I am making?    &lt;br /&gt;
A:     My project aims at creating a framework that will generate speech in core sugar. I want to implement speech as a basic functionality in sugar.&lt;br /&gt;
:      Let me become more clear. I am using some case scenarios to elaborate my proposal. Imagine any window containing some text is open in sugar. Now what I will do is to provide speech to the text in that window. User will have the freedom to listen the text he has selected. This framework will speak the complete text that the user has selected.&lt;br /&gt;
&lt;br /&gt;
:      Now how user can do this is very simple. &lt;br /&gt;
&lt;br /&gt;
:      First he have to select the text he wants to listen then he can simply press a keyboard shortcut (key combination like Alt+s or something else) or a button provided in the sugar like the home button. From now onwards I will call this button as speech button. This button can be made as a permanent which is showed up on moving the mouse pointer to the top left corner of the screen.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide a configuration management tool with a simple GUI. Simple because our target users are small children of age group 3-15.&lt;br /&gt;
&lt;br /&gt;
:      Now what you can do with this tool is that you can configure the speech. Like you can increase or decrease the volume, can change the language, can change the accent, pitch, male or female voice, rate of speech etc.&lt;br /&gt;
&lt;br /&gt;
:      I will also provide karaoke style coloring or captioning to the words being spoken. Like for example write activity is open up in sugar with some text. Then the user selects the text and presses the speech button. Now a separate window containing the selected text will open up. The captioning will be achieved in that separate window while the framework is reading the text. &lt;br /&gt;
 &lt;br /&gt;
:     One more thing which I am aims at is a keyboard speaker. In the configuration tool, the user will be provided with an option of turning on or off the keyboard speaker. So if the key board speaker is turned on, then as the user presses any keyboard key, the framework will speak it. Like if user presses Tab key then the framework will speak &#039;Tab&#039;, on pressing caps lock it will speak &#039;caps lock&#039; and so on.&lt;br /&gt;
&lt;br /&gt;
:     Now after describing so many features of this framework, I think that why not to make it a useful thing for the blind users too. So here is an idea.&lt;br /&gt;
&lt;br /&gt;
:     ICON READER. Now what is icon reader is simple. As the blind user is browsing the XO, he can keep track of the current position of the mouse pointer. Suppose at present the mouse pointer is at Home button. Then as the user presses a pre-defined keyboard key the framework will speak &#039;Home&#039;. Similarly if pointer is at desktop the framework will speak &#039;desktop&#039;.&lt;br /&gt;
&lt;br /&gt;
:     If sugar labs likes this idea then they can provide a simple change in the XO hardware by creating that predefined keyboard key feel-able for the blind user. Like for example there are two keys j and f having a slight projection for feel. This functionality can be a boon for blind users.   &lt;br /&gt;
&lt;br /&gt;
:     Here again I am pointing the main features of my proposal:&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing speech to text opened in any window in any activity in sugar.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Providing a configuration Panel with GUI from which speech configuration can be changed.  &lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Karoake style coloring of the text being spoken.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Key board speaker.&lt;br /&gt;
&lt;br /&gt;
*     &#039;&#039;&#039; Icon reader.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;Who are you making it for? &lt;br /&gt;
&lt;br /&gt;
:     According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this framework for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, XO will now become a boon for blind children too.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Why do they need it?&lt;br /&gt;
&lt;br /&gt;
:    The main of sugar is to spread the fruit of literacy. And as I have already mentioned that students can learn very fast if speech is also included with the text or words they read. So including this framework in sugar will make it more efficient.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, now the blind students or children can also use the XO which will be like a boon for them. Blinds also want to study...&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:    I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are: &lt;br /&gt;
&lt;br /&gt;
:    Some rough ideas of implementation:&lt;br /&gt;
&lt;br /&gt;
:    Below I am showing the basic structure of my framework i.e., the speech synthesizing framework.&lt;br /&gt;
&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                 ---------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;On the bottom level is the speech engine (espeak) producing the speech.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
 &lt;br /&gt;
*   &#039;&#039;&#039;For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now to get the user selected text my idea is to use clipboard module which takes care of copy paste. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;A small code snippet which I have prepared for demonstration purpose is shown below. You can try it. But first please download the espeak.py code from the following link:&lt;br /&gt;
    http://git.sugarlabs.org/projects/listen-spell/repos/mainline/blobs/master/espeak.py&lt;br /&gt;
    &#039;&#039;&#039;First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
                         import gtk&lt;br /&gt;
                         from espeak import espeak&lt;br /&gt;
                         obj=espeak()&lt;br /&gt;
                         clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                         text=clip.wait_for_text()&lt;br /&gt;
                         if text==None:&lt;br /&gt;
                             obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                         else:&lt;br /&gt;
                             obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;You can select the text anywhere in the sugar. Although I have created this code using espeak directly but in future I will be using gstreamer plugin.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10: &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:    &#039;&#039;&#039;April 21-March 22&lt;br /&gt;
:     During this period I will remain in constant touch with my mentor and sugar community. I will remain active on IRC and mailing list to discuss the design details and further improvements that can be incorporated in this project.&lt;br /&gt;
&lt;br /&gt;
:     I will also study a relatively new things called STARDICT and ORCA. These can be very useful for sugar. &lt;br /&gt;
&lt;br /&gt;
:     Up to this time I will become absolutely clear on my further approach. But now I am providing a rough plan.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;May 24 - June 5&lt;br /&gt;
*     Will work on implementing the command line interface of the framework.&lt;br /&gt;
*     complete the basic architecture in which user can select and listen the text from command line interface.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 6- June 15     &lt;br /&gt;
*     Discuss the GUI design for configuration manager on IRC.&lt;br /&gt;
*     Design the finalized GUI of the configuration manager.&lt;br /&gt;
*     Will release the snapshots of the GUI on wiki page.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 16 - June 25&lt;br /&gt;
*     Implement the configuration manger.&lt;br /&gt;
*     linking the various options I already described in the GUI.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 18 - June 25&lt;br /&gt;
*     Implementation of the keyboard speaker.&lt;br /&gt;
*     Releasing the basic configuration manager. &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 26- July 7&lt;br /&gt;
*     implementation of the icon reader.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;July 8-July12&lt;br /&gt;
*     finalize an alpha release of the framwork.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 13&lt;br /&gt;
*    Mid term evaluation. Will release the alpha version.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 14 - July 23&lt;br /&gt;
*    Will test it on XO.&lt;br /&gt;
*    Ask for bugs and further improvements.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 24 - August 3&lt;br /&gt;
*    Will port the framework on windows.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 4 - August 13 &lt;br /&gt;
*    Ask for feedback&lt;br /&gt;
*    Preparation for the beta version release.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 14 - August 22&lt;br /&gt;
*    Start the documentation work&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 23 onwards&lt;br /&gt;
*    Continue Working on beta release.&lt;br /&gt;
*    I will continue working on this to make it available in official sugar distros. &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:  I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi.&lt;br /&gt;
:   I already have a bit of coding experience in real events like AI challenge whose simulator code I prepared just in a time span of 15 days.&lt;br /&gt;
:   Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
 &lt;br /&gt;
In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:   Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
:   A lot of students from my college have been associated with OLPC for development work. Like:&lt;br /&gt;
 &lt;br /&gt;
*   &#039;&#039;&#039;Food Force which is still in its developing phase. Recently we have achieved collaboration in Food Force. Mr. Deepank and Mr. Mohit Taneja (both are my seniors) are involved with this OLPC project from last one year. &lt;br /&gt;
*   &#039;&#039;&#039;Listen and Spell. This project was started at GSoC 2008 by Mr. Assim Deodia (Senior) and has recently achieved progress to remove speech dispatcher dependencies from it.&lt;br /&gt;
*   &#039;&#039;&#039;Speech dispatcher. This project was completed by my senior Mr. Hemant Goyal at GSoC 2008.&lt;br /&gt;
:   So by giving these examples what I am trying to convey is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can. So I can get a lot of guidance and ready help in any case of technical designs or other sort of work. My chances of stucking at any point are very low.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:   According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Philip Wagner &amp;lt;Philip5147@aol.com&amp;gt;&lt;br /&gt;
:    &amp;quot;I am Philip Wagner.  I am a member of the Education Team for Sugar Labs. I was a teacher for seventeen years in Africa and in the  United States.&lt;br /&gt;
The speech synthesizer is very important to help children learn to speak, read and write languages.&lt;br /&gt;
Some research states that a child needs to hear words pronounced between one hundred and one thousand times before the child knows the word.  This is a monumental task for a teacher to be repeating words enough times for the children to learn.  The speech synthesizer is a tool which helps the teacher in doing some of that repetition. The more words a child  knows the more learning can take place inside the child&#039;s brain.  We think with words.  If we do not have the words necessary to do the thinking then things don&#039;t progress as well.&lt;br /&gt;
For writing, the child  listens to a story that the speech synthesizer very patiently repeats as many times as the child needs it and then the child writes the story in his or her own words..&lt;br /&gt;
The hearing, seeing, and writing of the words helps the student for reading.&lt;br /&gt;
There are many more words that are used in books and on the internet than we use to speak with.&lt;br /&gt;
In English we use about ninety thousand words for speaking. An estimate of the total words in English is more than five-hundred-thousand.  One dictionary has five-hundred-thousand entries.&lt;br /&gt;
The more a child learns the more the child will learn. We cannot depend on teachers to teach enough words to children. I am very much encouraged that someone is working toward preparing the speech synthesizer in Sugar.&amp;quot; &lt;br /&gt;
 &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[DevelopmentTeam#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
A:   My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:   Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:   When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
:   Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:    I think that most of the things I have already mentioned. One last point I would like to emphasize is that speech synthesis in sugar is indispensable. So if for any reasons you think that I should not be given a chance then please consider this project always in the future sugar developments.   &lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24706</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24706"/>
		<updated>2009-04-01T15:46:52Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:   &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:   Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:   chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:   Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:   I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:collaborating with any mentor wouldn&#039;t be a big deal.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:   I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:   Then I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:   Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
:   Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:   Speech Synthesis&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What are you making?    &lt;br /&gt;
&lt;br /&gt;
A:   I want to integrate speech in the core sugar. Means I want to create a framework which can provide speech synthesis as a basic functionality in sugar.&lt;br /&gt;
:    Let me become more clear. I am using case scenarios to clear my proposal. Imagine any window containing some text is open in sugar. Now what I will do is to provide speech to the text in that window. And of-course user has the freedom to listen the text he has selected. Means this framework will speak either the complete text contained in the window or the user selected text.&lt;br /&gt;
&lt;br /&gt;
:    Now how user can do this is very simple. &lt;br /&gt;
&lt;br /&gt;
:    First he have to select the text he wants to listen then he can simply press a keyboard shortcut (key combination like Alt+s) or a button provided in the sugar like the home button. From now onwards I will call this button as speech button. This button can be made as a permanent which is showed up on moving the mouse pointer to the top left corner of the screen.&lt;br /&gt;
&lt;br /&gt;
:    I will also provide a configuration management tool with a simple GUI. Simple because our target users are small children of age group 3-15.&lt;br /&gt;
&lt;br /&gt;
:    Now what you can do with this tool is that you can configure the speech. Like you can increase or decrease the volume, can change the language, can change the &#039;&#039;&#039;accent, pitch, male or female voice, rate of speech etc.&lt;br /&gt;
&lt;br /&gt;
:    I will also provide karaoke style coloring or captioning to the words being spoken. Like for example write activity is open up in sugar with some text. Then the user selects the text and presses the speech button. Now the framework will start speaking the selected text. &lt;br /&gt;
&lt;br /&gt;
:    The word which is currently being spoken will be captioned so that user can keep the track of the word.&lt;br /&gt;
&lt;br /&gt;
:    One more thing which I am including is a keyboard speaker. In the configuration tool, the user will be provided with an option of turning on or off the keyboard speaker. So if the key board speaker is turned on, then as the user presses any keyboard key, the framework will speak it. Like if user presses Tab key then the framework will speak &#039;Tab&#039;, on pressing caps lock it will speak &#039;caps lock&#039; and so on.&lt;br /&gt;
&lt;br /&gt;
:    Now after describing so many features of this framework, I think that why not to make it a useful thing for the blind users. So here is an idea.&lt;br /&gt;
&lt;br /&gt;
:    ICON READER. Now what is icon reader is simple. As the blind user is browsing the XO, he can keep track of the current position of the mouse pointer. Suppose at present the mouse pointer is at Home button. Then as the user presses a pre-defined keyboard key the framework will speak &#039;Home&#039;. Similarly if pointer is at desktop the framework will speak &#039;desktop&#039;.&lt;br /&gt;
&lt;br /&gt;
:    If sugar labs likes this idea then they can provide a simple change in the XO hardware by creating that predefined keyboard key feel-able for the blind user. Like for example there are two keys j and f having a slight projection for feel. This functionality can be a boon for blind users.   &lt;br /&gt;
&lt;br /&gt;
:    Here again I am pointing the main characteristics of my proposal:&lt;br /&gt;
&lt;br /&gt;
*    &#039;&#039;&#039; Providing speech to text opened in any window in any activity in sugar.&lt;br /&gt;
&lt;br /&gt;
*    &#039;&#039;&#039; Providing a configuration Panel with GUI from which speech configuration can be changed.  &lt;br /&gt;
&lt;br /&gt;
*    &#039;&#039;&#039; Karoake style coloring of the text being spoken.&lt;br /&gt;
&lt;br /&gt;
*    &#039;&#039;&#039; Key board speaker.&lt;br /&gt;
&lt;br /&gt;
*    &#039;&#039;&#039; Icon reader.  &lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Who are you making it for? &lt;br /&gt;
&lt;br /&gt;
:   According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this framework for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, XO can now become a boon or a useful thing for blind children too.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Why do they need it?&lt;br /&gt;
&lt;br /&gt;
:    The main of sugar is to spread the fruit of literacy. And as I have already mentioned that students can learn very fast if speech is also included with the text or words they read. So including this framework in sugar will make it more efficient.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, now the blind students or children can also use the XO which will be like a boon for them. Blinds also want to study...&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:   I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are: &lt;br /&gt;
&lt;br /&gt;
:   Some rough ideas of implementation:&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                 ---------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;On the top level is the speech engine (espeak) producing the speech.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
 &lt;br /&gt;
*   &#039;&#039;&#039;For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now to get the user selected text my idea is to use clipboard module which takes care of copy paste. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;A small code snippet which I have prepared for demonstration purpose is shown below. You can copy paste this and try it. First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
                         import gtk&lt;br /&gt;
                         from espeak import espeak&lt;br /&gt;
                         obj=espeak()&lt;br /&gt;
                         clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                         text=clip.wait_for_text()&lt;br /&gt;
                         if text==None:&lt;br /&gt;
                         obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                         else:&lt;br /&gt;
                         obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;You can select the text anywhere in the sugar. Although I have created this code using espeak directly but in future I will be using gstreamer plugin.&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10:   &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:   &#039;&#039;&#039;April 21-March 22&lt;br /&gt;
:    During this period I will remain in constant touch with my mentor and sugar community. I will remain active on IRC and mailing list to discuss the design details and further improvements that can be incorporated in this project.&lt;br /&gt;
&lt;br /&gt;
:    I will also study a relatively new things called STARDICT and ORCA. These can be very useful for sugar. &lt;br /&gt;
&lt;br /&gt;
:    Up to this time I will become absolutely clear on my further approach. But now I am providing a rough plan.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;May 24 - June 5&lt;br /&gt;
*     Will work on implementing the command line interface of the framework.&lt;br /&gt;
*     Will complete the basic architecture in which user can select and listen the text from command line interface.&lt;br /&gt;
*     Will discuss the UI design for configuration manager on IRC.  &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 6- June 15&lt;br /&gt;
*     Will work on implementing the keyboard reader. (Command line)&lt;br /&gt;
*     Will design the finalized GUI of the configuration manager.&lt;br /&gt;
*     Will release the snapshots of the GUI on wiki page.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 16 - June 25&lt;br /&gt;
*     Will implement the configuration manger.&lt;br /&gt;
*     Will link the various options I already described in the GUI.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 18 - June 25&lt;br /&gt;
*     Will release the basic configuration manager. &lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;June 26- July 7&lt;br /&gt;
*     Will implement the keyboard speaker.&lt;br /&gt;
*     Work on the implementation of the icon reader.&lt;br /&gt;
&lt;br /&gt;
:     &#039;&#039;&#039;July 8-July12&lt;br /&gt;
*     Will finalize an alpha release of the framwork.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 13&lt;br /&gt;
*    Mid term evaluation. Will release the alpha version.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 14 - July 23&lt;br /&gt;
*    Will test it on XO.&lt;br /&gt;
*    Ask for bugs and further improvements.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;July 24 - August 3&lt;br /&gt;
*    Will port the framework on windows.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 4 - August 13 &lt;br /&gt;
*    Ask for feedback&lt;br /&gt;
*    Preparation for the beta version release.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 14 - August 22&lt;br /&gt;
*    Will release the beta version&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;August 23 onwards&lt;br /&gt;
*    I will continue working on this to make it available in official sugar distros. &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:  I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi.&lt;br /&gt;
:   I have already described some of my past achievements like AI challenge whose simulator code I prepared just in a time span of 15 days.&lt;br /&gt;
:   Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
 &lt;br /&gt;
In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:   Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
:   A lot of students from my college have been associated with OLPC for development work. Like:&lt;br /&gt;
 &lt;br /&gt;
*   &#039;&#039;&#039;Food Force which is still in its developing phase. We are working hard to achieve collaboration in Food Force.&lt;br /&gt;
*   &#039;&#039;&#039;Listen and Spell. This project was started at GSoC 2008 and is still in progress to remove speech dispatcher dependencies from it.&lt;br /&gt;
*   &#039;&#039;&#039;Speech dispatcher. This project was completed by my senior at GSoC 2008.&lt;br /&gt;
:   So by giving these examples what I am trying to say is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can. So I can get a lot of guidance and ready help in any case. My chances of stucking at any point are very low.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:   According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Edward Cherlin &amp;lt;echerlin@gmail.com&amp;gt;&lt;br /&gt;
:    &amp;quot;call our text coloring engine to mark the word being spoken. That&#039;s designed for the pre-literate, on the model of Same-Language Subtitling in India.&amp;quot;&lt;br /&gt;
:    Means the people in developing nations like India can learn the language or text faster if same language subtitling model is employed.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Philip Wagner &amp;lt;Philip5147@aol.com&amp;gt;&lt;br /&gt;
:    &amp;quot;I am Philip Wagner.  I am a member of the Education Team for Sugar Labs. I was a teacher for seventeen years in Africa and in the  United States.&lt;br /&gt;
The speech synthesizer is very important to help children learn to speak, read and write languages.&lt;br /&gt;
Some research states that a child needs to hear words pronounced between one hundred and one thousand times before the child knows the word.  This is a monumental task for a teacher to be repeating words enough times for the children to learn.  The speech synthesizer is a tool which helps the teacher in doing some of that repetition. The more words a child  knows the more learning can take place inside the child&#039;s brain.  We think with words.  If we do not have the words necessary to do the thinking then things don&#039;t progress as well.&lt;br /&gt;
For writing, the child  listens to a story that the speech synthesizer very patiently repeats as many times as the child needs it and then the child writes the story in his or her own words..&lt;br /&gt;
The hearing, seeing, and writing of the words helps the student for reading.&lt;br /&gt;
There are many more words that are used in books and on the internet than we use to speak with.&lt;br /&gt;
In English we use about ninety thousand words for speaking. An estimate of the total words in English is more than five-hundred-thousand.  One dictionary has five-hundred-thousand entries.&lt;br /&gt;
The more a child learns the more the child will learn. We cannot depend on teachers to teach enough words to children.&lt;br /&gt;
 &lt;br /&gt;
:   &#039;&#039;&#039;I am very much encouraged that Chirag Jain is working toward preparing the speech synthesizer in Sugar.&amp;quot; &lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[DevelopmentTeam#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
A:   My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:   Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:   When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
:   Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24701</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24701"/>
		<updated>2009-04-01T14:42:59Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:   &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:   Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:   chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:   Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:   I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:collaborating with any mentor wouldn&#039;t be a big deal.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:   I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:   Then I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:   Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
:   Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:   Speech Synthesis&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What are you making?    &lt;br /&gt;
&lt;br /&gt;
A:   I want to integrate speech in the core sugar. Means I want to create a framework which can provide speech synthesis as a basic functionality in sugar.&lt;br /&gt;
:    Let me become more clear. I am using case scenarios to clear my proposal. Imagine any window containing some text is open in sugar. Now what I will do is to provide speech to the text in that window. And of-course user has the freedom to listen the text he has selected. Means this framework will speak either the complete text contained in the window or the user selected text.&lt;br /&gt;
&lt;br /&gt;
:    Now how user can do this is very simple. &lt;br /&gt;
&lt;br /&gt;
:    First he have to select the text he wants to listen then he can simply press a keyboard shortcut (key combination like Alt+s) or a button provided in the sugar like the home button. From now onwards I will call this button as speech button. This button can be made as a permanent which is showed up on moving the mouse pointer to the top left corner of the screen.&lt;br /&gt;
&lt;br /&gt;
:    I will also provide a configuration management tool with a simple GUI. Simple because our target users are small children of age group 3-15.&lt;br /&gt;
&lt;br /&gt;
:    Now what you can do with this tool is that you can configure the speech. Like you can increase or decrease the volume, can change the language, can change the &#039;&#039;&#039;accent, pitch, male or female voice, rate of speech etc.&lt;br /&gt;
&lt;br /&gt;
:    I will also provide karaoke style coloring or captioning to the words being spoken. Like for example write activity is open up in sugar with some text. Then the user selects the text and presses the speech button. Now the framework will start speaking the selected text. &lt;br /&gt;
&lt;br /&gt;
:    The word which is currently being spoken will be captioned so that user can keep the track of the word.&lt;br /&gt;
&lt;br /&gt;
:    One more thing which I am including is a keyboard speaker. In the configuration tool, the user will be provided with an option of turning on or off the keyboard speaker. So if the key board speaker is turned on, then as the user presses any keyboard key, the framework will speak it. Like if user presses Tab key then the framework will speak &#039;Tab&#039;, on pressing caps lock it will speak &#039;caps lock&#039; and so on.&lt;br /&gt;
&lt;br /&gt;
:    Now after describing so many features of this framework, I think that why not to make it a useful thing for the blind users. So here is an idea.&lt;br /&gt;
&lt;br /&gt;
:    ICON READER. Now what is icon reader is simple. As the blind user is browsing the XO, he can keep track of the current position of the mouse pointer. Suppose at present the mouse pointer is at Home button. Then as the user presses a pre-defined keyboard key the framework will speak &#039;Home&#039;. Similarly if pointer is at desktop the framework will speak &#039;desktop&#039;.&lt;br /&gt;
&lt;br /&gt;
:    If sugar labs likes this idea then they can provide a simple change in the XO hardware by creating that predefined keyboard key feel-able for the blind user. Like for example there are two keys j and f having a slight projection for feel. This functionality can be a boon for blind users.   &lt;br /&gt;
&lt;br /&gt;
:    Here again I am pointing the main characteristics of my proposal:&lt;br /&gt;
&lt;br /&gt;
*    &#039;&#039;&#039; Providing speech to text opened in any window in any activity in sugar.&lt;br /&gt;
&lt;br /&gt;
*    &#039;&#039;&#039; Providing a configuration Panel with GUI from which speech configuration can be changed.  &lt;br /&gt;
&lt;br /&gt;
*    &#039;&#039;&#039; Karoake style coloring of the text being spoken.&lt;br /&gt;
&lt;br /&gt;
*    &#039;&#039;&#039; Key board speaker.&lt;br /&gt;
&lt;br /&gt;
*    &#039;&#039;&#039; Icon reader.  &lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Who are you making it for? &lt;br /&gt;
&lt;br /&gt;
:   According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this framework for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, XO can now become a boon or a useful thing for blind children too.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;Why do they need it?&lt;br /&gt;
&lt;br /&gt;
:    The main of sugar is to spread the fruit of literacy. And as I have already mentioned that students can learn very fast if speech is also included with the text or words they read. So including this framework in sugar will make it more efficient.&lt;br /&gt;
&lt;br /&gt;
:    Not only this, now the blind students or children can also use the XO which will be like a boon for them. Blinds also want to study...&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:   I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are: &lt;br /&gt;
&lt;br /&gt;
:   Some rough ideas of implementation:&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                 ---------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;On the top level is the speech engine (espeak) producing the speech.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Then a command line tool that will be operated by a button or Keyboard shortcut to speak the selected text.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Then comes the GUI for the configuration management tool of the speech. It will configure the sound qualities like volume, language, accent, pitch etc.&lt;br /&gt;
 &lt;br /&gt;
*   &#039;&#039;&#039;For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now to implement speech in sugar core my idea is to use clipboard module which takes care of copy paste in sugar. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The basic idea is to provide a read button in core sugar (like a home button) which is always there. So that if a user selects any of the text in the current window and presses the button it gets speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;A small code snippet which I have prepared for demonstration purpose is shown below. You can copy paste this and try it. First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
                         import gtk&lt;br /&gt;
                         from espeak import espeak&lt;br /&gt;
                         obj=espeak()&lt;br /&gt;
                         clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                         text=clip.wait_for_text()&lt;br /&gt;
                         if text==None:&lt;br /&gt;
                         obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                         else:&lt;br /&gt;
                         obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;You can select the text anywhere in the sugar. Be sure to first install the espeak. Although I have created this code using espeak but in future I will be using gstreamer plugin.&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10:   &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:  I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi. A lot of students from this college have been associated with OLPC for development work. Like: &lt;br /&gt;
*   &#039;&#039;&#039;Food Force which is still in its developing phase. My seniors are working hard to achieve collaboration in Food Force.&lt;br /&gt;
*   &#039;&#039;&#039;Listen and Spell. This project was started by my senior Assim at GSoC 2008 and he is still working on it to remove speech dispatcher dependencies from it.&lt;br /&gt;
*   &#039;&#039;&#039;Speech dispatcher. This project was handled by my senior Hemant at GSoC 2008.&lt;br /&gt;
:   So by giving these examples what I am trying to say is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can.&lt;br /&gt;
:   I have already described some of my past achievements like AI challenge whose simulator code I prepared just in a time span of 15 days. In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:   Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:   According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Edward Cherlin &amp;lt;echerlin@gmail.com&amp;gt;&lt;br /&gt;
:    &amp;quot;call our text coloring engine to mark the word being spoken. That&#039;s designed for the pre-literate, on the model of Same-Language Subtitling in India.&amp;quot;&lt;br /&gt;
:    Means the people in developing nations like India can learn the language or text faster if same language subtitling model is employed.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Philip Wagner &amp;lt;Philip5147@aol.com&amp;gt;&lt;br /&gt;
:    &amp;quot;I am Philip Wagner.  I am a member of the Education Team for Sugar Labs. I was a teacher for seventeen years in Africa and in the  United States.&lt;br /&gt;
The speech synthesizer is very important to help children learn to speak, read and write languages.&lt;br /&gt;
Some research states that a child needs to hear words pronounced between one hundred and one thousand times before the child knows the word.  This is a monumental task for a teacher to be repeating words enough times for the children to learn.  The speech synthesizer is a tool which helps the teacher in doing some of that repetition. The more words a child  knows the more learning can take place inside the child&#039;s brain.  We think with words.  If we do not have the words necessary to do the thinking then things don&#039;t progress as well.&lt;br /&gt;
For writing, the child  listens to a story that the speech synthesizer very patiently repeats as many times as the child needs it and then the child writes the story in his or her own words..&lt;br /&gt;
The hearing, seeing, and writing of the words helps the student for reading.&lt;br /&gt;
There are many more words that are used in books and on the internet than we use to speak with.&lt;br /&gt;
In English we use about ninety thousand words for speaking. An estimate of the total words in English is more than five-hundred-thousand.  One dictionary has five-hundred-thousand entries.&lt;br /&gt;
The more a child learns the more the child will learn. We cannot depend on teachers to teach enough words to children.&lt;br /&gt;
 &lt;br /&gt;
:   &#039;&#039;&#039;I am very much encouraged that Chirag Jain is working toward preparing the speech synthesizer in Sugar.&amp;quot; &lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[DevelopmentTeam#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
A:   My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:   Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:   When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
:   Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24681</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24681"/>
		<updated>2009-04-01T11:56:22Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:   &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:   Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:   chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:   Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:   I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:collaborating with any mentor wouldn&#039;t be a big deal.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:   I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:   Then I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:   Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
:   Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:   Speech Synthesis in core sugar&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What are you making?    &lt;br /&gt;
&lt;br /&gt;
A:   I want to integrate speech in the core sugar. Means I want to provide speech synthesis as a basic functionality in sugar.&lt;br /&gt;
:   According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;Who are you making it for, and why do they need it? &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this activity for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:   I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are:&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The main aim of sugar in speech synthesis is to integrate the speech in core sugar. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Integrating speech in core sugar means providing a speech generator as a basic functionality in sugar. Thus if there is any window containing a text is open in sugar then the selected text can be read out by the application running behind.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The other aim is to develop a GUI for speech configuration which will also act as a configuration management tool.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now in this tool, basic facilities like changing the volume, pitch, voice, accent, language etc can be included.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Accent according to locale is yet another important feature that we aim at in the speech synthesis. E-speak already provides different accents for different languages. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Another nice idea is a keyboard speaker. Means whenever a user presses any of the key, the activity speaks it out. &lt;br /&gt;
&lt;br /&gt;
:   Some rough ideas of implementation:&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                 ---------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;On the top level is the speech engine (espeak) producing the speech.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Then a command line tool that will be operated by a button or Keyboard shortcut to speak the selected text.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Then comes the GUI for the configuration management tool of the speech. It will configure the sound qualities like volume, language, accent, pitch etc.&lt;br /&gt;
 &lt;br /&gt;
*   &#039;&#039;&#039;For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now to implement speech in sugar core my idea is to use clipboard module which takes care of copy paste in sugar. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The basic idea is to provide a read button in core sugar (like a home button) which is always there. So that if a user selects any of the text in the current window and presses the button it gets speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;A small code snippet which I have prepared for demonstration purpose is shown below. You can copy paste this and try it. First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
                         import gtk&lt;br /&gt;
                         from espeak import espeak&lt;br /&gt;
                         obj=espeak()&lt;br /&gt;
                         clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
                         text=clip.wait_for_text()&lt;br /&gt;
                         if text==None:&lt;br /&gt;
                         obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
                         else:&lt;br /&gt;
                         obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;You can select the text anywhere in the sugar. Be sure to first install the espeak. Although I have created this code using espeak but in future I will be using gstreamer plugin.&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10:   &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:  I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi. A lot of students from this college have been associated with OLPC for development work. Like: &lt;br /&gt;
*   &#039;&#039;&#039;Food Force which is still in its developing phase. My seniors are working hard to achieve collaboration in Food Force.&lt;br /&gt;
*   &#039;&#039;&#039;Listen and Spell. This project was started by my senior Assim at GSoC 2008 and he is still working on it to remove speech dispatcher dependencies from it.&lt;br /&gt;
*   &#039;&#039;&#039;Speech dispatcher. This project was handled by my senior Hemant at GSoC 2008.&lt;br /&gt;
:   So by giving these examples what I am trying to say is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can.&lt;br /&gt;
:   I have already described some of my past achievements like AI challenge whose simulator code I prepared just in a time span of 15 days. In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:   Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:   According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Edward Cherlin &amp;lt;echerlin@gmail.com&amp;gt;&lt;br /&gt;
:    &amp;quot;call our text coloring engine to mark the word being spoken. That&#039;s designed for the pre-literate, on the model of Same-Language Subtitling in India.&amp;quot;&lt;br /&gt;
:    Means the people in developing nations like India can learn the language or text faster if same language subtitling model is employed.&lt;br /&gt;
&lt;br /&gt;
:    &#039;&#039;&#039;According to Philip Wagner &amp;lt;Philip5147@aol.com&amp;gt;&lt;br /&gt;
:    &amp;quot;I am Philip Wagner.  I am a member of the Education Team for Sugar Labs. I was a teacher for seventeen years in Africa and in the  United States.&lt;br /&gt;
The speech synthesizer is very important to help children learn to speak, read and write languages.&lt;br /&gt;
Some research states that a child needs to hear words pronounced between one hundred and one thousand times before the child knows the word.  This is a monumental task for a teacher to be repeating words enough times for the children to learn.  The speech synthesizer is a tool which helps the teacher in doing some of that repetition. The more words a child  knows the more learning can take place inside the child&#039;s brain.  We think with words.  If we do not have the words necessary to do the thinking then things don&#039;t progress as well.&lt;br /&gt;
For writing, the child  listens to a story that the speech synthesizer very patiently repeats as many times as the child needs it and then the child writes the story in his or her own words..&lt;br /&gt;
The hearing, seeing, and writing of the words helps the student for reading.&lt;br /&gt;
There are many more words that are used in books and on the internet than we use to speak with.&lt;br /&gt;
In English we use about ninety thousand words for speaking. An estimate of the total words in English is more than five-hundred-thousand.  One dictionary has five-hundred-thousand entries.&lt;br /&gt;
The more a child learns the more the child will learn. We cannot depend on teachers to teach enough words to children.&lt;br /&gt;
 &lt;br /&gt;
:   &#039;&#039;&#039;I am very much encouraged that Chirag Jain is working toward preparing the speech synthesizer in Sugar.&amp;quot; &lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[DevelopmentTeam#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
A:   My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:   Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:   When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
:   Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24680</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24680"/>
		<updated>2009-04-01T11:40:39Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:   &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:   Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:   chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:   Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:   I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:collaborating with any mentor wouldn&#039;t be a big deal.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:   I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:   Then I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:   Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
:   Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:   Speech Synthesis in core sugar&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What are you making?    &lt;br /&gt;
&lt;br /&gt;
A:   I want to integrate speech in the core sugar. Means I want to provide speech synthesis as a basic functionality in sugar.&lt;br /&gt;
:   According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;Who are you making it for, and why do they need it? &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this activity for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:   I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are:&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The main aim of sugar in speech synthesis is to integrate the speech in core sugar. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Integrating speech in core sugar means providing a speech generator as a basic functionality in sugar. Thus if there is any window containing a text is open in sugar then the selected text can be read out by the application running behind.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The other aim is to develop a GUI for speech configuration which will also act as a configuration management tool.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now in this tool, basic facilities like changing the volume, pitch, voice, accent, language etc can be included.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Accent according to locale is yet another important feature that we aim at in the speech synthesis. E-speak already provides different accents for different languages. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Another nice idea is a keyboard speaker. Means whenever a user presses any of the key, the activity speaks it out. &lt;br /&gt;
&lt;br /&gt;
:   Some rough ideas of implementation:&lt;br /&gt;
                                                     --------------&lt;br /&gt;
                                                         Speech  &lt;br /&gt;
                                                        (Level 1)&lt;br /&gt;
                                                     ---------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                        Espeak (TTS)&lt;br /&gt;
                                                         (Level 2)&lt;br /&gt;
                                                    -----------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                     gstreamer Plugin&lt;br /&gt;
                                                        (Level 3)&lt;br /&gt;
                                                    ------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           V&lt;br /&gt;
                                                   -------------------&lt;br /&gt;
                                                    Command Line Tool&lt;br /&gt;
                                            (To produce speech of the selected text)&lt;br /&gt;
                                            (In user selected languages and accents)&lt;br /&gt;
                                                   --------------------&lt;br /&gt;
                                                           |&lt;br /&gt;
                                                           |&lt;br /&gt;
                                 ---------------------------------------------------------&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 |                                                        |&lt;br /&gt;
                                 V                                                        V&lt;br /&gt;
                    ------------------------                                      ---------------------&lt;br /&gt;
                    Button/Keyboard shortcut                                 GUI for Configuration management &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now to implement speech in sugar core my idea is to use clipboard module which takes care of copy paste in sugar. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The basic idea is to provide a read button in core sugar (like a home button) which is always there. So that if a user selects any of the text in the current window and presses the button it gets speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;A small code snippet which I have prepared for demonstration purpose is shown below. You can copy paste this and try it. First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
:   import gtk&lt;br /&gt;
:   from espeak import espeak&lt;br /&gt;
:   obj=espeak()&lt;br /&gt;
:   clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
:   text=clip.wait_for_text()&lt;br /&gt;
:   if text==None:&lt;br /&gt;
:      obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
:  else:&lt;br /&gt;
:      obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;You can select the text anywhere in the sugar. Be sure to first install the espeak. Although I have created this code using espeak but in future I will be using gstreamer plugin.&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10:   &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:  I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi. A lot of students from this college have been associated with OLPC for development work. Like: &lt;br /&gt;
*   &#039;&#039;&#039;Food Force which is still in its developing phase. My seniors are working hard to achieve collaboration in Food Force.&lt;br /&gt;
*   &#039;&#039;&#039;Listen and Spell. This project was started by my senior Assim at GSoC 2008 and he is still working on it to remove speech dispatcher dependencies from it.&lt;br /&gt;
*   &#039;&#039;&#039;Speech dispatcher. This project was handled by my senior Hemant at GSoC 2008.&lt;br /&gt;
:   So by giving these examples what I am trying to say is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can.&lt;br /&gt;
:   I have already described some of my past achievements like AI challenge whose simulator code I prepared just in a time span of 15 days. In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:   Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:   According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
:    According to Edward Cherlin &amp;lt;echerlin@gmail.com&amp;gt;&lt;br /&gt;
:    &#039;&#039;&#039;call our text coloring engine to mark the word being spoken. That&#039;s designed for the pre-literate, on the model of Same-Language Subtitling in India.&lt;br /&gt;
:    Means the people in developing nations like India can learn the language or text faster if same language subtitling model is employed.&lt;br /&gt;
[ONE MORE PARA]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[DevelopmentTeam#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
A:   My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:   Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:   When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
:   Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24679</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24679"/>
		<updated>2009-04-01T11:36:53Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:   &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:   Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:   chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:   Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:   I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:collaborating with any mentor wouldn&#039;t be a big deal.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:   I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:   Then I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:   Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
:   Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:   Speech Synthesis in core sugar&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;My project description. What are you making?    &lt;br /&gt;
&lt;br /&gt;
A:   I want to integrate speech in the core sugar. Means I want to provide speech synthesis as a basic functionality in sugar.&lt;br /&gt;
:   According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;Who are you making it for, and why do they need it? &lt;br /&gt;
&lt;br /&gt;
:    Language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this activity for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
:   I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are:&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The main aim of sugar in speech synthesis is to integrate the speech in core sugar. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Integrating speech in core sugar means providing a speech generator as a basic functionality in sugar. Thus if there is any window containing a text is open in sugar then the selected text can be read out by the application running behind.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The other aim is to develop a GUI for speech configuration which will also act as a configuration management tool.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now in this tool, basic facilities like changing the volume, pitch, voice, accent, language etc can be included.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Accent according to locale is yet another important feature that we aim at in the speech synthesis. E-speak already provides different accents for different languages. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Another nice idea is a keyboard speaker. Means whenever a user presses any of the key, the activity speaks it out. &lt;br /&gt;
&lt;br /&gt;
:   Some rough ideas of implementation:&lt;br /&gt;
&lt;br /&gt;
:                                                    --------------&lt;br /&gt;
:                                                        Speech  &lt;br /&gt;
:                                                       (Level 1)&lt;br /&gt;
:                                                    ---------------&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                                          V&lt;br /&gt;
:                                                   -----------------&lt;br /&gt;
:                                                       Espeak (TTS)&lt;br /&gt;
:                                                        (Level 2)&lt;br /&gt;
:                                                   -----------------&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                                          V&lt;br /&gt;
:                                                   ------------------&lt;br /&gt;
:                                                    gstreamer Plugin&lt;br /&gt;
:                                                       (Level 3)&lt;br /&gt;
:                                                   ------------------&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                                          V&lt;br /&gt;
:                                                  -------------------&lt;br /&gt;
:                                                   Command Line Tool&lt;br /&gt;
:                                           (To produce speech of the selected text)&lt;br /&gt;
:                                           (In user selected languages and accents)&lt;br /&gt;
:                                                  --------------------&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                                          |&lt;br /&gt;
:                                ---------------------------------------------------------&lt;br /&gt;
:                                |                                                        |&lt;br /&gt;
:                                |                                                        |&lt;br /&gt;
:                                V                                                        V&lt;br /&gt;
:                   ------------------------                                      ---------------------&lt;br /&gt;
:                   Button/Keyboard shortcut                                 GUI for Configuration management&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now to implement speech in sugar core my idea is to use clipboard module which takes care of copy paste in sugar. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The basic idea is to provide a read button in core sugar (like a home button) which is always there. So that if a user selects any of the text in the current window and presses the button it gets speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;A small code snippet which I have prepared for demonstration purpose is shown below. You can copy paste this and try it. First select some text and then run the code trough terminal. The code will speak the text. This is a very basic thing which we want to achieve in sugar&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
:   import gtk&lt;br /&gt;
:   from espeak import espeak&lt;br /&gt;
:   obj=espeak()&lt;br /&gt;
:   clip=gtk.Clipboard(display=gtk.gdk.display_get_default(),selection=&amp;quot;PRIMARY&amp;quot;)&lt;br /&gt;
:   text=clip.wait_for_text()&lt;br /&gt;
:   if text==None:&lt;br /&gt;
:      obj.speak(&amp;quot;Sorry! No text is selected&amp;quot;)&lt;br /&gt;
:  else:&lt;br /&gt;
:      obj.speak(text)&lt;br /&gt;
&lt;br /&gt;
:   &#039;&#039;&#039;You can select the text anywhere in the sugar. Be sure to first install the espeak. Although I have created this code using espeak but in future I will be using gstreamer plugin.&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10:   &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:  I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi. A lot of students from this college have been associated with OLPC for development work. Like: &lt;br /&gt;
*   &#039;&#039;&#039;Food Force which is still in its developing phase. My seniors are working hard to achieve collaboration in Food Force.&lt;br /&gt;
*   &#039;&#039;&#039;Listen and Spell. This project was started by my senior Assim at GSoC 2008 and he is still working on it to remove speech dispatcher dependencies from it.&lt;br /&gt;
*   &#039;&#039;&#039;Speech dispatcher. This project was handled by my senior Hemant at GSoC 2008.&lt;br /&gt;
:   So by giving these examples what I am trying to say is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can.&lt;br /&gt;
:   I have already described some of my past achievements like AI challenge whose simulator code I prepared just in a time span of 15 days. In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:   Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:   According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
:    According to Edward Cherlin &amp;lt;echerlin@gmail.com&amp;gt;&lt;br /&gt;
:    &#039;&#039;&#039;call our text coloring engine to mark the word being spoken. That&#039;s designed for the pre-literate, on the model of Same-Language Subtitling in India.&lt;br /&gt;
:    Means the people in developing nations like India can learn the language or text faster if same language subtitling model is employed.&lt;br /&gt;
[ONE MORE PARA]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[DevelopmentTeam#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
A:   My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:   Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:   When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
:   Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24671</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24671"/>
		<updated>2009-04-01T09:04:08Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: /* About your project */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:   &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:   Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:   chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:   Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:   I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:collaborating with any mentor wouldn&#039;t be a big deal.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:   I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:   Then I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:   Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
:   Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:   Speech Synthesis in core sugar&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;Describe your project in 10-20 sentences. What are you making? Who are you making it for, and why do they need it?    What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
A:   I want to integrate speech in the core sugar. Means I want to provide speech synthesis as a basic functionality in sugar.&lt;br /&gt;
:   According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
:    Hence language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this activity for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:   I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are:&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The main aim of sugar in speech synthesis is to integrate the speech in core sugar. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Integrating speech in core sugar means providing a speech generator as a basic functionality in sugar. Thus if there is any window containing a text is open in sugar then the selected text can be read out by the application running behind.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The other aim is to develop a GUI for speech configuration which will also act as a configuration management tool.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now in this tool, basic facilities like changing the volume, pitch, voice, accent, language etc can be included.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Accent according to locale is yet another important feature that we aim at in the speech synthesis. E-speak already provides different accents for different languages. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Another nice idea is a keyboard speaker. Means whenever a user presses any of the key, the activity speaks it out. &lt;br /&gt;
&lt;br /&gt;
:   Some rough ideas of implementation:&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now to implement speech in sugar core my idea is to use clipboard module which takes care of copy paste in sugar. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The basic idea is to provide a read button in core sugar (like a home button) which is always there. So that if a user selects any of the text in the current window and presses the button it gets speak out.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10:   &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:  I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi. A lot of students from this college have been associated with OLPC for development work. Like: &lt;br /&gt;
*   &#039;&#039;&#039;Food Force which is still in its developing phase. My seniors are working hard to achieve collaboration in Food Force.&lt;br /&gt;
*   &#039;&#039;&#039;Listen and Spell. This project was started by my senior Assim at GSoC 2008 and he is still working on it to remove speech dispatcher dependencies from it.&lt;br /&gt;
*   &#039;&#039;&#039;Speech dispatcher. This project was handled by my senior Hemant at GSoC 2008.&lt;br /&gt;
:   So by giving these examples what I am trying to say is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can.&lt;br /&gt;
:   I have already described some of my past achievements like AI challenge whose simulator code I prepared just in a time span of 15 days. In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:   Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
&lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:   According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
:    According to Edward Cherlin &amp;lt;echerlin@gmail.com&amp;gt;&lt;br /&gt;
:    &#039;&#039;&#039;call our text coloring engine to mark the word being spoken. That&#039;s designed for the pre-literate, on the model of Same-Language Subtitling in India.&lt;br /&gt;
:    Means the people in developing nations like India can learn the language or text faster if same language subtitling model is employed.&lt;br /&gt;
[ONE MORE PARA]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[DevelopmentTeam#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
A:   My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:   Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:   When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
:   Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
	<entry>
		<id>https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24662</id>
		<title>Speech-synthesis</title>
		<link rel="alternate" type="text/html" href="https://wiki.sugarlabs.org/index.php?title=Speech-synthesis&amp;diff=24662"/>
		<updated>2009-04-01T06:01:06Z</updated>

		<summary type="html">&lt;p&gt;Chiragjain1989: /* You and the community */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====About you====&lt;br /&gt;
&lt;br /&gt;
Q.1:   &#039;&#039;&#039;What is your name?&lt;br /&gt;
&lt;br /&gt;
A:   Chirag Jain&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.2:   &#039;&#039;&#039;What is your email address?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989{AT}gmail{DOT}com&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.3:   &#039;&#039;&#039;What is your Sugar Labs wiki username?&lt;br /&gt;
&lt;br /&gt;
A:   chiragjain1989&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.4:   &#039;&#039;&#039;What is your IRC nickname?&lt;br /&gt;
&lt;br /&gt;
A:   chirag&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.5:   &#039;&#039;&#039;What is your primary language? (We have mentors who speak multiple languages and can match you with one of them if   you&#039;d prefer.)&lt;br /&gt;
&lt;br /&gt;
A:   Hindi and English&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.6:   &#039;&#039;&#039;Where are you located, and what hours do you tend to work? (We also try to match mentors by general time zone if possible.)&lt;br /&gt;
&lt;br /&gt;
A:   I am located in India, Delhi 5:30+GMT. I can work from early morning to late midnight.&lt;br /&gt;
:collaborating with any mentor wouldn&#039;t be a big deal.   &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.7:   &#039;&#039;&#039;Have you participated in an open-source project before? If so, please send us URLs to your profile pages for those  projects, or some other demonstration of the work that you have done in open-source. If not, why do you want to work on an open-source project this summer?&lt;br /&gt;
&lt;br /&gt;
A:   I was not aware of a thing like open source before I stepped into my college. But then I heard a lot about this stuff from my seniors. Then I started participating in coding events and my first open source event was AI Challenge organized during our technical fest. &lt;br /&gt;
:   I did write a simulator code for the event.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.com/p/artificial-intelligence.&lt;br /&gt;
&lt;br /&gt;
:   Then I also made a Sudoku solver in open source using a back tracking method in C++. The algorithm has complexity which is exponential in nature.&lt;br /&gt;
 &lt;br /&gt;
:   Link: http://code.google.come/p/sudoku-crazy&lt;br /&gt;
&lt;br /&gt;
:   I also actively participate at SPOJ programming contest site: http://www.spoj.pl/&lt;br /&gt;
:   Currently I am at world rank 756.&lt;br /&gt;
&lt;br /&gt;
:Link: http://www.spoj.pl/users/chiragjain1989&lt;br /&gt;
:   Now after knowing a lot about open source I want to gain some real time experience in open source development. The GSoC is an opportunity where I can apply my technical skills, can learn new things and at the same time can contribute something to the society. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====About your project====&lt;br /&gt;
&lt;br /&gt;
Q.8:   &#039;&#039;&#039;What is the name of your project?&lt;br /&gt;
&lt;br /&gt;
A:   Speech Synthesis in core sugar&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.9:   &#039;&#039;&#039;Describe your project in 10-20 sentences. What are you making? Who are you making it for, and why do they need it?    What technologies (programming languages, etc.) will you be using?&lt;br /&gt;
&lt;br /&gt;
A:   I want to integrate speech in the core sugar. Means I want to provide speech synthesis as a basic functionality in sugar.&lt;br /&gt;
:   According to eye-tracking research it can be shown that ‘’viewers naturally synchronize the auditory and textual information while watching a film song with SLS. When SLS is integrated into popular TV entertainment, reading happens automatically and subconsciously.’’ &lt;br /&gt;
:    Hence language learning can be a great experience if done with speech. The literacy rate can be increased by 6-10% if speech is also included with text because this is the ability of our brain to easily remember sounds rather than text. So I am making this activity for children of age group 3-15 so that learning language can become easier for them.&lt;br /&gt;
&lt;br /&gt;
:   I discussed a lot with alsroot, assimd and besmac on IRC about this project. The main points of discussion are:&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The main aim of sugar in speech synthesis is to integrate the speech in core sugar. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Integrating speech in core sugar means providing a speech generator as a basic functionality in sugar. Thus if there is any window containing a text is open in sugar then the selected text can be read out by the application running behind.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The other aim is to develop a GUI for speech configuration which will also act as a configuration management tool.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now in this tool, basic facilities like changing the volume, pitch, voice, accent, language etc can be included.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Accent according to locale is yet another important feature that we aim at in the speech synthesis. E-speak already provides different accents for different languages. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Another nice idea is a keyboard speaker. Means whenever a user presses any of the key, the activity speaks it out. &lt;br /&gt;
&lt;br /&gt;
:   Some rough ideas of implementation:&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;There are two options for using a layer over TTS engine espeak, one is a speech dispatcher which was created as last year GSoC project and other is the gstreamer plugin. &lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Both of these use espeak. Listen and Spell uses the speechd. But when I discussed it with alsroot on IRC, he told me that using a speechd is a bad idea because it has become a system daemon and requires root privileges to work. Therefore using gstreamer plugin is the only and best idea.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the GUI pyGtk can be used.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;Now to implement speech in sugar core my idea is to use clipboard module which takes care of copy paste in sugar. So using this module the entire selected text can be sent to the speech framework that it can speak out.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;For the keyboard speaker, we can simply store the keystrokes in a file and then send the file to the speech generator.&lt;br /&gt;
&lt;br /&gt;
*   &#039;&#039;&#039;The basic idea is to provide a read button in core sugar (like a home button) which is always there. So that if a user selects any of the text in the current window and presses the button it gets speak out.&lt;br /&gt;
&lt;br /&gt;
----   &lt;br /&gt;
  &lt;br /&gt;
Q.10:   &#039;&#039;&#039;What is the timeline for development of your project? The Summer of Code work period is 7 weeks long, May 23 - August 10; tell us what you will be working on each week. (As the summer goes on, you and your mentor will adjust your schedule, but it&#039;s good to have a plan at the beginning so you have an idea of where you&#039;re headed.) Note that you should probably plan to have something &amp;quot;working and 90% done&amp;quot; by the midterm evaluation (July 6-13); the last steps always take longer than you think, and we will consider cancelling projects which are not mostly working by then.&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.11:   &#039;&#039;&#039;Convince us, in 5-15 sentences, that you will be able to successfully complete your project in the timeline you have described. This is usually where people describe their past experiences, credentials, prior projects, schoolwork, and that sort of thing, but be creative. Link to prior work or other resources as relevant.&lt;br /&gt;
&lt;br /&gt;
A:  I am currently pursuing my B.E. in Computer science from Netaji Subash Institute of Technology, New Delhi. A lot of students from this college have been associated with OLPC for development work. Like: &lt;br /&gt;
*   &#039;&#039;&#039;Food Force which is still in its developing phase. My seniors are working hard to achieve collaboration in Food Force.&lt;br /&gt;
*   &#039;&#039;&#039;Listen and Spell. This project was started by my senior Assim at GSoC 2008 and he is still working on it to remove speech dispatcher dependencies from it.&lt;br /&gt;
*   &#039;&#039;&#039;Speech dispatcher. This project was handled by my senior Hemant at GSoC 2008.&lt;br /&gt;
:   So by giving these examples what I am trying to say is that I have got many helping seniors, who have a lot of experience and who are ready to help me in every possible way they can.&lt;br /&gt;
:   I have already described some of my past achievements like AI challenge whose simulator code I prepared just in a time span of 15 days. In school also I prepared a lot of small projects in C++ like digital diary, Sudoku solver, library manager, telephone directory etc.&lt;br /&gt;
:   Another reason that I can easily complete the project is that I will be getting almost 3 months break during my summer vacations right from the end of May to August. Therefore I can concentrate entirely on this project with all my energies.&lt;br /&gt;
  &lt;br /&gt;
====You and the community====&lt;br /&gt;
&lt;br /&gt;
Q.12:   &#039;&#039;&#039;If your project is successfully completed, what will its impact be on the Sugar Labs community? Give 3 answers, each 1-3 paragraphs in length. The first one should be yours. The other two should be answers from members of the Sugar Labs community, at least one of whom should be a Sugar Labs GSoC mentor. Provide email contact information for non-GSoC mentors.&lt;br /&gt;
&lt;br /&gt;
A:   According to me, the main aim of sugar labs is to spread the fruit of literacy in developing nations. It is a common experience that we learn very fast on listening things then reading them. Providing speech in core sugar will be like making the sugar 10-15% more efficient. When children of age group 3-15 and who are learning languages will hear the speech again and again they will be able to learn it very fast. Not only this, now they will be able to hear a story or any other text than just reading it. One more potential advantage is for blind students which can&#039;t read the texts but can learn the language by listening it and feeling the words.&lt;br /&gt;
:    According to Edward Cherlin &amp;lt;echerlin@gmail.com&amp;gt;&lt;br /&gt;
:    &#039;&#039;&#039;call our text coloring engine to mark the word being spoken. That&#039;s designed for the pre-literate, on the model of Same-Language Subtitling in India.&lt;br /&gt;
:    Means the people in developing nations like India can learn the language or text faster if same language subtitling model is employed.&lt;br /&gt;
[ONE MORE PARA]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.13:   &#039;&#039;&#039;Sugar Labs will be working to set up a small (5-30 unit) Sugar pilot near each student project that is accepted to GSoC so that you can immediately see how your work affects children in a deployment. We will make arrangements to either supply or find all the equipment needed. Do you have any ideas on where you would like your deployment to be, who you would like to be involved, and how we can help you and the community in your area begin it?&lt;br /&gt;
&lt;br /&gt;
A:   I would greatly appreciate the efforts of sugar if they are planing for this and I think that my home town  which is still backward and has many primary schools, will be the best place where this pilot can be set up. I have many friends in the home town who are involved in such activities and they would love to contribute in here also. I also have a primary school near my home where we can easily test the activity.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.14:   &#039;&#039;&#039;What will you do if you get stuck on your project and your mentor isn&#039;t around?&lt;br /&gt;
&lt;br /&gt;
A:   Well I have some of my great helping seniors who are already associated with OLPC for some projects (Like Food Force) and who are ready to help me out in every possible way they can.&lt;br /&gt;
:   If still the problem can&#039;t be resolved then I can always ask it on IRC.&lt;br /&gt;
:   Google is also a very great option &lt;br /&gt;
:   I can also post the problem on sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.15:   &#039;&#039;&#039;How do you propose you will be keeping the community informed of your progress and any problems or questions you might have over the course of the project?&lt;br /&gt;
&lt;br /&gt;
:   I will regularly post my progress reports on my wiki page.&lt;br /&gt;
:   Link: http://wiki.sugarlabs.org/go/chiragjain1989&lt;br /&gt;
:   I can mail my progress reports to sugar mailing list.&lt;br /&gt;
&lt;br /&gt;
====Miscellaneous====&lt;br /&gt;
[[Image:chirag.png|thumb|right|My Screenshot with my email address]]&lt;br /&gt;
Q.16:   &#039;&#039;&#039;We want to make sure that you can set up a [[DevelopmentTeam#Development_systems|development environment]] before the summer starts. Please send us a link to a screenshot of your Sugar development environment with the following modification: when you hover over the XO-person icon in the middle of Home view, the drop-down text should have your email in place of &amp;quot;Restart.&amp;quot; See the image on the right for an example. It&#039;s normal to need assistance with this, so please visit our IRC channel, #sugar on irc.freenode.net, and ask for help.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
A:   My development environment screen shot is attached on the right side.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.17:   &#039;&#039;&#039;What is your t-shirt size? (Yes, we know Google asks for this already; humor us.)&lt;br /&gt;
&lt;br /&gt;
A:   Extra Large&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.18:   &#039;&#039;&#039;Describe a great learning experience you had as a child.&lt;br /&gt;
 &lt;br /&gt;
A:   When I was in my primary school there were some teachers who believed in education through entertainment. So they always perform some entertaining activities to teach us. Like for example when I was in third or fourth standard, I always get confused in less than and greater than signs. Even if I could make which number is greater or lesser but I become confuse in selecting the right sign.  So one day I approached my teacher. She removed my confusion by a nice method. She told me that I should give two dots in front of the number which is greater like : and one dot to the number which is lesser like. For example if I have to place sign between 2___ 5 then I would give one dot in front of 2 and two dots in front of 5 like this 2. : 5 &lt;br /&gt;
:   Now on joining these dots we can get the correct less than sign.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Q.19:   &#039;&#039;&#039;Is there anything else we should have asked you or anything else that we should know that might make us like you or your project more?&lt;br /&gt;
&lt;br /&gt;
A:   [TODO]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;noinclude&amp;gt;[[Category:2009 GSoC applications]]&amp;lt;/noinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:GSoC]]&lt;/div&gt;</summary>
		<author><name>Chiragjain1989</name></author>
	</entry>
</feed>