Difference between revisions of "Sugar Labs/SOM"

From Sugar Labs
Jump to navigation Jump to search
 
(232 intermediate revisions by 5 users not shown)
Line 1: Line 1:
 +
{{TOCright}}
 
== Kohonen Self Organising Maps ==
 
== Kohonen Self Organising Maps ==
  
 
[[Image:SOM_legend.jpg|thumb|172px|Legend]]
 
[[Image:SOM_legend.jpg|thumb|172px|Legend]]
Self Organising Maps (SOMs) can act as 2D spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related words), and label size is a rough guide to basic frequency of a feature. There are many "correct" map layouts for the same set of data, each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each is a continuous surface and wraps around north/south and east/west (surface of a torus), so if you find an interesting label to one side, remember to check it's neighbours on the opposite side.
+
Self Organising Maps (SOMs) can be used as 2d spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related terms), and label size indicates guide to basic frequency of a feature. There are many "correct" 2d map layouts for the same set of data (due to the multidimensional nature of the data), each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each map is a continuous, tillable surface, and wraps around north/south and east/west (surface of a torus); so if you find an interesting label to one edge, remember to check it's neighbours on the opposite side.
  
 
== What Do They Show? ==
 
== What Do They Show? ==
  
Well, you could just treat them like tag clouds, showing the top 200 word features used on the list for a given week/month, but the maps also hold spacial information. Word features that appear close together on the map were used closely (on average) in text content from the list. A height metaphor is also used to indicate the features with the strongest mean associations - the map auto centres on the highest pink peak features, these words have the strongest associations with all the rest of the features on the map; word features in the blue and green areas have weaker mean associations relative to the pink highs, but should not be considered negatively as they will often be tightly associated with surrounding neighbours.
+
Well, you could just treat them as tag clouds, showing the top two hundred word features used on the mail-list for a given week/month, but the maps also hold spacial information that provides context. Word features that appear close together on the map were used closely (on average) in email content from the mail-list. A height metaphor is used to indicate the combination of feature association strength and feature frequency; word features in the blue and green areas have weaker mean associations and frequency relative to the hot orange and pink highs, but should not be considered negatively as they are still in the top ~200 terms, and will often be tightly associated with surrounding neighbours.
  
 
== SOM Related Research Papers ==
 
== SOM Related Research Papers ==
Line 17: Line 18:
 
== It's An Education Project Mailing List ==
 
== It's An Education Project Mailing List ==
  
Weekly maps generated with posts from the [http://lists.sugarlabs.org/listinfo/iaep IAEP mailing list] - most recent maps shown first.
+
Monthly maps generated with posts from the [http://lists.sugarlabs.org/listinfo/iaep IAEP mailing list]. Most recent maps shown first - for older maps please see the [[Sugar_Labs/SOM/IAEP|IAEP map history map archive]] page.
  
<gallery>
+
<gallery widths="275" heights="150" perrow="2">
Image:2008-November-15-21-som.jpg|'''2008 Nov 15th-21st'''
+
File:2012-July-som.png|'''2012 July''' (31 emails)
Image:2008-November-8-14-som.jpg|'''2008 Nov 8th-14th'''<br>Run-up to Sugarcamp meeting. ''Note: Base relief detail added to highlight neighbour deltas.''
+
File:2012-June-som.png|'''2012 June''' (81 emails)
Image:2008-November-1-7-som.jpg|'''2008 Nov 1st-7th'''
 
Image:2008-October-25-31-som.jpg|'''2008 October 25th-31st'''
 
Image:2008-October-18-24-som.jpg|'''2008 October 18th-24th'''
 
Image:2008-October-11-17-som.jpg|'''2008 October 11th-17th'''
 
Image:2008-October-4-10-som.jpg|'''2008 October 4th-10th'''
 
Image:2008-Sept-27-Oct-3-som.jpg|'''2008 Sept 27th Oct 3rd'''
 
Image:2008-September-20-26-som.jpg|'''2008 Sept 20th-26th'''
 
Image:2008-September-13-19-som.jpg|'''2008 Sept 13th-19th'''<br>Discussions about deployment feedback; an interesting side cluster regarding example mock-up PR material.
 
Image:2008-September-6-12-som.jpg|'''2008 Sept 6th-12th'''
 
Image:2008-August-30-September-5-som.jpg|'''2008 Aug 30th - Sept 5th'''
 
Image:2008-August-23-29-som.jpg|'''2008 August 23rd-29th'''
 
Image:2008-August-16-22-som.jpg|'''2008 August 16th-22nd'''
 
Image:2008-August-9-15-som.jpg|'''2008 August 9th-15th'''
 
Image:2008-August-02-08-som.jpg|'''2008 August 2nd-8th'''<br>Sugar and Work surrounded by OLPC, lab, education, development, and community.
 
Image:2008-July-26-August-1-som.jpg|'''2008 Jul 26th - Aug 1st'''<br>A fairly quiet traffic week with a good educational focus (how to best use/apply/approach technology tools for education).
 
Image:2008-July-19-25-som.jpg|'''2008 July 19th-25th'''<br>Prominent verbs are: doing; programming; learning; and education; math; etc
 
Image:2008-July-12-18-som.jpg|'''2008 July 12th-18th'''<br>Some prominent themes on mathematics education.
 
Image:2008-July-05-11-som.jpg|'''2008 July 5th-11th'''<br>Good educational focus; lesson plans, teachers, learning.
 
Image:2008-June-28-July-4-som.jpg|'''2008 Jun 28th - Jul 4th'''<br>Quiet again. Tail-end of Etoys discussion.
 
Image:2008-June-21-27-som.jpg|'''2008 June 21st-27th'''<br>List activity picked up this week; the ongoing Smalltallk/Debian drama is featured.
 
Image:2008-June-14-20-som.jpg|'''2008 June 14th-20th'''<br>A low traffic week. Most of it wiki admin related.
 
Image:2008-June-07-13-som.jpg|'''2008 June 7th-13th'''<br>The discussion seems to have been focused on features ("needs"), documentation and governance.
 
Image:2008-May-31-June-06-som.jpg|'''2008 May 31st - Jun 6th'''<br>Discussion seems to have been focused on Sugar development: what is used, needed, made and to be made.
 
Image:2008-May-24-30-som.jpg|'''2008 May 24th-30th'''<br>The Sugar Lab wiki activity start to snowball.
 
Image:2008-May-17-23-som.jpg|'''2008 May 17th-23rd'''
 
Image:2008-May-10-16-som.jpg|'''2008 May 10th-16th'''<br>Some overflow from dual boot Linux/Windows announcement.
 
Image:2008-May-01-09-som.jpg|'''2008 May 1st-9th'''<br>Lots of discussion around the direction of Sugar, OLPC and the XO platform.
 
 
</gallery>
 
</gallery>
  
 
== Sugar Mailing List ==
 
== Sugar Mailing List ==
  
Monthly maps generated with posts from the [http://lists.laptop.org/listinfo/sugar Sugar mailing list] - most recent maps shown first.
+
Monthly maps generated with posts from the [http://lists.laptop.org/listinfo/sugar Sugar mailing list]. Most recent maps shown first - for older maps see the [[Sugar_Labs/SOM/Sugar|Sugar map history]] page.
  
 
<gallery>
 
<gallery>
Image:2008-July-Sugar-som.jpg|'''2008 July'''
+
Image:2009-April-Sugar devel som.jpg|'''2009 April'''
Image:2008-June-Sugar-som.jpg|'''2008 June'''
+
Image:2009-March-Sugar-devel-som.jpg|'''2009 March'''
Image:2008-May-Sugar-som.jpg|'''2008 May'''
+
Image:2009-February-Sugar-devel-som.jpg|'''2009 February'''
Image:2008-April-Sugar-som.jpg|'''2008 April'''
+
Image:2009-January-Sugar-devel-som.jpg|'''2009 January'''
Image:2008-March-Sugar-som.jpg|'''2008 March'''
+
Image:2008-December-Sugar-devel-som.jpg|'''2008 December'''
Image:2008-February-Sugar-som.jpg|'''2008 February'''
+
Image:2008-November-Sugar-devel-som.jpg|'''2008 November'''
Image:2008-January-Sugar-som.jpg|'''2008 January'''
+
Image:2008-October-Sugar-devel-som.jpg|'''2008 October'''
Image:2007-December-Sugar-som.jpg|'''2007 December'''
+
Image:2008-September-Sugar-devel-som.jpg|'''2008 September'''
Image:2007-November-Sugar-som.jpg|'''2007 November'''
 
Image:2007-October-Sugar-som.jpg|'''2007 October'''
 
Image:2007-September-Sugar-som.jpg|'''2007 September'''
 
Image:2007-August-Sugar-som.jpg|'''2007 August'''
 
Image:2007-July-Sugar-som.jpg|'''2007 July'''
 
Image:2007-June-Sugar-som.jpg|'''2007 June'''
 
Image:2007-May-Sugar-som.jpg|'''2007 May'''
 
Image:2007-April-Sugar-som.jpg|'''2007 April'''
 
Image:2007-March-Sugar-som.jpg|'''2007 March'''
 
Image:2007-February-Sugar-som.jpg|'''2007 February'''
 
Image:2007-January-Sugar-som.jpg|'''2007 January'''
 
Image:2006-December-Sugar-som.jpg|'''2006 December'''
 
Image:2006-November-Sugar-som.jpg|'''2006 November'''
 
Image:2006-October-Sugar-som.jpg|'''2006 October'''
 
Image:2006-September-Sugar-som.jpg|'''2006 September'''
 
Image:2006-August-som.jpg|'''2006 August'''
 
Image:2006-July-Sugar-som.jpg|'''2006 July'''
 
 
</gallery>
 
</gallery>
 +
 +
== Technology in Education Academic Research Papers ==
 +
 +
A selection of SOMs for technology in education [http://wiki.laptop.org/go/Academic_papers research papers] relating to the One Laptop Per Child project can be found on the laptop.org wiki.
  
 
== Future ==
 
== Future ==
  
The mapping algorithms and visualisation style will continue to be refined, details will be posted here on any significant modifications. The code base was originally designed for bulk text documents from a single author, tested on works of literature from Project Gutenberg.
+
The mapping algorithms and visualisation style will continue to be refined, details will be posted on any significant modifications (see comments under images for changes). The code base was originally designed for bulk text documents from a single author, tested on works of literature from Project Gutenberg.

Latest revision as of 22:23, 6 August 2012

Kohonen Self Organising Maps

Legend

Self Organising Maps (SOMs) can be used as 2d spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related terms), and label size indicates guide to basic frequency of a feature. There are many "correct" 2d map layouts for the same set of data (due to the multidimensional nature of the data), each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each map is a continuous, tillable surface, and wraps around north/south and east/west (surface of a torus); so if you find an interesting label to one edge, remember to check it's neighbours on the opposite side.

What Do They Show?

Well, you could just treat them as tag clouds, showing the top two hundred word features used on the mail-list for a given week/month, but the maps also hold spacial information that provides context. Word features that appear close together on the map were used closely (on average) in email content from the mail-list. A height metaphor is used to indicate the combination of feature association strength and feature frequency; word features in the blue and green areas have weaker mean associations and frequency relative to the hot orange and pink highs, but should not be considered negatively as they are still in the top ~200 terms, and will often be tightly associated with surrounding neighbours.

SOM Related Research Papers

Self-Organizing Maps and Constructive Learning SOM from text.

Self-Organizing Maps and Constructive Learning (PDF) Honkela T., Leinonen T., Lonka K., Raike A. (2000): Self-Organizing Maps and Constructive Learning. Proceedings of ICEUT'2000, International Conference on Educational Uses of Communication and Information Technologies, Beijing, China. August 21-25, 2000, pp. 339-343.

In this article, the use of the self-organizing map (SOM) is approached on the basis of current theories of learning. Possibilities of computer and networked platforms that aim at helping human learning are also inspected. It is shown how the SOM can be considered a model of constructive learning. The area of constructive learning is outlined and two cases of using the self-organizing map in computer supported learning environments are presented.


It's An Education Project Mailing List

Monthly maps generated with posts from the IAEP mailing list. Most recent maps shown first - for older maps please see the IAEP map history map archive page.

Sugar Mailing List

Monthly maps generated with posts from the Sugar mailing list. Most recent maps shown first - for older maps see the Sugar map history page.

Technology in Education Academic Research Papers

A selection of SOMs for technology in education research papers relating to the One Laptop Per Child project can be found on the laptop.org wiki.

Future

The mapping algorithms and visualisation style will continue to be refined, details will be posted on any significant modifications (see comments under images for changes). The code base was originally designed for bulk text documents from a single author, tested on works of literature from Project Gutenberg.