Difference between revisions of "Sugar Labs/SOM"

From Sugar Labs
Jump to navigation Jump to search
(ADD)
(Added Sugar map gallery)
Line 26: Line 26:
  
 
<gallery>
 
<gallery>
Image:2008-June-Sugar-som.jpg
+
Image:2008-June-Sugar-som.jpg|2008 June
Image:2008-May-Sugar-som.jpg
+
Image:2008-May-Sugar-som.jpg|2008 May
Image:2008-April-Sugar-som.jpg
+
Image:2008-April-Sugar-som.jpg|2008 April
Image:2008-March-Sugar-som.jpg
+
Image:2008-March-Sugar-som.jpg|2008 March
Image:2008-February-Sugar-som.jpg
+
Image:2008-February-Sugar-som.jpg|2008 February
Image:2008-January-Sugar-som.jpg
+
Image:2008-January-Sugar-som.jpg|2008 January
Image:2007-December-Sugar-som.jpg
+
Image:2007-December-Sugar-som.jpg|2007 December
Image:2007-November-Sugar-som.jpg
+
Image:2007-November-Sugar-som.jpg|2007 November
Image:2007-October-Sugar-som.jpg
+
Image:2007-October-Sugar-som.jpg|2007 October
Image:2007-September-Sugar-som.jpg
+
Image:2007-September-Sugar-som.jpg|2007 September
Image:2007-August-Sugar-som.jpg
+
Image:2007-August-Sugar-som.jpg|2007 August
Image:2007-July-Sugar-som.jpg
+
Image:2007-July-Sugar-som.jpg|2007 July
Image:2007-June-Sugar-som.jpg
+
Image:2007-June-Sugar-som.jpg|2007 June
Image:2007-May-Sugar-som.jpg
+
Image:2007-May-Sugar-som.jpg|2007 May
 
+
Image:2007-April-Sugar-som.jpg|2007 April
Image:2007-March-Sugar-som.jpg
+
Image:2007-March-Sugar-som.jpg|2007 March
Image:2007-February-Sugar-som.jpg
+
Image:2007-February-Sugar-som.jpg|2007 February
Image:2007-January-Sugar-som.jpg
+
Image:2007-January-Sugar-som.jpg|2007 January
Image:2006-December-Sugar-som.jpg
+
Image:2006-December-Sugar-som.jpg|2006 December
Image:2006-November-Sugar-som.jpg
+
Image:2006-November-Sugar-som.jpg|2006 November
Image:2006-October-Sugar-som.jpg
+
Image:2006-October-Sugar-som.jpg|2006 October
Image:2006-September-Sugar-som.jpg
+
Image:2006-September-Sugar-som.jpg|2006 September
 
+
Image:2006-August-som.jpg|2006 August
Image:2006-July-Sugar-som.jpg
+
Image:2006-July-Sugar-som.jpg|2006 July
 
</gallery>
 
</gallery>
  

Revision as of 13:56, 13 July 2008

Kohonen Self Organising Maps

Legend

Self Organising Maps (SOMs) can act as 2D spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related words), and label size is a rough guide to basic frequency of a feature. There are many "correct" map layouts for the same set of data, each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each is a continuous surface and wraps around north/south and east/west (surface of a torus), so if you find an interesting label to one side, remember to check it's neighbours on the opposite side.

What Do They Show?

Well, you could just treat them like tag clouds, showing the top 200 word features used on the list for a given week/month, but the maps also hold spacial information. Word features that appear close together on the map were used closely (on average) in text content from the list. A height metaphor is also used to indicate the features with the strongest mean associations - the map auto centres on the highest pink peak features, these words have the strongest associations with all the rest of the features on the map; word features in the blue and green areas have weaker mean associations relative to the pink highs, but should not be considered negatively as they will often be tightly associated with surrounding neighbours.

It's An Education Project Mailing List (Weekly)

Sugar Mailing List (Monthly)

Future

The mapping algorithms and visualisation style will continue to be refined, details will be posted here on any significant modifications. The code base was originally designed for bulk text documents from a single author, tested on works of literature from Project Gutenberg.