Changes

Jump to navigation Jump to search
481 bytes removed ,  22:23, 6 August 2012
Line 3: Line 3:     
[[Image:SOM_legend.jpg|thumb|172px|Legend]]
 
[[Image:SOM_legend.jpg|thumb|172px|Legend]]
Self Organising Maps (SOMs) can be used as 2d spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related terms), and label size indicates guide to basic frequency of a feature. There are many "correct" 2d map layouts for the same set of data (due to the multidimensional nature of the data), each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each is a continuous, tillable surface and wraps around north/south and east/west (surface of a torus); so if you find an interesting label to one side, remember to check it's neighbours on the opposite side.
+
Self Organising Maps (SOMs) can be used as 2d spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related terms), and label size indicates guide to basic frequency of a feature. There are many "correct" 2d map layouts for the same set of data (due to the multidimensional nature of the data), each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each map is a continuous, tillable surface, and wraps around north/south and east/west (surface of a torus); so if you find an interesting label to one edge, remember to check it's neighbours on the opposite side.
    
== What Do They Show? ==
 
== What Do They Show? ==
Line 18: Line 18:  
== It's An Education Project Mailing List ==
 
== It's An Education Project Mailing List ==
   −
Weekly maps generated with posts from the [http://lists.sugarlabs.org/listinfo/iaep IAEP mailing list]. Most recent maps shown first - for older maps please see the [[Sugar_Labs/SOM/IAEP|IAEP map history map archive]] page.
+
Monthly maps generated with posts from the [http://lists.sugarlabs.org/listinfo/iaep IAEP mailing list]. Most recent maps shown first - for older maps please see the [[Sugar_Labs/SOM/IAEP|IAEP map history map archive]] page.
    
<gallery widths="275" heights="150" perrow="2">
 
<gallery widths="275" heights="150" perrow="2">
File:2011-Aug-27-Sept-2-som.jpg|'''2011 Aug 27th-Sept 2nd''' (40 emails)
+
File:2012-July-som.png|'''2012 July''' (31 emails)
File:2011-Aug-20-26-som.jpg|'''2011 Aug 20th-26th''' (33 emails)
+
File:2012-June-som.png|'''2012 June''' (81 emails)
File:2011-Aug-13-19-som.jpg|'''2011 Aug 13th-19th''' (21 emails)
  −
File:2011-Aug-6-12-som.jpg|'''2011 Aug 6th-12th''' (28 emails)
  −
File:2011-Jul-30-Aug-5-som.jpg|'''2011 Jul 30th-Aug 5th''' (29 emails)
  −
File:2011-July-23-29-som.jpg|'''2011 July 23rd-29th''' (41 emails)
  −
File:2011-July-16-22-som.jpg|'''2011 July 16th-22nd''' (45 emails)
  −
File:2011-July-9-15-som.jpg|'''2011 July 9th-15th''' (63 emails). ''Includes improved support for hyphenated terms.''
   
</gallery>
 
</gallery>
  
2,354

edits

Navigation menu