Changes

Jump to navigation Jump to search
472 bytes removed ,  22:23, 6 August 2012
Line 3: Line 3:     
[[Image:SOM_legend.jpg|thumb|172px|Legend]]
 
[[Image:SOM_legend.jpg|thumb|172px|Legend]]
Self Organising Maps (SOMs) can be used as 2d spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related terms), and label size indicates guide to basic frequency of a feature. There are many "correct" 2d map layouts for the same set of data (due to the multidimensional nature of the data), each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each is a continuous, tillable surface and wraps around north/south and east/west (surface of a torus); so if you find an interesting label to one side, remember to check it's neighbours on the opposite side.
+
Self Organising Maps (SOMs) can be used as 2d spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related terms), and label size indicates guide to basic frequency of a feature. There are many "correct" 2d map layouts for the same set of data (due to the multidimensional nature of the data), each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each map is a continuous, tillable surface, and wraps around north/south and east/west (surface of a torus); so if you find an interesting label to one edge, remember to check it's neighbours on the opposite side.
    
== What Do They Show? ==
 
== What Do They Show? ==
Line 18: Line 18:  
== It's An Education Project Mailing List ==
 
== It's An Education Project Mailing List ==
   −
Weekly maps generated with posts from the [http://lists.sugarlabs.org/listinfo/iaep IAEP mailing list]. Most recent maps shown first - for older maps please see the [[Sugar_Labs/SOM/IAEP|IAEP map history map archive]] page.
+
Monthly maps generated with posts from the [http://lists.sugarlabs.org/listinfo/iaep IAEP mailing list]. Most recent maps shown first - for older maps please see the [[Sugar_Labs/SOM/IAEP|IAEP map history map archive]] page.
    
<gallery widths="275" heights="150" perrow="2">
 
<gallery widths="275" heights="150" perrow="2">
File:2011-July-9-15-som.jpg|'''2011 July 9th-15th''' (63 emails). ''Includes improved support for hyphenated terms.''
+
File:2012-July-som.png|'''2012 July''' (31 emails)
File:2011-Jul-2-8-som.jpg|'''2011 Jul 2nd-8th''' (54 emails)
+
File:2012-June-som.png|'''2012 June''' (81 emails)
File:2011-Jun-25-Jul-1-som.jpg|'''2011 Jun 25th-Jul 1st''' (51 emails)
  −
File:2011-Jun-18-24-som.jpg|'''2011 Jun 18th-24th''' (43 emails)
  −
File:2011-Jun-11-17-som.jpg|'''2011 Jun 11th-17th''' (105 emails)
  −
File:2011-Jun-4-10-som.jpg|'''2011 Jun 4th-10th''' (80 emails)
  −
File:2011-May-28-Jun-3-som.jpg|'''2011 May 28th-Jun 3rd''' (57 emails)
  −
File:2011-May-21-27-som.jpg|'''2011 May 21st-27th''' (34 emails)
   
</gallery>
 
</gallery>
  
2,354

edits

Navigation menu