Changes

Jump to navigation Jump to search
716 bytes removed ,  22:23, 6 August 2012
Line 3: Line 3:     
[[Image:SOM_legend.jpg|thumb|172px|Legend]]
 
[[Image:SOM_legend.jpg|thumb|172px|Legend]]
Self Organising Maps (SOMs) can be used as 2d spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related terms), and label size indicates guide to basic frequency of a feature. There are many "correct" 2d map layouts for the same set of data (due to the multidimensional nature of the data), each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each is a continuous, tillable surface and wraps around north/south and east/west (surface of a torus); so if you find an interesting label to one side, remember to check it's neighbours on the opposite side.
+
Self Organising Maps (SOMs) can be used as 2d spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related terms), and label size indicates guide to basic frequency of a feature. There are many "correct" 2d map layouts for the same set of data (due to the multidimensional nature of the data), each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each map is a continuous, tillable surface, and wraps around north/south and east/west (surface of a torus); so if you find an interesting label to one edge, remember to check it's neighbours on the opposite side.
    
== What Do They Show? ==
 
== What Do They Show? ==
Line 18: Line 18:  
== It's An Education Project Mailing List ==
 
== It's An Education Project Mailing List ==
   −
Weekly maps generated with posts from the [http://lists.sugarlabs.org/listinfo/iaep IAEP mailing list]. Most recent maps shown first - for older maps please see the [[Sugar_Labs/SOM/IAEP|IAEP map history]] page.
+
Monthly maps generated with posts from the [http://lists.sugarlabs.org/listinfo/iaep IAEP mailing list]. Most recent maps shown first - for older maps please see the [[Sugar_Labs/SOM/IAEP|IAEP map history map archive]] page.
    
<gallery widths="275" heights="150" perrow="2">
 
<gallery widths="275" heights="150" perrow="2">
File:2011-May-21-27-som.jpg|'''2011 May 21st-27th''' (34 emails)
+
File:2012-July-som.png|'''2012 July''' (31 emails)
File:2011-May-14-20-som.jpg|'''2011 May 14th-20th''' (74 emails). ''Includes a new process to group associated terms together before clustering. A good example on this map is it finding Sugar Labs the community, being distinct from Sugar the software, previously the single term Sugar would have gravitated both community & software clusters towards each other.''
+
File:2012-June-som.png|'''2012 June''' (81 emails)
Image:2011-May-7-13-som.jpg|'''2011 May 7th-13th''' (75 emails)
  −
Image:2011-April-30-May-6-som.jpg|'''2011 April 30th-May 6th''' (49 emails)
  −
Image:2011-April-23-29-som.jpg|'''2011 April 23rd-29th''' (102 emails)
  −
Image:2011-April-16-22-som.jpg|'''2011 April 16th-22nd''' (79 emails)
  −
Image:2011-Apr-9-15-som.jpg|'''2011 Apr 9th-15th''' (59 emails)
  −
Image:2011-Apr-2-8-som.jpg|'''2011 Apr 2nd-8th''' (58 emails)
   
</gallery>
 
</gallery>
  
2,354

edits

Navigation menu