Line 3: |
Line 3: |
| | | |
| [[Image:SOM_legend.jpg|thumb|172px|Legend]] | | [[Image:SOM_legend.jpg|thumb|172px|Legend]] |
− | Self Organising Maps (SOMs) can be used as 2d spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related terms), and label size indicates guide to basic frequency of a feature. There are many "correct" 2d map layouts for the same set of data (due to the multidimensional nature of the data), each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each is a continuous, tillable surface and wraps around north/south and east/west (surface of a torus); so if you find an interesting label to one side, remember to check it's neighbours on the opposite side. | + | Self Organising Maps (SOMs) can be used as 2d spatial summariser visualisations of multidimensional data. In the maps shown here, text distance metrics are generated from the weekly/monthly content on some of the more active mailing lists. Using a geographic like landscape metaphor, the height (colour gradient) indicates features with strong associations to all other features; proximity represents association between specific features (e.g. related terms), and label size indicates guide to basic frequency of a feature. There are many "correct" 2d map layouts for the same set of data (due to the multidimensional nature of the data), each map generation will usually settle into a slightly different set of local minima, but the associations are no less valid for each. After removing linguistic junk words, and word stemming, the maps currently pick the weeks/months top ~200 features by frequency. Each map is a continuous, tillable surface, and wraps around north/south and east/west (surface of a torus); so if you find an interesting label to one edge, remember to check it's neighbours on the opposite side. |
| | | |
| == What Do They Show? == | | == What Do They Show? == |
Line 18: |
Line 18: |
| == It's An Education Project Mailing List == | | == It's An Education Project Mailing List == |
| | | |
− | Weekly maps generated with posts from the [http://lists.sugarlabs.org/listinfo/iaep IAEP mailing list]. Most recent maps shown first - for older maps please see the [[Sugar_Labs/SOM/IAEP|IAEP map history map archive]] page.
| + | Monthly maps generated with posts from the [http://lists.sugarlabs.org/listinfo/iaep IAEP mailing list]. Most recent maps shown first - for older maps please see the [[Sugar_Labs/SOM/IAEP|IAEP map history map archive]] page. |
| | | |
| <gallery widths="275" heights="150" perrow="2"> | | <gallery widths="275" heights="150" perrow="2"> |
− | File:2011-Aug-13-19-som.jpg|'''2011 Aug 13th-19th''' (21 emails) | + | File:2012-July-som.png|'''2012 July''' (31 emails) |
− | File:2011-Aug-6-12-som.jpg|'''2011 Aug 6th-12th''' (28 emails)
| + | File:2012-June-som.png|'''2012 June''' (81 emails) |
− | File:2011-Jul-30-Aug-5-som.jpg|'''2011 Jul 30th-Aug 5th''' (29 emails)
| |
− | File:2011-July-23-29-som.jpg|'''2011 July 23rd-29th''' (41 emails)
| |
− | File:2011-July-16-22-som.jpg|'''2011 July 16th-22nd''' (45 emails)
| |
− | File:2011-July-9-15-som.jpg|'''2011 July 9th-15th''' (63 emails). ''Includes improved support for hyphenated terms.'' | |
− | File:2011-Jul-2-8-som.jpg|'''2011 Jul 2nd-8th''' (54 emails)
| |
− | File:2011-Jun-25-Jul-1-som.jpg|'''2011 Jun 25th-Jul 1st''' (51 emails)
| |
| </gallery> | | </gallery> |
| | | |