GEOの方の統計値(タイプ別)

GEOも眺めています。フルのデータで641GB(圧縮済)、発現値をとったメタデータで2.6GB(非圧縮)って扱うのも一苦労だよ。
で、この中に、データのタイプってのがあったので、数えてみた。

17891 Expression profiling by array
1151 Genome binding/occupancy profiling by genome tiling array
569 Genome binding/occupancy profiling by high throughput sequencing
502 Non-coding RNA profiling by array
501 Genome variation profiling by genome tiling array
364 Non-coding RNA profiling by high throughput sequencing
350 Expression profiling by genome tiling array
337 Genome variation profiling by array
321 Genome variation profiling by SNP array
288 Expression profiling by high throughput sequencing
218 Expression profiling by SAGE
216 Other
182 SNP genotyping by SNP array
163 Methylation profiling by genome tiling array
93 Genome binding/occupancy profiling by array
92 Non-coding RNA profiling by genome tiling array
88 Methylation profiling by array
63 Methylation profiling by high throughput sequencing
40 Expression profiling by RT-PCR
38 Protein profiling by protein array
19 Expression profiling by MPSS
18 Third-party reanalysis
15 Genome variation profiling by high throughput sequencing
8 Genome binding/occupancy profiling by SNP array
7 Expression profiling by SNP array
4 Protein profiling by Mass Spec
2 Methylation profiling by SNP array
1 rrp6 delta, trf4 delta vs WT
1 Reference design

1つのシリーズで複数のtypeがあったりするので、それは重複しているけれども、重複を許して23542(行)で、許さないと22431(行)。