Download
All GEMLeR datsets are available as compressed .zip archives.
GEMLeR v1.0 (Released 10th October 2008):
![]() | GEMLeR_ARFF_short.zip (9 OVA + 36 AP datasets, ARFF format, 706 MB) |
![]() | GEMLeR_CSV_short.zip (9 OVA + 36 AP datasets, CSV format, 688 MB) |
Both archives contain the following:
AP ("all-paired") Datasets
| Filename | Num. Samples | Num. Genes | Num. Class1 / Num. Class2 |
| AP_Breast_Colon.arff | 630 | 10937 | 344 / 286 |
| AP_Breast_Kidney.arff | 604 | 10937 | 344 / 260 |
| AP_Breast_Lung.arff | 470 | 10937 | 344 / 126 |
| AP_Breast_Omentum.arff | 421 | 10937 | 344 / 77 |
| AP_Breast_Ovary.arff | 542 | 10937 | 344 / 198 |
| AP_Breast_Prostate.arff | 413 | 10937 | 344 / 69 |
| AP_Breast_Uterus.arff | 468 | 10937 | 344 / 124 |
| AP_Colon_Kidney.arff | 546 | 10937 | 286 / 260 |
| AP_Colon_Lung.arff | 412 | 10937 | 286 / 126 |
| AP_Colon_Omentum.arff | 363 | 10937 | 286 / 77 |
| AP_Colon_Ovary.arff | 484 | 10937 | 286 / 198 |
| AP_Colon_Prostate.arff | 355 | 10937 | 286 / 69 |
| AP_Colon_Uterus.arff | 410 | 10937 | 286 / 124 |
| AP_Endometrium_Breast.arff | 405 | 10937 | 61 / 344 |
| AP_Endometrium_Colon.arff | 347 | 10937 | 61 / 286 |
| AP_Endometrium_Kidney.arff | 321 | 10937 | 61 / 260 |
| AP_Endometrium_Lung.arff | 187 | 10937 | 61 / 126 |
| AP_Endometrium_Omentum.arff | 138 | 10937 | 61 / 77 |
| AP_Endometrium_Ovary.arff | 259 | 10937 | 61 / 198 |
| AP_Endometrium_Prostate.arff | 130 | 10937 | 61 / 69 |
| AP_Endometrium_Uterus.arff | 185 | 10937 | 61 / 124 |
| AP_Lung_Kidney.arff | 386 | 10937 | 126 / 260 |
| AP_Lung_Uterus.arff | 250 | 10937 | 126 / 124 |
| AP_Omentum_Kidney.arff | 337 | 10937 | 77 / 260 |
| AP_Omentum_Lung.arff | 203 | 10937 | 77 / 126 |
| AP_Omentum_Ovary.arff | 275 | 10937 | 77 / 198 |
| AP_Omentum_Prostate.arff | 146 | 10937 | 77 / 69 |
| AP_Omentum_Uterus.arff | 201 | 10937 | 77 / 124 |
| AP_Ovary_Kidney.arff | 458 | 10937 | 198 / 260 |
| AP_Ovary_Lung.arff | 324 | 10937 | 198 / 126 |
| AP_Ovary_Uterus.arff | 322 | 10937 | 198 / 124 |
| AP_Prostate_Kidney.arff | 329 | 10937 | 69 / 260 |
| AP_Prostate_Lung.arff | 195 | 10937 | 69 / 126 |
| AP_Prostate_Ovary.arff | 267 | 10937 | 69 / 198 |
| AP_Prostate_Uterus.arff | 193 | 10937 | 69 / 124 |
| AP_Uterus_Kidney.arff | 384 | 10937 | 124 / 260 |
OVA ("one-versus-all") Datasets
| Filename | Num. Samples | Num. Genes | Num. Class1 / Num. Class2 |
| OVA_Breast.arff | 1545 | 10937 | 344 / 1201 |
| OVA_Colon.arff | 1545 | 10937 | 286 / 1259 |
| OVA_Endometrium.arff | 1545 | 10937 | 61 / 1484 |
| OVA_Kidney.arff | 1545 | 10937 | 260 / 1285 |
| OVA_Lung.arff | 1545 | 10937 | 126 / 1419 |
| OVA_Omentum.arff | 1545 | 10937 | 77 / 1468 |
| OVA_Ovary.arff | 1545 | 10937 | 198 / 1347 |
| OVA_Prostate.arff | 1545 | 10937 | 69 / 1476 |
| OVA_Uterus.arff | 1545 | 10937 | 124 / 1421 |
GEMLeR Full Datasets
GEMLeR FULL v1.0 (Relesead 30th October 2008):
![]() | GEMLeR_AP_CSV_full.zip (36 AP datasets, CSV format, 1.30 GB) |
![]() | GEMLeR_OVA_CSV_full.zip (9 OVA datasets, CSV format, 1.46 GB) |
GEMLeR FULL contains all datasets in original dimension (54681 probes).
You might encounter problems working with full versions of datasets on some non 64-bit operating systems with less than 3 GB of RAM due to high memory demands of OVA datasets when building complex classifiers.
