# Datasets

| Task           | Format     | Dataset                                                                                                                            | Train (X, y)               | Validation (X, y)          | Test (X, y)                | Total size |
| -              | -          | -                                                                                                                                  | -                          | -                          | -                          | -          |
| Classification | libsvm     | [a1a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#a1a)                                                     | (1605, 119), (1605,)       |                            | (30956, 123), (30956,)     |   2MiB     |
| Classification | libsvm     | [a2a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#a2a)                                                     | (2265, 119), (2265,)       |                            | (30296, 123), (30296,)     |   2MiB     |
| Classification | libsvm     | [a3a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#a3a)                                                     | (3185, 122), (3185,)       |                            | (29376, 123), (29376,)     |   2MiB     |
| Classification | libsvm     | [a4a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#a4a)                                                     | (4781, 122), (4781,)       |                            | (27780, 123), (27780,)     |   2MiB     |
| Classification | libsvm     | [a5a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#a5a)                                                     | (6414, 122), (6414,)       |                            | (26147, 123), (26147,)     |   2MiB     |
| Classification | libsvm     | [a6a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#a6a)                                                     | (11220, 122), (11220,)     |                            | (21341, 123), (21341,)     |   2MiB     |
| Classification | libsvm     | [a7a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#a7a)                                                     | (16100, 122), (16100,)     |                            | (16461, 123), (16461,)     |   2MiB     |
| Classification | libsvm     | [a8a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#a8a)                                                     | (22696, 123), (22696,)     |                            | (9865, 122), (9865,)       |   2MiB     |
| Classification | libsvm     | [a9a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#a9a)                                                     | (32561, 123), (32561,)     |                            | (16281, 122), (16281,)     |   3MiB     |
| Classification | libsvm     | [australian](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#australian)                                       | (690, 14), (690,)          |                            |                            | 113KiB     |
| Classification | libsvm     | [australian_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#australian)                                 | (690, 14), (690,)          |                            |                            |  69KiB     |
| Classification | libsvm     | [breast-cancer](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#breast-cancer)                                 | (683, 10), (683,)          |                            |                            |  85KiB     |
| Classification | libsvm     | [breast-cancer_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#breast-cancer)                           | (683, 10), (683,)          |                            |                            |  85KiB     |
| Classification | libsvm     | [cod-rna](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#cod-rna)                                             | (59535, 8), (59535,)       | (271617, 8), (271617,)     | (157413, 8), (157413,)     |  37MiB     |
| Classification | libsvm     | [colon-cancer](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#colon-cancer)                                   | (62, 2000), (62,)          |                            |                            |   2MiB     |
| Classification | libsvm     | [covtype.binary](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#covtype.binary)                               | (581012, 54), (581012,)    |                            |                            |  86MiB     |
| Classification | libsvm     | [covtype.binary.scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#covtype.binary)                         | (581012, 54), (581012,)    |                            |                            |  68MiB     |
| Classification | libsvm     | [diabetes](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#diabetes)                                           | (768, 8), (768,)           |                            |                            |  74KiB     |
| Classification | libsvm     | [diabetes_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#diabetes)                                     | (768, 8), (768,)           |                            |                            |  67KiB     |
| Classification | libsvm     | [duke-breast-cancer](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#duke%20breast-cancer)                     | (38, 7129), (38,)          | (4, 7129), (4,)            |                            |   4MiB     |
| Classification | libsvm     | [fourclass](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#fourclass)                                         | (862, 2), (862,)           |                            |                            |  24KiB     |
| Classification | libsvm     | [fourclass_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#fourclass)                                   | (862, 2), (862,)           |                            |                            |  23KiB     |
| Classification | libsvm     | [german.numer](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#german.numer)                                   | (1000, 24), (1000,)        |                            |                            | 279KiB     |
| Classification | libsvm     | [german.numer_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#german.numer)                             | (1000, 24), (1000,)        |                            |                            | 162KiB     |
| Classification | libsvm     | [gisette](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#gisette)                                             | (6000, 5000), (6000,)      |                            | (1000, 5000), (1000,)      | 283MiB     |
| Classification | libsvm     | [heart](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#heart)                                                 | (270, 13), (270,)          |                            |                            |  42KiB     |
| Classification | libsvm     | [heart_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#heart)                                           | (270, 13), (270,)          |                            |                            |  27KiB     |
| Classification | libsvm     | [ijcnn1](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#ijcnn1)                                               | (35000, 22), (35000,)      | (14990, 22), (14990,)      | (91701, 22), (91701,)      |  21MiB     |
| Classification | libsvm     | [ionosphere](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#ionosphere)                                       | (351, 34), (351,)          |                            |                            | 100KiB     |
| Classification | libsvm     | [leukemia](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#leukemia)                                           | (38, 7129), (38,)          |                            | (34, 7129), (34,)          |   7MiB     |
| Classification | libsvm     | [madelon](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#madelon)                                             | (2000, 500), (2000,)       |                            | (600, 500), (600,)         |  10MiB     |
| Classification | libsvm     | [mushrooms](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#mushrooms)                                         | (8124, 112), (8124,)       |                            |                            | 859KiB     |
| Classification | libsvm     | [news20.binary](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#news20.binary)                                 | (19996, 1355191), (19996,) |                            |                            | 134MiB     |
| Classification | libsvm     | [phishing](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#phishing)                                           | (11055, 68), (11055,)      |                            |                            |   3MiB     |
| Classification | libsvm     | [rcv1.binary](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#rcv1.binary)                                     | (20242, 47236), (20242,)   |                            | (677399, 47236), (677399,) |   1GiB     |
| Classification | libsvm     | [real-sim](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#real-sim)                                           | (72309, 20958), (72309,)   |                            |                            |  86MiB     |
| Classification | libsvm     | [skin_nonskin](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#skin_nonskin)                                   | (245057, 3), (245057,)     |                            |                            |   4MiB     |
| Classification | libsvm     | [splice](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#splice)                                               | (1000, 60), (1000,)        |                            | (2175, 60), (2175,)        |   2MiB     |
| Classification | libsvm     | [sonar](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#sonar)                                                 | (208, 60), (208,)          |                            |                            | 152KiB     |
| Classification | libsvm     | [svmguide1](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#svmguide1)                                         | (3089, 4), (3089,)         |                            | (4000, 4), (4000,)         | 432KiB     |
| Classification | libsvm     | [svmguide3](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#svmguide3)                                         | (1243, 22), (1243,)        |                            | (41, 22), (41,)            | 308KiB     |
| Classification | libsvm     | [w1a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#w1a)                                                     | (2477, 300), (2477,)       |                            | (47272, 300), (47272,)     |   3MiB     |
| Classification | libsvm     | [w2a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#w2a)                                                     | (3470, 300), (3470,)       |                            | (46279, 300), (46279,)     |   3MiB     |
| Classification | libsvm     | [w3a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#w3a)                                                     | (4912, 300), (4912,)       |                            | (44837, 300), (44837,)     |   3MiB     |
| Classification | libsvm     | [w4a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#w4a)                                                     | (7366, 300), (7366,)       |                            | (42383, 300), (42383,)     |   3MiB     |
| Classification | libsvm     | [w5a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#w5a)                                                     | (9888, 300), (9888,)       |                            | (39861, 300), (39861,)     |   3MiB     |
| Classification | libsvm     | [w6a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#w6a)                                                     | (17188, 300), (17188,)     |                            | (32561, 300), (32561,)     |   3MiB     |
| Classification | libsvm     | [w7a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#w7a)                                                     | (24692, 300), (24692,)     |                            | (25057, 300), (25057,)     |   3MiB     |
| Classification | libsvm     | [w8a](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#w8a)                                                     | (49749, 300), (49749,)     |                            | (14951, 300), (14951,)     |   4MiB     |
| Regression     | libsvm     | [abalone](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#abalone)                                             | (4177, 8), (4177,)         |                            |                            | 253KiB     |
| Regression     | libsvm     | [abalone_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#abalone)                                       | (4177, 8), (4177,)         |                            |                            | 362KiB     |
| Regression     | libsvm     | [bodyfat](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#bodyfat)                                             | (252, 14), (252,)          |                            |                            |  28KiB     |
| Regression     | libsvm     | [bodyfat_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#bodyfat)                                       | (252, 14), (252,)          |                            |                            |  43KiB     |
| Regression     | libsvm     | [cadata](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#cadata)                                               | (20640, 8), (20640,)       |                            |                            |   5MiB     |
| Regression     | libsvm     | [cpusmall](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#cpusmall)                                           | (8192, 12), (8192,)        |                            |                            | 684KiB     |
| Regression     | libsvm     | [cpusmall_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#cpusmall)                                     | (8192, 12), (8192,)        |                            |                            |   1MiB     |
| Regression     | libsvm     | [E2006-log1p](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#E2006-log1p)                                     | (16087, 4272227), (16087,) |                            | (3308, 4272226), (3308,)   |   3GiB     |
| Regression     | libsvm     | [E2006-E2006-tfidf](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#E2006-tfidf)                               | (16087, 150360), (16087,)  |                            | (3308, 150358), (3308,)    | 596MiB     |
| Regression     | libsvm     | [eunite2001](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#eunite2001)                                       | (336, 16), (336,)          |                            | (31, 16), (31,)            |  34KiB     |
| Regression     | libsvm     | [mg](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#mg)                                                       | (1385, 6), (1385,)         |                            |                            | 149KiB     |
| Regression     | libsvm     | [mg_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#mg)                                                 | (1385, 6), (1385,)         |                            |                            | 105KiB     |
| Regression     | libsvm     | [mpg](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#mpg)                                                     | (392, 7), (392,)           |                            |                            |  19KiB     |
| Regression     | libsvm     | [mpg_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#mpg)                                               | (392, 7), (392,)           |                            |                            |  27KiB     |
| Regression     | libsvm     | [pyrim](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#pyrim)                                                 | (74, 27), (74,)            |                            |                            |  17KiB     |
| Regression     | libsvm     | [pyrim_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#pyrim)                                           | (74, 27), (74,)            |                            |                            |  11KiB     |
| Regression     | libsvm     | [space_ga](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#space_ga)                                           | (3107, 6), (3107,)         |                            |                            | 552KiB     |
| Regression     | libsvm     | [space_ga_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#space_ga)                                     | (3107, 6), (3107,)         |                            |                            | 246KiB     |
| Regression     | libsvm     | [triazines](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#triazines)                                         | (186, 60), (186,)          |                            |                            |  98KiB     |
| Regression     | libsvm     | [triazines_scale](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#triazines)                                   | (186, 60), (186,)          |                            |                            |  62KiB     |
| Regression     | libsvm     | [YearPredictionMSD](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#YearPredictionMSD)                         | (463715, 90), (463715,)    |                            | (51630, 90), (51630,)      | 601MiB     |
| Regression     | uci        | [yacht](https://archive.ics.uci.edu/ml/datasets/Yacht+Hydrodynamics)                                                               | (308, 6), (308,)           |                            |                            |  11KiB     |
| Regression     | skl        | [boston-housing](https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_boston.html)                              | (506, 13), (506,)          |                            |                            |   0B       |
| Regression     | skl        | [california-housing](https://scikit-learn.org/stable/modules/generated/sklearn.datasets.fetch_california_housing.html)             | (20640, 8), (20640,)       |                            |                            |   0B       |
| Regression     | uci        | [concrete](https://archive.ics.uci.edu/ml/datasets/Concrete+Compressive+Strength)                                                  | (1030, 8), (1030,)         |                            |                            | 122KiB     |
| Regression     | uci        | [energy](https://archive.ics.uci.edu/ml/datasets/Energy+efficiency)                                                                | (768, 8), (768,)           |                            |                            |  74KiB     |
| Regression     | uci        | [naval-propulsion](https://archive.ics.uci.edu/ml/datasets/Condition+Based+Maintenance+of+Naval+Propulsion+Plants)                 | (11934, 16), (11934,)      |                            |                            |   3MiB     |
| Regression     | uci        | [power-plant](https://archive.ics.uci.edu/ml/datasets/Combined+Cycle+Power+Plant)                                                  | (9568, 4), (9568,)         |                            |                            |   4MiB     |
| Classification | skl        | [digits](https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_digits.html)                                      | (1797, 64), (1797,)        |                            |                            |   0B       |
| Clustering     | other      | [faithful](http://www.stat.cmu.edu/~larry/all-of-statistics/=data/faithful.dat)                                                    | (272, 2)                   |                            |                            |   6KiB     |
| Recommendation | other      | [movielens-100k](https://files.grouplens.org/datasets/movielens/ml-100k.zip)                                                       | (944, 1683)                |                            |                            |  15MiB     |
| Recommendation | other      | [movielens-1m](https://files.grouplens.org/datasets/movielens/ml-1m.zip)                                                           | (6041, 3953)               |                            |                            |  24MiB     |
