{"regular": {"description": "Given a partial description like \"she opened the hood of the car,\"\nhumans can reason about the situation and anticipate what might come\nnext (\"then, she examined the engine\"). SWAG (Situations With Adversarial Generations)\nis a large-scale dataset for this task of grounded commonsense\ninference, unifying natural language inference and physically grounded reasoning.\n\nThe dataset consists of 113k multiple choice questions about grounded situations\n(73k training, 20k validation, 20k test).\nEach question is a video caption from LSMDC or ActivityNet Captions,\nwith four answer choices about what might happen next in the scene.\nThe correct answer is the (real) video caption for the next event in the video;\nthe three incorrect answers are adversarially generated and human verified,\nso as to fool machines but not humans. SWAG aims to be a benchmark for\nevaluating grounded commonsense NLI and for learning representations.\n\nThe full data contain more information,\nbut the regular configuration will be more interesting for modeling\n(note that the regular data are shuffled). The test set for leaderboard submission\nis under the regular configuration.\n", "citation": "@inproceedings{zellers2018swagaf,\n    title={SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference},\n    author={Zellers, Rowan and Bisk, Yonatan and Schwartz, Roy and Choi, Yejin},\n    booktitle = \"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP)\",\n    year={2018}\n}\n", "homepage": "https://rowanzellers.com/swag/", "license": "Unknown", "features": {"video-id": {"dtype": "string", "id": null, "_type": "Value"}, "fold-ind": {"dtype": "string", "id": null, "_type": "Value"}, "startphrase": {"dtype": "string", "id": null, "_type": "Value"}, "sent1": {"dtype": "string", "id": null, "_type": "Value"}, "sent2": {"dtype": "string", "id": null, "_type": "Value"}, "gold-source": {"dtype": "string", "id": null, "_type": "Value"}, "ending0": {"dtype": "string", "id": null, "_type": "Value"}, "ending1": {"dtype": "string", "id": null, "_type": "Value"}, "ending2": {"dtype": "string", "id": null, "_type": "Value"}, "ending3": {"dtype": "string", "id": null, "_type": "Value"}, "label": {"num_classes": 4, "names": ["0", "1", "2", "3"], "names_file": null, "id": null, "_type": "ClassLabel"}}, "post_processed": null, "supervised_keys": null, "builder_name": "swag", "config_name": "regular", "version": {"version_str": "0.0.0", "description": null, "major": 0, "minor": 0, "patch": 0}, "splits": {"train": {"name": "train", "num_bytes": 30274672, "num_examples": 73546, "dataset_name": "swag"}, "validation": {"name": "validation", "num_bytes": 8451771, "num_examples": 20006, "dataset_name": "swag"}, "test": {"name": "test", "num_bytes": 8417644, "num_examples": 20005, "dataset_name": "swag"}}, "download_checksums": {"https://raw.githubusercontent.com/rowanz/swagaf/master/data/train.csv": {"num_bytes": 28243333, "checksum": "5748b51126ac255c5a6f26e1ba473b51116d6c822aeb25e63ecba282c9d0e610"}, "https://raw.githubusercontent.com/rowanz/swagaf/master/data/val.csv": {"num_bytes": 7893588, "checksum": "c0497b2cd7f3e6b7df995524b1853f62285d60d110d659b19545ca80b2903234"}, "https://raw.githubusercontent.com/rowanz/swagaf/master/data/test.csv": {"num_bytes": 7817885, "checksum": "a689a1a4e892a65ca625c1f0fcf77bcce004b59ad1caeb134ca5ec080a711cb6"}}, "download_size": 43954806, "post_processing_size": null, "dataset_size": 47144087, "size_in_bytes": 91098893}, "full": {"description": "Given a partial description like \"she opened the hood of the car,\"\nhumans can reason about the situation and anticipate what might come\nnext (\"then, she examined the engine\"). SWAG (Situations With Adversarial Generations)\nis a large-scale dataset for this task of grounded commonsense\ninference, unifying natural language inference and physically grounded reasoning.\n\nThe dataset consists of 113k multiple choice questions about grounded situations\n(73k training, 20k validation, 20k test).\nEach question is a video caption from LSMDC or ActivityNet Captions,\nwith four answer choices about what might happen next in the scene.\nThe correct answer is the (real) video caption for the next event in the video;\nthe three incorrect answers are adversarially generated and human verified,\nso as to fool machines but not humans. SWAG aims to be a benchmark for\nevaluating grounded commonsense NLI and for learning representations.\n\nThe full data contain more information,\nbut the regular configuration will be more interesting for modeling\n(note that the regular data are shuffled). The test set for leaderboard submission\nis under the regular configuration.\n", "citation": "@inproceedings{zellers2018swagaf,\n    title={SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference},\n    author={Zellers, Rowan and Bisk, Yonatan and Schwartz, Roy and Choi, Yejin},\n    booktitle = \"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP)\",\n    year={2018}\n}\n", "homepage": "https://rowanzellers.com/swag/", "license": "Unknown", "features": {"video-id": {"dtype": "string", "id": null, "_type": "Value"}, "fold-ind": {"dtype": "string", "id": null, "_type": "Value"}, "startphrase": {"dtype": "string", "id": null, "_type": "Value"}, "gold-ending": {"dtype": "string", "id": null, "_type": "Value"}, "distractor-0": {"dtype": "string", "id": null, "_type": "Value"}, "distractor-1": {"dtype": "string", "id": null, "_type": "Value"}, "distractor-2": {"dtype": "string", "id": null, "_type": "Value"}, "distractor-3": {"dtype": "string", "id": null, "_type": "Value"}, "gold-source": {"dtype": "string", "id": null, "_type": "Value"}, "gold-type": {"dtype": "string", "id": null, "_type": "Value"}, "distractor-0-type": {"dtype": "string", "id": null, "_type": "Value"}, "distractor-1-type": {"dtype": "string", "id": null, "_type": "Value"}, "distractor-2-type": {"dtype": "string", "id": null, "_type": "Value"}, "distractor-3-type": {"dtype": "string", "id": null, "_type": "Value"}, "sent1": {"dtype": "string", "id": null, "_type": "Value"}, "sent2": {"dtype": "string", "id": null, "_type": "Value"}}, "post_processed": null, "supervised_keys": null, "builder_name": "swag", "config_name": "full", "version": "0.0.0", "splits": {"train": {"name": "train", "num_bytes": 34941649, "num_examples": 73546, "dataset_name": "swag"}, "validation": {"name": "validation", "num_bytes": 9832603, "num_examples": 20006, "dataset_name": "swag"}}, "download_checksums": {"https://raw.githubusercontent.com/rowanz/swagaf/master/data/train_full.csv": {"num_bytes": 31608559, "checksum": "2353de255a79d4e699f478a42454758062d9d36aac75a4035948915877e1a248"}, "https://raw.githubusercontent.com/rowanz/swagaf/master/data/val_full.csv": {"num_bytes": 8929065, "checksum": "59f4905390446352ffbdbb1ebcd88ae790df91fd59661c626eeddd7a4b184502"}}, "download_size": 40537624, "post_processing_size": null, "dataset_size": 44774252, "size_in_bytes": 85311876}}