最終更新:2023-03-23 (木) 15:55:59 (400d)  

OpenWebText
Top / OpenWebText

Open clone of OpenAI's unreleased WebText dataset scraper used to train GPT-2.

https://github.com/jcpeterson/openwebtext