Andrej Karpathy sur X : "FineWeb-Edu: High quality LLM dataset filtering the original 15 trillion FineWeb tokens to 1.3 trillion of the highest (educational) quality, as judged by a Llama 3 70B..."
Tags:
About This Document
File info
Documents with similar tags (experimental)
2023-08-16 About
2023-07-01 About
2023-04-16 About
2023-01-25 About
2022-11-17 About
2022-07-23 About
2022-07-18 About