Skip to content

TeraflopAI/SEC-EDGAR

Text GenerationText ClassificationEN

The TeraflopAI/SEC-EDGAR dataset is a EN text generation resource from TeraflopAI at 2025.

About TeraflopAI/SEC-EDGAR

Datamule, Teraflop AI, and Eventual collaborated to release the SEC-EDGAR dataset. The dataset contains 590 gbs of data, spanning 8 million samples and 43 billion tokens from all major filings in the SEC EDGAR database. The bulk data was collect...

Details

Task
Text Generation, Text Classification
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
TeraflopAI
Year
2025
Download

Related Text Generation, Text Classification datasets

FAQ