bradfordlevy/BeanCounter
General NLPEnglish
Created by bradfordlevy at 2024, the bradfordlevy/BeanCounter is a General NLP dataset in English in Parquet format. With 16.8K downloads and 5 likes, it is actively used by the community and is a 10M<n<100M-scale dataset.
About bradfordlevy/BeanCounter
🫘🧮 BeanCounter
Datset Summary
BeanCounter is a low-toxicity, large-scale, and open dataset of business-oriented text. See Wang and Levy (2024) for details of the data collection, analysis, and some explorations of using the data for ...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 10M<n<100M
- Creator
- bradfordlevy
- Year
- 2024
- Downloads
- 16782
- Likes
- 5