Skip to content

bradfordlevy/BeanCounter

General NLPEnglish

Created by bradfordlevy at 2024, the bradfordlevy/BeanCounter is a General NLP dataset in English in Parquet format. With 16.8K downloads and 5 likes, it is actively used by the community and is a 10M<n<100M-scale dataset.

About bradfordlevy/BeanCounter

🫘🧮 BeanCounter Datset Summary BeanCounter is a low-toxicity, large-scale, and open dataset of business-oriented text. See Wang and Levy (2024) for details of the data collection, analysis, and some explorations of using the data for ...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
10M<n<100M
Creator
bradfordlevy
Year
2024
Downloads
16782
Likes
5
Download Homepage

Related General NLP datasets

FAQ