Question 1

What is the jedibear/s2orc_full dataset?

Accepted Answer

S2ORC Full — Semantic Scholar Open Research Corpus

A complete redistribution of the S2ORC dataset in Parquet format on Hugging Face, containing 14.5 million academic papers with full text, structured metadata, and citation information.

...

Question 2

Is jedibear/s2orc_full a benchmark?

Accepted Answer

jedibear/s2orc_full is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download jedibear/s2orc_full?

Accepted Answer

jedibear/s2orc_full is available at its source: https://huggingface.co/datasets/jedibear/s2orc_full.

Question 4

What license is jedibear/s2orc_full released under?

Accepted Answer

jedibear/s2orc_full is distributed under the odc-by license.

jedibear/s2orc_full

About jedibear/s2orc_full

Details

Related Text Generation, Feature Extraction, Text Classification datasets

FAQ