Skip to content

ai4privacy/pii-masking-300k

Text ClassificationToken ClassificationTable Question AnsweringQuestion AnsweringZero Shot ClassificationSummarizationFeature ExtractionText GenerationTranslationFill MaskTabular ClassificationTabular To TextTable To TextText RetrievalOtherEN, FR, DE

The ai4privacy/pii-masking-300k dataset is a EN, FR, DE text classification resource from ai4privacy at 2024. With 4.2K downloads and 106 likes, it is actively used by the community. It is released under the other license and is a 100K<n<1M-scale dataset.

About ai4privacy/pii-masking-300k

👉 Looking for the newest release? The current flagship is ai4privacy/pii-masking-openpii-1.5m. 1.6M samples, 30 languages, 19 PII classes, Asia Pacific extension.?** The current flagship is ai4privacy/pii-masking-openpii-1m. 1.4M samples, 23 langu...

Details

Task
Text Classification, Token Classification, Table Question Answering, Question Answering, Zero Shot Classification, Summarization, Feature Extraction, Text Generation, Translation, Fill Mask, Tabular Classification, Tabular To Text, Table To Text, Text Retrieval, Other
Language
EN, FR, DE
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
ai4privacy
Year
2024
License
other
Downloads
4186
Likes
106
Download Homepage

Related Text Classification, Token Classification, Table Question Answering, Question Answering, Zero Shot Classification, Summarization, Feature Extraction, Text Generation, Translation, Fill Mask, Tabular Classification, Tabular To Text, Table To Text, Text Retrieval, Other datasets

FAQ