keithito/lj_speech
Automatic Speech RecognitionText To SpeechText To AudioEN
Created by keithito at 2022, the keithito/lj_speech is a automatic speech recognition dataset in EN in Parquet format.
About keithito/lj_speech
This is a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading
passages from 7 non-fiction books in English. A transcription is provided for each clip. Clips vary in length
from 1 to 10 seconds and have a...
Details
- Task
- Automatic Speech Recognition, Text To Speech, Text To Audio
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- keithito
- Year
- 2022