This page is unofficial. Please refer to the forum and official website for accurate information.
Sentence
- Contributor: Who knows grammar
- Tool: Sentence Collector
- To do: Find, add and review public domain sentences
- Goal: 1,800,000 sentences
Voice
- Contributor: Who can read a sentence
- Tool: Common Voice
- To do: Record and validate the voices
- Goal: 2,000 hours of validated voice
Training
- Contributor: Engineer
- Tool: Datasets
- To do: Find the right data for the purpose, train the system
- Goal: Create a system to understand speech
- Prepare: Download datasets
Working
- User: Everyone
- Tool: Application
- To do: Enter a voice, or a sentence for the app to read
- Goal: Depending on the user
- The end user of the dataset is the engineers who train the speech recognition system, although the final end user is everyone (or the system's target population).
- The origin of the dataset - voice recording - is that of sentence collection.
- ref: