[DRAFT] Workflow of Common Voice

This page is unofficial. Please refer to the forum and official website for accurate information.


  1. Sentence
    • Contributor: Who knows grammar
    • Tool: Sentence Collector
    • To do: Find, add and review public domain sentences
    • Goal: 1,800,000 sentences
  2. Voice
    • Contributor: Who can read a sentence
    • Tool: Common Voice
    • To do: Record and validate the voices
    • Goal: 2,000 hours of validated voice
  3. Training
    • Contributor: Engineer
    • Tool: Datasets
    • To do: Find the right data for the purpose, train the system
    • Goal: Create a system to understand speech
    • Prepare: Download datasets
  4. Working
    • User: Everyone
    • Tool: Application
    • To do: Enter a voice, or a sentence for the app to read
    • Goal: Depending on the user
    • Prepare: Choose an app

Sentence flow

  1. Find public domain texts
  2. Editing a sentence
    • ref. How-to on Sentence Collector
    • The easier to read and the shorter the better.
    • Remember, some people are not native speakers or have trouble reading out loud.
  3. Review a sentence
  4. Approved sentences are added to Common Voice
  5. Volunteer reads a sentence

Voice flow

  1. Datasets with additional clips are released
  2. User downloads the dataset
  3. Engineer uses dataset to train speech recognition system (or use it for other purposes)