[DRAFT] Task Setting in Common Voice

This page is unofficial. Please refer to the forum and official website for accurate information.


Overview of Common Voice (roughly)
Order Contributing part Work How to Development (technical) part Note
1 Text corpus FindDataset material
AddSentence CollectorDevelopment of Sentence Collector
Review
2 Voice corpus RecordingWeb appDevelopment of Web app
Validation
3 Model training TrainingDatasets (Goal)
4 Working Enter (voice or text)Apps

How to contribute

Task setting before dataset generation

  1. Add Language
  2. Translate Web App

Task Setting in Collecting Text

  1. Find
    1. What? (Know what to collect)
    2. Where? How?
    3. If found: [Important] Check the rights status of the text (ja)
    4. How to add
  2. Add

    Memo

  3. Review

    Memo

    • The review will carry over to the previous and subsequent pages
    • What to do with a sentence we can' t decide on?: We can ignore it

    Question

    • What happens to the rejection sentence?

Task Setting in Collecting Voice

  1. Recording
    1. Prepare an audio input device (microphone, etc.)
    2. Read the sentence
    3. Check to see if the voice is recorded
    4. Submit clips

    Memo

    • What do we care about when we record?: Common Voiceの録音と検証のやり方
    • What to do with a sentence we can't or don't want to read?: Skip button
    • What to do with an inappropriate sentence?: Report button
    • We can submit less than 5 clips
    • We can delete clips in the middle of a recording to end it

    Question

    • What do deal with a microphone that doesn't work (can't input voice)?
  2. Validate
    1. Don't look at the sentence, listen to the voice
    2. Next, look at the sentence and make sure it matches the voice

    Memo

    • What to do with a clip we can't decide on?: Skip button
    • What to do with inappropriate voices and sentences?: Report button