common-voice / community-playbook Public

👥 📕 Mozilla Common Voice Community Playbook V1.2

  • Thank you to our authors, reviewers and maintainers of the community playbook without your support this playbook wouldn’t be possible!
  • Learn more about how we maintain the playbook and the Playbook’s License
  • Content coordination: Hillary Juma, Common Voice Community Manager

Mozilla Common Voice is an initiative to help teach machines how real people talk. Everyone can take part in Common Voice! You don’t need technical experience, though if you do, that’s great too!

Playbook Pages

Pages About Localization Translating project tools and material to be understood by contributors in their language Text Corpus Gathering, validating and processing public domain sentences Voice Corpus Recording and validating voice clips to create a public domain dataset Communities Connect with the variety of communities participating in Common Voice Mobilization Resources and tips for mobilizing your community CC0 Waiver Process How to secure a cc0 license for text corpus

What is Common Voice?

Voice is natural, voice is human. That’s why we’re excited about creating usable voice technology for our machines. But to create voice systems, developers need an extremely large amount of voice data.

Most of the data used by large companies isn’t available to the majority of people. We think that stifles innovation. So we’ve launched Common Voice, a project to help make voice recognition open and accessible to everyone.

Language communities advance a language from not having a presence in Common Voice at all to being able to generate a functional Speech to text (STT) model which is able to understand how people speak. Once the Common Voice website is localized by a language community, the process involves the 5 following steps.

Collecting and validating public domain sentences Recording and validating the recordings of the sentences Repeating this process to grow the size of the data Generating a dataset and using machine learning to train speech-to-text models using this dataset


Our community playbook is a living document of our communities history and knowledge. After reading the playbook chapters, you will understand:

  • The goals and ethos of the Common Voice project
  • The journey of language onto Common Voice
  • How to set up and maintain a language community as part of Common Voice

As you read the playbook….

🔨 Make sure you check the required skills for each section and look for people who can fit.

💬 Check the channels section to learn how to set up your local forums and chat to communicate with other people in your language.

Each chapter covers; Purpose – Who we are – Success – How to join – What we do – Roles – Channels

TLDR: Too Long Don’t Read

To help you quickly navigate the playbook, this list provides you with possible information you would like to have and the associated content.

Participation Guidelines

Common Voice communities are governed by Mozilla’s code of conduct and etiquette guidelines, we take this very seriously and no violations are tolerated.

We encourage you to please read Mozilla Community Participation Guidelines before contributing to this project.

For more information on how to report violations of the Community Participation Guidelines, please read our ‘How to Report’ page.


Mozilla Foundation stewards the overall Common Voice project and is the ultimate decision-maker for its direction and goals. It also oversees the development of some tools and channels described here to support our communities.

Common Voice language communities are self-organized, and you don’t need to ask for permission to participate or mobilize any of these communities in your language. All the data generated by communities is published under open licences.

Some community roles exist formally and informally, and they all should follow the Mozilla leadership shared agreements.

The Common Voice Team at Mozilla Foundation, share weekly updates to the community on discourse.

Common Voice Communities 🗺

Common Voice has a variety of communities that support the project in different important areas, they are usually grouped by language.

Language Communities

👥 A language’s journey onto Common Voice is made possible with communities of multidisciplinary teams of committed people. Roles vary from no coding needed to organizing roles. Our community mobilization resource and Community page can connect you to resources and existing language communities that can support you.

Leave a Reply

Your email address will not be published.