Copyright Issues in Japanese language Collector

Japanese language collector have the following problems:

Perhaps this is a problem with the corpus.

I went to the source page and checked the "Public Domain version" and it contains the above text. These sources are famous cartoons and games, and they are obviously not in the public domain. The "Public Domain version" file has a [Manga] flag, but some of the sentences are not. Honestly, I can't determine how much of the offending text is in the mix.

Three Non-Public Domain Sources

Japanese language collection, again.

Are the reviewers, users Rrock9312 and Rrock2139 the same person?


I checked common-voice/sentence-collector.json. Probably the current Japanese source text isn't all in the public domain. That's a shame.


This is insane, and a betrayal to the people.