speech-recognition
Here are 2,799 public repositories matching this topic...
-
Updated
Mar 9, 2022 - C++
-
Updated
May 3, 2022 - Shell
-
Updated
Apr 10, 2022 - HTML
Fedora & apt-get
Specs
- Leon version: latest
- OS (or browser) version: Fedora 30
- Node.js version: 10.16.3
- Complete "npm run check" output:
➡ Here is the diagnosis about your current setup
✔ Run
✔ Run modules
✔ Reply you by texting
❗ Amazon Polly text-to-speech
❗ Google Cloud text-to-speech
❗ Watson text-to-speech
❗ Offline text-to-speech
❗ Google Cloud speech-to-text
❗ Watson spee
-
Updated
Mar 26, 2021 - JavaScript
-
Updated
Apr 20, 2022 - Python
-
Updated
Mar 25, 2022 - C++
-
Updated
Apr 28, 2022 - Python
-
Updated
May 2, 2022 - Python
-
Updated
May 3, 2022 - Jupyter Notebook
-
Updated
May 3, 2022 - Python
-
Updated
Feb 22, 2022 - C
目前的多音字使用 pypinyin 或者 g2pM,精度有限,想做一个基于 BERT (或者 ERNIE) 多音字预测模型,简单来说就是假设某语言有 100 个多音字,每个多音字最多有 3 个发音,那么可以在 BERT 后面接 100 个 3 分类器(简单的 fc 层即可),在预测时,找到对应的分类器进行分类即可。
参考论文:
tencent_polyphone.pdf
数据可以用 https://github.com/kakaobrain/g2pM 提供的数据
进阶:多任务的 BERT


Several tokenizers currently have no associated tests. I think that adding the test file for one of these tokenizers could be a very good way to make a first contribution to transformers.
Tokenizers concerned
not yet claimed
none
claimed