#
speech-recognition
Here are 2,171 public repositories matching this topic...
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
machine-learning
embedded
deep-learning
offline
tensorflow
speech-recognition
neural-networks
speech-to-text
deepspeech
on-device
-
Updated
Jun 27, 2021 - C++
kaldi-asr/kaldi is the official location of the Kaldi project.
shell
c-plus-plus
cuda
speech
speech-recognition
speech-to-text
kaldi
speaker-verification
speaker-id
-
Updated
Jul 1, 2021 - Shell
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
machine-learning
natural-language-processing
deep-neural-networks
reinforcement-learning
computer-vision
deep-learning
optimization
machine-translation
deep-reinforcement-learning
medical-imaging
speech-recognition
artificial-neural-networks
pattern-recognition
probabilistic-graphical-models
bayesian-statistics
artificial-intelligence-algorithms
visual-recognition
graph-neural-networks
-
Updated
May 21, 2021
Open
Fedora & apt-get
2
AsterYujano
commented
Oct 5, 2019
Specs
- Leon version: latest
- OS (or browser) version: Fedora 30
- Node.js version: 10.16.3
- Complete "npm run check" output:
➡ Here is the diagnosis about your current setup
✔ Run
✔ Run modules
✔ Reply you by texting
❗ Amazon Polly text-to-speech
❗ Google Cloud text-to-speech
❗ Watson text-to-speech
❗ Offline text-to-speech
❗ Google Cloud speech-to-text
❗ Watson spee
-
Updated
Mar 26, 2021 - JavaScript
Facebook AI Research's Automatic Speech Recognition Toolkit
-
Updated
Jul 1, 2021 - C++
Speech recognition module for Python, supporting several engines and APIs, online and offline.
-
Updated
Feb 28, 2021 - Python
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
-
Updated
May 16, 2021 - Python
End-to-End Speech Processing Toolkit
deep-learning
chainer
end-to-end
machine-translation
pytorch
speech-synthesis
speech-recognition
kaldi
voice-conversion
speech-separation
speech-enhancement
speech-translation
-
Updated
Jul 4, 2021 - Python
NeMo: a toolkit for conversational AI
nlp
text-to-speech
deep-learning
neural-network
machine-translation
speech-synthesis
speech-recognition
speech-to-text
nmt
nlp-machine-learning
-
Updated
Jul 4, 2021 - Jupyter Notebook
PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
-
Updated
Jan 8, 2021 - C
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
audio
deep-learning
tensorflow
paper
end-to-end
evaluation
cnn
lstm
speech-recognition
rnn
automatic-speech-recognition
feature-vector
data-preprocessing
phonemes
timit-dataset
layer-normalization
rnn-encoder-decoder
chinese-speech-recognition
-
Updated
May 25, 2021 - Python
A PyTorch-based Speech Toolkit
audio
transformers
pytorch
voice-recognition
speech-recognition
speech-to-text
language-model
speaker-recognition
speaker-verification
speech-processing
audio-processing
asr
speaker-diarization
speechrecognition
speech-separation
speech-enhancement
spoken-language-understanding
huggingface
speech-toolkit
speechbrain
-
Updated
Jul 4, 2021 - Python
Lingvo
nlp
research
translation
tensorflow
machine-translation
speech
distributed
tts
speech-synthesis
mnist
speech-recognition
lm
seq2seq
speech-to-text
gpu-computing
language-model
asr
-
Updated
Jul 4, 2021 - Python
-
Updated
Nov 20, 2018 - Python
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
deep-neural-networks
deep-learning
speech
dnn
pytorch
recurrent-neural-networks
lstm
gru
speech-recognition
rnn
kaldi
rnn-model
asr
lstm-neural-networks
multilayer-perceptron-network
timit
dnn-hmm
-
Updated
Mar 15, 2021 - Python
3
nshmyrev
commented
Aug 4, 2020
One can use https://github.com/s-yata/marisa-trie to save a lot of space for symbols.
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
roadmap
neural-network
cnn
dnn
tts
speech-synthesis
speech-recognition
rnn
seq2seq
automatic-speech-recognition
papers
language-model
attention-mechanism
speaker-verification
timit-dataset
acoustic-model
recognition-synthesis
-
Updated
Jun 30, 2021
Machine Learning Resources, Practice and Research
-
Updated
Apr 11, 2021 - Python
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
-
Updated
Dec 21, 2020
-
Updated
Mar 3, 2020 - Python
Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.
raspberry-pi
machine-learning
hack
smarthome
microphone
speech-recognition
classification
alias
sound-synthesis
wakeword
-
Updated
Apr 5, 2020 - Python
Kalliope is a framework that will help you to create your own personal assistant.
linux
bot
home-automation
speech-synthesis
speech-recognition
personal-assistant
bot-creation
raspberry
speech-to-text
jarvis
-
Updated
Jun 11, 2021 - Python
DELTA is a deep learning based natural language and speech processing platform.
nlp
front-end
ops
deep-learning
text-classification
tensorflow
nlu
speech
inference
text-generation
speech-recognition
seq2seq
sequence-to-sequence
speaker-verification
asr
tensorflow-serving
emotion-recognition
custom-ops
serving
tensorflow-lite
-
Updated
Apr 16, 2021 - Python
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
text-to-speech
deep-learning
tensorflow
multi-node
speech-synthesis
speech-recognition
seq2seq
speech-to-text
neural-machine-translation
sequence-to-sequence
language-model
multi-gpu
float16
mixed-precision
-
Updated
May 11, 2021 - Python
List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
swift
machine-learning
natural-language-processing
computer-vision
deep-learning
neural-network
artificial-intelligence
speech-recognition
gpgpu
awesome-list
-
Updated
Jul 30, 2018
A PaddlePaddle Speech to Text toolkit.
speech
transformer
speech-recognition
speech-to-text
conformer
deepspeech
ngram-language-model
ctc-decode
mandarin-language
-
Updated
Jul 3, 2021 - Jupyter Notebook
Open-Source Large Vocabulary Continuous Speech Recognition Engine
-
Updated
Jun 6, 2021 - C
deepxuexi
commented
Jun 25, 2019
一个非常方便的python录音程序,专门为MASR量身定做:
按回车开始录音,说完话后再按Enter结束录音并显示识别结果,录音文件会以识别的文本命名保存,方便后期统计识别率。
代码地址:
https://github.com/deepxuexi/ARFASR
如果觉得好用,给我点个star,谢谢!
6
Improve this page
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."
Add better error message to
HubertForCTC
,Wav2Vec2ForCTC
if labels are bigger than vocab size.Motivation
Following this issue: huggingface/transformers#12264 it is clear that an error message should be thrown if any of the any of the labels are >
self.config.vocab_size
or else silent errors can sneak into the training script.So w