-
Updated
Jun 17, 2023 - Python
speech-recognition
Here are 3,903 public repositories matching this topic...
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
-
Updated
Jun 9, 2023 - C++
Port of OpenAI's Whisper model in C/C++
-
Updated
Jun 16, 2023 - C
-
Updated
Jun 4, 2023 - TypeScript
kaldi-asr/kaldi is the official location of the Kaldi project.
-
Updated
May 5, 2023 - Shell
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
-
Updated
Dec 19, 2022 - HTML
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
-
Updated
Jun 2, 2023 - Jupyter Notebook
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
-
Updated
Jun 13, 2023 - Python
Speech recognition module for Python, supporting several engines and APIs, online and offline.
-
Updated
Jun 9, 2023 - Python
NeMo: a toolkit for conversational AI
-
Updated
Jun 16, 2023 - Python
End-to-End Speech Processing Toolkit
-
Updated
Jun 16, 2023 - Python
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
-
Updated
Feb 16, 2023 - Python
-
Updated
Oct 3, 2022 - JavaScript
Facebook AI Research's Automatic Speech Recognition Toolkit
-
Updated
May 22, 2023 - C++
A PyTorch-based Speech Toolkit
-
Updated
Jun 17, 2023 - Python
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
-
Updated
Jun 1, 2023 - Jupyter Notebook
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
-
Updated
May 6, 2023 - Jupyter Notebook
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
-
Updated
Jun 7, 2023 - Python
Production First and Production Ready End-to-End Speech Recognition Toolkit
-
Updated
Jun 12, 2023 - C++
Improve this page
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."