-
Updated
Aug 3, 2021 - Shell
#
speech
Here are 950 public repositories matching this topic...
kaldi-asr/kaldi is the official location of the Kaldi project.
shell
c-plus-plus
cuda
speech
speech-recognition
speech-to-text
kaldi
speaker-verification
speaker-id
-
Updated
Mar 26, 2021 - JavaScript
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
natural-language-processing
computer-vision
deep-learning
neural-network
speech
recommendation
paddlepaddle
-
Updated
Aug 10, 2021 - Python
python
text-to-speech
deep-learning
speech
pytorch
tts
vocoder
tacotron
tensorflow2
tacotron2
melgan
speaker-encoder
dataset-analysis
glow-tts
multiband-melgan
gantts
-
Updated
Aug 10, 2021 - Jupyter Notebook
Code examples for new APIs of iOS 10.
ios
demo
metal
speech
cnn
swift-3
image-recognition
convolutional-neural-networks
ios10
uiviewpropertyanimator
swift-4
metal-performance-shaders
metal-cnn
-
Updated
Aug 11, 2021 - Swift
Lingvo
nlp
research
translation
tensorflow
machine-translation
speech
distributed
tts
speech-synthesis
mnist
speech-recognition
lm
seq2seq
speech-to-text
gpu-computing
language-model
asr
-
Updated
Aug 13, 2021 - Python
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
deep-neural-networks
deep-learning
speech
dnn
pytorch
recurrent-neural-networks
lstm
gru
speech-recognition
rnn
kaldi
rnn-model
asr
lstm-neural-networks
multilayer-perceptron-network
timit
dnn-hmm
-
Updated
Mar 15, 2021 - Python
python
text-to-speech
deep-learning
speech
pytorch
tts
vocoder
tacotron
speaker-encodings
tensorflow2
melgan
speaker-encoder
melgan-stft
multi-speaker-tts
glow-tts
hifigan
align-tts
tts-model
-
Updated
Aug 14, 2021 - Jupyter Notebook
WaveNet vocoder
-
Updated
Nov 2, 2020 - Python
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
-
Updated
Mar 19, 2018 - Python
DELTA is a deep learning based natural language and speech processing platform.
nlp
front-end
ops
deep-learning
text-classification
tensorflow
nlu
speech
inference
text-generation
speech-recognition
seq2seq
sequence-to-sequence
speaker-verification
asr
tensorflow-serving
emotion-recognition
custom-ops
serving
tensorflow-lite
-
Updated
Apr 16, 2021 - Python
A PaddlePaddle Speech to Any toolkit.
speech
transformer
speech-recognition
speech-to-text
conformer
deepspeech
ngram-language-model
speech-translation
ctc-decode
mandarin-language
-
Updated
Aug 12, 2021 - Jupyter Notebook
Python library and CLI tool to interface with Google Translate's text-to-speech API
-
Updated
Jul 30, 2021 - Python
Open-Source Large Vocabulary Continuous Speech Recognition Engine
-
Updated
Jul 8, 2021 - C
brightening-eyes
commented
Feb 20, 2018
hi,
as you know, in SoLoud, the number of filters are limited
we should implement more like different reverbs, fir and irr filters, (these could be used to implement HRTF support), Chorus, One Poll, One Zero, Pole Zero, Two Pole, Two Zero, etc
a library exists called stk under zlib license which already implemented these maybe we can implement some of these out
Open
Seek performance
5
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
-
Updated
Jun 7, 2018 - Python
Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple
text-to-speech
german
speech
pytorch
english
speech-recognition
spanish
colab
speech-to-text
pretrained-models
stt
asr
pretrained
onnx
stt-benchmark
enterprise-grade-stt
silero-models
tts-models
torch-hub
-
Updated
Aug 9, 2021 - Jupyter Notebook
A Python wrapper for Kaldi
python
wrapper
numpy
speech
feature-extraction
speech-recognition
kaldi
language-model
asr
openfst
clif
-
Updated
Jul 27, 2021 - Python
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
data-science
natural-language-processing
deep-neural-networks
deep-learning
neural-network
keras
voice
speech
emotion
python3
audio-files
speech-recognition
emotion-recognition
natural-language-understanding
speech-emotion-recognition
-
Updated
Mar 18, 2021 - Jupyter Notebook
Speech Enhancement Generative Adversarial Network in TensorFlow
deep-neural-networks
deep-learning
tensorflow
speech
gan
generative-model
generative-adversarial-networks
-
Updated
Jan 14, 2021 - Python
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
data
speech
dnn
lstm
speech-recognition
attention
vad
voice-detection
voice-activity-detection
bdnn
acam
speech-activity-detection
-
Updated
Jun 9, 2021 - MATLAB
python
machine-learning
projects
speech
artificial-intelligence
webcam
python-scripts
python-projects
-
Updated
Aug 10, 2021 - Jupyter Notebook
Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
-
Updated
Aug 10, 2021 - TypeScript
alexa
node
speech
voice-recognition
speech-recognition
speech-to-text
voice-control
stt
hotword-detection
keyword-spotting
-
Updated
May 15, 2021 - JavaScript
A neural network for end-to-end speech denoising
machine-learning
deep-learning
end-to-end
speech
neural-networks
wavenet
speech-processing
speech-denoising
-
Updated
Jul 24, 2019 - Python
RodriSanchez1
commented
Aug 5, 2021
javafx
mp3
speech
audio-visualizer
audio-player
audio-recorder
spectrum-analyzer
audio-formats
web-browser
audio-processing
dropbox-client
stream-player
java-speech
java-stream-player
-
Updated
Aug 3, 2021 - Java
Improve this page
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."
TimeStretch
,TimeMasking
andFrequencyMasking
are implementation of SpecAugment, but it is not immediately clear from the documentation.We can add
Note: when adding images to documentation, we need to make sure that