#

speech

Here are 950 public repositories matching this topic...

kaldi-asr / kaldi

Star

kaldi-asr/kaldi is the official location of the Kaldi project.

shell c-plus-plus cuda speech speech-recognition speech-to-text kaldi speaker-verification speaker-id

Updated Aug 3, 2021
Shell

TalAter / annyang

Star

💬 Speech recognition for your site

demo gui tutorial voice speech speech-recognition speech-to-text hacktoberfest

Updated Mar 26, 2021
JavaScript

PaddlePaddle / models

Star

Pre-trained and Reproduced Deep Learning Models （『飞桨』官方模型库，包含多种学术前沿和工业场景验证的深度学习模型）

natural-language-processing computer-vision deep-learning neural-network speech recommendation paddlepaddle

Updated Aug 10, 2021
Python

mozilla / TTS

Star

🤖

💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

python text-to-speech deep-learning speech pytorch tts vocoder tacotron tensorflow2 tacotron2 melgan speaker-encoder dataset-analysis glow-tts multiband-melgan gantts

Updated Aug 10, 2021
Jupyter Notebook

shu223 / iOS-10-Sampler

Sponsor Star

Code examples for new APIs of iOS 10.

ios demo metal speech cnn swift-3 image-recognition convolutional-neural-networks ios10 uiviewpropertyanimator swift-4 metal-performance-shaders metal-cnn

Updated Aug 11, 2021
Swift

tensorflow / lingvo

Star

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

Updated Aug 13, 2021
Python

pytorch-kaldi

mravanelli / pytorch-kaldi

Star

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

deep-neural-networks deep-learning speech dnn pytorch recurrent-neural-networks lstm gru speech-recognition rnn kaldi rnn-model asr lstm-neural-networks multilayer-perceptron-network timit dnn-hmm

Updated Mar 15, 2021
Python

coqui-ai / TTS

Star

🐸

💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts vocoder tacotron speaker-encodings tensorflow2 melgan speaker-encoder melgan-stft multi-speaker-tts glow-tts hifigan align-tts tts-model

Updated Aug 14, 2021
Jupyter Notebook

readbeyond / aeneas

Star

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Updated Apr 16, 2021
Python

r9y9 / wavenet_vocoder

Sponsor Star

WaveNet vocoder

python speech pytorch speech-synthesis wavenet speech-processing wavenet-vocoder neural-vocoder

Updated Nov 2, 2020
Python

Kyubyong / tacotron

Star

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

tensorflow speech tts speech-synthesis-model

Updated Mar 19, 2018
Python

Delta-ML / delta

Star

DELTA is a deep learning based natural language and speech processing platform.

Updated Apr 16, 2021
Python

pytorch / audio

Star

Open

Mention SpecAugment in doc

mthrok commented Aug 9, 2021

📚 Documentation

TimeStretch, TimeMasking and FrequencyMasking are implementation of SpecAugment, but it is not immediately clear from the documentation.

We can add

Reference to the paper
Add example images

Note: when adding images to documentation, we need to make sure that

documentation is still comprehensive without the image

Read more

help wanted doc good first issue contributions welcome

Open

Add example codes for documentation

9

Open

Better structure dataset implementations

22

PaddlePaddle / DeepSpeech

Star

A PaddlePaddle Speech to Any toolkit.

speech transformer speech-recognition speech-to-text conformer deepspeech ngram-language-model speech-translation ctc-decode mandarin-language

Updated Aug 12, 2021
Jupyter Notebook

pndurette / gTTS

Star

Python library and CLI tool to interface with Google Translate's text-to-speech API

python cli text-to-speech python-library pypi speech tts gtts speech-api

Updated Jul 30, 2021
Python

julius-speech / julius

Star

Open-Source Large Vocabulary Continuous Speech Recognition Engine

recognition speech speech-recognition audio-processing

Updated Jul 8, 2021
C

soloud

jarikomppa / soloud

Star

Open

more filters should be implemented

1

brightening-eyes commented Feb 20, 2018

hi,
as you know, in SoLoud, the number of filters are limited
we should implement more like different reverbs, fir and irr filters, (these could be used to implement HRTF support), Chorus, One Poll, One Zero, Pole Zero, Two Pole, Two Zero, etc
a library exists called stk under zlib license which already implemented these maybe we can implement some of these out

Read more

help wanted good first issue

Open

Seek performance

5

Kyubyong / dc_tts

Star

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

speech tts speech-to-text

Updated Jun 7, 2018
Python

snakers4 / silero-models

Star

Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

text-to-speech german speech pytorch english speech-recognition spanish colab speech-to-text pretrained-models stt asr pretrained onnx stt-benchmark enterprise-grade-stt silero-models tts-models torch-hub

Updated Aug 9, 2021
Jupyter Notebook

pykaldi / pykaldi

Star

A Python wrapper for Kaldi

python wrapper numpy speech feature-extraction speech-recognition kaldi language-model asr openfst clif

Updated Jul 27, 2021
Python

praat / praat

Star

Praat: Doing Phonetics By Computer

speech phonetics acoustics

Updated Aug 14, 2021
C

MITESHPUTHRANNEU / Speech-Emotion-Analyzer

Star

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

data-science natural-language-processing deep-neural-networks deep-learning neural-network keras voice speech emotion python3 audio-files speech-recognition emotion-recognition natural-language-understanding speech-emotion-recognition

Updated Mar 18, 2021
Jupyter Notebook

santi-pdp / segan

Star

Speech Enhancement Generative Adversarial Network in TensorFlow

deep-neural-networks deep-learning tensorflow speech gan generative-model generative-adversarial-networks

Updated Jan 14, 2021
Python

jtkim-kaist / VAD

Star

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

data speech dnn lstm speech-recognition attention vad voice-detection voice-activity-detection bdnn acam speech-activity-detection

Updated Jun 9, 2021
MATLAB

Amazing-Python-Scripts

avinashkranjan / Amazing-Python-Scripts

Star

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

python machine-learning projects speech artificial-intelligence webcam python-scripts python-projects

Updated Aug 10, 2021
Jupyter Notebook

googleapis / nodejs-speech

Star

Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.

nodejs machine-learning speech speech-to-text

Updated Aug 10, 2021
TypeScript

evancohen / sonus

Star

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

alexa node speech voice-recognition speech-recognition speech-to-text voice-control stt hotword-detection keyword-spotting

Updated May 15, 2021
JavaScript

drethage / speech-denoising-wavenet

Star

A neural network for end-to-end speech denoising

machine-learning deep-learning end-to-end speech neural-networks wavenet speech-processing speech-denoising

Updated Jul 24, 2019
Python

cboard-org / cboard

Star

Open

Create a new Tour component for Board Component

1

RodriSanchez1 commented Aug 5, 2021

The Board Tour its inside the Board.component. We need to create a new component called BoardTour.component

Read more

bug good first issue

Open

Feature: reordering elements in the output box

Open

Check boards exported from Cboard are successfully import on coughdrop

4

Find more good first issues →

goxr3plus / XR3Player

Star

🎧

🎼 Advanced JavaFX Media Player

javafx mp3 speech audio-visualizer audio-player audio-recorder spectrum-analyzer audio-formats web-browser audio-processing dropbox-client stream-player java-speech java-stream-player

Updated Aug 3, 2021
Java

Improve this page

Add a description, image, and links to the speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."