#

speech-processing

Here are 517 public repositories matching this topic...

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated Jul 27, 2023
Python

pliang279 / awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

machine-learning natural-language-processing reinforcement-learning computer-vision deep-learning robotics healthcare reading-list representation-learning speech-processing multimodal-learning

Updated Jun 25, 2023

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated Jul 27, 2023
Jupyter Notebook

r9y9 / wavenet_vocoder

Sponsor

WaveNet vocoder

python speech pytorch speech-synthesis wavenet speech-processing wavenet-vocoder neural-vocoder

Updated Nov 2, 2020
Python

microsoft / torchscale

Foundation Architecture for (M)LLMs

machine-learning natural-language-processing translation computer-vision transformer speech-processing multimodal pretrained-language-model

Updated Jul 26, 2023
Python

r9y9 / deepvoice3_pytorch

Sponsor

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

python machine-learning end-to-end pytorch tts speech-synthesis speech-processing multi-speaker

Updated Jun 29, 2023
Python

awesome-diarization

wq2012 / awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

machine-learning awesome deep-learning speech-recognition awesome-list speech-processing speaker-diarization

Updated Jul 4, 2023

mravanelli / SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Updated Apr 28, 2021
Python

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

text-to-speech tts speech-synthesis voice-recognition speech-recognition speech-to-text stt speech-processing voice-activity-detection speech-separation speech-emotion-recognition voice-cloning

Updated Jul 27, 2022

midas-research / audino

Open source audio annotation tool for humans

python machine-learning datasets speech-processing audio-processing annotation-tool audio-annotation

Updated Mar 4, 2023
JavaScript

linto-ai / whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Updated Jun 20, 2023
Python

drethage / speech-denoising-wavenet

A neural network for end-to-end speech denoising

machine-learning deep-learning end-to-end speech neural-networks wavenet speech-processing speech-denoising

Updated Jul 6, 2023
Python

nanahou / Awesome-Speech-Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

deep-neural-networks signal-processing machine-learning-algorithms speech-processing speech-enhancement

Updated Dec 1, 2020
MATLAB

Ryuk17 / SpeechAlgorithms

Speech Algorithms

speech-processing

Updated Feb 28, 2023
C

haoheliu / voicefixer

Sponsor

General Speech Restoration

speech tts speech-synthesis super-resolution speech-processing vocoder speech-analysis denoise mel speech-enhancement dereverberation declipping

Updated Jun 20, 2023
Python

breizhn / DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

audio raspberry-pi deep-learning tensorflow keras speech-processing dns-challenge noise-reduction audio-processing real-time-audio speech-enhancement speech-denoising onnx tf-lite noise-suppression dtln-model

Updated Apr 26, 2022
Python

arjo129 / uSpeech

Speech recognition toolkit for the arduino

arduino speech-recognition signal speech-processing

Updated May 5, 2021
C++

Audio-WestlakeU / FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

audio reproducible-research paper speech pytorch band speech-processing noise-reduction denoising speech-separation speech-enhancement narrow-band single-channel pretrained-model full-band sub-band

Updated Feb 23, 2023
Python

santi-pdp / pase

Problem Agnostic Speech Encoder

deep-learning pytorch unsupervised-learning speech-processing multi-task-learning waveform-analysis self-supervised-learning

Updated Jul 6, 2023
Python

huawei-noah / Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

speech-synthesis speech-recognition speech-processing

Updated Jul 6, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."