🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
-
Updated
Jun 6, 2024
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
[AAAI 2023] AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
The Abuse Project Audio Dataset (TAPAD). Think MNIST for audio profanity.
Voice activity detection and speaker gender segmentation audiovisual corpus
A utility for wrapping the Free Spoken Digit Dataset into PyTorch-ready data set splits.
LibriVox dataset for Bulgarian language TTS
Heart and Lung Sounds Dataset Recorded from a Clinical Manikin using Digital Stethoscope (HLS-CMDS)
Source code for baseline obtenience
This repository contains data preprocessing and analysis techniques for audio data.
ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scripts to generate and upload the data.
This repository contains a custom Arabic digits (0-9) dataset contributed by multiple individuals and a neural network model designed to accurately recognize these digits.
Fine tuning Whisper-Small LLM for Hinglish Audio dataset
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🎼️🎶️ The Audio:Instruments:Tabor category for AI2001, containing Tabor audio datasets.
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🎼️🎶️ The Audio:Instruments:Bell category for AI2001, containing Bell audio datasets.
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🎼️🎶️ The Audio:Instruments:Piano category for AI2001, containing Piano audio datasets.
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🎼️🎶️ The Audio:Instruments:Drums category for AI2001, containing Drums audio datasets.
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🎼️🎶️ The Audio:Instruments:Saxophone category for AI2001, containing Saxophone audio datasets.
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🎼️🎶️ The audio:sound effects category for AI2001, containing sound effect (SFX) datasets
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🎼️🎶️ The Audio:Instruments:Cannon category for AI2001, containing Cannon audio datasets.
Add a description, image, and links to the audio-dataset topic page so that developers can more easily learn about it.
To associate your repository with the audio-dataset topic, visit your repo's landing page and select "manage topics."