757 questions
-1
votes
1
answer
31
views
Can't hear Librosa chirp
I am using python under VS.
When running the following code
import librosa
from IPython.display import Audio
sr = 22050
y_sweep = librosa.chirp(fmin=librosa.note_to_hz('C3'),
fmax=...
0
votes
0
answers
21
views
generating dynamic sized mel spectrograms based on song duration
I have song files that varies in duration. I am trying to generate their mel spectrograms for training a CNN model afterwards. Since songs contain many frequencies over relatively short duration, ...
0
votes
0
answers
37
views
how to convert a python np.ndarray to Java Float[] array [duplicate]
I am using py4j for a project to connect my Python code to a JVM. Now this project requires me to pass an audio signal from the Python end to the Java end. I am using Librosa to create the audio ...
0
votes
0
answers
28
views
How to work with the librosa.amplitude_to_db function
I am working with liberosa and I need to get the estimate dB out of a few audio files. I thought I could easly use the librosa.amplitude_to_db function but I have a problem.
I don't knowe what I did ...
0
votes
0
answers
55
views
Error as_list() when trying to train my audio classification model
I have this google colab code and i want to train a model on mp3 files 3 seconds long, 44.10khz files that have 47.8kb and are 128kbps that are located in 6 folders and i keep getting this error when ...
1
vote
0
answers
71
views
Problem with a simple script in which the Librosa Python library does not work well for me
I have a problem with the script that I detail below;
My intention is to automate a click the moment an alarm sounds on my PC.
I have used the 'Sounddevice', 'Librosa', and 'PyAutoGui' libraries, in ...
0
votes
0
answers
15
views
How do visualise gini index from a mel spectogram?
i have plotted a spectogram from an audio file imported with librosa and converted it to a logarithmic scale:
mel_spectrogram = librosa.feature.melspectrogram(y=audio_data, sr=sr, n_fft=n_fft, ...
0
votes
1
answer
33
views
Librosa Split cannot separate audio
I am trying to create a filter to separate relevant audio from silence. I have been following the directions listed in this post: Find the best decibel threshold to split an audio into segments with ...
1
vote
0
answers
29
views
Can't save image in Ubuntu for Librosa
Below code is fine to operate in jupyter notebook in Windows.
However, images can't be saved in Jetson Orin Nano in Ubuntu 22.04 LTS.
I put audio files in "Record" folder.
I checked there ...
0
votes
0
answers
18
views
Specgram function in python not able to plot logarithmic y axis?
I am plotting the spectrogram for the electric field data. However the plot does not give the correct logarithmic y axis.
I have tried using specgram function in both matplotlib as well as scipy ...
0
votes
1
answer
58
views
How can I run MFCC with Librosa on a signal that is not audio?
I'm experimenting with MFCC as a signal processing technique to analyze the results of empirical mode decomposition on the original signal. This, imfs['0'], is an array of floating point values with ...
3
votes
1
answer
76
views
How to detect audio retakes using Python?
I have a lot of audio recordings for lectures where I say the same thing multiple times, mostly it's incomplete statements like:
"this is the part" (and then retrying)
"this is the part ...
0
votes
1
answer
33
views
How to Clear Old Audio File Data in Librosa?
I am trying to process about 700 .wav audio files by converting each one into a spectrogram and saving it in .jpg format.
During each iteration of processing an audio file, I attempt to delete the ...
1
vote
0
answers
48
views
Can't recognize audio file in Flask backend
I am having difficulty processing an audio blob I am sending to a Flask backend. I want to process the blob into a mel-spectrogram using librosa/pydub, but I get the following errors:
ERROR:root:...
2
votes
0
answers
634
views
Handling 'Too Large for Available Bit Count' Error When Reading Part of an MP3 File in Python
I am trying to read a specific part of an MP3 file, but I am encountering an error:
[src/libmpg123/layer3.c:INT123_do_layer3():1771] error: part2_3_length (1376) too large for available bit count (760)...