plotting voice segments in sound file

So my project is to take a .wav file with speech segments and create a script that will label the voice portions on the plot of the actual speech based on its spectrum. So, we know that voice frequencies range from 85-400 Hz. I've taken the FFT of the sample and the frequency distribution is strange. Very high at the low range and high range with almost nothing in the voice range. There's not a lot of other noise in the sample. any advice would be appreciated. What I would like to do is measure frequency across time and label parts that fall within in speech frequencies as the speech portions.

4 comentarios

Jan
Jan el 11 de Mayo de 2014
Please show us the code you use.
Joseph
Joseph el 11 de Mayo de 2014
to start:
fs=10000;
[sp,fs]=wavread('soundfile.wav'); %this particular sound file is 10 seconds long
>> spDFT=fft(sp);
>> stem (abs(spDFT))
Anveshkumar Kolluri
Anveshkumar Kolluri el 28 de Jun. de 2016
You can actually perform Fourier transform, which zeroes out the un-voiced signal and you are left with only voiced part.
Now you can plot the graph to get only the voiced part.
Image Analyst
Image Analyst el 29 de Jun. de 2016
You forgot to attach 'soundfile.wav'. Why not just threshold the signal? Are there other noises just as loud as the voice but in a different frequency range?

Iniciar sesión para comentar.

Respuestas (0)

Categorías

Más información sobre Simulation, Tuning, and Visualization en Centro de ayuda y File Exchange.

Etiquetas

Preguntada:

el 11 de Mayo de 2014

Comentada:

el 29 de Jun. de 2016

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by