plotting voice segments in sound file

Joseph

11 Mayo 2014

0 Respuestas

Actualizado a las 29 Jun. 2016

5 Visualizaciones (30 días)

Iniciar sesión para responder a esta pregunta.

Follow Question

Iniciar sesión para responder a esta pregunta.

Follow Question

Mostrar comentarios más antiguos

0 votos

So my project is to take a .wav file with speech segments and create a script that will label the voice portions on the plot of the actual speech based on its spectrum. So, we know that voice frequencies range from 85-400 Hz. I've taken the FFT of the sample and the frequency distribution is strange. Very high at the low range and high range with almost nothing in the voice range. There's not a lot of other noise in the sample. any advice would be appreciated. What I would like to do is measure frequency across time and label parts that fall within in speech frequencies as the speech portions.

4 comentarios
Mostrar 2 comentarios más antiguos Ocultar 2 comentarios más antiguos

Anveshkumar Kolluri el 28 de Jun. de 2016

You can actually perform Fourier transform, which zeroes out the un-voiced signal and you are left with only voiced part.

Now you can plot the graph to get only the voiced part.

Image Analyst el 29 de Jun. de 2016

You forgot to attach 'soundfile.wav'. Why not just threshold the signal? Are there other noises just as loud as the voice but in a different frequency range?

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Follow Question