input shape to the LSTM net when doing inference for VAD tasks

Question

YUKAI SHEN el 7 de Mzo. de 2023

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/1924515-input-shape-to-the-lstm-net-when-doing-inference-for-vad-tasks

Respondida: Brian Hemmat el 7 de Mzo. de 2023

Hi, I am following this article to train a LSTM network for VAD tasks: https://www.mathworks.com/help/deeplearning/ug/voice-activity-detection-in-noise-using-deep-learning.html

My question is, when testing a trained LSTM network, as in the article did, the input data is not shaped as the training input as (#frames, #time_steps, #features), does this mean, when doing inference, the trained LSTM network will take each frame as a input independetly, and classify if this frame is noise or voice, so basically there is no hidden states used when doing inference, am I right?

Thank you in advance!

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Brian Hemmat el 7 de Mzo. de 2023

0
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/1924515-input-shape-to-the-lstm-net-when-doing-inference-for-vad-tasks#answer_1187540

I did not look at the dimensions you're discussing, but I can say that you are correct that the "streaming" code in the example classifies chunks independently. Note that it is calling classify and not classifyAndUpdateState.

Stay tuned for the R2023a release, where we have updated the example to maintain state (should be coming in the next few weeks).

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

input shape to the LSTM net when doing inference for VAD tasks

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

input shape to the LSTM net when doing inference for VAD tasks

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos