hi , i have question . In matlab , can i take online speech signal from Microphone USB and use it in my code at same time ?? can do that ?????

speech signal from Microphone USB

Hamza Ashraf el 4 de Ag. de 2020

hi i have trained a network to detect specific sound now how can i use it. i want to provide audio coming from mike to network. can you tell me how to do that

Walter Roberson el 4 de Ag. de 2020

Hazma: unfortunately we cannot tell from your postings which MATLAB version you are using, and we cannot tell if you have the Audio System Toolbox. Also, are you using Windows 10 and do you have Data Acquisition Toolbox? (That combination gives you another option)

Hamza Ashraf el 5 de Ag. de 2020

i am using matlab R2017a and yes i have both Audio System Toolbox and Data Acquisition Toolbox and am using windows 8.1 but i can install windows 10 if that is necessary

Walter Roberson el 5 de Ag. de 2020

https://www.mathworks.com/matlabcentral/answers/569851-real-time-audio-signal-from-microphone#answer_470941

Hamza Ashraf el 5 de Ag. de 2020

Oki sir that gives a real time audio signal but how can i provide it to trained network so that it can check either it is the specific sound or not

Walter Roberson el 5 de Ag. de 2020

You read one buffer-full of sound from the microphone. You then pass that buffer to predict() or classify() depending on exactly which kind of network you are using.

Hamza Ashraf el 5 de Ag. de 2020

sir i can not understand from the link that you commented above so kindly can you give me an example code to understand better

Walter Roberson el 6 de Ag. de 2020

Abrir en MATLAB Online

fs = 22100;
FrameSize = 512;
adr = audioDeviceReader(fs, FrameSize, 'NumChannels', 2);
while true
    inbuf = adr();
    prediction = predict(YourTrainedNetwork, inbuf);
    if prediction >= 1
        fprintf('You said the secret word at %s\n', char(datetime));
    end
end

Hamza Ashraf el 7 de Ag. de 2020

Abrir en MATLAB Online

load('trained_net_001.mat')
%              audiofilename  %audio_label
[t1, ls] = data('noise.wav',      2);
YTest = classify(net, t1);
[t, ls] = data('noiseambulancewail.wav',1);
YTest = classify(net, t);
amount_wail = sum(YTest == categorical(ls'))/numel(ls)

sir this is how i test my network but i dont understand this line of above code you commented

prediction = predict(YourTrainedNetwork, inbuf);

please tell me how i replace this line according to my case

Walter Roberson el 7 de Ag. de 2020

Abrir en MATLAB Online

[t1, ls] = data('noise.wav', 2);

I do not recognize any function named data that accepts a file name as a handle. Are you loading a function handle as part of trained_net_001.mat ? What is the ls output -- is it the category (label) information associated with the file?

Wail_category = something appropriate that is predicted by your net
fs = 22100;
FrameSize = 512;
adr = audioDeviceReader(fs, FrameSize, 'NumChannels', 2);
while true
    inbuf = adr();
    YTest = classify(net, inbuf);
    was_it_wail == YTest == Wail_category;
    if was_it_wail
        fprintf('Wail detected at time %s\n', char(datetime));
    end
end

Hamza Ashraf el 8 de Ag. de 2020

Abrir en MATLAB Online

function [specs, labels] = data(file_name, label)
    clear specs labels
    [wial_s,wail_fs]=audioread(file_name);
    [wail_freq, ~, ~] = spectrogram(wial_s, 512, 128, 512,wail_fs, 'yaxis');
    wail = 20*log10(abs(wail_freq));
    window = 30;
    overlap = 10;
    size_wail = size(wail);
    copies = 5;
    j = 1;
    %creating positive data samples
    xt = 1: window-overlap: (size_wail(2) - window + 1);
    for xms = 1:copies
        for i = xt
            specs(:,:,1,j) = wail(:,i:window+i-1);
            labels(j) =label;
            j = j+1;
        end
    end
    
    % this is how you draw the spectrogram.
    % surf(t,f,sl, 'EdgeColor', 'none')
    % colormap(jet);
end

this is data function that is used to create data samples for network to train it and yes ls is the label

i tried your above code but i get the following error

Error using DAGNetwork/predict>predictBatch (line 238)

Incorrect input size. The input images must have a size of [257 30 1].

Error in DAGNetwork/predict (line 118)

Y = predictBatch( ...

Error in DAGNetwork/classify (line 115)

scores = this.predict( X, varargin{:} );

Error in SeriesNetwork/classify (line 458)

[labels, scores] = this.UnderlyingDAGNetwork.classify(X, varargin{:});

Hamza Ashraf el 9 de Ag. de 2020

sir are you there can you please help in this

Walter Roberson el 9 de Ag. de 2020

Abrir en MATLAB Online

You will need to write a new function related to your data() function, that instead of reading the data from a file, accepts wail_s and wail_fs as inputs and does the same kind of calculation of specs (but does not assign labels.)

Then in my loop where I showed

    inbuf = adr();
    YTest = classify(net, inbuf);

You would instead

    inbuf = adr();
    sp = calculate_specs(inbuf, fs);
    YTest = classify(net, sp);
    

Question:

for i = xt

Are you sure you do not want

for i = 1:xt

?

Hamza Ashraf el 10 de Ag. de 2020

Abrir en MATLAB Online

oki i got it thankyou. it is working now but my tarined network is not giving me correct outcomes. i have trained it on large amount of data about 200 audio files but still am not getting correct outcomes. i dont know where am doing it wrong. can you help me in that case. here is code for my convolution neural network

layers = [imageInputLayer([257 30 1])
          convolution2dLayer(10,3)
          reluLayer
          maxPooling2dLayer(2,'Stride',2)
          convolution2dLayer(5,2)
          reluLayer
          maxPooling2dLayer(2,'Stride',2)
          fullyConnectedLayer(2)
          fullyConnectedLayer(2)
          softmaxLayer
          classificationLayer()];
options = trainingOptions('sgdm','MaxEpochs',60, 'InitialLearnRate',0.0001);
net = trainNetwork(specs, categorical(labels), layers, options);
YTest = classify(net, tests);

Walter Roberson el 10 de Ag. de 2020

It is not clear to me why you would use image layers to try to process features extracted from audio input.

Hamza Ashraf el 10 de Ag. de 2020

1803-153578318109-14.pdf

see in my data function audio files are spectrogram images thats why i used image layers.

the technique am using is explained in following pdf document

Walter Roberson el 10 de Ag. de 2020

... Then your function that processes the audio from the microphone is going to need to create an image of the spectrogram, instead of just calculating the values for the spectrogram. This is not what your code reading from file did: your code reading from file calculated the values without writing out a spectrogram image.

Hamza Ashraf el 10 de Ag. de 2020

Abrir en MATLAB Online

function specs = dataoutput(wial_s,wail_fs)
    clear specs
    %[wial_s,wail_fs]=audioread(file_name);
    [wail_freq, ~, ~] = spectrogram(wial_s, 512, 128, 512,wail_fs, 'yaxis');
    wail = 20*log10(abs(wail_freq));
    window = 30;
    overlap = 10;
    size_wail = size(wail);
    copies = 1;
    j = 1;
    %creating positive data samples
    xt = 1: window-overlap: (size_wail(2) - window + 1);
    for xms = 1:copies
        for i = xt
            specs(:,:,1,j) = wail(:,i:window+i-1);
           
            j = j+1;
        end
    end
    
    % this is how you draw the spectrogram.
    % surf(t,f,sl, 'EdgeColor', 'none')
    % colormap(jet);
end

this is my function that process the audio from microphone and gives data to network. please look at it and mention if something need to be changed or how can i get correct results

Walter Roberson el 11 de Ag. de 2020

If you have the signal processing toolbox, then you could consider using buffer() instead of your current for xms / for i loops.

Your dataoutput() routine does appear to be consistent with your earlier data() routine. However, neither of them create the spectrogram images that you indicate that your deep learning code relies on. You would have to imwrite() the images, possibly after capturing the image from the screen.

Hamza Ashraf el 11 de Ag. de 2020

Oki sir as my data functions are extracting features from audio input which layers should i use in my network. Can you help me modify my network in that scenario

Walter Roberson el 12 de Ag. de 2020

You just might be able to continue using imageInputLayer, but instead of an image of a spectrogram, you would pass it the array of specs data

Hamza Ashraf el 13 de Ag. de 2020

From my data fucntion it can be seen that am passing the array of specs and labels but still am not getting acurate results

speech signal from Microphone USB

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Respuestas (3)

5 comentarios
Mostrar 3 comentarios más antiguos Ocultar 3 comentarios más antiguos

22 comentarios
Mostrar 20 comentarios más antiguos Ocultar 20 comentarios más antiguos

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Categorías

Etiquetas

Community Treasure Hunt

speech signal from Microphone USB

0 comentarios Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Respuestas (3)

5 comentarios Mostrar 3 comentarios más antiguos Ocultar 3 comentarios más antiguos

22 comentarios Mostrar 20 comentarios más antiguos Ocultar 20 comentarios más antiguos

0 comentarios Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Categorías

Etiquetas

Ver también

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

5 comentarios
Mostrar 3 comentarios más antiguos Ocultar 3 comentarios más antiguos

22 comentarios
Mostrar 20 comentarios más antiguos Ocultar 20 comentarios más antiguos

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos