Why can future values affect NARX time series output?

Question

Kevin Johnson el 11 de Feb. de 2023

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/1910890-why-can-future-values-affect-narx-time-series-output

Comentada: Kevin Johnson el 11 de Feb. de 2023

I am using A NARX network with one target vector and several input vectors. I am trying to predict the target value each day based on the inputs provided that day. If I feed all of the input and target values into the network up to and including the current day, I get one output result. But if I also feed in additional future input and target values, I get a different output result.

I have used " rng 'default' " to keep the same seed for the random number generator. I suspect there is some normalization or something going on that is using the future values, but I can't locate it. I am not doing that myself.

Obviously, any prediction routine that uses future data is invalid.

Thoughts?

I checked this as follows: I substituted a linear regression model instead of the NARX, and got completely reproducible and identical results whether the future data was present or not. The inputs and targets fed to this routine were identical to those fed to the NARX.

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Walter Roberson el 11 de Feb. de 2023

0
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/1910890-why-can-future-values-affect-narx-time-series-output#answer_1169400

Movida: Walter Roberson el 11 de Feb. de 2023

Abrir en MATLAB Online

I am perhaps being naive on this, but the behaviour you describe for NARX is what I would expect.

Suppose you had a series of points and you were doing a polynomial fit on them and using it to estimate values:

rng(12346)

N = 25;

x = 1:25;

y = rand(1, N);

x20 = x(1:20);

y20 = y(1:20);

p20 = polyfit(x20, y20, 8);

yfit20 = polyval(p20, x20, y20);

plot(x, y, '.k');

hold on

plot(x20, yfit20, '-ob');

hold off

Now, would you expect the same prediction for the 20th location if you added more data in?

p_all = polyfit(x, y, 8);

yfit_all = polyval(p_all, x, y);

figure

plot(x, y, '.k');

hold on

plot(x20, yfit20, '-ob');

plot(x, yfit_all, ':+r');

hold off

Any system that uses any kind of aggregate information (mean, standard deviation, moments) is likely to produce a different output for a given location if you fed it more information.

7 comentarios
Mostrar 5 comentarios más antiguosOcultar 5 comentarios más antiguos

Walter Roberson el 11 de Feb. de 2023

But if I also feed in additional future input and target values, I get a different output result.

Perhaps I am not understanding you in what you mean by that.

If you are saying that given rng('default') that you expect that if you feed in only x(1:20) and y(1:20) that the prediction for x(21) should be the same as-if you had feed in all x(1:25) and y(1:25) and then asked to predict for x(21) -- if that is what you are saying, then I would remind you that when you have more input data then the random splits between training and test and validation are going to be different, and different number of iterations might be used to meet tolerance goals (training can stop for a number of different reasons, not just for having taken a fixed number of iterations).

Kevin Johnson el 11 de Feb. de 2023

That is what I am saying, yes. I can see what you mean. I was under the impression that the algorithm marches along in time, predicting at the leading edge - the "frontier". I see I am likely mistaken in that. I think my error is that I am training on the whole set in both variant examples (partial data vs. complete), and during training, the algorithm does indeed look backward and forward in the sense of data divison etc as you say. So instead I need to train the network, then separately simulate the network going forward in tme. My results were very good when I input all the data, and I see now that was probably essentially an in-sample result, which is almost always better than an out-of-sample result. Sound right?

Iniciar sesión para comentar.

Why can future values affect NARX time series output?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

7 comentarios
Mostrar 5 comentarios más antiguosOcultar 5 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

Why can future values affect NARX time series output?

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

7 comentarios Mostrar 5 comentarios más antiguosOcultar 5 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

7 comentarios
Mostrar 5 comentarios más antiguosOcultar 5 comentarios más antiguos