Borrar filtros
Borrar filtros

Canoncorr Coefficients for large data

4 visualizaciones (últimos 30 días)
Matt Soldano
Matt Soldano el 10 de Mzo. de 2022
Respondida: Ayush Modi el 18 de En. de 2024
[A, B, r, U, V, stats] = canoncorr(XTrain,yTrain);
I am working with a large dataset, however our problem stems from the fact that we have over 50,000 observations and about 93 samples. We understand that the matrix will not be full rank when we run canoncorr, nonetheless we are curious as to how canoncorr picks specific observations for the coefficients (A and B) for each sample. To further explain, we created a LOOCV script to see which observations were picked for the coefficient matricies A and B. What we learned was that there were a handful of observations that were used on the training data for all of the samples, and only ~350 of the 50,000 observations were used for the training data.
Ultimate question: Does canoncorr randomly set the observations to 0 for the coefficients matrix when there are more observations than samples? Or is there a system for picking the best/most representative observations for the coefficients matrix?

Respuestas (1)

Ayush Modi
Ayush Modi el 18 de En. de 2024
Hi Matt,
As per my understanding, you would like to know how does "canoncorr" function selects the observations to set to 0 if input matrix is not a full rank matrix. I am assuming that:
  • Observations are the individual data points or samples in the dataset i.e. 50,000 observations.
  • Variables are the different measurements or features recorded for each observation i.e. 93 variables.
"canoncorr" function does not randomly set observations to zero for the coefficients matrix. It systematically computes the canonical coefficients for the variables, and any zeros in the coefficients matrices correspond to variables that are linearly dependent on others.
Please refer to the following MathWorks documentation for more information on "canoncorr" function:
Hope this helps!

Categorías

Más información sobre Dimensionality Reduction and Feature Extraction en Help Center y File Exchange.

Productos


Versión

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by