Unsupervised clustering of categorical data

10 visualizaciones (últimos 30 días)
Daniel Guignard
Daniel Guignard el 23 de Nov. de 2021
Respondida: Pratyush Roy el 1 de Dic. de 2021
Hi everyone,
I wanted to cluster a time serie dataset which has 30 timepoints and more than 50'000 rows. The dataset is categorical (from 1 to 6) which represent different categories.
The problem with my current clustergram method using the euclidian distance metrics, is that it will cluster the category 5 closer to 6. I don't want that, those categories are not somehow related. How is it possible to remove this bias in the clustering?
Hope my question is clear, thanks for your further help!
  2 comentarios
Image Analyst
Image Analyst el 23 de Nov. de 2021
Could be clearer if you attached a .mat file with your table, as many rows as will fit into 5 MB (attachment size limit).
Daniel Guignard
Daniel Guignard el 23 de Nov. de 2021
sure, here is the matrix

Iniciar sesión para comentar.

Respuestas (1)

Pratyush Roy
Pratyush Roy el 1 de Dic. de 2021
Hi Daniel,
The link here might be helpful for clustering categorical or non-numeric data.
Hope this helps!

Productos


Versión

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by