Creating numerical variables from categorical variables in an unbalanced dataset
2 visualizaciones (últimos 30 días)
Mostrar comentarios más antiguos
Grigorios Kyritsakas
el 12 de Feb. de 2020
Comentada: Grigorios Kyritsakas
el 17 de Feb. de 2020
Hello there,
I would like to apply Random Forrest method in a highly unbalanced dataset that includes both numerical and categoorical variables.In order to improve my classification results, before applying the method for classification I thought to create synthtic datasets using the SMOTE and the ADASYN algorithm. However, both methods work only with numerical variables, therefore, I would like to ask if you have any suggestion regarding the way to transform my categorical variables into numerical ones.
With many thanks in advance for your help
0 comentarios
Respuesta aceptada
Lei Hou
el 14 de Feb. de 2020
Hi Grigorios,
You can do something as the following.
catVar = categorical(["a" "b" "c" "b" "a"]);
numValue = [0.1 3 100]; % The order of numbers refers to the order of categories returned by categories(catVar)
numVar = numValue(catVar)
Hoping my solution helpful to you.
Más respuestas (0)
Ver también
Categorías
Más información sobre Probability Distributions en Help Center y File Exchange.
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!