Does this mean that my trees are grown too deep? Or can this be an indicator of perfect classification with probability of 100% or 0%?
Random forest prediction probabilities
10 visualizaciones (últimos 30 días)
Mostrar comentarios más antiguos
Memo Remo
el 13 de Abr. de 2021
Comentada: Memo Remo
el 22 de Jun. de 2021
Hi,
I trained a random forest model using MATLAB's "TreeBagger" function. However, when I use the "predict" function, my probabilities are all 0 or 1 except for a few predictions. Despite having 4000 observations, my roc curve has also only three data point. Can you suggest any solution for this problem?
Thanks in advance.
4 comentarios
Respuesta aceptada
Aditya Patil
el 10 de Mayo de 2021
Getting a probability of 1 suggests that the model has overfitted, and the observation is being predicted as belonging to the specific class by all trees.
You can overcome this issue by reducing the size of the trees. Few of the options that might help are,
- MinLeafSize: Set this to higher value
- MaxNumSplits: Set this to a lower value
Alternately, you may want to use a different approach entirely, by using SVMs or other classifiers.
Más respuestas (0)
Ver también
Categorías
Más información sobre Classification Ensembles en Help Center y File Exchange.
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!