How can I supply (trainsvm) my dataset if it was in text form?

Question

Muneera Abdulsalam el 2 de En. de 2017

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/318804-how-can-i-supply-trainsvm-my-dataset-if-it-was-in-text-form

Cerrada: MATLAB Answer Bot el 20 de Ag. de 2021

I am using trainsvm to train my dataset, I have the group names, and related text in different forms (doc, pdf).

1. How can I supply this text for svm to train it? In what form (extension) Matlab will accept the input? 2. Is there another function I should use before training to select features? If so, What choices does Matlab provide?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

La pregunta está cerrada.

Answer 1

mizuki el 3 de En. de 2017

0
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/318804-how-can-i-supply-trainsvm-my-dataset-if-it-was-in-text-form#answer_249219

Assuming that you use SVMTRAIN function.

1. Your variable should be either string array (which is released in R2016b) or cell array if your target vector contains text.

2. Preprocessing depends on your data, so do not care about it at first. There are so many, but PCA (principal component analysis) would be the easy one to use for preprocessing data.

If you have MATLAB whose version is R2015a or later, use classificationLearner app & its MATLAB code generation feature.

classification learner

https://www.mathworks.com/help/stats/classificationlearner-app.html https://www.youtube.com/watch?v=ufFitvEm83w

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Muneera Abdulsalam el 3 de En. de 2017

My question is that, I am having my data in a form of text, how can I tokenize, select features, and give this information back to the function? Is it done manually? or is there certain tools? will Classification Learner tokenize and select features for me?

Walter Roberson el 3 de En. de 2017

You need to write the tokenization code, which is not easy to get right for English text, especially when you consider quoted material and the multiple uses of "." and ","

How can I supply (trainsvm) my dataset if it was in text form?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Ver también

Etiquetas

Productos

Community Treasure Hunt

How can I supply (trainsvm) my dataset if it was in text form?

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

3 comentarios Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Ver también

Etiquetas

Productos

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo