How to remove top phrases from a word cloud

3 visualizaciones (últimos 30 días)
Pat Canny
Pat Canny el 2 de Jul. de 2020
Respondida: Pat Canny el 2 de Jul. de 2020
I have created a set of word clouds, but want to remove some common two-word phrases that appear in each.
I tried using removeWords, but it did not appear to work!
Is there a way to see what the top phrases are so that I could check if there are any extra spaces, for instance?

Respuesta aceptada

Pat Canny
Pat Canny el 2 de Jul. de 2020
The removeWords function only removes specific words. It won't remove phrases like you might expect.
The word clouds are showing n-grams, therefore you are looking remove common n-grams. To do so, use the removeNgrams function. Each phrase should be a 1x2 string array (see the "thou art" and "thou dost" example in the removeNgrams Documentation)
If you are interested in the top n-grams in a bag-of-n-grams model (which you used to create the word cloud), you can use the topkngrams function.
Alternatively (and this is not the preferred approach), you can extract the WordData from a word cloud object:
wc = wordcloud(bag);
wordData = wc.WordData;

Más respuestas (0)

Categorías

Más información sobre Modeling and Prediction en Help Center y File Exchange.

Productos


Versión

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by