Least Frequent Words in document

2 visualizaciones (últimos 30 días)
Charmaine Tan
Charmaine Tan el 28 de Nov. de 2018
Respondida: Snehal el 29 de En. de 2025
If I use topkwords to find the most-frequent words, what code can I use to show the 10-least frequent words?

Respuestas (1)

Snehal
Snehal el 29 de En. de 2025
Hi,
I understand that you want to display the 10-least frequent words from a given set of words or sentences.
This can be achieved using the 'topkwords' function. Pass the input to 'topkwords', setting the k value to 'inf'. Then, sort the output of 'topkwords' in ascending order and display the top 10 words.
Refer to the sample code below for better understanding:
% Sample text data
textData = "This is a sample text. This text is for testing if our approach can display the least frequent words correctly or not";
% before using the ‘topkwords’ function, we need to convert the text into bag-of-words format
documents = tokenizedDocument(textData);
docs = bagOfWords(documents);
table = topkwords(docs, inf);
sortedTable = sortrows(table,'Count');
% Select the 10 least frequent words
numLeastFrequent = 10;
leastFrequentWords = sortedTable.Word(1:numLeastFrequent);
leastFrequentCounts = sortedTable.Count(1:numLeastFrequent);
% Display the 10 least frequent words and their counts
disp(leastFrequentWords);
"a" "sample" "." "for" "testing" "if" "our" "approach" "can" "display"
Refer to the following documentations for more details:
Hope this helps.

Categorías

Más información sobre Cell Arrays en Help Center y File Exchange.

Productos


Versión

R2018b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by