tf idf - How to create my own stop words list? -


i create stop words list non-english language. metrics better creating stop words list: term frequency entire document collection or tf-idf metrics?

you can use r this:

my.list <- unlist(read.table("c:/users/blabla/desktop/files/yourstopword.txt",                              stringsasfactors=false)) my.stops <- c(my.list) mycorpus <- tm_map(mycorpus, removewords, my.stops) 

Comments

Popular posts from this blog

java.util.scanner - How to read and add only numbers to array from a text file -

rewrite - Trouble with Wordpress multiple custom querystrings -

debian - 500 Error upon login into Plesk Admin - auth.php3? -