tf idf - How to create my own stop words list? -
i create stop words list non-english language. metrics better creating stop words list: term frequency entire document collection or tf-idf metrics?
you can use r this:
my.list <- unlist(read.table("c:/users/blabla/desktop/files/yourstopword.txt", stringsasfactors=false)) my.stops <- c(my.list) mycorpus <- tm_map(mycorpus, removewords, my.stops)
Comments
Post a Comment