I'm currently performing a topic modelling using LDA from text2vec
package. I managed to create a dtm matrix and then apply LDA
and its fit_transform
method with n_topics=50
.
While looking at the top words from each topic, a question popped into my mind. I plan to apply the model to new data afterwards and there's a possibility of occurence of new words, which were not encountered by the model before. Will the model still be able to assign each word to its respective topic? Moreover, will these words also be added to the topic, so that I will be able to locate them using get_top_words
?
Thank you for answering!