I need to find a common root word matched for all related words for a keyword extractor.
How to convert words into the same root using the python nltk lemmatizer?
- Eg:
- generalized, generalization -> general
- optimal, optimized -> optimize (maybe)
- configure, configuration, configured -> configure
The python nltk lemmatizer gives 'generalize', for 'generalized' and 'generalizing' when part of speech(pos) tag parameter is used but not for 'generalization'.
Is there a way to do this?