1. Should stop words be removed from corpus beforehand? My topic_model generates clusters with most frequent words like "the", "and", "to" and etc. 2. Is there any model to process long text without truncating them to maximum available for transformers?