Skip to content
This repository


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Python library for creating word clouds from text

branch: master

Fetching latest commit…


Cannot retrieve the latest commit at this time

Octocat-spinner-32 example
Octocat-spinner-32 weighwords
Octocat-spinner-32 .gitignore
Octocat-spinner-32 COPYING
Octocat-spinner-32 README.rst
Octocat-spinner-32 gpl-3.0.txt
Octocat-spinner-32 lgpl-3.0.txt

WeighWords is a Python library for creating word weights from text. It can be used to create word clouds.

WeighWords does not do visualization of word clouds. For that, you can paste its output into a tool like or the IBM Word-Cloud Generator.

Rather than use simple word frequency, it weighs words by statistical models known as parsimonious language models. These models are good at picking up the words that distinguish a text document from other documents in a collection. The downside to this is that you can't use WeighWords to make a word cloud of a single document; you need a bunch of document to compare to.



pip install weighwords


python install


D. Hiemstra, S. Robertson and H. Zaragoza (2004). Parsimonious Language Models for Information Retrieval. Proc. SIGIR'04.

R. Kaptein, D. Hiemstra and J. Kamps (2010). How different are Language Models and word clouds? Proc. ECIR.

Something went wrong with that request. Please try again.