Skip to content

matzech/word_cloud

 
 

Repository files navigation

Build Status licence DOI

word_cloud

A little word cloud generator in Python. Read more about it on the blog post or the website. The code is Python 2, but Python 3 compatible.

Installation

Fast install:

pip install wordcloud

If you are using conda, it might be even easier to use anaconda cloud:

conda install -c https://conda.anaconda.org/amueller wordcloud

For a manual install get this package:

wget https://github.com/amueller/word_cloud/archive/master.zip
unzip master.zip
rm master.zip
cd word_cloud-master

Install the package:

python setup.py install

Installation notes

worcloud depends on numpy>=1.5.1, pillow and matplotlib. To install it via pip, you will also need a C compiler.

Windows

If you're having trouble with pip installation on windows, you can find a .whl file at:

https://www.lfd.uci.edu/~gohlke/pythonlibs/#wordcloud

Ubuntu

If the installation of the package fails, due to a missing pyconfig.h file, you need to install the python-dev package.

For Python 2.*

sudo apt-get install python-dev

For Python 3.*

sudo apt-get install python3-dev
CentOS / RHEL

If the compilation via gcc of the package fails, due to a missing Python.h file, you need to install the python-devel package.

For Python 2.*

sudo yum install -y python-devel

For Python 3.*

sudo yum install -y python34-devel

Examples

Check out examples/simple.py for a short intro. A sample output is:

Constitution

Or run examples/masked.py to see more options. A sample output is:

Alice in Wonderland

Getting fancy with some colors: Parrot with rainbow colors

Command-line usage

The wordcloud_cli.py tool can be used to generate word clouds directly from the command-line:

$ wordcloud_cli.py --text mytext.txt --imagefile wordcloud.png

If you're dealing with PDF files, then pdftotext, included by default with many Linux distribution, comes in handy:

$ pdftotext mydocument.pdf - | wordcloud_cli.py --imagefile wordcloud.png

In the previous example, the - argument orders pdftotext to write the resulting text to stdout, which is then piped to the stdin of wordcloud_cli.py.

Use wordcloud_cli.py --help so see all available options.

Licensing

The wordcloud library is MIT licenced, but contains DroidSansMono.ttf, a true type font by Google, that is apache licensed. The font is by no means integral, and any other font can be used by setting the font_path variable when creating a WordCloud object.

About

A little word cloud generator in Python

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 97.2%
  • Shell 2.7%
  • Batchfile 0.1%