Code Monkey home page Code Monkey logo

word_cloud's Introduction

Build Status licence DOI

word_cloud

A little word cloud generator in Python. Read more about it on the blog post or the website. The code is Python 2, but Python 3 compatible.

Installation

Fast install:

pip install wordcloud

If you are using conda, it might be even easier to use anaconda cloud:

conda install -c https://conda.anaconda.org/amueller wordcloud

For a manual install get this package:

wget https://github.com/amueller/word_cloud/archive/master.zip
unzip master.zip
rm master.zip
cd word_cloud-master

Install the package:

python setup.py install

Installation notes

worcloud depends on numpy>=1.5.1, pillow and matplotlib. To install it via pip, you will also need a C compiler.

Windows

If you're having trouble with pip installation on windows, you can find a .whl file at:

http://www.lfd.uci.edu/~gohlke/pythonlibs/#wordcloud

Ubuntu

If the installation of the package fails, due to a missing pyconfig.h file, you need to install the python-dev package.

For Python 2.*

sudo apt-get install python-dev

For Python 3.*

sudo apt-get install python3-dev
CentOS / RHEL

If the compilation via gcc of the package fails, due to a missing Python.h file, you need to install the python-devel package.

For Python 2.*

sudo yum install -y python-devel

For Python 3.*

sudo yum install -y python34-devel

Examples

Check out examples/simple.py for a short intro. A sample output is:

Constitution

Or run examples/masked.py to see more options. A sample output is:

Alice in Wonderland

Getting fancy with some colors: Parrot with rainbow colors

Command-line usage

The wordcloud_cli.py tool can be used to generate word clouds directly from the command-line:

$ wordcloud_cli.py --text mytext.txt --imagefile wordcloud.png

If you're dealing with PDF files, then pdftotext, included by default with many Linux distribution, comes in handy:

$ pdftotext mydocument.pdf - | wordcloud_cli.py --imagefile wordcloud.png

In the previous example, the - argument orders pdftotext to write the resulting text to stdout, which is then piped to the stdin of wordcloud_cli.py.

Use wordcloud_cli.py --help so see all available options.

Used in

Reddit Cloud

Reddit Cloud is a Reddit bot which generates word clouds for comments in submissions and user histories. You can see it being operated on /u/WordCloudBot2 (top posting).

A Reddit Cloud sample

Chat Stats (Twitch.tv)

Chat Stats is a visualization program for Twitch streams, which generates word clouds for comments made by Twitch users in the chat. It also creates various charts and graphs pertaining to concurrent viewership and chat rate over time.

Chat Stats Sample

Twitter Word Cloud Bot

Twitter Word Cloud Bot is a twitter bot which generates word clouds for twitter users when it is mentioned with a particular hashtag. Here you can see it in action, while here you can see all the word clouds generated so far.

Stack Overflow Users Tag Cloud

Stackoverflow Tag Cloud generates tag clouds of users on Stack Overflow or any Stack Exchange site. If you are contributing to Stack Overflow community, it's an easy way to share your expertise with others through an image. Here's Stack Overflow's highest reputation user Jon Skeet's tag cloud -

Screenshot

[other]

Send a pull request to add yours here.

Issues

Using Pillow instead of PIL might might get you the TypeError: 'int' object is not iterable problem also showcased on the blog.

Licensing

The wordcloud library is MIT licenced, but contains DroidSansMono.ttf, a true type font by Google, that is apache licensed. The font is by no means integral, and any other font can be used by setting the font_path variable when creating a WordCloud object.

word_cloud's People

Contributors

amueller avatar boidolr avatar paul-nechifor avatar peter92 avatar petrushev avatar icyblade avatar mkcor avatar remram44 avatar langner avatar ianozsvald avatar defacto133 avatar vkolmakov avatar terrycojones avatar popcorncolonel avatar valentinarho avatar igorapm avatar droyed avatar cjmay avatar gustavoaragon avatar sedders123 avatar biogeek avatar lowks avatar piyushkhemka avatar laserson avatar sluetze avatar vaastav avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.