Code Monkey home page Code Monkey logo

cleanco's Introduction

cleanco - clean organization names

What is it / what does it do?

This is a Python package that processes company names, providing cleaned versions of the names by stripping away terms indicating organization type (such as "Ltd." or "Corp").

Using a database of organization type terms, It also provides an utility to deduce the type of organization, in terms of US/UK business entity types (ie. "limited liability company" or "non-profit").

Finally, the system uses the term information to suggest countries the organization could be established in. For example, the term "Oy" in company name suggests it is established in Finland, whereas "Ltd" in company name could mean UK, US or a number of other countries.

How do I install it?

Download it from this site and unzip the directory.

  • Mac: cd into it, and enter sudo python setup.py install along with your system password.
  • Windows: Same thing but without sudo.

How does it work?

Let's look at some sample code. First, create an instance of the module:

>>> from cleanco import cleanco

Prepare a string of a company name that you want to process:

>>> business_name = "Some Big Pharma, LLC"

Throw it into the instance:

>>> x = cleanco(business_name)

You can now get the company types:

>>> x.type()
['Limited Liability Company']

...the possible countries...

>>> x.country()
['United States of America', 'Philippines']

...and a clean version of the company name.

>>> x.clean_name()
'Some Big Pharma'

Are there bugs?

You better believe it. Please let me know or fork this project. I'm sure some of the company suffixes are way incorrect and I'm missing a lot more information.

Special thanks to:

cleanco's People

Contributors

petri avatar psolin avatar

Watchers

James Cloos avatar Christian R. Kounatze avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.