Cleanco

2.3 · active · verified Fri Apr 10

Cleanco is a Python library (current version 2.3) designed to process company names. It cleans names by stripping away terms indicating organization type (like 'Ltd.' or 'Corp.'), deduces the business entity type (e.g., 'limited liability company'), and suggests possible countries of establishment. Releases are somewhat irregular, with recent updates in late 2023 and early 2024.

Warnings

Install

Imports

Quickstart

This example demonstrates how to clean a company name, deduce its business type, and suggest possible countries of origin using the primary functions of the cleanco library.

from cleanco import basename, typesources, matches, countrysources

business_name = "Some Big Pharma, LLC"
cleaned_name = basename(business_name)
print(f"Cleaned name: {cleaned_name}")

classification_sources = typesources()
business_types = matches(business_name, classification_sources)
print(f"Business types: {business_types}")

country_classification_sources = countrysources()
possible_countries = matches(business_name, country_classification_sources)
print(f"Possible countries: {possible_countries}")

view raw JSON →