{"id":24796,"library":"urduhack","title":"UrduHack","description":"A Natural Language Processing (NLP) library for the Urdu language, providing tokenization, normalization, lemmatization, and POS tagging. Current version is 1.1.1, released in 2020. The project appears to be in maintenance mode with last commit in 2021.","status":"maintenance","version":"1.1.1","language":"python","source_language":"en","source_url":"https://github.com/urduhack/urduhack","tags":["nlp","urdu","natural-language-processing"],"install":[{"cmd":"pip install urduhack","lang":"bash","label":"Standard install"}],"dependencies":[],"imports":[{"note":"The library is imported as a top-level module; sub-modules like `urduhack.tokenization` are accessed directly.","wrong":"from urduhack import urduhack","symbol":"urduhack","correct":"import urduhack"}],"quickstart":{"code":"import urduhack\n# Example: tokenize a sentence\ntext = \"اردو زبان پاکستان کی قومی زبان ہے۔\"\ntokens = urduhack.tokenize(text)\nprint(tokens)","lang":"python","description":"Basic usage: import urduhack and call tokenize on Urdu text."},"warnings":[{"fix":"Treat output as a plain list of strings.","message":"urduhack.tokenize returns lists of tokens, not spaCy-like objects. Do not expect .text attributes.","severity":"gotcha","affected_versions":"all"},{"fix":"Consider using alternatives like `urdu-words` or `hazm` for Persian/Urdu, or fork the library to update dependencies.","message":"The library has not been updated since 2021 and may not work with Python 3.10+. Dependencies like TensorFlow may be outdated.","severity":"deprecated","affected_versions":">= 1.0.0"},{"fix":"Run `urduhack.download()` or let the functions auto-download.","message":"urduhack.normalize and urduhack.lemmatize require downloading models on first run. Internet connection needed.","severity":"gotcha","affected_versions":"all"}],"env_vars":null,"last_verified":"2026-05-01T00:00:00.000Z","next_check":"2026-07-30T00:00:00.000Z","problems":[{"fix":"pip install tensorflow (or tensorflow-cpu) before using urduhack.","cause":"urduhack depends on TensorFlow, which is not installed automatically in some environments.","error":"ModuleNotFoundError: No module named 'tensorflow'"},{"fix":"Use `import urduhack` and ensure you have version 1.1.1. The function is `urduhack.tokenize`, not `urduhack.tokenization.tokenize`.","cause":"The import path is incorrect or an older version of urduhack is installed.","error":"AttributeError: module 'urduhack' has no attribute 'tokenize'"}],"ecosystem":"pypi","meta_description":null,"install_score":null,"install_tag":null,"quickstart_score":null,"quickstart_tag":null}