Unicode Slugify

Unicode Slugify is a slugifier that generates unicode slugs. It was originally used in the Firefox Add-ons web site to generate slugs for add-ons and add-on collections. Many of these add-ons and collections had unicode characters and required more than simple transliteration.

Usage

from slugify import slugify, SLUG_OK

# Default usage : lower, spaces replaced with "-", only alphanum and "-_~" chars, keeps unicode
slugify(u'Bän...g (bang)')
# u'bäng-bang'

# Keep capital letters and spaces
slugify(u'Bän...g (bang)', lower=False, spaces=True)
# u'Bäng bang'

# Replace non ascii chars with their "best" representation
slugify(u'北京 (capital of China)', only_ascii=True)
# u'bei-jing-capital-of-china'

# Allow some extra chars
slugify(u'北京 (capital of China)', ok=SLUG_OK+'()', only_ascii=True)
# u'bei-jing-(capital-of-china)'

# "snake_case" example
def snake_case(s):
    # As "-" is not in allowed Chars, first one (`_`) is used for space replacement
    return slugify(s, ok='_', only_ascii=True)
snake_case(u'北京 (capital of china)')
# u'bei_jing_capital_of_china'

# "CamelCase" example
def camel_case(s):
    return slugify(s.title(), ok='', only_ascii=True, lower=False)
camel_case(u'北京 (capital of china)')
# u'BeiJingCapitalOfChina'

Thanks

Tomaz Solc, unidecode, https://pypi.python.org/pypi/Unidecode

A slugifier that works in unicode

Related tags

Overview

Unicode Slugify

Usage

Thanks

Owner

Mozilla

Meeting, rendezvous, confluence (Finnish kohtaaminen) mark up, down, and up again.

WorldCloud Orçamento de Estado 2022

Fuzzy string matching like a boss. It uses Levenshtein Distance to calculate the differences between sequences in a simple-to-use package.

Find a Doc is a free online resource aimed at helping connect the foreign community in Japan with health services in their native language.

Aml - anti-money laundering

This project is a small tool for processing url-containing texts delivered by HUAWEI Share on Windows.

AnnIE - Annotation Platform, tool for open information extraction annotations using text files.

Map Reduce Wordcount in Python using gRPC

Add your new words to a text file and get them randomly.

A minimal python script for generating multiple onetime use bip39 seed phrases

REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.

Free & simple way to encipher text

TextStatistics - Get a text file wich contains English text

This is REST-API for Indonesian Text Summarization using Non-Negative Matrix Factorization for the algorithm to summarize documents and FastAPI for the framework.

Answer some questions and get your brawler csvs ready!

This is a text summarizing tool written in Python

Fuzzy String Matching in Python

A non-validating SQL parser module for Python

🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙‍♀️

Format Covid values to ASCII-Table (Only for Germany and Austria)