Normalizes files or standard input using a Unicode normalization form.
Project description
Usage:
unormalize [-f FORM] [-i EXT] [files...]
nfc [-i EXTENSION] [files...]
nfd [-i EXTENSION] [files...]
nfkc [-i EXTENSION] [files...]
nfkd [-i EXTENSION] [files...]
Options
-i EXTENSION -- Modify files inplace, saving back-ups with EXTENSION
-f FORM-- normalization form
Home-page: https://github.com/eddieantonio/unormalize
Author: Eddie Antonio Santos
Author-email: easantos@ualberta.ca
License: MIT
Download-URL: https://github.com/eddieantonio/unormalize
Description: **************************************************
unormalize - Filters that do Unicode normalization
**************************************************
Converts UTF-8 input to the desired UTF-8 in Unicode normalization form.
Read about the `Unicode Normalization Forms`_!
=====
Usage
=====
There are five executables included, that all have the exact same usage and
arguments:
- unormalize
- nfc
- nfd
- nfkc
- nfkd
You may either redirect or pipe input into `unormalize` (and its buddies), or
provide filenames as arguments.
-------
Options
-------
``-f FORM``/``--form=FORM``
Selects the normalization form: one of NFC, NFD, NFKC, or NFKD. The
equivalently named executables imply their respective normalization form;
``unormalize`` is equivilent to ``nfk`` without the ``--form`` arugment.
``-i EXTENSION``/``--in-place EXTENSION``
Filenames **must** be specified as arguments. If so, this opens them, and
converts them into the desired normalization form, in place. ``EXTENSION`` is
the extension given to back-ups of the original files.
========
Examples
========
Convert clipboard contents to NFC (OS X)::
$ pbpaste | nfc | pbcopy
Convert a file, in-place, to NFKD::
$ nfkd --in-place=.bak file.txt && rm file.txt.bak
Convert circled, variants, and half-widths to their compatible forms::
$ echo 'ℍ①カ' | nfkc
H1カ
=======
License
=======
© 2015 Eddie Antonio Santos. MIT Licensed.
.. _`Unicode Normalization Forms`: http://unicode.org/reports/tr15/
Platform: UNKNOWN
Classifier: Development Status :: 5 - Production/Stable
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3.3
Classifier: Programming Language :: Python :: 3.4
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Environment :: Console
Classifier: License :: OSI Approved :: MIT License
Classifier: Topic :: Text Processing
Classifier: Topic :: Utilities
unormalize [-f FORM] [-i EXT] [files...]
nfc [-i EXTENSION] [files...]
nfd [-i EXTENSION] [files...]
nfkc [-i EXTENSION] [files...]
nfkd [-i EXTENSION] [files...]
Options
-i EXTENSION -- Modify files inplace, saving back-ups with EXTENSION
-f FORM-- normalization form
Home-page: https://github.com/eddieantonio/unormalize
Author: Eddie Antonio Santos
Author-email: easantos@ualberta.ca
License: MIT
Download-URL: https://github.com/eddieantonio/unormalize
Description: **************************************************
unormalize - Filters that do Unicode normalization
**************************************************
Converts UTF-8 input to the desired UTF-8 in Unicode normalization form.
Read about the `Unicode Normalization Forms`_!
=====
Usage
=====
There are five executables included, that all have the exact same usage and
arguments:
- unormalize
- nfc
- nfd
- nfkc
- nfkd
You may either redirect or pipe input into `unormalize` (and its buddies), or
provide filenames as arguments.
-------
Options
-------
``-f FORM``/``--form=FORM``
Selects the normalization form: one of NFC, NFD, NFKC, or NFKD. The
equivalently named executables imply their respective normalization form;
``unormalize`` is equivilent to ``nfk`` without the ``--form`` arugment.
``-i EXTENSION``/``--in-place EXTENSION``
Filenames **must** be specified as arguments. If so, this opens them, and
converts them into the desired normalization form, in place. ``EXTENSION`` is
the extension given to back-ups of the original files.
========
Examples
========
Convert clipboard contents to NFC (OS X)::
$ pbpaste | nfc | pbcopy
Convert a file, in-place, to NFKD::
$ nfkd --in-place=.bak file.txt && rm file.txt.bak
Convert circled, variants, and half-widths to their compatible forms::
$ echo 'ℍ①カ' | nfkc
H1カ
=======
License
=======
© 2015 Eddie Antonio Santos. MIT Licensed.
.. _`Unicode Normalization Forms`: http://unicode.org/reports/tr15/
Platform: UNKNOWN
Classifier: Development Status :: 5 - Production/Stable
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3.3
Classifier: Programming Language :: Python :: 3.4
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Environment :: Console
Classifier: License :: OSI Approved :: MIT License
Classifier: Topic :: Text Processing
Classifier: Topic :: Utilities
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
unormalize-0.2.0.tar.gz
(4.6 kB
view hashes)
Built Distribution
Close
Hashes for unormalize-0.2.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f07b1e6154dbc631e5e9fcacf7f92a283458cfd58821ba310c1939b6c39781a7 |
|
MD5 | 66992085f2a9973896b5cb0a3db54500 |
|
BLAKE2b-256 | edc67fb244fe3d228c200691c0cebcd7a0090dae1b5f5353a6cf9478b00d26d0 |