utf8trans

NAME
SYNOPSIS
DESCRIPTION
OPTIONS
USAGE
LIMITATIONS
AUTHOR

NAME

utf8trans - Transliterate UTF-8 characters according to a table

SYNOPSIS

utf8trans \kx charmap [file]...

DESCRIPTION

utf8trans transliterates characters in the specified files (or standard input, if they are not specified) and writes the output to standard output. All input and output is in the UTF-8 encoding.

This program is usually used to render characters in Unicode text files as some markup escapes or ASCII transliterations. (It is not intended for general charset conversions.) It provides functionality similar to the character maps in XSLT 2.0 (XML Stylesheet Language Transformations, version 2.0).

OPTIONS

-m, --modify

Modifies the given files in-place with their transliterated output, instead of sending it to standard output. This option is useful for efficient transliteration of many files at once.

--help

Show brief usage information and exit.

--version

Show version and exit.

USAGE

The translation is done according to the rules in the character map, named in the file charmap. It has the following format:

Each line represents a translation entry, except for blank lines and comment lines, which are ignored.

Any amount of whitespace (space or tab) may precede the start of an entry.

Comment lines begin with #. Everything on the same line is ignored.

Each entry consists of the Unicode codepoint of the character to translate, in hexadecimal, followed one space or tab, followed by the translation string, up to the end of the line.

The translation string is taken literally, including any leading and trailing spaces (except the delimeter between the codepoint and the translation string), and all types of characters. The newline at the end is not included.

The above format is intended to be restrictive, to keep utf8trans simple. But if a XML-based format is desired, there is a xmlcharmap2utf8trans script that comes with the docbook2X distribution, that converts character maps in XSLT 2.0 format to the utf8trans format.

LIMITATIONS

utf8trans does not work with binary files, because malformed UTF-8 sequences in the input are substituted with U+FFFD characters. However, null characters in the input are handled correctly. This limitation may be removed in the future.

There is no way to include a newline or null in the substitution string.

AUTHOR

Steve Cheng <stevecheng@users.sourceforge.net>.

utf8trans

NAME

SYNOPSIS

DESCRIPTION

OPTIONS

USAGE

LIMITATIONS

AUTHOR

Dále u nás najdete

Z evropského koláče chytrých telefonů ukusují Apple a Honor

V Evropě roste zájem o alternativu k Microsoftu, říká Petra Novotná

Při podezření na rakovinu jděte za praktikem, nehledejte na internetu

Připravit, pozor, teď! Spouštíme Channeltrends Awards 2025

Zahrávají si ČEZ či E.ON s čínským ohněm?

EET není český výmysl. Zjistěte, kde za účtenku můžete vyhrát auto

Malware, ransomware a další online hrozby: Jak se liší?

Velkým firmám nejsou lhostejní jejich klienti na Blízkém východě

Stát se za data retention omluvil, ale údaje sbírá dál

Školkovné se vrací. S jakou obměnou?

Registrace cizinců podle JMHZV praxi

Proč mají vysavače mikrofony? Omylem ovládl tisíce vysavačů DJI

Budoucnost Office, digitálního pracoviště a e‑shopů

Útok AirSnitch dovoluje překonat izolaci klientů na Wi-Fi

Pojišťovny zneužívají lenosti svých klientů. Ti za to platí

Statistiky o ransomware, které jste asi neznali nebo si neuvědomili

AI se snaží promlouvat i do stavebnictví

AI prolomila celou firemní infrastrukturu za 21 hodin

Pálení žáhy zhoršuje nevhodná večeře. Vadí přejídání i kafe

Kdo se bude moct vyhnout EET a co bude muset splnit?