NAME
konwert - interface for various character encoding conversions
SYNOPSIS
konwert
FILTER
[FILE]...
[-o
DEST
|
R -O ]
DESCRIPTION
Konwert allows filtering multiple files through multiple filters.
It filters the specified
R FILE s,
or stdin if none are given.
Simple
FILTER
is the name of an executable file from the directory
~/.konwert/filters
or the system-wide one, normally
R /usr/share/konwert/filters .
Such program itself filters stdin to stdout.
The filtering rule can be more complex:
konwert
B FILTER1 + FILTER2
means
konwert
FILTER1
|
konwert
R FILTER2 .
konwert
B FORMAT1 - FORMAT2,
unless such filter exists, tries to find a common
R FORMAT3 ,
such that both filters
B FORMAT1 - FORMAT3
and
B FORMAT3 - FORMAT1
do exist.
konwert
B FILTER / ARG /...
passes arguments to the filter. Arguments can also be specified here:
B FORMAT1 / ARGS - FORMAT2.
The meaning of arguments depends on the particular filter.
konwert
I '( COMMAND ARGS... )'
executes this arbitrary shell command. This is useful with
-o
or
-O
options. The command cannot contain the string
R )+ ,
which will terminate this filter's specification.
OPTIONS
-o DEST
output goes to this file/directory instead of stdout
-O
every input file is replaced with its translation
--help
display help and exit
--version
output version information and exit
Redirecting output to one of the source files with either
-o
or
>
instead of
-O
will corrupt it! Option
-O
creates a temporary file in
/tmp
and later copies it back onto the source.
CHARACTER ENCODING CONVERSIONS
You can convert text between any two charsets, for example
konwert
R cp437-iso2 .
Characters unavailable in the target charset will be substituted with
approximations with available ones. The approximations need not be
single characters.
The following character sets are currently supported:
utf8 = unicode
Unicode UTF-8
iso1 = isolatin1
ISO-8859-1 aka ISO Latin 1 (Western European)
iso2 = isolatin2
ISO-8859-2 aka ISO Latin 2 (Central European)
iso3 = isolatin3
ISO-8859-3 aka ISO Latin 3 (Esperanto)
iso4 = isolatin4
ISO-8859-4 aka ISO Latin 4 (Baltic)
iso5 = isolatincyr
ISO-8859-5 (Cyrillic)
iso6 = isolatinarabic
ISO-8859-6 (Arabic)
iso7 = isolatingreek
ISO-8859-7 (Greek)
iso8 = isolatinhebrew
ISO-8859-8 (Hebrew)
iso9 = isolatin5 = isolatintur
ISO-8859-9 aka ISO Latin 5 (Turkish)
iso10 = isolatin6 = isolatinnordic
ISO-8859-10 aka ISO Latin 6 (Nordic)
iso12 = isolatin7 = isolatinceltic
ISO-8859-12 aka ISO Latin 6 (Celtic) - Draft
iso13 = isolatin8 = isolatinbaltic
ISO-8859-13 aka ISO Latin 6 (Baltic) - Draft
iso14 = isolatin9 = isolatinsami
ISO-8859-14 aka ISO Latin 6 (S