Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

nkf(3) [suse man page]

NKF(3)							User Contributed Perl Documentation						    NKF(3)

NAME
NKF - Perl extension for Network Kanji Filter SYNOPSIS
use NKF; $output = nkf("-s",$input); DESCRIPTION
This is a Perl Extension version of nkf (Netowrk Kanji Filter ) 1.7. It converts the last argument and return converted result. Conversion details are specified by flags before the last argument. Flags: "b,u Output is bufferred (DEFAULT),Output is unbufferred" "j,s,e Outout code is JIS 7 bit (DEFAULT), Shift JIS, AT&T JIS (EUC)" "J,S,E Input assumption is JIS 7 bit , Shift JIS, AT&T JIS (EUC)" "t no conversion" "i_ Output sequence to designate JIS-kanji (DEFAULT B)" "o_ Output sequence to designate ASCII (DEFAULT B)" "r {de/en}crypt ROT13/47" "m[BQ] MIME decode [B:base64 stream,Q:quoted stream]" "l ISO8859-1 (Latin-1) support" "f Folding: "-f60" or "-f"" "Z[0-2] Convert X0208 alphabet to ASCII 1: Kankaku to space,2: 2 spaces" "X,x Assume X0201 kana in MS-Kanji, "-x" preserves X0201" "B[0-2] Broken input 0: missing ESC,1: any X on ESC-[($]-X,2: ASCII on NL" "d,c Delete in line feed, Add in line feed" "m0 No MIME decode." "M MIME encode. Header style. All ASCII code and control characters are intact." "MB MIME encode. Base64 stream. Kanji conversion is performed before encoding, so this cannot be used as a picture encoder." "l Input and output code is ISO8859-1 (Latin-1) and ISO-2022-JP." "L[wmu] new line mode" " -Lu unix (LF) " " -Lw windows (CRLF) " " -Lm mac (CR) " " --fj,--unix,--mac,--msdos, --windows convert for these system" " --jis,--euc,--sjis,--mime,--base64 convert for named code" " --jis-input,--euc-input,--sjis-input,--mime-input,--base64-input assume input system" " -- ignore rest of -option" " --help" " --version" AUTHOR
Network Kanji Filter Version 1.9 (2/0002/Shinji Kono) Copyright (C) 1987, FUJITSU LTD. (I.Ichikawa),1998 S. Kono, COW SEE ALSO
perl(1). nkf(1) perl v5.12.1 2005-04-09 NKF(3)

Check Out this Related Man Page

Encode::JP(3pm) 					 Perl Programmers Reference Guide					   Encode::JP(3pm)

NAME
Encode::JP - Japanese Encodings SYNOPSIS
use Encode qw/encode decode/; $euc_jp = encode("euc-jp", $utf8); # loads Encode::JP implicitly $utf8 = decode("euc-jp", $euc_jp); # ditto ABSTRACT
This module implements Japanese charset encodings. Encodings supported are as follows. Canonical Alias Description -------------------------------------------------------------------- euc-jp /euc.*jp$/i EUC (Extended Unix Character) /jp.*euc/i /ujis$/i shiftjis /shift.*jis$/i Shift JIS (aka MS Kanji) /sjis$/i 7bit-jis /jis$/i 7bit JIS iso-2022-jp ISO-2022-JP [RFC1468] = 7bit JIS with all Halfwidth Kana converted to Fullwidth iso-2022-jp-1 ISO-2022-JP-1 [RFC2237] = ISO-2022-JP with JIS X 0212-1990 support. See below MacJapanese Shift JIS + Apple vendor mappings cp932 /windows-31j$/i Code Page 932 = Shift JIS + MS/IBM vendor mappings jis0201-raw JIS0201, raw format jis0208-raw JIS0201, raw format jis0212-raw JIS0201, raw format -------------------------------------------------------------------- DESCRIPTION
To find out how to use this module in detail, see Encode. Note on ISO-2022-JP(-1)? ISO-2022-JP-1 (RFC2237) is a superset of ISO-2022-JP (RFC1468) which adds support for JIS X 0212-1990. That means you can use the same code to decode to utf8 but not vice versa. $utf8 = decode('iso-2022-jp-1', $stream); and $utf8 = decode('iso-2022-jp', $stream); yield the same result but $with_0212 = encode('iso-2022-jp-1', $utf8); is now different from $without_0212 = encode('iso-2022-jp', $utf8 ); In the latter case, characters that map to 0212 are first converted to U+3013 (0xA2AE in EUC-JP; a white square also known as 'Tofu' or 'geta mark') then fed to the decoding engine. U+FFFD is not used, in order to preserve text layout as much as possible. BUGS
The ASCII region (0x00-0x7f) is preserved for all encodings, even though this conflicts with mappings by the Unicode Consortium. SEE ALSO
Encode perl v5.18.2 2013-11-04 Encode::JP(3pm)
Man Page