NKF(3) User Contributed Perl Documentation NKF(3)
NAME
NKF - Perl extension for Network Kanji Filter
SYNOPSIS
use NKF;
$output = nkf("-s",$input);
DESCRIPTION
This is a Perl Extension version of nkf (Netowrk Kanji Filter ) 1.7. It converts the last argument and return converted result. Conversion
details are specified by flags before the last argument.
Flags:
"b,u Output is bufferred (DEFAULT),Output is unbufferred"
"j,s,e Outout code is JIS 7 bit (DEFAULT), Shift JIS, AT&T JIS (EUC)"
"J,S,E Input assumption is JIS 7 bit , Shift JIS, AT&T JIS (EUC)"
"t no conversion"
"i_ Output sequence to designate JIS-kanji (DEFAULT B)"
"o_ Output sequence to designate ASCII (DEFAULT B)"
"r {de/en}crypt ROT13/47"
"m[BQ] MIME decode [B:base64 stream,Q:quoted stream]"
"l ISO8859-1 (Latin-1) support"
"f Folding: "-f60" or "-f""
"Z[0-2] Convert X0208 alphabet to ASCII 1: Kankaku to space,2: 2 spaces"
"X,x Assume X0201 kana in MS-Kanji, "-x" preserves X0201"
"B[0-2] Broken input 0: missing ESC,1: any X on ESC-[($]-X,2: ASCII on NL"
"d,c Delete
in line feed, Add
in line feed"
AUTHOR
Network Kanji Filter Version 1.7 (1/9811/Shinji Kono) Copyright (C) 1987, FUJITSU LTD. (I.Ichikawa),1997 S. Kono, COW
SEE ALSO
perl(1). nkf(1)
perl v5.8.0 1998-11-16 NKF(3)
Check Out this Related Man Page
NKF(2L) NKF(2L)
NAME
nkf - Network Kanji code conversion Filter v2.0.5
SYNOPSIS
nkf [ options ] [ file ]
DESCRIPTION
Nkf is a yet another kanji code converter among networks, hosts and terminals. It converts input kanji code to designated kanji code such
as 7-bit JIS, MS-kanji (shifted-JIS), utf-8 or EUC.
One of the most unique facicility of nkf is the guess of the input kanji code. It currently recognizes 7-bit JIS, MS-kanji (shifted-
JIS),utf-8 and EUC. So users needn't the input kanji code specification.
By default X0201 kana is converted into X0208 kana. For X0201 kana, SO/SI, SSO and ESC-(-I methods are supported. For automatic code detec-
tion, nkf assumes no X0201 kana in MS-Kanji. To accept X0201 in MS-Kanji, use -X, -x or -S.
Options are bellow:
-j output 7-bit JIS code. This is a default.
-s output MS-kanji (shifted-JIS) code.
-e output EUC (AT&T) code.
-w output UTF-8 (Unicode 8bit form).
-S Assume MS-Kanji and X0201 kana input. It also accpet JIS. AT&T EUC is recognized as X0201 kana. Without -x flag, X0201 kana is con-
verted into X0208.
-J Assume JIS input. It also accepts Japanese EUC. This is the default. This flag does not excludde MS-Kanji.
-E Assume AT&T EUC input. It also accpet JIS. Same as -J.
-B Assume broken JIS-Kanji, which lost ESC. Usefull when your site is using old B-News Nihongo patch. -B1 allows any char after ESC-(
or ESC-$. -B2 forces ASCII after NL.
-W Assume UTF-8 input.
-m MIME ISO-2022-JP/ISO8859-1 decode. (default) To see ISO8859-1 (Latin-1) -l is necessary. -mN does loose encoding. It allows line
break in the middle of the basr64 encoding.
-mB Decode MIME base64 encoded stream. Remove header or other part before conversion.
-mQ Decode MIME quoted stream. '_' in quoted is converted to space.
-m0 No MIME decode.
-M MIME encode. Header style. All ASCII code and control characters are intact.
-MB MIME encode. Base64 stream. Kanji conversion is performed before encoding, so this cannot be used as a picture encoder. perfome
quoted encoding.
-l Input and output code is ISO8859-1 (Latin-1) and ISO-2022-JP. -s, -e and -x are not compatible with this option.
-fn Folding on n length in a line. Default 60. -f40-0 forces 0 margin folding.
-X Allow X0201 kana in MS-Kanji. X0201 is converted into X0208 by default. This is default in MSDOS.
-x Try to preseve X0208 kana. Assume X0201 kana in MS-Kanji. And do not convert X0201 kana to X0208. In JIS output, ESC-(-I is used.
In EUC output, SSO is used.
-Z Convert X0208 alphabet to ASCII. -Z1 converts X0208 kankaku to one ASCII space. -Z2 converts X0208 kankaku to two ASCII spaces.
-Z3 Replacing "><&" into '>', '<', '"', '&' as in HTML.
-I Replacing Non iso-2022-jp char into the strage geta character.
-b bufferd output. This is a default.
-u unbufferd output.
-t no operations.
-O Output result to file. The first name in arguments becomes output. Please be careful. If there are no file arguments, nkf.out is
chosen. --overwrite does rewriting. Original listed files are replaced by filtered result.
-ic output ESC-$-c as sequence to designate JIS-kanji (Default is B.)
-oc output ESC-(-c as sequence to designate single-byte roman characters (Default is B.)
-r {de/en}crypt ROT13/47
-v display Version
-T Text mode output (MS-DOS)
-c add CR after NL.
-d delete CR after NL.
-L[wmu] new line mode
-Lu unix (LF)
-Lw windows (CRLF)
-Lm mac (CR)
default no conversion
-F
New line preserving line folding.
hiragana/katakana translation
-h1 --katakana
-h2 --hirakana
-h3 --hirakana-katakana
-- long options
--fj,--unix,--mac,--msdos, --windows
convert for these system
--jis,--euc,--sjis,--mime,--base64
convert for named code
--jis-input,--euc-input,--sjis-input,--mime-input,--base64-input
assume input system
-- ignore rest of -option
--help
--version
FILES
nkf - binary
AUTHOR
Itaru Ichikawa <ichikawa@flab.fujitsu.co.jp>, (was ichikawa@fujitsu.JUNET)
EDITOR
a_kuroe@hoffman.cc.sophia.ac.jp (Akihiko Kuroe), kono@ie.u-ryukyu.ac.jp (Shinji KONO), furukawa@tcp-ip.or.jp ( Rei FURUKAWA )`
BUGS
Nkf cannot handle some input that contains mixed kanji codes. Automatic code detection becomes very weak with -x, -X and -S. MIME encod-
ing is very loose.
ACKNOWLEDGE
Thanks for those people.
nkf 1.7,1.9,2.0
Akio Furukawa, OHARA Shigeki, Hiroaki Sengoku, Ikuhiro MORITA, Junn Ohta, KAWAMURA Masao, Kazuhiko Mori, Keitaro Isokawa, Ken-ichi Hirose,
Ki-ichiro SATO, Kiwamu Aoyama, Koichi Hirayama, Mitsuru Hase, OHARA Shigeki, Rei FURUKAWA, Satoru Takabayashi, Shigeyuki Takagi, Shin
MICHIMUKO, Tsutomu Sakai, YAMASHITA Junji, Yasuyuki Sato, Yoshiaki Yanagihara, Yoshiaki Yanagihara, hat@so-net, SHIOZAKI Takehiko, Koji
Arai, Eiichiro Itani, Masayuki Hatta, and many others.
URL:
www.ie.u-ryukyu.ac.jp/~kono/nkf/
07/Feb/2004 NKF(2L)