TSLMENDIAN(1) User Contributed Perl Documentation TSLMENDIAN(1)NAME
tslmendian - change the byte-order of sunpinyin's threaded back-off language model
SYNOPSIS
tslmendian [option]
DESCRIPTION
slmendian converts the binary language model files used by SunPinyin from big-endian to small-endian or vice versa.
OPTIONS -e endian
Specify the output-lm-file's endian-ness. It can be le or ge. If not given, tslmendian uses the host's endian-ness.
-v Prints out the endian-ness of input-lm-file.
-i input-lm-file
Identify the input file of convert. Generally, this file is generated by slmthread or tslmpack.
-o out-lm-file
Identify the output file of convert.
NOTES
The converted output file is equivalent to the input. But if you compare the output of tslminfo of both files, the ARPA file generated by
tslminfo may different. This is due to the different precision in different machines.
And due to uninitialized padding data in data structure, the checksum of the original data file may different from the converted-back file,
even though they are identical from SunPinyin's point of view.
AUTHOR
Originally written by Kov. Chai <tchaikov.gmail.com>. Currently maintained by Kov.Chai <tchaikov@gmail.com>.
SEE ALSO slmthread(1). tslminfo, tslmpack.
perl v5.14.2 2012-06-09 TSLMENDIAN(1)
Check Out this Related Man Page
XD(1) General Commands Manual XD(1)NAME
xd - hex, octal, decimal, or ASCII dump
SYNOPSIS
xd [ option ... ] [ -format ... ] [ file ... ]
DESCRIPTION
Xd concatenates and dumps the files (standard input by default) in one or more formats. Groups of 16 bytes are printed in each of the
named formats, one format per line. Each line of output is prefixed by its address (byte offset) in the input file. The first line of
output for each group is zero-padded; subsequent are blank-padded.
Formats other than -c are specified by pairs of characters telling size and style, by default. The sizes are
1 or b 1-byte units.
2 or w 2-byte big-endian units.
4 or l 4-byte big-endian units.
8 or v 8-byte big-endian units.
The styles are
o
Octal.
x
Hexadecimal.
d
Decimal.
Other options are
-c Format as 1x but print ASCII representations or C escape sequences where possible.
-astyle Print file addresses in the given style (and size 4).
-u (Unbuffered) Flush the output buffer after each 16-byte sequence.
-s Reverse (swab) the order of bytes in each group of 4 before printing.
-r Print repeating groups of identical 16-byte sequences as the first group followed by an asterisk.
SOURCE
/sys/src/cmd/xd.c
SEE ALSO db(1)BUGS
The various output formats don't line up properly in the output of xd.
XD(1)
Hi,
I haven't done this for awhile, and further, I've never done it in perl so I appreciate any help you can give me.
I have a file of lines, each with 5 data points that look like this:
AB,N,ALLIANCEBERNSTEIN HLDNG L.P,AB,N
ALD,N,ALLIED CAPITAL CORPORATION,ALD,N
AFC,N,ALLIED CAPITAL... (4 Replies)
Dear Users,
How do I account for endian-ness while reading a Binary File generated from Solaris 5.8 (Sparc) on a Windows XP (x86) system. I just know that its a Binary File. I have no knowledge about the native datatype of the Binary File. (20 Replies)
Hi ,
I have a memory file like this with two columns:
@C010 AA
@C011 AA
@C012 FE
@C013 FF
@C014 F7
@C015 FF
first is memory add, second is the data.
I wan to convert into a serial sequence starting from '00000' all the way to 'FFFFF' with those fields from the above file... (15 Replies)