04-08-2013
Thanks Yoda. I've tried both options but did not get the expected result.
When I'm using the 1st option within a function, it is showing the error "typeset bad option" and when I'm using the 2nd option with nawk, it is working, but I'm getting the same list of characters in output, not distinct character.
Is there any options?
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I was wondering if somebody could help me with something on UNIX. I have a file that looks like this -
"nelson,bill","bill","123 Main St","Mpls","MN",55444,8877,william
I want to replace all comma with pipes (|), except if the comma is within double quotes. (The first field is an example of... (8 Replies)
Discussion started by: nelson553011
8 Replies
2. UNIX for Advanced & Expert Users
Hey,
Is there anyway I anks,
Pocha (12 Replies)
Discussion started by: pochaman
12 Replies
3. Shell Programming and Scripting
Hi
I have the fixed width flat file having the following data
12345aaaaaaaaaabbbbbbbbbb
12365sssssssssscccccccccc
12365sssss
12367ddddddddddvvvvvvvvvv
12367 vvvvv
Here the first column is length 5 second is length 10 third is length 10
if the second or third column exceeds... (3 Replies)
Discussion started by: Brado
3 Replies
4. AIX
hello everybody
i am looking for a shell to cut a flat file (with a long unique line) according to a certain number of characters and redirect every result to an output file.
here is an example
MyFile :
12 3 456 12 3 456 12 3 456 .....
and i took every 9-characters including BLANKS... (6 Replies)
Discussion started by: fastlane3000
6 Replies
5. Shell Programming and Scripting
Hi ,
I have a similar problem.
Please can anyone help me with a shell script or a perl.
I have a flat file like this
fruit country
apple germany
apple india
banana pakistan
banana saudi
mango india
I want to get a output like
fruit country
apple ... (7 Replies)
Discussion started by: smalya
7 Replies
6. Shell Programming and Scripting
Hi, I have a series of files (upwards of 500) the filename format is as follows
CC10-1234P1999.WGS84.p190
each of this files is in a directory named for the file but excluding the extension.
Now the last three numeric characters, in this case 999, can be anything from 001 to 999, I need to... (3 Replies)
Discussion started by: roche.j.mike
3 Replies
7. UNIX for Dummies Questions & Answers
HI,
can any one help me please ..
i have flat file like
qwer123rt ass3242ccf jjk654
kjh838ppp nhdg453ok hdkk34
i want remove numeric characters in the flat file
i want output like this
qwerrt assccf jjk
kjhppp nhdgok hdkk
help me... (4 Replies)
Discussion started by: rafimd1985
4 Replies
8. Shell Programming and Scripting
Hi All,
Assuming i have got a file test.dat which has contains as follows:
Unix = abc def fgt jug
111 2222 3333
Linux = gggg pppp qqq
C# = ccc ffff llll
I would like to traverse through the file, get the 1st occurance of "=" and then need to get the sting... (22 Replies)
Discussion started by: rtagarra
22 Replies
9. Shell Programming and Scripting
Hi ,
I'm using KSH88
I tried the following example to get the last 6 characters from a string
echo 'abcdefghids' | sed 's/.*\(.\{6\}\)$/\1/'
What chages i need to do to get the first 6 characters from the string
my desired output should be abcdef
Thank you (6 Replies)
Discussion started by: smile689
6 Replies
10. Shell Programming and Scripting
Hi All ,
I have multiple pipe delimited csv files are present in a directory.I need to find out distinct count on a column on those files and need the total distinct
count on all files.
We can't merge all the files here as file size are huge in millions.I have tried in below way for each... (9 Replies)
Discussion started by: STCET22
9 Replies
LEARN ABOUT OSF1
telecode
telecode(5) File Formats Manual telecode(5)
NAME
telecode - A character encoding system (codeset) for Traditional Chinese
DESCRIPTION
The Telecode codeset (called Mitac Telex in early versions of the operating system) consists of 2 character planes. Each character plane
has 8836 character positions. In plane 1, standard characters occupy positions 0001 to 8045; the remaining 791 positions are for user-
defined characters. In plane 2, standard characters occupy positions 0001 to 8489; the remaining 346 positions are for user-defined charac-
ters. Telecode uses 2-byte values to represent characters on both planes.
Plane 1 Character Encoding
To differentiate plane 1 code from plane 2 code, the most significant bit (MSB) is set on in both bytes of a plane 1 character code. The
following formula calculates the value of a plane 1 character from its position on the plane:
1st byte = M + 161
2nd byte = N + 161 - M x 94
In this formula, N is the position of the character and M = N / 94.
For example, if a character is at position 2502 on plane 1, its encoding value is BBDB, which is calculated as follows:
N = 2502, M = 2502/94 = 26 1st byte = 26 + 161 = 187 2nd byte = 2502 + 161 - 26 x 94 = 219
Plane 2 Character Encoding
To differentiate plane 2 code from plane 1 code, the MSB of the first byte is set on and that of the second byte is set off for each plane
2 character code. The following formula calculates the value of a plane 2 character from its position:
1st byte = M + 161
2nd byte = N + 33 - M x 94
In this formula, N is the position of the character on the plane and M = N / 94.
For example, if a character is at position 2502 on plane 2, its encoding value is BB5B, which is calculated as follows:
N = 2502, M = 2502/94 = 26 1st byte = 26 + 161 = 187 2nd byte = 2502 + 33 - 26 x 94 = 91
Codeset Conversion
The following codeset converter pairs are available for converting Traditional Chinese characters between telecode and other encoding for-
mats. Refer to iconv_intro(5) for an introduction to codeset conversion. For more information about the other codeset for which telecode
is the input or output, see the reference page specified in the list item. big5_telecode, telecode_big5
Converting from and to the Big-5 codeset: big5(5).
Note that Big-5 encoding is equivalent to the Microsoft code-page format used on PCs for Traditional Chinese. You can therefore use
these converters to convert Traditional Chinese characters between PC code page format and Telecode encoding format. For more infor-
mation on how the operating system supports PC code pages, see code_page(5). dechanyu_telecode, telecode_dechanyu
Converting from and to the DEC Hanyu codeset: dechanyu(5). eucTW_telecode, telecode_eucTW
Converting from and to Taiwanese Extended UNIX Code: eucTW(5).
Font Support for Telecode
The operating system supports Telecode only through conversion to another codeset.
SEE ALSO
Commands: locale(1)
Others: ascii(5), big5(5), Chinese(5), code_page(5), dechanyu(5), dechanzi(5), eucTW(5), GBK(5), i18n_intro(5), i18n_printing(5),
iconv_intro(5), l10n_intro(5), sbig5(5)
telecode(5)