Identify File with ControlM Characters Post: 302370809

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Identify type of file

hi all, i have the next question: how can i identify the type of a file? . I'm working in Unix (Solaris 5.7) and i would like identify if a file is or not is a "flat file". I need have a program what separates the flat file in a directory, and the excel file in another directory. I must get...

2. UNIX for Advanced & Expert Users

sudo comand with ControlM

I'm trying to change the user Id that the script is running under. I tried the sudo comand but the job was submitted under ControlM and it seems that controlM is not allowing the user id to change. I have included the job output below. The sudo comand was suppose to set the user id to "DWSOLAP"...

3. Shell Programming and Scripting

Identify records having junk characters in unix

Hi Friends, I need to have a command in Unix which output all teh records havingg junk characters in a file.... I know a command cat -tv <Filename> which opens the file and we can check for any junk character in it. But my requirement is to fetch ONLY THOSE records having junk characters....

4. Shell Programming and Scripting

how do I identify files with characters beyond a certain range.

I have a directory with hundreds of files that can not have data pass column 80. I do not know of way to combine "grep" and "cut" command. I tried: cat * | cut -c 81-120 |pg but it only shows me the line, not the file name. Any help would be appreciated. Been on this all...

5. Shell Programming and Scripting

Compare large file and identify difference in separate file

I have a very large system generated file containing around 500K rows size 100MB like following HOME|ALICE STREET|3||NEW LISTING HOME|NEWPORT STREET|1||NEW LISTING HOME|KING STREET|5||NEW LISTING HOME|WINSOME AVENUE|4||MODIFICATION CAR|TOYOTA|4||NEW LISTING CAR|FORD|4||NEW...

6. Shell Programming and Scripting

Identify extended ascii characters in a file

Hi, Is there a way to identify the lines in a file having extended ascii characters and display the same? For instance I have a file abc.txt having below data aaa|bbb|111|This is first line aaa|bbb|222|This is sec�nd line aaa|bbb|333|This is third line aaa|bbb|444|This is fo�rth line...

7. Shell Programming and Scripting

Unable to identify the special characters beyond the range of "[\x80-\xFF]"

I want to filter out the special character whose ascii value doesn't fall within the range "" . Example:� or Ć. So in that case is there any defined range which will filter out this characters. I can filter those which falls withing "" . Need to filter those special chracter which doesn't...

8. Shell Programming and Scripting

Identify missing file

I am on linux and I am supposed to receive 3 files. If any of the files are not received I need to identify the missing file and throw it out in a variable. I have put in something like this if ] then echo "file $file1 was found" else echo "ERROR: file $file1 was not found!!!"...

9. Shell Programming and Scripting

Regex to identify illegal characters in a perso-arabic database

I am working on Sindhi: a perso-Arabic script and since it shares the Unicode-block with over 400 other languages, quite often the database contains characters which are not wanted: illegal characters. I have identified the character set of Sindhi which is given below: For clarity's sake, each...

10. UNIX for Beginners Questions & Answers

Exit1 not to report failure in controlM

HI Team, I running below script from controlM and job is reporting as failure everyday so i tried to change the if exitstatus=1 (send only email) but not to end as a job is failed. can you let me know where i have to change this script to make the script not to fail but instead send email and...

LEARN ABOUT FREEBSD

gb18030

GB18030(5)						      BSD File Formats Manual							GB18030(5)

NAME

     gb18030 -- GB 18030 encoding method for Chinese text

SYNOPSIS

     ENCODING "GB18030"

DESCRIPTION

     The GB18030 encoding implements GB 18030-2000, a PRC national standard for the encoding of Chinese characters.  It is a superset of the older
     GB 2312-1980 and GBK encodings, and incorporates Unicode's Unihan Extension A completely.	It also provides code space for all Unicode 3.0
     code points.

     Multibyte characters in the GB18030 encoding can be one byte, two bytes, or four bytes long.  There are a total of over 1.5 million code
     positions.

     GB 11383-1981 (ASCII) characters are represented by single bytes in the range 0x00 to 0x7F.

     Chinese characters are represented as either two bytes or four bytes.  Characters that are represented by two bytes begin with a byte in the
     range 0x81-0xFE and end with a byte either in the range 0x40-0x7E or 0x80-0xFE.

     Characters that are represented by four bytes begin with a byte in the range 0x81-0xFE, have a second byte in the range 0x30-0x39, a third
     byte in the range 0x81-0xFE and a fourth byte in the range 0x30-0x39.

SEE ALSO

     euc(5), gb2312(5), gbk(5), utf8(5)

     Chinese National Standard GB 18030-2000: Information Technology -- Chinese ideograms coded character set for information interchange --
     Extension for the basic set, March 2000.

     The Unicode Standard, Version 3.0, The Unicode Consortium, 2000.

STANDARDS

     The GB18030 encoding is believed to be compatible with GB 18030-2000.

BSD
								  August 10, 2003							       BSD

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Identify type of file

Discussion started by: DebianJ

2. UNIX for Advanced & Expert Users

sudo comand with ControlM

Discussion started by: u156531

3. Shell Programming and Scripting

Identify records having junk characters in unix

Discussion started by: sureshg_sampat

4. Shell Programming and Scripting

how do I identify files with characters beyond a certain range.

Discussion started by: kcsunsun01dev