Sponsored Content
Top Forums Shell Programming and Scripting Remove all JUNK character from file. Post 302282004 by Amit.Sagpariya on Friday 30th of January 2009 02:13:07 AM
Old 01-30-2009
Remove all JUNK character from file.

Hi Team,

I have a file having size greater than 1 GB. What i want to do is to check if it contains any JUNK character (ie any special charater thats not on the key board stroke). This file has 532 column & seperated with ^~^.

I have found some solution from the file, but it is for a specific junk character not for all.

Can someone help me...
 

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

Get rid of junk character in a file

I have a file with one of the following lines, when opened with vi 33560010686GPT£120600GBPGBP10082007DS In the above line, I want to get rid of the junk character before the £ (pound sysmbol). When I tried copying £ from windows and copy in unix vi, it prints as £ and I tried pattern replace... (2 Replies)
Discussion started by: nskworld
2 Replies

2. UNIX for Dummies Questions & Answers

How to remove junk (^Ò) character while FTPing

Hi All, I have been trying to FTP some data files from Windows directory to a UNIX server. The txt file in the windows contails the following data: "111~XYZ~1~Contact person’s phone number~COMMENTS~~~~" but the same line is appearing as "111~XYZ~1~Contact person^Òs phone number~COMMENTS~~~~"... (8 Replies)
Discussion started by: vkumbhakarna
8 Replies

3. Shell Programming and Scripting

Junk Character in file

Hi set filename "./GopiRun.sh" if } err] { writeLog "error in exec " writeLog $a } else { writeLog $a } The above code will execute a file GopiRun.sh,and will log the output of the exec to a file. The problem is the file has lot of junk character in it,how to avoid it. The... (2 Replies)
Discussion started by: nathgopi214
2 Replies

4. Shell Programming and Scripting

Check Junk character in sql file

Hello, I have two .sql files which I transferred from Windows to Unix (Linux Enterprise Linux Server release 5.3).I want to ensure that these two files have no junk characters in them.How do I do it in the simplest possible way? Many thanks DJ (1 Reply)
Discussion started by: Digjoy83
1 Replies

5. Shell Programming and Scripting

removing of junk character

Dear ALL, How to remove junk charecter ^M from unix file i am using sun solaris unix. I already tried few commands :%s/^M//g :%s/r//g but it didnt helped me. Any help appriciated. Thanks Ripudaman Please view this code tag video for how to use code tags when posting code... (5 Replies)
Discussion started by: ripudaman.singh
5 Replies

6. Shell Programming and Scripting

Remove all junk characters from a text file

I am using flatfile, in that flat file we are getting the junk chars 1)I21001f<82>^Me<85>!h49 Service Charge 2) I21001f‚ e...!h49 Service Charge please tell me how to remove all junk chars in unix scripts. (1 Reply)
Discussion started by: Talari
1 Replies

7. Shell Programming and Scripting

How to see junk character in file in.?

Hi I want to know how to see junk character in a file. i am not able to see junk character using vi or cat command. below is the junk char . which i see in host file 10.178.14.67▒▒▒ ac01sp02-vip actually it should be like this 10.178.14.67 ac01sp02-vip i am using secure CRT... (11 Replies)
Discussion started by: scriptor
11 Replies

8. Shell Programming and Scripting

Junk character appearing after downloading the file from windows server

Hello, Im downloading the file from windows server through FTP, the downloaded file is containing some junk character at very start of the file as below and causing my whole script is to fail, how to download without junk or how to remove these before processing it? ▒▒"nmdbfnmdsfsdf" ... (19 Replies)
Discussion started by: Riverstone
19 Replies

9. Emergency UNIX and Linux Support

Help in viewing the junk character

Hello All, I have issues in unix file when I loaded that to database and do select * from table where description like '%'+char(13)+'%' on it I am able to get records. I tried to view the file in unix it is all having blank character which I think is all non ascii which I am not able view.... (11 Replies)
Discussion started by: arunkumar_mca
11 Replies

10. Shell Programming and Scripting

How to remove newline character if it is the only character in the entire file.?

I have a file which comes every day and the file data look's as below. Vi abc.txt a|b|c|d\n a|g|h|j\n Some times we receive the file with only a new line character in the file like vi abc.txt \n (8 Replies)
Discussion started by: rak Kundra
8 Replies
Character(3m17n)						 The m17n Library						  Character(3m17n)

NAME
Character - Character objects and API for them. Defines #define MCHAR_MAX Maximum character code. Functions MSymbol mchar_define_property (const char *name, MSymbol type) Define a character property. void * mchar_get_prop (int c, MSymbol key) Get the value of a character property. int mchar_put_prop (int c, MSymbol key, void *val) Set the value of a character property. MCharTable * mchar_get_prop_table (MSymbol key, MSymbol *type) Get the char-table for a character property. Variables: Keys of character properties These symbols are used as keys of character properties. MSymbol Mscript Key for script. MSymbol Mname Key for character name. MSymbol Mcategory Key for general category. MSymbol Mcombining_class Key for canonical combining class. MSymbol Mbidi_category Key for bidi category. MSymbol Msimple_case_folding Key for corresponding single lowercase character. MSymbol Mcomplicated_case_folding Key for corresponding multiple lowercase characters. MSymbol Mcased Key for values used in case operation. MSymbol Msoft_dotted Key for values used in case operation. MSymbol Mcase_mapping Key for values used in case operation. MSymbol Mblock Key for script block name. Detailed Description Character objects and API for them. The m17n library represents a character by a character code (an integer). The minimum character code is 0. The maximum character code is defined by the macro MCHAR_MAX. It is assured that MCHAR_MAX is not smaller than 0x3FFFFF (22 bits). Characters 0 to 0x10FFFF are equivalent to the Unicode characters of the same code values. A character can have zero or more properties called character properties. A character property consists of a key and a value, where key is a symbol and value is anything that can be cast to (void *). 'The character property that belongs to character C and whose key is K' may be shortened to 'the K property of C'. Define Documentation #define MCHAR_MAX Maximum character code. The macro MCHAR_MAX gives the maximum character code. Variable Documentation MSymbol Mscript Key for script. The symbol Mscript has the name 'script' and is used as the key of a character property. The value of such a property is a symbol representing the script to which the character belongs. Each symbol that represents a script has one of the names listed in the Unicode Technical Report #24. MSymbol Mname Key for character name. The symbol Mname has the name 'name' and is used as the key of a character property. The value of such a property is a C-string representing the name of the character. MSymbol Mcategory Key for general category. The symbol Mcategory has the name 'category' and is used as the key of a character property. The value of such a property is a symbol representing the general category of the character. Each symbol that represents a general category has one of the names listed as abbreviations for General Category in Unicode. MSymbol Mcombining_class Key for canonical combining class. The symbol Mcombining_class has the name 'combining-class' and is used as the key of a character property. The value of such a property is an integer that represents the canonical combining class of the character. The meaning of each integer that represents a canonical combining class is identical to the one defined in Unicode. MSymbol Mbidi_category Key for bidi category. The symbol Mbidi_category has the name 'bidi-category' and is used as the key of a character property. The value of such a property is a symbol that represents the bidirectional category of the character. Each symbol that represents a bidirectional category has one of the names listed as types of Bidirectional Category in Unicode. MSymbol Msimple_case_folding Key for corresponding single lowercase character. The symbol Msimple_case_folding has the name 'simple-case-folding' and is used as the key of a character property. The value of such a property is the corresponding single lowercase character that is used when comparing M-texts ignoring cases. If a character requires a complicated comparison (i.e. cannot be compared by simply mapping to another single character), the value of such a property is 0xFFFF. In this case, the character has another property whose key is Mcomplicated_case_folding. MSymbol Mcomplicated_case_folding Key for corresponding multiple lowercase characters. The symbol Mcomplicated_case_folding has the name 'complicated-case-folding' and is used as the key of a character property. The value of such a property is the corresponding M-text that contains a sequence of lowercase characters to be used for comparing M-texts ignoring case. MSymbol Mcased Key for values used in case operation. The symbol Mcased has the name 'cased' and is used as the key of charater property. The value of such a property is an integer value 1, 2, or 3 representing 'cased', 'case-ignorable', and both of them respective. See the Unicode Standard 5.0 (Section 3.13 Default Case Algorithm) for the detail. MSymbol Msoft_dotted Key for values used in case operation. The symbol Msoft_dotted has the name 'soft-dotted' and is used as the key of charater property. The value of such a property is Mt if a character has 'Soft_Dotted' property, and Mnil otherwise. See the Unicode Standard 5.0 (Section 3.13 Default Case Algorithm) for the detail. MSymbol Mcase_mapping Key for values used in case operation. The symbol Mcase_mapping has the name 'case-mapping' and is used as the key of charater property. The value of such a property is a plist of three M-Texts; lower, title, and upper of the corresponding character. See the Unicode Standard 5.0 (Section 5.18 Case Mappings) for the detail. MSymbol Mblock Key for script block name. The symbol Mblock the name 'block' and is used as the key of charater property. The value of such a property is a symbol representing a script block of the corresponding character. Author Generated automatically by Doxygen for The m17n Library from the source code. COPYRIGHT
Copyright (C) 2001 Information-technology Promotion Agency (IPA) Copyright (C) 2001-2011 National Institute of Advanced Industrial Science and Technology (AIST) Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License <http://www.gnu.org/licenses/fdl.html>. Version 1.6.2 12 Jan 2011 Character(3m17n)
All times are GMT -4. The time now is 06:34 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy