02-23-2011
how to grep junk characters in a file
hi guys,
I am generating a file from datastage (an etl tool).
Now the file is having some junk characters like ( Á,L´±,ñ and so on)..
I want to use the grep function to figure out all the junk characters and their location.
Can somebody help me out in finding it out.. if possible i just want to replace each of them with a ""(empty string).
Thanks for your help in advance..
10 More Discussions You Might Find Interesting
1. Solaris
Hello All,
I have a DOS file which I run a DOS 2 UNIX utility on. When run from Solaris, I can view the file perfectly. But, when run from linux, I see a bunch of junk(^@) at the beginning of every line in the file. Does anyone know the cause of this?
COMMAND TO CONVERT:
tr -d '\015\032'... (7 Replies)
Discussion started by: vada010
7 Replies
2. Shell Programming and Scripting
Can anyone tell me how to read a file in perl having junk characters . I have only one junk character which is repeated many times in the file. While i'm reading and printing the file , it is displaying till the 1st occurence of that junk character and rest of the file is not being read. (1 Reply)
Discussion started by: k_surya
1 Replies
3. Shell Programming and Scripting
Hi,
Is there anyway to find the junk characters in a file.Consider the file has data as given below:
123|abc^M|Doctor^C #record 1
234|def|Med #record 2
345|dfg^C|Wrong^V #record 3
The junk characters are highlighted and this is a pipe delimited file.
Is there anyway to... (20 Replies)
Discussion started by: ashwin3086
20 Replies
4. Shell Programming and Scripting
Hi,
I have a file with data as given below
$cat file1
123|abc|345
345|def|567
The first record is good record. The second record has an invisible junk character like \032.
I was replace all the occurences of that invisible character with #.
I want to do this for a set of... (16 Replies)
Discussion started by: ashwin3086
16 Replies
5. UNIX for Dummies Questions & Answers
Hello sir,
I have generated XML file from VS 2005. It works well in windows but it shows some junk characters in unix.
Can any help me with this problem.
Thank you in advance.
Hema (6 Replies)
Discussion started by: hemavenkatesh
6 Replies
6. Shell Programming and Scripting
Urgently ur help is needed.
Actually my req is i have an input file, that input file may have junk characters (^M, ^Z) etc...
eg:
cat file
name abc^Z addres
name2 msdmskd^Z address2
I want to validate the record and display where exactly this junk character resides.
I want to... (3 Replies)
Discussion started by: help_scr_seeker
3 Replies
7. Solaris
Hi,
I rebooted a Solaris 11 box and after that date stamp is coming in junk in almost all directories.
root@tstilp05 # ls -l
total 112
drwxrwxr-x 9 root sys 19 juin 1 03:10 adm
drwxr-xr-x 6 root sys 6 sept. 19 2012 ai
drwxr-xr-x 3 root bin ... (3 Replies)
Discussion started by: solaris_1977
3 Replies
8. Shell Programming and Scripting
I am using flatfile, in that flat file we are getting the junk chars
1)I21001f<82>^Me<85>!h49 Service Charge
2) I21001f‚
e...!h49 Service Charge
please tell me how to remove all junk chars in unix scripts. (1 Reply)
Discussion started by: Talari
1 Replies
9. Shell Programming and Scripting
Hi All
Need Help
I have a file with the below format (ABC.TXT) :
®¿¿ABCDHEJJSJJ|XCBJSKK01|M|7348974982790
HDFLJDKJSKJ|KJALKSD02|M|7378439274898
KJHSAJKHHJJ|LJDSAJKK03|F|9898982039999
(cont......)
I need to write a script where it will check for : blank lines (between rows,before... (6 Replies)
Discussion started by: chatwithsaurav
6 Replies
10. UNIX for Beginners Questions & Answers
Hi All,
I have a issue that we are getting Junk characters from source and i am not able to load that records to Database.
Line breakers
Junk Characters (Â and different every time)
Japanese Characters
Every time I am using grep command and awk -F "\007" to find them and delete that... (1 Reply)
Discussion started by: spradeep86
1 Replies
ACTIVE(5) File Formats Manual ACTIVE(5)
NAME
active, active.times - list of active Usenet newsgroups
DESCRIPTION
The file /var/lib/news/active lists the newsgroups that the local site receives. Each newsgroup should be listed only once. Each line
specifies one group; their order in the file does not matter. Within each newsgroup, articles are assigned unique names, which are mono-
tonically increasing numbers.
If an article is posted to newsgroups not mentioned in this file, those newsgroups are ignored. If no valid newsgroups are specified, the
article is filed into the newsgroup ``junk'' and only propagated to sites that receive the ``junk'' newsgroup.
Each line consists of four fields specified by a space:
name himark lomark flags
The first field is the name of the newsgroup. The second field is the highest article number that has been used in that newsgroup. The
third field is the lowest article number in the group; this number is not guaranteed to be accurate, and should only be taken to be a hint.
Note that because of article cancellations, there may be gaps in the numbering sequence. If the lowest article number is greater then the
highest article number, then there are no articles in the newsgroup. In order to make it possible to update an entry in-place without
rewriting the entire file, the second and third fields are padded out with leading zeros to make them a fixed width.
The fourth field can contain one of the following flags:
y Local postings are allowed
n No local postings are allowed, only remote ones
m The group is moderated and all postings must be approved
j Articles in this group are not kept, but only passed on
x Articles cannot be posted to this newsgroup
=foo.bar Articles are locally filed into the ``foo.bar'' group
If a newsgroup has the ``j'' flag, then no articles will be filed into that newsgroup and local postings to that group should not be gener-
ated. If an article for such a newsgroup is received from a remote site, it will be filed into the ``junk'' newsgroup if it is not cross-
posted. This is different from not having a newsgroup listed in the file because sites can subscribe to ``j'' newsgroups and the article
will be propagated to them.
If the fourth field of a newsgroup starts with an equal sign, then the newsgroup is an alias. Articles can be posted to the group, but
will be treated as if they were posted to the group named after the equal sign. The second and third fields are ignored. Note that the
Newsgroup header is not modified (Alias groups are typically used during a transition, and are typically created with ctlinnd(8)). An
alias newsgroup should not point to another alias.
The file /var/lib/news/active.times provides a chronological record of when newsgroups are created. This file is normally updated by
innd(8) whenever a ctlinnd ``newgroup'' command is done. Each line consist of three fields:
name time creator
The first field is the name of the newsgroup. The second field is the time it was created, expressed as the number of seconds since the
epoch -- i.e., a time_t; see gettimeofday(2). The third field is the electronic mail address of the person who created the group.
HISTORY
Written by Rich $alz <rsalz@uunet.uu.net> for InterNetNews. This is revision 1.13, dated 1996/10/29.
SEE ALSO
ctlinnd(8), innd(8).
ACTIVE(5)