Sed - remove special characters


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Sed - remove special characters
# 1  
Old 09-13-2012
Sed - remove special characters

Hi,

I have a file with this line, it's always in the first line:

I want to remove these special characters: ´╗┐

file1
Code:
´╗┐\\bar\c$\test2\;3.348.118 Bytes;160 ;3
\\bar\c$\test\;35 Bytes;2 ;1

I want the same file to be only
Code:
\\bar\c$\test2\;3.348.118 Bytes;160 ;3
\\bar\c$\test\;35 Bytes;2 ;1

I am not able to remove those chars,

Last edited by Scrutinizer; 09-13-2012 at 08:49 PM.. Reason: code tags
# 2  
Old 09-13-2012
In principle not too difficult:
Code:
$ sed 's/^´╗┐//' file1

, but you should be aware that these character graphics chars usually belong to a multibyte character set like utf-8 or so which may impose restrictions.
This User Gave Thanks to RudiC For This Post:
# 3  
Old 09-13-2012
Can you paste the output of head -1 file1 | od -c for us so we can see the exact byte codes you have in this file.
This User Gave Thanks to Chubler_XL For This Post:
# 4  
Old 09-13-2012
Or to remove multibyte characters, you could try:
Code:
LANG=C tr -d '[\200-\377]' < infile

This User Gave Thanks to Scrutinizer For This Post:
# 5  
Old 09-14-2012
Quote:
Originally Posted by Chubler_XL
Can you paste the output of head -1 file1 | od -c for us so we can see the exact byte codes you have in this file.
Code:
 
 
This is what I got 

$ head -1 lista1.csv | od -c
0000000 357 273 277 \ \ n a k a m a \ c o m p
0000020 a r t \ ; 9 4 . 2 6 0 . 9 7 4
0000040 B y t e s ; 7 ; 1 \r \n
0000055

---------- Post updated at 11:16 AM ---------- Previous update was at 11:16 AM ----------

Quote:
Originally Posted by RudiC
In principle not too difficult:
Code:
$ sed 's/^´╗┐//' file1

, but you should be aware that these character graphics chars usually belong to a multibyte character set like utf-8 or so which may impose restrictions.
Thanks, but I'm afraid this didn't work

---------- Post updated at 11:18 AM ---------- Previous update was at 11:16 AM ----------

Quote:
Originally Posted by Scrutinizer
Or to remove multibyte characters, you could try:
Code:
LANG=C tr -d '[\200-\377]' < infile

That has worked perfectly,

Thank you very much.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Remove Special Characters Within Text

Hi, I have a "|" delimited file that is exported from a database. There is one column in the file which has description/comments entered by some application user. It has "Control-M" character and "New Line" character in between the text. Hence, when i export the data, this record with the new... (4 Replies)
Discussion started by: tarun.trehan
4 Replies

2. Shell Programming and Scripting

How to remove special characters?

Hi Gurus, I have file which contains some unicode charachator like "ü". I want to replace it with some charactors. I searched in internet and got command sed "s/ü/-/g", but I don't know how to type ü in unix command line. Please help me for this one. Thanks in advance (7 Replies)
Discussion started by: ken6503
7 Replies

3. Shell Programming and Scripting

How to remove some special characters in a string?

Hi, I have string like this ="Lookup Procedure" But i want the output like this Lookup Procedure =," should be removed. Please suggest me the solution. Regards, Madhuri (2 Replies)
Discussion started by: srimadhuri
2 Replies

4. Shell Programming and Scripting

sed or tr to remove specific group of special characters

Hi, I have a input of the form: ..., word1, word2, word3... I want out put of the form word1, word2, word3 I tried echo '..., word1, word2, word3...' | tr -d '...,' but that takes out the commas in the middle too so I get word1 word2 word3 but I want the commas in the middle. ... (3 Replies)
Discussion started by: forumbaba
3 Replies

5. Shell Programming and Scripting

Remove string between two special characters

Hi All, I have a variable like AVAIL="\ BACK:bkpstg:testdb3.iad.expertcity.com:backtest|\ #AUTH:authstg:testdb3.iad.expertcity.com:authiapd|\ TEST:authstg:testdb3.iad.expertcity.com:authiapd|\ " What I want to do here is that If a find # before any entry, remove the entire string... (5 Replies)
Discussion started by: engineermayur
5 Replies

6. Shell Programming and Scripting

remove special characters

hello all I am writing a perl code and i wish to remove the special characters for text. I wish to remove all extended ascii characters. If the list of special characters is huge, how can i do this using substitute command s/specialcharacters/null/g I really want to code like... (3 Replies)
Discussion started by: vasuarjula
3 Replies

7. UNIX for Dummies Questions & Answers

How to Remove Special Characters

Dear Members, We have a file which contains some special characters. I need to replace these special character by a new line character(\n). The Special character is \x85. I am not sure what this character means and how we can remove it. Any inputs are greatly appreciated. Thanks... (5 Replies)
Discussion started by: sandeep_1105
5 Replies

8. Shell Programming and Scripting

How to remove special characters from each line?

Hello, Is there a simpler way to remove special characters (color codes) from each lines in a log file? I use sed like in the example below but I think there should be a more simple way to achieve the same result: $ cat -vet file1 ^, , , , Maybe to convert the file somehow? ... (5 Replies)
Discussion started by: majormark
5 Replies

9. Shell Programming and Scripting

Remove special characters from string

Hi there, I'd like to write a script that removes any set of character from any string. The first argument would be the string, the second argument would be the characters to remove. For example: $ myscript "My name's Santiago. What's yours?" "atu" My nme's Snigo. Wh's yors? I wrote the... (11 Replies)
Discussion started by: chebarbudo
11 Replies

10. UNIX for Dummies Questions & Answers

remove special and unicode characters

Hi, How do I remove the lines where special characters or Unicode characters appear? The following query does work but I wonder if there is a better way. cat test.txt | egrep -v '\)|#|,|&|-|\(|\\|\/|\.' The following lines show that my query is incomplete. Warning: The word "*Khan" is... (1 Reply)
Discussion started by: shantanuo
1 Replies
Login or Register to Ask a Question