sed non ascii value remove


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting sed non ascii value remove
# 1  
Old 01-28-2020
sed non ascii value remove

Hi All,


i am using the below perl command to remove the non ascii value,it is working fine.
we need to find the similar solution using the sed command. but i tried it is not working and getting the error.




Code:
perl -pe 's/[[:^ascii:]]//g'  test.txt


sed is not working.


Code:
sed -i 's/[^[:ascii:]]//g' test.txt


Error:
Code:
sed: -e expression #1, char 17: Invalid character class name


Last edited by Scrutinizer; 01-29-2020 at 12:46 AM..
# 2  
Old 01-28-2020
ascii is not found in the wctype character classes, nor mentioned in e.g. man regex.

What characters do you want to remove? Strange locales'? Be aware that control chars 0x00 - 0x1F (including e.g. <TAB> ) are in the ascii set as well...

Last edited by RudiC; 01-28-2020 at 10:01 PM..
# 3  
Old 01-29-2020
Classes like [:ascii:] and [:word:] are non-POSIX additional classes that can be found in perl, but are not recognised by standard sed.

Instead of [:ascii:], with GNU sed you could try a regex character range:
Code:
sed 's/[^\x00-\x7F]//g'  test.txt

To remove non-printable and non-"standard ascii" characters as RudiC suggests, you can try:
Code:
sed 's/[^\x20-\x7E]//g'  test.txt


--
Note:
I presume [[:^ascii:]] is a typo and you meant to write [^[:ascii:]]

Last edited by Scrutinizer; 01-29-2020 at 01:54 AM..
This User Gave Thanks to Scrutinizer For This Post:
# 4  
Old 01-29-2020
@Scrutinizer @RudiC thanks for quick respone.



i tried below one and getting the below error.



Code:
sed 's/[^\x00-\x7F]//g'


sed: -e expression #1, char 17: Invalid collation character
# 5  
Old 01-29-2020
What is your OS and version.
Can you show the exact command?
# 6  
Old 01-29-2020
FYI..Red Hat Enterprise Linux 7
VERSION_ID="7.7"
# 7  
Old 01-29-2020
And what is the command?
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Need to remove first and last character using sed

Hi I have file in below format. How i can remove the first and lost comma from this below file ,001E:001F,,,02EE,0FED:0FEF, I need output has below 001E:001F,,,02EE,0FED:0FEF (6 Replies)
Discussion started by: ranjancom2000
6 Replies

2. Shell Programming and Scripting

Convert Hex to Ascii in a Ascii file

Hi All, I have an ascii file in which few columns are having hex values which i need to convert into ascii. Kindly suggest me what command can be used in unix shell scripting? Thanks in Advance (2 Replies)
Discussion started by: HemaV
2 Replies

3. Shell Programming and Scripting

Grep to remove non-ASCII characters

I have been having an encoding problem that I need to solve. I have an 4-column tab-separated file: I need to remove all of the lines that contain the string 'vis-à-vis' achiever-n vis-à-vis+ns-j+vp oppose-v 1 achiever-n vis-à-vis+ns-the+vg assess-v 1 administrator-n ... (4 Replies)
Discussion started by: owwow14
4 Replies

4. Shell Programming and Scripting

Remove some special ascii character

Hello I have this special caracter after retreving rows from sql server: "....spasses: • Entrem al valort 6050108002811 • El donem..." I would like a sed command to remove it..or just know it's ascii code in order to replace it into my sql sentence.. Hope some one knows how to do that.... (7 Replies)
Discussion started by: ldiaz2106
7 Replies

5. Shell Programming and Scripting

remove pattern with sed

Hi, i want to remove a certain pattern when i type pwd. pwd will look like this: ..../....../....../Pat_logs/..../....../...../...... the dotted lines are just random directory names, i want it to remove the "Pat_logs/...../....../....../" part so for example: ... (5 Replies)
Discussion started by: a27wang
5 Replies

6. Shell Programming and Scripting

sed to remove

Hello I have a file with records...The records have several lines and have start and end born... This is a template: 000000001 LDR L ^^^^^nam^^2200325Iia^45e0 000000001 022 L $$a0081-3397 000000001 041 L $$aSPA 000000001 088 L $$aJ.E.N. 551 000000001 090 L $$aINFORMES JEN... (22 Replies)
Discussion started by: ldiaz2106
22 Replies

7. Shell Programming and Scripting

convert ascii values into ascii characters

Hi gurus, I have a file in unix with ascii values. I need to convert all the ascii values in the file to ascii characters. File contains nearly 20000 records with ascii values. (10 Replies)
Discussion started by: sandeeppvk
10 Replies

8. Shell Programming and Scripting

How to tell SED to emit output in 8-bit ASCII only?

I have to mangle some "plain ASCII" text file (i.e. 8 bits/characters where the text DOES contain characters like Umlauts and accented characters from the upper 7-bits range, i.e. with hex codes in ). For this I am trying to use SED which I downloaded as part of cygwin package (yes, I am doing... (0 Replies)
Discussion started by: mmo
0 Replies

9. Shell Programming and Scripting

sed over writes my original file (using sed to remove leading spaces)

Hello and thx for reading this I'm using sed to remove only the leading spaces in a file bash-280R# cat foofile some text some text some text some text some text bash-280R# bash-280R# sed 's/^ *//' foofile > foofile.use bash-280R# cat foofile.use some text some text some text... (6 Replies)
Discussion started by: laser
6 Replies

10. Shell Programming and Scripting

sed remove

anyone out there knows how to remove pattern <random string> use sed? (6 Replies)
Discussion started by: jamwong
6 Replies
Login or Register to Ask a Question