Pattern to replace ^M and ^Y in a 4.2 AIX text file

05-16-2009

Registered User

118, 0

Join Date: Nov 2005

Last Activity: 6 April 2010, 6:04 PM EDT

Location: Canada

Posts: 118

Thanks Given: 0

Thanked 0 Times in 0 Posts

Pattern to replace ^M and ^Y in a 4.2 AIX text file

I have files on my AIX 4.2 client system where I need to do the following replacements below but have no clue how ? They are control characters (linefeed, chariage return, ...).

First, replace "^M^Y^M" with ^char_for_end_of_line
Then replace "^M" with " "
Trim all left spaces

In VI, my files contents look like this :

aaaa zzzzzzzzzzzzzzzzzzzzzz^M
zzzzzzzzzzzzzzzzzzzzzz^M
zzzzzzzzzzzzzzzzzzzz^M
^M
^Y^M
aaaa zzzzzzzzzzzzzzzzzzzzzz^M
zzzzzzzzzzzzzzzzzzzzzz^M
zzzzzzzzzzzzzzzzzzzz^M
^M
^Y^M
aaaa zzzzzzzzzzzzzzzzzzzzzz^M
zzzzzzzzzzzzzzzzzzzzzz^M
zzzzzzzzzzzzzzzzzzzz^M
^M
^Y^M
...

I want it to be:
aaaa zzzzzzzzzzzzzzzzzzzzzz zzzzzzzzzzzzzzzzzzzzzz zzzzzzzzzzzzzzzzzzzzzz
aaaa zzzzzzzzzzzzzzzzzzzzzz zzzzzzzzzzzzzzzzzzzzzz zzzzzzzzzzzzzzzzzzzzzz
aaaa zzzzzzzzzzzzzzzzzzzzzz zzzzzzzzzzzzzzzzzzzzzz zzzzzzzzzzzzzzzzzzzzzz
aaaa zzzzzzzzzzzzzzzzzzzzzz zzzzzzzzzzzzzzzzzzzzzz zzzzzzzzzzzzzzzzzzzzzz
aaaa zzzzzzzzzzzzzzzzzzzzzz zzzzzzzzzzzzzzzzzzzzzz zzzzzzzzzzzzzzzzzzzzzz
...

Nb of records is unknown.
'zzzzzz' can have any combinations of "(", ")", "'", """, ",", "[", "]", ".", ";" (in other words anything with printable characters)

Browser_ice

View Public Profile for Browser_ice

Find all posts by Browser_ice

05-16-2009

Registered User

1,033, 8

Join Date: Sep 2008

Last Activity: 1 July 2013, 6:45 PM EDT

Location: Malvern, Worcs. U.K.

Posts: 1,033

Thanks Given: 0

Thanked 8 Times in 8 Posts

What you could really do with is dos2unix(1) that comes with Solaris and Linux but not with AIX IIRC, so instead you can use sed:

dos2unix:

Code:

sed -i 's/\r//' file

unix2dos:

Code:

sed -i 's/\n/\n\r/' file

from: UNIX BASH scripting: Linux flip command - alternative of dos2unix,unix2dos

or

dos2unix:

Code:

$ sed 's/^M$//'  input.txt > output.txt

unix2dos:

Code:

$ sed 's/$'"/`echo \\\r`/"   input.txt > output.txt

from: Howto: UNIX or Linux convert DOS newlines CR-LF to Unix/Linux format

The dos2unix examples will get rid of the carriage returns for you I will leave a scripting guru to work out the removal of particular unwanted line feeds.

TonyFullerMalv

View Public Profile for TonyFullerMalv

Find all posts by TonyFullerMalv

05-17-2009

Registered User

6,384, 2,214

Join Date: May 2005

Last Activity: 28 October 2019, 4:59 PM EDT

Location: In the leftmost byte of /dev/kmem

Posts: 6,384

Thanks Given: 143

Thanked 2,214 Times in 1,548 Posts

In your example it looks like you have groups of 3 lines of text followed by 2 lines. You want to combine the three lines of text into a single line and remove the two separating lines completely.

If this is the case:

Code:

sed -n 'N;N;s/[^M^Y]//g;s/\n//gp;N;N

This will first read two additional lines (to the first read line) from the file and combine these into the pattern space. The first replacement then throws out the control characters (^M and ^Y, enter them via <CTRL-V> in vi), the second replacement removes the newline characters combining the lines to one line and prints it. Then two additional lines (the separator lines) are read and discarded, since they are not printed at all, then repeat from start.

I hope this helps.

bakunin

bakunin

View Public Profile for bakunin

Find all posts by bakunin

05-19-2009

Registered User

118, 0

Join Date: Nov 2005

Last Activity: 6 April 2010, 6:04 PM EDT

Location: Canada

Posts: 118

Thanks Given: 0

Thanked 0 Times in 0 Posts

Quote:

Originally Posted by bakunin

Code:

sed -n 'N;N;s/[^M^Y]//g;s/\n//gp;N;N

What if the number of lines of the original file is unknown ?

In my example I gave 3 lines but it can be anything between 1 and 20 lines. The file contains any multi-line amount of records. Each records is totally independent from the previous one. One record could have 2 lines, the next 20, the next 5, ... No regular patterns for the amount of lines. The file contains a list of system generated alarms coming from 20 different servers, numerous amount of workstations, ...

Sorry I forgot to mention it.

Browser_ice

View Public Profile for Browser_ice

Find all posts by Browser_ice

05-21-2009

Registered User

118, 0

Join Date: Nov 2005

Last Activity: 6 April 2010, 6:04 PM EDT

Location: Canada

Posts: 118

Thanks Given: 0

Thanked 0 Times in 0 Posts

I tried the combinations below which do not change anything or are not recognized

\n
\^m
\^Y
Ctrl-V + Ctrl-M
Ctrl-V + Ctrl-Y => nothing is typed in the console, I have to do a Ctrl-C to get out
\x0D$
\xC1$
[^M^Y]
[^M]
[^Y]
\c[m => not recognized

sed 's/.$//' does remove the ^M at the end of each line but then it is still a multi-line format. Its like removing the last character of each line but keeping the end-of-line linefeed.

[added comments]
Is there a way to find out in VI what is the ascii value of the character under the cursor ?
It would help me identify the right decimal value to use in a replacement string.

[added comments]
I found out that ^M is actually \015. So I can remove it with tr -d '\015'
But I still haven't found out what ^Y is.

Last edited by Browser_ice; 05-21-2009 at 03:34 PM..

Browser_ice

View Public Profile for Browser_ice

Find all posts by Browser_ice

05-21-2009

Registered User

6,384, 2,214

Join Date: May 2005

Last Activity: 28 October 2019, 4:59 PM EDT

Location: In the leftmost byte of /dev/kmem

Posts: 6,384

Thanks Given: 143

Thanked 2,214 Times in 1,548 Posts

Quote:

Originally Posted by Browser_ice

What if the number of lines of the original file is unknown ?

In my example I gave 3 lines but it can be anything between 1 and 20 lines.

In this case you will have to have some indication for a "record" being complete. Maybe you will need some record starting criteria too, for which one could match. Provide some data and i will provide some solution.

Quote:

Originally Posted by Browser_ice

I tried the combinations below which do not change anything or are not recognized

This is just a way to enter non-printing (control-) characters into vi: enter input mode, press "CTRL-V", then press CTRL-M (for example for "^M"). You should be still in input mode and see "^M" under the cursor.

Quote:

sed 's/.$//' does remove the ^M at the end of each line but then it is still a multi-line format.

It removes the last character in a line, regardless which character this is - this is the problem. You have to specifically match "^M" (CTRL-M) and throw that out. You can throw out linefeeds by searching for "\n". Try the following with some test file:

Code:

sed 'N;s/\n/@/' /some/file

to see the effect: two lines combined to one and the linefeed is replaced by an at.

[quote]Is there a way to find out in VI what is the ascii value of the character under the cursor ?[/qoute]

No, but you can use "od -ax <file> | more".

I hope this helps.

bakunin

bakunin

View Public Profile for bakunin

Find all posts by bakunin

05-22-2009

Registered User

128, 1

Join Date: Sep 2005

Last Activity: 20 December 2011, 8:04 AM EST

Posts: 128

Thanks Given: 0

Thanked 1 Time in 1 Post

You may also use vi

vi file

:.%s/^M//

THIS STRING WILL NOT WORK JUST LIKE THIS AS YOU HAVE TO USE CTRL SET

SO THE COMMAND TO GET THIS SAME STRING IS THIS
:.%s/(ctrl+v)(ctrl+M)//

I hope this helps you

ravager

View Public Profile for ravager

Find all posts by ravager

AIX

Pattern to replace ^M and ^Y in a 4.2 AIX text file

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Replace pattern in text

Discussion started by: abdossamad2003

2. Shell Programming and Scripting

Grab text after pattern and replace

Discussion started by: gpk_newbie

3. Shell Programming and Scripting

Pattern replace from a text file using sed

Discussion started by: my_Perl

4. Shell Programming and Scripting

Search and Replace a text if the line contains a pattern

Discussion started by: machomaddy

5. Shell Programming and Scripting

sed help, Find a pattern, replace it with same text minus leading 0

Discussion started by: SirHenry1

6. Shell Programming and Scripting

Sed command to replace with pattern except for text and closing parentheses

Discussion started by: missb

7. Shell Programming and Scripting

find pattern and replace the text before it

Discussion started by: balan1983a

8. Shell Programming and Scripting

Create multiple text file from a single text file on AIX

Discussion started by: lodhi1978

9. Shell Programming and Scripting

Replace Text Based On Pattern

Discussion started by: Grizzly

10. Shell Programming and Scripting

pattern replace inside text file using sed

Discussion started by: meharo