Newline between unequal record fields

03-14-2013

Registered User

2, 0

Join Date: Mar 2013

Last Activity: 14 March 2013, 3:02 PM EDT

Posts: 2

Thanks Given: 0

Thanked 0 Times in 0 Posts

Newline between unequal record fields

Assume the following 5 records (field separator is a space):

Code:

0903 0903 0910 0910 0910 0910 0910 0910 0917 0917 0917 0917 0924
1001 1001 1001 1001 1008 1008 1008 1008 1015 1015 1015 1015 1022
1029 1029 1029 1029 1105 1105 1105 1105 1112 1112 1112 1112 1119
1126 1126 1126 1126 1203 1203 1203 1203 1210 1210 1210 1210 1217
1224 1224 1224 1224 1224 1224 1224 1224 1231 1231 1231 1231

The output result needed:

Code:

0903 0903
0910 0910 0910 0910 0910 0910
0917 0917 0917 0917
0924
1001 1001 1001 1001
1008 1008 1008 1008
1015 1015 1015 1015
1022
1029 1029 1029 1029
1105 1105 1105 1105
1112 1112 1112 1112
1119
1126 1126 1126 1126
1203 1203 1203 1203
1210 1210 1210 1210
1217
1224 1224 1224 1224 1224 1224 1224 1224
1231 1231 1231 1231

Assume additional records will have different values. Without doing this by hand I've been unable solve it. I tried using a combination of sed, awk, and grep scripts with no success. Any help would be appreciated.

Last edited by Scrutinizer; 03-14-2013 at 02:29 PM.. Reason: code tags

tree

View Public Profile for tree

Find all posts by tree

03-14-2013

Moderator

12,296, 3,792

Join Date: Nov 2008

Last Activity: 1 January 2021, 1:47 AM EST

Location: Amsterdam

Posts: 12,296

Thanks Given: 679

Thanked 3,792 Times in 3,282 Posts

Try:

Code:

awk '{for(i=1; i<NF; i++) $i=$i ($(i+1)==$i?FS:RS)}1' OFS=

Scrutinizer

View Public Profile for Scrutinizer

Find all posts by Scrutinizer

03-14-2013

Registered User

2, 0

Join Date: Mar 2013

Last Activity: 14 March 2013, 3:02 PM EDT

Posts: 2

Thanks Given: 0

Thanked 0 Times in 0 Posts

Totally answered my problem. Been working on this for a week. Finished reading the O'Reilly book on bash scripting but could find the answer. It's good but doesn't go into too much detail on sed or awk. Thanks.

tree

View Public Profile for tree

Find all posts by tree

03-14-2013

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

awk is its own programming language, it's hard to go over it in detail without it becoming its own book. Not a difficult language mind you, but quite different.

Corona688

View Public Profile for Corona688

Visit Corona688's homepage!

Find all posts by Corona688

03-14-2013

Read Only

1,278, 486

Join Date: Sep 2012

Last Activity: 27 February 2020, 8:59 PM EST

Location: Houston, Texas, USA

Posts: 1,278

Thanks Given: 0

Thanked 486 Times in 451 Posts

try also (in case same values wrap on the next line):

Code:

awk '{w=(w)?w:$1;for(i=1; i<=NF; i++) {printf ($i==w)? $i" ":"\n"$i" "; w=$i}} END {print ""}' infile

rdrtx1

View Public Profile for rdrtx1

Find all posts by rdrtx1

03-14-2013

Moderator

3,689, 1,352

Join Date: Jan 2012

Last Activity: 22 August 2020, 11:29 PM EDT

Location: Galactic Empire

Posts: 3,689

Thanks Given: 268

Thanked 1,352 Times in 1,258 Posts

If you are interested, here is a solution using bash:

Code:

#!/bin/bash

while read line
do
        for c_num in $line
        do
                [[ "$c_num" == "$p_num" ]] && printf "%s " $c_num || printf "\n%s " $c_num
                p_num="$c_num"
        done
done < file
printf "\n"

Last edited by Yoda; 03-14-2013 at 04:44 PM.. Reason: correction

Yoda

View Public Profile for Yoda

Visit Yoda's homepage!

Find all posts by Yoda

03-14-2013

Registered User

4,673, 588

Join Date: Oct 2010

Last Activity: 1 February 2016, 3:35 PM EST

Location: Southern NJ, USA (Nord)

Posts: 4,673

Thanks Given: 8

Thanked 588 Times in 561 Posts

One approach is to make the fields all lines - homogenous if separated, but my standard sed looper is fine for merging lines:

Code:

tr ' ' '\12' < in_file | sed '
  :loop
  $q
  N
  s/\(....\)\n\1/\1 \1/
  t loop
  P
  s/.*\n//
  t loop
 ' > out_file

But this might mess up for two lines of the same number. In some apps, that might be great; you can put a "| sort" after the "tr" and merge far separated numbers, or a "| sort | uniq -c" and reduce them to a count.

Maybe pure sed is actually better yet:

Code:

sed '
  s/ /\
/g
  s/\(....\)\n\1/\1 \1/g
  s/\(....\)\n\1/\1 \1/g
 ' in_file > out_file

Cheap trick, making all the spaces line feeds and then making them back into spaces where equal. There's a lesson about negative cases there. Mostly, line feed was a certainly not in use substitute character. Once I swapped line feed and form feed so I could sed pages into insert statements (one page per row in one column) and then reversed the fomr feeds back to line feeds. Note that you have to sub twice, for the odd and even spaces. Also, if you know what a string is, you do not have to source the original bytes, any dup quad looks the same!

Last edited by DGPickett; 03-14-2013 at 05:53 PM..

DGPickett

View Public Profile for DGPickett

Find all posts by DGPickett

Shell Programming and Scripting

Newline between unequal record fields

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Removinf newline characters in first 62 fields

Discussion started by: sagarparadkar

2. Shell Programming and Scripting

awk - compare 1st 15 fields of record with 20 fields

Discussion started by: sljnk

3. Shell Programming and Scripting

Delete last 2 fields from every record in a file

Discussion started by: bigbuk

4. Shell Programming and Scripting

Newline characters in fields of a file

Discussion started by: lakshmi001

5. Shell Programming and Scripting

Remove newline character or join the broken record

Discussion started by: ratheeshjulk

6. Shell Programming and Scripting

awk puts newline between fields

Discussion started by: unclecameron

7. Shell Programming and Scripting

remove newline chars in each record of file

Discussion started by: srilaxmi

8. Shell Programming and Scripting

Making changes in the fields of a record

Discussion started by: kanu_pathak

9. Shell Programming and Scripting

Manipulating fields record wise

Discussion started by: rinku11

10. Shell Programming and Scripting

awk: record has too many fields

Discussion started by: chaandana