AWK Multi-Line Records Numbering Problem

10-24-2007

Registered User

41, 0

Join Date: Oct 2007

Last Activity: 18 March 2010, 8:35 AM EDT

Posts: 41

Thanks Given: 0

Thanked 0 Times in 0 Posts

AWK Multi-Line Records Numbering Problem

I have a set of files of multi-line records with the records separated by a blank line. I needed to add a record number to the front of each line followed by a colon and did the following:

Code:

awk 'BEGIN {FS = "\n"; RS = ""}{for (i=1; i<=NF; i++)print NR,":",$i}' ~/Desktop/data98-1-25.txt > ~/Desktop/numbered-data98-1-25.txt

so i would get something like:
1: XXX:CCCC:XYXYX
1: XTZ:CACC:XYXYX
1: XZZ:DDDD:XYXYX
2: XTZ:CACC:XYXYX
2: XZZ:DMMD:XYXYX
3: XZZ:DMMD:XYXYX
4: XZZ:DMMD:XYXYX
4: XVZ:DMHD:XYXYX
4: XVV:DLMD:XYXYX
4: XTZ:DCDD:XYXYX

Problem is my numbers are not coming out right. When i do a count like:
awk '{RS=""; print NR}' ~/Desktop/data98-1-25.txt > ~/Desktop/Count98-1-25.txt

I get the the number i am expecting for the last set of records in the file: 4959 but when i run the code up above for numbering each record the last set of records shows the end number as 4958. Is this one of those NR starts at zero and i started i at 1 or vice-versa type of problems; or is my code wrong to do what i was trying to do?

Another question i will have, is when i go to start processing the next file to start numbering it's records how do i get the count to start on 4960?

RacerX

View Public Profile for RacerX

Find all posts by RacerX

10-31-2007

Registered User

24, 0

Join Date: Jan 2007

Last Activity: 31 January 2017, 5:44 AM EST

Posts: 24

Thanks Given: 1

Thanked 0 Times in 0 Posts

Could you paste some sample lines from the input file as well ...

mvijayv

View Public Profile for mvijayv

Find all posts by mvijayv

11-01-2007

Registered User

1,714, 63

Join Date: Apr 2004

Last Activity: 15 May 2020, 11:27 AM EDT

Location: Bordeaux, France

Posts: 1,714

Thanks Given: 2

Thanked 63 Times in 59 Posts

IN your count program, the Record Separator must be define outside the action code.

Code:

awk '{print NR}' RS="" ~/Desktop/data98-1-25.txt 
awk -v RS="" '{print NR}' ~/Desktop/data98-1-25.txt 
awk 'BEGIN {RS=""} {print NR} ~/Desktop/data98-1-25.txt

Jean-Pierre.

aigles

View Public Profile for aigles

Find all posts by aigles

11-01-2007

Registered User

41, 0

Join Date: Oct 2007

Last Activity: 18 March 2010, 8:35 AM EDT

Posts: 41

Thanks Given: 0

Thanked 0 Times in 0 Posts

Thanks, i did discover that the missing BEGIN statement in my count program makes all the difference in arriving at a correct count to validate that my numbering program was working correctly.

GIVEN INPUT FILE WITH FOLLOWING RECORDS:

Code:

XXX:CCCC:XYXYX
XTZ:CACC:XYXYX
XZZ:DDDD:XYXYX

XTZ:CACC:XYXYX
XZZ:DMMD:XYXYX

XZZ:DMMD:XYXYX

XZZ:DMMD:XYXYX
XVZ:DMHD:XYXYX
XVV:DLMD:XYXYX
XTZ:DCDD:XYXYX

Using my bad count program: awk '{RS=""; print NR}' ~/Desktop/data_in.txt it will return:
1
2
3
4
5

Using your version: awk 'BEGIN {RS=""} {print NR}' ~/Desktop/data_in.txt it correctly returns:
1
2
3
4

This newbie learned a valuable lesson, the hard way.

As an aside, for others who may stumble across this thread; I solved the problem of how to get the count to start on 4960 at the beginning of the next file by doing this:

Code:

awk 'BEGIN {FS = "\n"; RS = ""}{for (i=1; i<=NF; i++)print NR+4959,":",$i}' ~/Desktop/data98-26-50.txt

I'm sure there were probably much better ways to do it, but it accomplished what i needed done to the records in the next file to be processed at the time.

Thanks again to all of you who have helped me along my way in using Awk to get some jobs done.

RacerX

View Public Profile for RacerX

Find all posts by RacerX

Shell Programming and Scripting

AWK Multi-Line Records Numbering Problem

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Help with reformat single-line multi-fasta into multi-line multi-fasta

Discussion started by: patrick87

2. Shell Programming and Scripting

awk use sequential line numbering in output

Discussion started by: cmccabe

3. Shell Programming and Scripting

awk - Multi-line data to be stored in variable

Discussion started by: chill3chee

4. Shell Programming and Scripting

Multi-line filtering based on multi-line pattern in a file

Discussion started by: Finja

5. Shell Programming and Scripting

Conditional Multi-Line Grep Problem

Discussion started by: redbluefish

6. Shell Programming and Scripting

Transpose multi-line records into a single row

Discussion started by: daveyabe

7. UNIX for Dummies Questions & Answers

Alphabetical sort for multi line records contains in a single file

Discussion started by: quee1763

8. Shell Programming and Scripting

Capturing multi-line records containing known value?

Discussion started by: cs03dmj

9. Shell Programming and Scripting

sed or awk help - line numbering w/ different start value

Discussion started by: camwheel

10. Shell Programming and Scripting

AWK Multi-Line Records Processing

Discussion started by: RacerX