Help on looping using awk


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Help on looping using awk
# 1  
Old 01-23-2014
Help on looping using awk

I have the data like this:
Code:
PONUMBER,SUPPLIER,LINEITEM,SPLITLINE,LINEAMOUNT,CURRENCY
IR5555,Supplier1,1,1,83.1,USD
IR5555,Supplier1,1,3,40.4,USD
IR5555,Supplier1,1,6,54.1,USD
IR5555,Supplier1,1,8,75.1,USD

IR5556,Supplier2,1,1,41.1,USD
IR5556,Supplier2,1,3,43.1,USD

IR5557,Supplier3,1,1,41.1,USD
IR5557,Supplier3,2,3,43.1,USD

Basically what I needed to do was arrange the splitline field in the correct order if the lineitems are of the same value and on the same PONUMBER. for reference please see below:
Code:
PONUMBER,SUPPLIER,LINEITEM,SPLITLINE,LINEAMOUNT,CURRENCY
IR5555,Supplier1,1,1,83.1,USD
IR5555,Supplier1,1,2,40.4,USD
IR5555,Supplier1,1,3,54.1,USD
IR5555,Supplier1,1,4,75.1,USD

IR5556,Supplier2,1,1,41.1,USD
IR5556,Supplier2,1,2,34.1,USD

IR5557,Supplier3,1,1,12.1,USD
IR5557,Supplier3,2,1,78.1,USD

I have tried to use this code:
Code:
awk -F',' 'NR==1{print;next}{a[$1","$2","$3]++;b[$1","$2","$3]=$5","$6};END
 {for(x in a) {for(i=1; i<=a[x]; i++) {print x","i","b[x]}}}' test.csv

which corrects the splitline numbering but I have overlooked the amount, now my main problem is the lineamount is not correct, it only gets one value and repeats it. please see below:
Code:
PONUMBER,SUPPLIER,LINEITEM,SPLITLINE,LINEAMOUNT,CURRENCY
IR5555,Supplier1,1,1,75.1,USD
IR5555,Supplier1,1,2,75.1,USD
IR5555,Supplier1,1,3,75.1,USD
IR5555,Supplier1,1,4,75.1,USD

IR5557,Supplier3,1,1,12.1,USD
IR5557,Supplier3,2,1,78.1,USD

IR5556,Supplier2,1,1,34.1,USD
IR5556,Supplier2,1,2,34.1,USD

I would really appreciate guys if you could help me in the lineamount issue.
Thanks!

Last edited by jeffreybsu; 01-23-2014 at 03:25 AM.. Reason: corrected output data
# 2  
Old 01-23-2014
I would do it a bit differently. try:
Code:
awk '{n=$1 FS $2 FS $3} n!=p{c=1; p=n} NF{$4=c++}1' FS=, OFS=, file

# 3  
Old 01-23-2014
Quote:
Originally Posted by jeffreybsu
I have the data like this:
Code:
PONUMBER,SUPPLIER,LINEITEM,SPLITLINE,LINEAMOUNT,CURRENCY
IR5555,Supplier1,1,1,83.1,USD
IR5555,Supplier1,1,3,40.4,USD
IR5555,Supplier1,1,6,54.1,USD
IR5555,Supplier1,1,8,75.1,USD

IR5556,Supplier2,1,1,41.1,USD
IR5556,Supplier2,1,3,43.1,USD

IR5557,Supplier3,1,1,41.1,USD
IR5557,Supplier3,2,3,43.1,USD

Basically what I needed to do was arrange the splitline field in the correct order if the lineitems are of the same value and on the same PONUMBER. for reference please see below:
Code:
PONUMBER,SUPPLIER,LINEITEM,SPLITLINE,LINEAMOUNT,CURRENCY
IR5555,Supplier1,1,1,83.1,USD
IR5555,Supplier1,1,2,40.4,USD
IR5555,Supplier1,1,3,54.1,USD
IR5555,Supplier1,1,4,75.1,USD

IR5556,Supplier2,1,1,41.1,USD
IR5556,Supplier2,1,2,34.1,USD

IR5557,Supplier3,1,1,12.1,USD
IR5557,Supplier3,2,1,78.1,USD

I have tried to use this code:
Code:
awk -F',' 'NR==1{print;next}{a[$1","$2","$3]++;b[$1","$2","$3]=$5","$6};END
 {for(x in a) {for(i=1; i<=a[x]; i++) {print x","i","b[x]}}}' test.csv

which corrects the splitline numbering but I have overlooked the amount, now my main problem is the lineamount is not correct, it only gets one value and repeats it. please see below:
Code:
PONUMBER,SUPPLIER,LINEITEM,SPLITLINE,LINEAMOUNT,CURRENCY
IR5555,Supplier1,1,1,75.1,USD
IR5555,Supplier1,1,2,75.1,USD
IR5555,Supplier1,1,3,75.1,USD
IR5555,Supplier1,1,4,75.1,USD

IR5557,Supplier3,1,1,12.1,USD
IR5557,Supplier3,2,1,78.1,USD

IR5556,Supplier2,1,1,34.1,USD
IR5556,Supplier2,1,2,34.1,USD

I would really appreciate guys if you could help me in the lineamount issue.
Thanks!
The following script fixes some of the LINEAMOUNT values, but I have no idea how you expect to get the values shown in red above from the input you provided:
Code:
awk '
BEGIN { FS = OFS = ","
}
NR == 1 || NF < 6 {
        print
        next
}
{       print $1, $2, $3, ++a[$1 FS $2 FS $3], $5, $6
}' test.csv

which produces the output:
Code:
PONUMBER,SUPPLIER,LINEITEM,SPLITLINE,LINEAMOUNT,CURRENCY
IR5555,Supplier1,1,1,83.1,USD
IR5555,Supplier1,1,2,40.4,USD
IR5555,Supplier1,1,3,54.1,USD
IR5555,Supplier1,1,4,75.1,USD

IR5556,Supplier2,1,1,41.1,USD
IR5556,Supplier2,1,2,43.1,USD

IR5557,Supplier3,1,1,41.1,USD
IR5557,Supplier3,2,1,43.1,USD

which seems to me to match the input data you provided changing only the values in the SPLITLINE field.

If all of the input lines with the same values for the 1st three fields are always adjacent, Scrutinizer's suggestion is a simpler way to perform this task.
# 4  
Old 01-23-2014
Code:
awk -F, '!$0||NR==1{print;next}{$4=++a[$1$3]}1' yourfile

Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk nested looping?

I am trying to parse a text file and send its output to another file but I am having trouble conceptualizing how I am supposed to do this in awk. The text file has a organization like so: Name Date Status Location (city, state, zip fields) Where each of these is on a separate line in... (1 Reply)
Discussion started by: kellyanneghj
1 Replies

2. Shell Programming and Scripting

Looping through pairs of files with awk

Hi all, please help me construct the command. i want to loop through all files named bam* and bed*. My awk works for a particular pair but there are too many pairs to do manually. I have generated multiple files in a folder in a given pattern. The files are named like bam_fixed1.bam... (2 Replies)
Discussion started by: newbie83
2 Replies

3. Shell Programming and Scripting

AWK looping over 2 variables

I would like to loop over variables i and j consecutively, { a = -6.7 b = 7.0 c =0.1 { for (i = 0; i<=(b-a)/c; i++) for (j = 1; j<=(b-a)/c; j++) '$1<=(a+j*c)&&$1>=(a+i*c)' FILENAME > output_j '{print $2}' output_j > output_j_f } I essentially want to print the range of $1... (9 Replies)
Discussion started by: chrisjorg
9 Replies

4. Shell Programming and Scripting

Looping within the elements of a file using awk

Hi all, I have a file containing 5000 rows and 4 columns. I need to do a loop within the rows based on the values of column 3. my sample data is formatted like the ones below: what i need to do is to make a loop that will allow me to plot the values of x,y,values corresponding to month 1 to month... (10 Replies)
Discussion started by: ida1215
10 Replies

5. Shell Programming and Scripting

looping in awk

How do I remove last comma? echo "xx yy zz" | awk 'BEGIN{FS=" "}{for (i=1; i<=NF; i++) printf "%s,", $i}'output: xx,yy,zz, required output: xx,yy,zz or (ideally!): xx, yy & zz many thanks in advance! (4 Replies)
Discussion started by: euval
4 Replies

6. UNIX for Dummies Questions & Answers

Help with AWK looping

I'm trying to parse a configuration text file using awk. The following is a sample from the file I'm searching. I can retrieve the formula and recipe names easily but now I want to take it one step farther. In addition to the formula name, I would like to also get the value of the attribute... (6 Replies)
Discussion started by: new2awk
6 Replies

7. Shell Programming and Scripting

Urgent - Looping using AWK

Hi I have a file which is having following text. The file is in a tabular form with 5 fields. i.e field1, field2 ..... field5 are its columns and there are many rows in it say COUNT is the number of rows Field 1 Field2 Field3 Field4 Field5 ------- ------- ... (8 Replies)
Discussion started by: skyineyes
8 Replies

8. Shell Programming and Scripting

looping and awk/sed help

I am pretty new to this, but imagine what I am trying to do is possible iI am trying to make an automated DB comparison tool that selects all columns in all tables and compares them to the same thing in another DB. anyway I have created 2 files to help with this the first file is a... (13 Replies)
Discussion started by: Zelp
13 Replies

9. Shell Programming and Scripting

Awk: looping problem!

I am having a problem with awk when I run it with a loop. It works perfectly when I echo a single line from the commandline. For example: echo 'MFG009 9153852832' | awk '$2 ~ /^0-9]$/{print $2}' The Awk command above will print field 2 if field 2 matches 10 digits, but when I run the loop... (5 Replies)
Discussion started by: cstovall
5 Replies

10. UNIX for Advanced & Expert Users

Looping in awk

Can somebody give me a cleaner way of writing the following script. I was thinking that I could use a loop in the awk statement. It works fine the way it is but I just want the script to be cleaner. #!/usr/bin/sh for r in 0 1 2 3 4 5 6 do DAY=`gdate --date="${r} days ago" +%m\/%d\/%y`... (3 Replies)
Discussion started by: keelba
3 Replies
Login or Register to Ask a Question