awk sed cut? to rearrange random number of fields into 3 fields


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting awk sed cut? to rearrange random number of fields into 3 fields
# 1  
Old 04-28-2009
Question awk sed cut? to rearrange random number of fields into 3 fields

I'm working on formatting some attendance data to meet a vendors requirements to upload to their system. With some help on the forums here, I have the data close. But they've since changed what they want.

The vendor wants me to submit three fields to them. Field 1 is the studentid field, field 2 is to be the building number and field 3 is a comma (and space char) delimited field holding periods.

This is what I have out of our records system now. Basically studentid, the periods, then the building number last. Each student is only listed once, so field 1 is unique in the input file 'absentees'.

250,1,2,3,45
255,2,4,6,8,10,45
1005,1,50
2099,1,2,51

I'd like to move the last field to the second position. Then enclose the remaining fields into one field with double quotes. Only if more than one period though (see third record for example of a record with only 1 period). So basically I need to get it looking like this (header row just fyi):

sid,bldg,periods
250,45,"1, 2, 3"
255,45,"2, 4, 6, 8, 10"
1005,50,1
2099,51,"1, 2"

I've used sed and cut to rearrange the order of fields when the number of fields are fixed. Not sure how to do it with a random number of fields though. Really not sure how to get the double quotes in there with comma and a space char.

I'm on hp-ux, so just awk is avail to me. (no gawk or nawk)

thanks in advance for any suggestions
# 2  
Old 04-28-2009
Quote:
Originally Posted by axo959
...
I'd like to move the last field to the second position. Then enclose the remaining fields into one field with double quotes. Only if more than one period though (see third record for example of a record with only 1 period). So basically I need to get it looking like this (header row just fyi):

sid,bldg,periods
250,45,"1, 2, 3"
255,45,"2, 4, 6, 8, 10"
1005,50,1
2099,51,"1, 2"

...
I'm on hp-ux, so just awk is avail to me. (no gawk or nawk)

thanks in advance for any suggestions

Assuming the given file format:

Code:
awk -F, '{ if(NF>3){ $1=$1 FS $NF; $2=q $2; $NF=q; sub(/,"$/,q) }
            else { $0=$1 FS $3 FS $2 } } 1'  q='"'  OFS=, filename


_____

HP-UX's awk chokes for huge files ( if so, think of a different tool ).
# 3  
Old 04-28-2009
Code:
awk -v OFS=, -F, '{
  $1 = $1 "," $NF
  $NF = ""
  printf "%s,", $1
  $1 = ""
  if ( NF > 3 )
     printf "\"%s\"\n", substr($0,2,length-2)
  else
     printf "%s\n", substr($0,2,length-2)
}' "$FILE" |
 awk -F\" -v OFS=\" '{ gsub(",",", ",$2); print}'

# 4  
Old 04-28-2009
Tools looks good

thank you, its looking good.
# 5  
Old 04-29-2009
With Perl:

Code:
perl -F, -lane'
print @F > 3
  ? join ",", $F[0], $F[-1], qw(") . ( join ", ", @F[ 1 .. @F - 2 ] ) . qw(")
  : $_
  ' infile

Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Is there a UNIX command that can compare fields of files with differing number of fields?

Hi, Below are the sample files. x.txt is from an Excel file that is a list of users from Windows and y.txt is a list of database account. $ head -500 x.txt y.txt ==> x.txt <== TEST01 APP_USER_PROFILE USER03 APP_USER_PROFILE TEST02 APP_USER_EXP_PROFILE TEST04 APP_USER_PROFILE USER01 ... (3 Replies)
Discussion started by: newbie_01
3 Replies

2. Shell Programming and Scripting

Rearrange fields of delimited text file

I want to rearrange the fields of delimited text file after sorting first line (only): input file: a_13;a_2;a_1;a_10 13;2;1;10 the result should be: a_1;a_2;a_10;a_13 1;2;10;13 any help would be appreciated andy (20 Replies)
Discussion started by: andy2000
20 Replies

3. Shell Programming and Scripting

Pattern Match and Rearrange the Fields in UNIX

For an Output like below Input : <Subject A="I" B="1039502" C="2015-06-30" D="010101010101"> Output : <Subject D="010101010101" B="1039502" C="2015-06-30" A="I"> I have been using something like below but not getting the desired output : awk -F ' ' '/Subject/ BEGIN{OFS=" ";}... (19 Replies)
Discussion started by: arunkesi
19 Replies

4. Shell Programming and Scripting

awk sort based on difference of fields and print all fields

Hi I have a file as below <field1> <field2> <field3> ... <field_num1> <field_num2> Trying to sort based on difference of <field_num1> and <field_num2> in desceding order and print all fields. I tried this and it doesn't sort on the difference field .. Appreciate your help. cat... (9 Replies)
Discussion started by: newstart
9 Replies

5. Shell Programming and Scripting

awk - compare 1st 15 fields of record with 20 fields

I'm trying to compare 2 files for differences in a selct number of fields. When differnces are found it will write the whole record of the second file including appending '|C' out to a delta file. Each record will have 20 fields, but only want to do comparison of 1st 15 fields. The 1st field of... (7 Replies)
Discussion started by: sljnk
7 Replies

6. Shell Programming and Scripting

Using awk to rearrange fields

Hi, I am required to arrange columns of a file i.e make the 15th column into the 1st column. I am doing awk 'begin {fs=ofs=","} {print $15,$1,$2,$3,$4,$5,$6,$7,$8,$9,$10,$11,$12,$13,$14}' ad.data>ad.csv the problem is that column 15 gets to column 1 but it is not comma separated with the... (10 Replies)
Discussion started by: seddoubt
10 Replies

7. Shell Programming and Scripting

How to print 1st field and last 2 fields together and the rest of the fields after it using awk?

Hi experts, I need to print the first field first then last two fields should come next and then i need to print rest of the fields. Input : a1,abc,jsd,fhf,fkk,b1,b2 a2,acb,dfg,ghj,b3,c4 a3,djf,wdjg,fkg,dff,ggk,d4,d5 Expected output: a1,b1,b2,abc,jsd,fhf,fkk... (6 Replies)
Discussion started by: 100bees
6 Replies

8. Shell Programming and Scripting

awk,cut fields by change field format

Hi Everyone, # cat 1.txt 1321631,77770132976455,19,20091001011859,20091001011907 1321631,77770132976455,19,20091001011859,20091001011907 1321631,77770132976455,19,20091001011859,20091001011907 # cat 1.txt | awk -F, '{OFS=",";print $1,$3,$4,$5}' 1321631,19,20091001011859,20091001011907... (7 Replies)
Discussion started by: jimmy_y
7 Replies

9. UNIX for Dummies Questions & Answers

count number of fields not using SED or AWK

hi forums i need help with a little problem i am having. i need to count the number of fields that are in a saved variable so i can use that number to make a different function work properly. is there a way of doing this without using SED/AWK? anything would be greatly appreciated (4 Replies)
Discussion started by: strasner
4 Replies

10. Shell Programming and Scripting

help with ksh/awk/sed script, random # of fields

Hello all, I'm working on an attendance callout script for a school district. I need to change our current layout for the vendor. Currently the data is in the form of: studentid,period,building, Heres a sample of some made up records: 500,1,30, 500,2,30, 500,3,30, 500,6,30,... (7 Replies)
Discussion started by: axo959
7 Replies
Login or Register to Ask a Question