Field widths based on a row

02-14-2016

Registered User

102, 1

Join Date: Jan 2010

Last Activity: 16 December 2017, 4:57 PM EST

Posts: 102

Thanks Given: 38

Thanked 1 Time in 1 Post

Field widths based on a row

I want to specify field width based on the row with FTR.
I can acheive this if column width is constant with:

Code:

awk 'BEGIN  { FIELDWIDTHS = "20 7 14 30" }{print $1,$4}' file

file:

Code:

COL1                COL2   CL3           FTR
AA8                 S2     CAT2          your comments
CC7                 D1     CAT3          last comments
DD3                        CAT1          2nd to comment
BB5                        CATE4         comment

output:

Code:

COL1                 FTR
AA8                  your comments
CC7                  last comments
DD3                  2nd to comment
BB5                  comment

How can I automatically determine Field widths based on the row with FTR?

aydj

View Public Profile for aydj

Find all posts by aydj

02-14-2016

Moderator

12,296, 3,792

Join Date: Nov 2008

Last Activity: 1 January 2021, 1:47 AM EST

Location: Amsterdam

Posts: 12,296

Thanks Given: 679

Thanked 3,792 Times in 3,282 Posts

Hi, try (using GNU awk):

Code:

awk '
  FNR==1 {
    s=$0
    while(match(s,/[^ ]+ */)) {   # use greedy match with field width and trailing spaces to find maximum width
      f=(f?f OFS:x) RLENGTH       # build string of field widths
      s=substr(s,RLENGTH+1)       # cut the line to to determine the next length
    }
    sub(/[0-9]*$/,1000,f)         # replace last field width with an arbitrary large number to capture the maximum width
    FIELDWIDTHS=f                 # set FIELDWIDTHS
    $0=$0                         # recalculate fields using FIELDWIDTHS
  }
  {
    print $1 $4
  }
' file

Last edited by Scrutinizer; 02-16-2016 at 12:27 PM.. Reason: Removed single excess space in output

This User Gave Thanks to Scrutinizer For This Post:

Scrutinizer

View Public Profile for Scrutinizer

Find all posts by Scrutinizer

02-14-2016

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

For something that will work with any POSIX-conforming version of awk you might also want to consider:

Code:

#!/bin/ksh
if [ $# -lt 1 ]
then	printf 'Usage: %s field_number...\n%s %s\n' "${0##*/}" \
	    'If the last input field is to be printed,' \
	    'it must be the last output field.' >&2
	exit 1
fi
printf '%s\n' "$@" | awk -F '  +' '
FNR == NR {
	f[++nf] = $1
	next
}
FNR == 1 {
	s[1] = 1
	fc = NF
	for(i = 2; i <= fc; i++)
		w[i - 1] = (s[i] = index($0, $i)) - s[i - 1]
#	for(i=1; i<=NF; i++)
#		printf("s[%d]=%d,w[%d]=%d\n", i, s[i], i, w[i])
#	for(i = 1; i <= nf; i++)
#		printf("f[%d]=%d\n", i, f[i])
}
{	for(i = 1; i < nf; i++)
		printf("%*s", -w[f[i]], substr($0, s[f[i]], w[f[i]]))
	print (f[nf] == fc) ? substr($0, s[f[nf]]) : \
	    substr($0, s[f[nf]], w[f[nf]])
}' - file

Note that this will print an arbitrary number of input fields in any output order desired, except the last input field can only be printed as the last output field if it is printed at all. It also allows field headings to contain multiple words separated by single spaces. But to work properly, there can't be any tabs in the input and there must be at least two spaces in the heading line separating fields.

For example, if file contains:

Code:

COLUMN 1 HEADING    COL 2  COL #3        FTR
AA8                 S2     CAT2          your comments
CC7                 D1     CAT3          last comments
DD3                        CAT1          2nd to comment
BB5                        CATE4         comment

and the script above is saved in an executable file named tester in the current directory, then the command:

Code:

./tester 2 2 4

would produce the output:

Code:

COL 2  COL 2  FTR
S2     S2     your comments
D1     D1     last comments
              2nd to comment
              comment

Although written and tested using the Korn shell, this will work with any shell that accepts basic Bourne shell syntax. If you want to try this on a Solaris/SunOS system, change awk to /usr/xpg4/bin/awk or nawk.

If you want to see how the code is determining field starting positions and field widths and the fields to be printed gathered from the script command line, you can remove the # characters at the start of the four lines in the FNR == 1 clause to see the debugging output those loops print.

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

02-15-2016

Registered User

102, 1

Join Date: Jan 2010

Last Activity: 16 December 2017, 4:57 PM EST

Posts: 102

Thanks Given: 38

Thanked 1 Time in 1 Post

Quote:

Originally Posted by Scrutinizer

Hi, try (using GNU awk):

Code:

awk '
  FNR==1 {
    s=$0
    while(match(s,/[^ ]+ */)) {   # use greedy match with field width and trailing spaces to find maximum width
      f=(f?f OFS:x) RLENGTH       # build string of field widths
      s=substr(s,RLENGTH+1)       # cut the line to to determine the next length
    }
    sub(/[0-9]*$/,1000,f)         # replace last field width with an arbitrary large number to capture the maximum width
    FIELDWIDTHS=f                 # set FIELDWIDTHS
    $0=$0                         # recalculate fields using FIELDWIDTHS
  }
  {
    print $1,$4
  }
' file

If I Have input with different column width (determined by Row with FTR), how can I achieve the output?
Input:

Code:

COL1                COL2   CL3           FTR
AA8                 S2     CAT2          your comments
CC7                 D1     CAT3          last comments
DD3                        CAT1          2nd to comment
BB5                        CATE4         comment
CL1                CL2   CL3           FTR
CC4                D1    CAT3          new comments
DD5                      CAT1          newcomment2

Output:

Code:

COL1                FTR
AA8                 your comments
CC7                 last comments
DD3                 2nd to comment
BB5                 comment
CL1                FTR
CC4                new comments
DD5                newcomment2

aydj

View Public Profile for aydj

Find all posts by aydj

02-15-2016

Registered User

15,129, 5,008

Join Date: Jul 2012

Last Activity: 4 May 2020, 4:31 PM EDT

Location: Aachen, Germany

Posts: 15,129

Thanks Given: 735

Thanked 5,008 Times in 4,483 Posts

Indeed, it's not quite clear to me what you're after, if the hitherto posts didn't satisfy you. Shooting in the dark:

Code:

awk '/FTR/ {FMT = index($0, $2)} {printf "%-*s%s\n", FMT, $1, $NF} ' FS="  +" file1
COL1                 FTR
AA8                  your comments
CC7                  last comments
DD3                  2nd to comment
BB5                  comment
CL1                 FTR
CC4                 new comments
DD5                 newcomment2

RudiC

View Public Profile for RudiC

Find all posts by RudiC

02-15-2016

Moderator

12,296, 3,792

Join Date: Nov 2008

Last Activity: 1 January 2021, 1:47 AM EST

Location: Amsterdam

Posts: 12,296

Thanks Given: 679

Thanked 3,792 Times in 3,282 Posts

An adaptation of the GNU awk script in post #2:

Code:

awk '
  / FTR *$/ {
    s=$0
    f=""
    while(match(s,/[^ ]+ */)) {   # use greedy match with field width and trailing spaces to find maximum width
      f=(f?f OFS:x) RLENGTH       # build string of field widths
      s=substr(s,RLENGTH+1)       # cut the line to to determine the next length
    }
    sub(/[0-9]*$/,1000,f)         # replace last field width with an arbitrary large number to capture the maximum width
    FIELDWIDTHS=f                 # set FIELDWIDTHS
    $0=$0                         # recalculate fields using FIELDWIDTHS
  }
  {
    print $1 $4
  }
' file

Last edited by Scrutinizer; 02-16-2016 at 12:27 PM.. Reason: Removed single excess space in output

These 2 Users Gave Thanks to Scrutinizer For This Post:

Scrutinizer

View Public Profile for Scrutinizer

Find all posts by Scrutinizer

Shell Programming and Scripting

Field widths based on a row

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Problem with getting awk to multiply a field by a value set based on condition of another field

Discussion started by: cotilloe

2. Shell Programming and Scripting

Analyzing last 2 fields of 1 row and 3rd field of next row

Discussion started by: ncwxpanther

3. Shell Programming and Scripting

Replacing field based on the value of other field

Discussion started by: weknowd

4. Shell Programming and Scripting

awk to adjust coordinates in field based on sequential numbers in another field

Discussion started by: cmccabe

5. Shell Programming and Scripting

awk to update value in field based on another field

Discussion started by: cmccabe

6. Shell Programming and Scripting

Splitting single row into multiple rows based on for every 10 digits of last field of the row

Discussion started by: kotra

7. Shell Programming and Scripting

Trying to remove duplicates based on field and row

Discussion started by: newbie2010

8. UNIX for Dummies Questions & Answers

awk - Summing a field based on another field

Discussion started by: treesloth

9. Shell Programming and Scripting

Find top N values for field X based on field Y's value

Discussion started by: FrancoisCN

10. Shell Programming and Scripting

How to insert data befor some field in a row of data depending up on values in row

Discussion started by: aemunathan