nawk is truncating output

10-13-2012

Registered User

168, 3

Join Date: Jul 2008

Last Activity: 17 April 2020, 4:42 PM EDT

Posts: 168

Thanks Given: 66

Thanked 3 Times in 3 Posts

nawk is truncating output

Legends,

I have 2 files f1 and f2. when i use nawk to compare the difference(subtraction) from 4th column of the file, it truncates the output.
can you please help to resolve this.

subtraction is (4th col of f1 - 4th col of f2). but it gives only below lines out of 116. I want to print all the lines of the file even if there is diff or no diff.

Code:

san:/tmp> wc -l f1 f2 | grep -v total
     116 f1
     116 f2

san:/tmp> head -3 f1 f2
==> f1 <==
TSCparser1 1irons1 EMEA_01 3
TSCparser12 1irons1 SPAIN_01 0
TSCparser13 1irons1 GERMANY_03 0

==> f2 <==
TSCparser1 1irons1 EMEA_01 3
TSCparser12 1irons1 SPAIN_01 0
TSCparser13 1irons1 GERMANY_03 0

san:/tmp> nawk 'FNR==NR{a[$1,$2,$3]=$4;next}{if(a[$1,$2,$3]){print $1,$2,$3,(a[$1,$2,$3]-$4)" times gapped in past 1 hr."}}' OFS="         " f1 f2
TSCparser1         1irons1         EMEA_01         0 times gapped in past 1 hr.
TSCparser94         1irons1         LSE_01         0 times gapped in past 1 hr.
TSCparser43         4irons1         STUTTGART_04         0 times gapped in past 1 hr.
TSCparser44         4irons1         STUTTGART_05         0 times gapped in past 1 hr.
TSCparser46         4irons1         STUTTGART_07         0 times gapped in past 1 hr.
TSCparser47         4irons1         STUTTGART_08         0 times gapped in past 1 hr.

sdosanjh

View Public Profile for sdosanjh

Find all posts by sdosanjh

10-13-2012

Registered User

1,650, 478

Join Date: Mar 2012

Last Activity: 11 September 2019, 8:06 AM EDT

Posts: 1,650

Thanks Given: 58

Thanked 478 Times in 474 Posts

try this..

Code:

nawk 'FNR==NR{a[$1,$2,$3]=$4;next}{if(a[$1,$2,$3] != ""){print $1,$2,$3,(a[$1,$2,$3]-$4)" times gapped in past 1 hr."}}' OFS="\t" f1 f2

This User Gave Thanks to pamu For This Post:

pamu

View Public Profile for pamu

Find all posts by pamu

10-13-2012

Registered User

15,129, 5,008

Join Date: Jul 2012

Last Activity: 4 May 2020, 4:31 PM EDT

Location: Aachen, Germany

Posts: 15,129

Thanks Given: 735

Thanked 5,008 Times in 4,483 Posts

The "error" is that in two of the three cases in your example, a[$1,$2,$3] exists, but is equal to zero. That's why awk won't print your line, even though the difference might be non-zero. Test it with $4 != 0 in f1. I'm not sure how to test the sheer existence of an entity in awk, but I think pamu has shown you a way to correct your statement.

-------------------------- edit ---------------------------------

Reading man pages is educational. From the mawk man page:

Quote:

An expression, expr in array evaluates to 1 if array[expr] exists, else to 0.

so,

Code:

($1,$2,$3) in a {print $1,$2,$3,(a[$1,$2,$3]-$4)" tim..."}

will do the job.

Last edited by RudiC; 10-13-2012 at 01:26 PM.. Reason: test of sheer existence found

RudiC

View Public Profile for RudiC

Find all posts by RudiC

10-13-2012

Registered User

1,413, 498

Join Date: Mar 2012

Last Activity: 8 November 2019, 2:39 AM EST

Location: India

Posts: 1,413

Thanks Given: 101

Thanked 498 Times in 474 Posts

Quote:

Originally Posted by sdosanjh

I want to print all the lines of the file even if there is diff or no diff.

Then, why are you checking something in if before printing the data? Drop that if:

Code:

nawk 'FNR==NR{a[$1,$2,$3]=$4;next}
{print $1,$2,$3,((($1,$2,$3) in a)?(a[$1,$2,$3]-$4):" ") " times gapped in past 1 hr."}' OFS="         " f1 f2

This will output all lines from f2. If matching line is found in f1, the numerical difference will be shown. Otherwise, a space will be shown in place of the difference.

Last edited by elixir_sinari; 10-13-2012 at 10:57 AM..

This User Gave Thanks to elixir_sinari For This Post:

elixir_sinari

View Public Profile for elixir_sinari

Find all posts by elixir_sinari

10-13-2012

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

You should also note that the value of SUBSEP varies in different implementations of awk (and I don't remember what value nawk uses). Some systems (for example OS X) default SUBSEP to an empty string. (SUBSEP is used to separate strings in multi-dimensional array subscripts). If there are any cases in your input where concatenating $1, $2, and $3 could yield a string that is not unique, you should explicitly set SUBSEP to something that doesn't appear in any of those three fields. Since $1 in your input ends with one or more digits and $2 starts with at least one digit, it looks like this could be possible issue with your input. For your input I would suggest setting SUBSEP to "," or "|" (e.g., add SUBSEP="," in your nawk command line after setting OFS).

RudiC said he didn't know how to test for the sheer existence of an entity in an array. The way to do that in this case would be to use:

Code:

if($1 SUBSEP $2 SUBSEP $3 in a) {...}

which would have the same meaning as:

Code:

if(a[$1,$2,$3] != "") {...}

in pamu's correction to the nawk script. In this case the test for an empty string is shorter than the test for existence (and for many is easier to read/understand), so I wouldn't make any change here.

This User Gave Thanks to Don Cragun For This Post:

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

10-13-2012

Registered User

3,231, 978

Join Date: Dec 2009

Last Activity: 11 June 2014, 8:40 PM EDT

Posts: 3,231

Thanks Given: 179

Thanked 978 Times in 791 Posts

Quote:

Originally Posted by Don Cragun

That's incorrect. From opensource.apple.com :: awk-18 :: tran.c (OS X 10.8.2):

Code:

char	**SUBSEP;	/* subscript separator for a[i,j,k]; default \034 */
...
SUBSEP = &setsymtab("SUBSEP", "\034", 0.0, STR|DONTFREE, symtab)->sval;

That code is also present in opensource.apple.com :: awk-1.2 :: tran.c (10.0), so it's not a recent change.

Any implementor who chooses an empty string for the value of SUBSEP should be shunned by the AWK community

. Seriously, though, the chance for collisions would be too great.

OS X's awk is nawk (which is also used by the BSD systems). "\034" is also the value of SUBSEP in the mawk, GNU awk, and busybox awk implementations.

In light of this, fiddling with SUBSEP is usually unnecessary.

Regards,
Alister

Last edited by alister; 10-13-2012 at 12:02 PM..

These 3 Users Gave Thanks to alister For This Post:

alister

View Public Profile for alister

Find all posts by alister

10-13-2012

Registered User

168, 3

Join Date: Jul 2008

Last Activity: 17 April 2020, 4:42 PM EDT

Posts: 168

Thanks Given: 66

Thanked 3 Times in 3 Posts

Thanks Elixir and pamu, it is working now. the only thing i forgot to mention is f1 has higher numeric count than f2 always.

Code:

example: if in 1st run of script 4th col of f2 =123, and f1=125
then second run will be f2=125, f1=127, always greater than value in f2

Last edited by sdosanjh; 10-13-2012 at 02:03 PM..

sdosanjh

View Public Profile for sdosanjh

Find all posts by sdosanjh

Shell Programming and Scripting

nawk is truncating output

10 More Discussions You Might Find Interesting

1. Solaris

How to avoid truncating in ps output ?

Discussion started by: solaris_1977

2. UNIX for Dummies Questions & Answers

awk truncating first field output?

Discussion started by: trashmouth12

3. Shell Programming and Scripting

Nawk command to output in var

Discussion started by: bulleteyedk

4. Shell Programming and Scripting

Nawk Problem - nawk out of space in tostring on

Discussion started by: Abhiraj Singh

5. Shell Programming and Scripting

How to print and append output of nawk script in commandline and as well into a file?

Discussion started by: Optimus81

6. Shell Programming and Scripting

NAWK conversion of hexadecimal input to decimal output via printf, I am close I can feel it

Discussion started by: PCGameGuy

7. Shell Programming and Scripting

help me how to use nawk for required output

Discussion started by: dodasajan

8. Shell Programming and Scripting

Truncating a variable

Discussion started by: whdr02

9. Shell Programming and Scripting

assigning nawk output to shell variable

Discussion started by: user_prady

10. Shell Programming and Scripting

Assigning nawk output to variables

Discussion started by: steveje0711