Compare Values between column in the same file

09-28-2016

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

I agree with RudiC that your specification (in both post #1 [after 20 edits] and in post #12) saying:

Quote:

if Column 3 has value= P then check column 7,8,9,10,12,13 (values should be zero in each) and hence sum of these columns should be zero.
if Column 3 has value= N then check column 7,8,9,10,12,13 (values should be zero in each) and hence sum of these columns should be zero.

appears to be a mistake. And to get the output you said you want, it would seem that if field #3 is N, you really want to check 7, 8, 9, 10, 11, and 13 instead of checking exactly the same fields for both P and N.

Assuming that that is correct, here is another awk script using the same underlying logic as the code RudiC suggested. This script builds the array of fields to be skipped for each letter than appears in the string in field #3 manually. RudiC's script depends on an extension to the standards (using an empty ERE in split() to create an array from single characters in a string) that is available in GNU awk but is not often supported on versions of awk available on UNIX systems and BSD-based systems. The standards say that the behavior of awk is unspecified when split() is called with an empty string specified as the field separator.

Unlike RudiC's code, this adds a new column to the header line as shown in the output header you said you wanted. And, while RudiC's code adds up all of the fields being checked, this code looks at each field individually and breaks out of the loop immediately if a non-zero value is found in a field that is to be checked.

The following awk script:

Code:

awk '
BEGIN {	# Initialize skip array: S[fn] = char
	# If field #3 contains the character specified by char, do NOT check
	# the contents of field #fn.
	S[7] = "S"
	S[8] = "E"
	S[9] = "A"
	S[10] = "M"
	S[11] = "P"
	S[12] = "N"
	S[13] = "C"
}
NR < 2 {# Add the requested heading field to the header line...
	print $0, "Newcolumn"
	# and skip to the next input line.
	next
}
{	# For all other input lines, check fields 7-13 inclusive:
	for(i = 7; i <= 13; i++)
		# If the character corresponding to the field is not prsent in
		# field #3 AND the field contains a non-zero value...
		if($3 !~ S[i] && $i) 
			# break out of the loop.
			break
	# Print the input line followed by "WARNING" if a non-zero value was
	# found in a field to be checked; otherwise, print the input line
	# followed by "GOOD".
	print $0, (i <= 13) ? "WARNING" : "GOOD"
}' file

when file contains the sample input you provided, produces the output:

Code:

COLUMN1 COLUMN2 COLUMN3 COLUMN4 COLUMN5 COLUMN6 SMS Email AO Mail Post N Cell Newcolumn
VEGE Potato E W 396 12 0 384 0 0 0 0 0 GOOD
VEGE Onion S W 17 0 17 0 0 0 0 0 0 GOOD
FRUIT APPLE N W 549 61 0 0 0 0 0 488 0 GOOD
FRUIT APPLE SE W 291 14 239 38 0 10 0 0 0 WARNING
FRUIT APPLE EAMS W 397 32 309 56 309 309 0 0 0 GOOD
FRUIT APPLE SEA W 808 58 663 87 488 20 0 0 0 WARNING

which seems to match the output you said you wanted.

If you want to try this on a Solaris/SunOS system, change awk to /usr/xpg4/bin/awk or nawk.

This User Gave Thanks to Don Cragun For This Post:

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

09-29-2016

Registered User

21, 0

Join Date: Sep 2016

Last Activity: 6 October 2016, 10:52 AM EDT

Posts: 21

Thanks Given: 23

Thanked 0 Times in 0 Posts

@Don ..Yes you and Rudi were right about P and N . Your code is working fine only one issue when I am running it, it is giving me list of files in that directory and gives the output.

doing something like below

Code:

> }
> {# For all other input lines, check fields 7-13 inclusive:
> for(i = 7; i <= 13; i++)
>
.csv                          20160405_StarData/            BCSreplay/                    archive/                      neha.txt                      sync_query_results_26668.txt
20140327_ISP_AAID_migrations/ 25049_CT_DELIVERY2.txt        DSLOrderWTN/                  data_for_demos/               neha2.txt                     sync_query_results_29460.txt
20150818_AppleCreate/         32706_CT_DELIVERY2.txt        MIGRATION_DATA_DIRECTORIES/   dp332j/                       neha3.txt                     sync_query_results_6635.txt
20151008_OMSIssue_DeTitanize/ ActivateHSIA/                 OldProjectDirs/               igate_data/                   other_logs/                   text.txt
20151014_DetitanizeMobility/  AdHoc_Requests/               ProcessEDDFiles/              junk/                         replays/                      tmpUniqAsMob.out
20160314_FULCRUM/             AddOrder_data/                ProjectDirs/                  kr9850/                       sh2818/                       tmpUniqAsMob.sql
> # If the character corresponding to the field is not prsent in
>
.csv                          20160405_StarData/            BCSreplay/                    archive/                      neha.txt                      sync_query_results_26668.txt
20140327_ISP_AAID_migrations/ 25049_CT_DELIVERY2.txt        DSLOrderWTN/                  data_for_demos/               neha2.txt                     sync_query_results_29460.txt
20150818_AppleCreate/         32706_CT_DELIVERY2.txt        MIGRATION_DATA_DIRECTORIES/   dp332j/                       neha3.txt                     sync_query_results_6635.txt
20151008_OMSIssue_DeTitanize/ ActivateHSIA/                 OldProjectDirs/               igate_data/                   other_logs/                   text.txt
20151014_DetitanizeMobility/  AdHoc_Requests/               ProcessEDDFiles/              junk/                         replays/                      tmpUniqAsMob.out
20160314_FULCRUM/             AddOrder_data/                ProjectDirs/                  kr9850/                       sh2818/                       tmpUniqAsMob.sql
> # field #3 AND the field contains a non-zero value...
>
.csv                          20160405_StarData/            BCSreplay/                    archive/                      neha.txt                      sync_query_results_26668.txt
20140327_ISP_AAID_migrations/ 25049_CT_DELIVERY2.txt        DSLOrderWTN/                  data_for_demos/               neha2.txt                     sync_query_results_29460.txt
20150818_AppleCreate/         32706_CT_DELIVERY2.txt        MIGRATION_DATA_DIRECTORIES/   dp332j/                       neha3.txt                     sync_query_results_6635.txt
20151008_OMSIssue_DeTitanize/ ActivateHSIA/                 OldProjectDirs/               igate_data/                   other_logs/                   text.txt
20151014_DetitanizeMobility/  AdHoc_Requests/               ProcessEDDFiles/              junk/                         replays/                      tmpUniqAsMob.out
20160314_FULCRUM/             AddOrder_data/                ProjectDirs/                  kr9850/                       sh2818/                       tmpUniqAsMob.sql
> if($3 !~ S[i] && $i)
>

Nina2910

View Public Profile for Nina2910

Find all posts by Nina2910

09-29-2016

Moderator

3,105, 1,603

Join Date: May 2013

Last Activity: 31 August 2020, 1:46 AM EDT

Location: Chennai

Posts: 3,105

Thanks Given: 1,269

Thanked 1,603 Times in 1,369 Posts

Hello Nina2910,

Could you please create a script for example script.ksh, paste script there then give it executable permissions eg-->chmod 755 and then run Rudi's/Don's code, it should fly without showing anything else then.

Thanks,
R. Singh

Last edited by RavinderSingh13; 09-29-2016 at 12:19 PM..

This User Gave Thanks to RavinderSingh13 For This Post:

RavinderSingh13

View Public Profile for RavinderSingh13

Find all posts by RavinderSingh13

09-29-2016

Registered User

21, 0

Join Date: Sep 2016

Last Activity: 6 October 2016, 10:52 AM EDT

Posts: 21

Thanks Given: 23

Thanked 0 Times in 0 Posts

@Ravinder...Rudi's code is working without script that's was only for Don's code

Nina2910

View Public Profile for Nina2910

Find all posts by Nina2910

09-29-2016

Registered User

15,129, 5,008

Join Date: Jul 2012

Last Activity: 4 May 2020, 4:31 PM EDT

Location: Aachen, Germany

Posts: 15,129

Thanks Given: 735

Thanked 5,008 Times in 4,483 Posts

Quote:

Originally Posted by Nina2910

@Rudi...yes you are right and code is working perfectly fine..Thank you.

would it be possible if you can explain the code ?

Yes:

Code:

awk '
BEGIN   {MX = split ("      SEAMPNC", CH, _)                    # create the CH array : CH[1] = " ",...,CH[7] = "S",...,CH[13] = "C" by splitting a string constant at the empty string ("_")
        }                                                       # MX variable could be skipped in this context; provided only for totally "dynamic data"
NR > 1  {SUM = 0                                                # initialize SUM for every input line
         for (i = 7; i<=13; i++) if ($3 !~ CH[i]) SUM += $i     # for the to be checked fields: test if the relevant char (CH[fieldNr]) is found in $3
                                                                # if yes, DON"T sum the field
         $(NF+1) = SUM?"WARNING":"GOOD"                         # add the respective info as the "last plus one" filed.
        }
1                                                               # default action: print
 ' file

Last edited by RudiC; 09-29-2016 at 01:34 PM..

This User Gave Thanks to RudiC For This Post:

RudiC

View Public Profile for RudiC

Find all posts by RudiC

09-29-2016

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

Quote:

Originally Posted by Nina2910

Code:

> }
> {# For all other input lines, check fields 7-13 inclusive:
> for(i = 7; i <= 13; i++)
>
.csv                          20160405_StarData/            BCSreplay/                    archive/                      neha.txt                      sync_query_results_26668.txt
20140327_ISP_AAID_migrations/ 25049_CT_DELIVERY2.txt        DSLOrderWTN/                  data_for_demos/               neha2.txt                     sync_query_results_29460.txt
20150818_AppleCreate/         32706_CT_DELIVERY2.txt        MIGRATION_DATA_DIRECTORIES/   dp332j/                       neha3.txt                     sync_query_results_6635.txt
20151008_OMSIssue_DeTitanize/ ActivateHSIA/                 OldProjectDirs/               igate_data/                   other_logs/                   text.txt
20151014_DetitanizeMobility/  AdHoc_Requests/               ProcessEDDFiles/              junk/                         replays/                      tmpUniqAsMob.out
20160314_FULCRUM/             AddOrder_data/                ProjectDirs/                  kr9850/                       sh2818/                       tmpUniqAsMob.sql
> # If the character corresponding to the field is not prsent in
>
.csv                          20160405_StarData/            BCSreplay/                    archive/                      neha.txt                      sync_query_results_26668.txt
20140327_ISP_AAID_migrations/ 25049_CT_DELIVERY2.txt        DSLOrderWTN/                  data_for_demos/               neha2.txt                     sync_query_results_29460.txt
20150818_AppleCreate/         32706_CT_DELIVERY2.txt        MIGRATION_DATA_DIRECTORIES/   dp332j/                       neha3.txt                     sync_query_results_6635.txt
20151008_OMSIssue_DeTitanize/ ActivateHSIA/                 OldProjectDirs/               igate_data/                   other_logs/                   text.txt
20151014_DetitanizeMobility/  AdHoc_Requests/               ProcessEDDFiles/              junk/                         replays/                      tmpUniqAsMob.out
20160314_FULCRUM/             AddOrder_data/                ProjectDirs/                  kr9850/                       sh2818/                       tmpUniqAsMob.sql
> # field #3 AND the field contains a non-zero value...
>
.csv                          20160405_StarData/            BCSreplay/                    archive/                      neha.txt                      sync_query_results_26668.txt
20140327_ISP_AAID_migrations/ 25049_CT_DELIVERY2.txt        DSLOrderWTN/                  data_for_demos/               neha2.txt                     sync_query_results_29460.txt
20150818_AppleCreate/         32706_CT_DELIVERY2.txt        MIGRATION_DATA_DIRECTORIES/   dp332j/                       neha3.txt                     sync_query_results_6635.txt
20151008_OMSIssue_DeTitanize/ ActivateHSIA/                 OldProjectDirs/               igate_data/                   other_logs/                   text.txt
20151014_DetitanizeMobility/  AdHoc_Requests/               ProcessEDDFiles/              junk/                         replays/                      tmpUniqAsMob.out
20160314_FULCRUM/             AddOrder_data/                ProjectDirs/                  kr9850/                       sh2818/                       tmpUniqAsMob.sql
> if($3 !~ S[i] && $i)
>

Pasting a script that contains tabs into a shell that uses tabs to trigger command completion is not going to work. As Ravinder suggested, copy my script into a file and execute the file.

You haven't told us what operating system or shell you're using, but I test it out with the following in a file named tester:

Code:

#!/bin/ksh
awk '
BEGIN {	# Initialize skip array: S[fn] = char
	# If field #3 contains the character specified by char, do NOT check
	# the contents of field #fn.
	S[7] = "S"
	S[8] = "E"
	S[9] = "A"
	S[10] = "M"
	S[11] = "P"
	S[12] = "N"
	S[13] = "C"
}
NR < 2 {# Add the requested heading field to the header line...
	print $0, "Newcolumn"
	# and skip to the next input line.
	next
}
{	# For all other input lines, check fields 7-13 inclusive:
	for(i = 7; i <= 13; i++)
		# If the character corresponding to the field is not prsent in
		# field #3 AND the field contains a non-zero value...
		if($3 !~ S[i] && $i) 
			# break out of the loop.
			break
	# Print the input line followed by "WARNING" if a non-zero value was
	# found in a field to be checked; otherwise, print the input line
	# followed by "GOOD".
	print $0, (i <= 13) ? "WARNING" : "GOOD"
}' "${1:-file}"

make it executable with:

Code:

chmod +x tester

and then executing it with:

Code:

./tester

produces the output I showed you in post #15 in this thread. If you invoke it with an operand that is the pathname of a different file to be processed:

Code:

./tester otherfile

it will process the data in otherfile instead of the data in a file named file.

This User Gave Thanks to Don Cragun For This Post:

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

09-30-2016

Registered User

21, 0

Join Date: Sep 2016

Last Activity: 6 October 2016, 10:52 AM EDT

Posts: 21

Thanks Given: 23

Thanked 0 Times in 0 Posts

would it possible if code can add also "Bad" to value which is suppose to be zero but it is not.

Code:

 
 COLUMN1 COLUMN2 COLUMN3 COLUMN4 COLUMN5 COLUMN6 SMS Email AO Mail Post N Cell Newcolumn
VEGE Potato E W 396 12 0 384 0 0 0 0 0 GOOD
VEGE Onion S W 17 0 17 0 0 0 0 0 0 GOOD
FRUIT APPLE N W 549 61 0 0 0 0 0 488 0 GOOD
 FRUIT APPLE SE W 291 14 239 38 0 Bad10 0 0 0 WARNING
FRUIT APPLE EAMS W 397 32 309 56 309 309 0 0 0 GOOD
FRUIT APPLE SEA W 808 58 663 87 488 Bad20 0 0 0 WARNING

like I added bad in row number 5 and 7.

Nina2910

View Public Profile for Nina2910

Find all posts by Nina2910

UNIX for Beginners Questions & Answers

Compare Values between column in the same file

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Compare values in multiple rows in one column using awk

Discussion started by: jiam912

2. Shell Programming and Scripting

Compare two files column values using awk

Discussion started by: judi

3. Shell Programming and Scripting

How to compare the values of a column in a same file using awk?

Discussion started by: utritala

4. UNIX for Dummies Questions & Answers

Compare values of fields from same column with awk

Discussion started by: lucasvs

5. Shell Programming and Scripting

Compare values in two files. For matching rows print corresponding values from File 1 in File2.

Discussion started by: Santoshbn

6. Shell Programming and Scripting

Take values from a column and put it in a variable and compare

Discussion started by: arijitsaha

7. Shell Programming and Scripting

How to compare the values of a column in awk in a same file and consecutive lines..

Discussion started by: manuswami

8. UNIX for Dummies Questions & Answers

Compare two files using awk or sed, add values in a column if their previous fields are same

Discussion started by: yerruhari

9. UNIX for Advanced & Expert Users

Compare two files using awk or sed, add values in a column if their previous fields are same

Discussion started by: yerruhari

10. Shell Programming and Scripting

I need to extract last column of a file and compare the values

Discussion started by: vukkusila