Need help with filtering records in a file

12-19-2017

Registered User

14, 0

Join Date: Sep 2011

Last Activity: 28 October 2019, 7:24 AM EDT

Posts: 14

Thanks Given: 4

Thanked 0 Times in 0 Posts

Need help with filtering records in a file

Hi,

I have following records in a file

more file1.txt

Code:

[123] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting applicaction ABC for user [user1@ldap1]
[345] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting applicaction CDE for user [user2@ldap2]
[678] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting applicaction XXX for user [user3@msad]
[628] [AGENT] [abc] [cde] [fgh] [asd] [asd] logging applicaction XXX for user [user3@ldap1]

I need to filter out records which have strings " setting application " and records which don't have @msad user.(elilminate records with uses@msad)

Output:

I tried below code

Code:

while read row;
do
echo $row | egrep "setting application" | awk -F@ldap '{print $0}' >> tempfile.txt

done < file1.txt

output am getting:

Code:

[123] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting applicaction ABC for user [user1@ldap1]
[345] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting applicaction CDE for user [user2@ldap2]
[678] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting applicaction XXX for user [user3@msad]

Desired output:

Code:

[123] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting applicaction ABC for user [user1@ldap1]
[345] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting applicaction CDE for user [user2@ldap2]

But this is not working as expected, its printing all the records.

Last edited by manid; 12-19-2017 at 01:02 AM..

manid

View Public Profile for manid

Find all posts by manid

12-19-2017

Registered User

5,091, 1,931

Join Date: May 2012

Last Activity: 15 July 2020, 4:46 AM EDT

Location: Simplicity

Posts: 5,091

Thanks Given: 565

Thanked 1,931 Times in 1,668 Posts

Your awk command sets a field delimiter that accordingly splits the line into $1 and $2 but then you split the whole line $0.

Most simple is a combination of grep and grep -v

Code:

egrep " setting application " file1.txt | egrep -v "@msad" > tempfile.txt

Also doable with shell built-ins.

Code:

while IFS="" read row
do
  case $row in
  *" setting application "*)
    case $row in
    *"@msad"*)
    ;;
    *)
      printf "%s\n" "$row"
    ;;
    esac
  ;;
  esac
done < file1.txt > tempfile.txt

MadeInGermany

View Public Profile for MadeInGermany

Find all posts by MadeInGermany

12-19-2017

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

If you're going to use two greps, with the fixed strings needed for this project, grep -F (or fgrep) will be faster than egrep (or grep -E). I.e., try:

Code:

grep -F ' setting application ' file1.txt | grep -vF '@mead' > tempfile.txt

If you don't want to do this just with shell built-ins as MadeInGermany suggested, you could also use either of the following awk scripts (that just need one invocation of awk):

Code:

awk -F'@msad' 'NF == 1 && / setting application /' file1.txt > tempfile.txt
awk '/ setting application / && ! /@msad/' file1.txt > tempfile.txt

Note, however, that with the supplied sample data from post #1 in file1.txt, none of the above produce any output.

All of these (and the sample code shown in post #1 are looking for setting application , but the desired lines in that sample data have applicaction instead of application

To get the output requested in post #1 from the input provided in post #1, you would need to use one of the following:

Code:

grep -F ' setting applicaction ' file1.txt | grep -vF '@mead' > tempfile.txt
awk -F'@msad' 'NF == 1 && / setting applicaction /' file1.txt > tempfile.txt
awk '/ setting applicaction / && ! /@msad/' file1.txt > tempfile.txt

This User Gave Thanks to Don Cragun For This Post:

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

12-19-2017

Registered User

14, 0

Join Date: Sep 2011

Last Activity: 28 October 2019, 7:24 AM EDT

Posts: 14

Thanks Given: 4

Thanked 0 Times in 0 Posts

I just want to know , is it possible to add column to the end of each record with the filename like below?

Code:

echo $row | grep -F "setting application" | grep -vF "@msad" | awk -v var="$file" '{print $0 var}'  >> test.txt

Code:

[123] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting application ABC for user [user1@ldap1] file1.txt
[345] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting application CDE for user [user2@ldap2] file1.txt

Moderator's Comments:

Please use CODE tags as required by forum rules!

Last edited by RudiC; 12-19-2017 at 12:37 PM.. Reason: Added CODE tags.

manid

View Public Profile for manid

Find all posts by manid

12-19-2017

Registered User

15,129, 5,008

Join Date: Jul 2012

Last Activity: 4 May 2020, 4:31 PM EDT

Location: Aachen, Germany

Posts: 15,129

Thanks Given: 735

Thanked 5,008 Times in 4,483 Posts

What happens if you try that (provided that the file shell variable is defined correctly)?

RudiC

View Public Profile for RudiC

Find all posts by RudiC

12-19-2017

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

Quote:

Originally Posted by manid

I just want to know , is it possible to add column to the end of each record with the filename like below?

Code:

echo $row | grep -F "setting application" | grep -vF "@msad" | awk -v var="$file" '{print $0 var}'  >> test.txt

Code:

[123] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting application ABC for user [user1@ldap1] file1.txt
[345] [AGENT] [abc] [cde] [fgh] [asd] [asd] setting application CDE for user [user2@ldap2] file1.txt

If you are reading a file line-by-line as suggested by MadeInGermany in post #2, please never feed those lines through a three element pipeline. Doing so is GROSSLY inefficient!

If you are processing multiple files (which would be a logical reason for adding the name of the file each record came from in the output you produce), you can easily modify MadeInGermany's shell suggestion from post #2 with something like:

Code:

for file in file*.txt
do	while IFS="" read row
	do	case $row in
		(*" setting application "*)
			case $row in
			(*"@msad"*)
				;;
			(*)	printf "%s %s\n" "$row" "$file";;
			esac;;
		esac
	done < "$file"
done > tempfile.txt

(which just uses shell built-ins and doesn't need to invoke any external utilities) or extend my awk suggestion from post #3 to something like:

Code:

awk '/ setting application / && ! /@msad/ { print $0, FILENAME }' file*.txt > tempfile.txt

(which just invokes awk once). Note the comma between $0 and FILENAME in the print statement; without it, there won't be any separation between the input lines and the name of the file from which it was extracted.

The code that you showed us in post #4 invokes two copies of grep and one copy of awk for every line read from each of the files you process.

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

Shell Programming and Scripting

Need help with filtering records in a file

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Filtering records of a csv file based on a value of a column

Discussion started by: sunilmudikonda

2. Shell Programming and Scripting

Separate records of a file on 2 types of records

Discussion started by: jamcogar

3. Shell Programming and Scripting

Deleting duplicate records from file 1 if records from file 2 match

Discussion started by: vestport

4. UNIX for Dummies Questions & Answers

Filtering records from 1 file based on some manipulation doen on second file

Discussion started by: mintu41

5. UNIX for Dummies Questions & Answers

Grep specific records from a file of records that are separated by an empty line

Discussion started by: Atrisa

6. Shell Programming and Scripting

filtering records based on numeric field value in 8th position

Discussion started by: indusri

7. Shell Programming and Scripting

Issues with filtering duplicate records using gawk script

Discussion started by: nmumbarkar

8. UNIX for Dummies Questions & Answers

Filtering records of a file based on a value of a column

Discussion started by: risk_sly

9. UNIX for Dummies Questions & Answers

Use records from one file to delete records in another file

Discussion started by: kenneth.mcbride

10. Shell Programming and Scripting

Count No of Records in File without counting Header and Trailer Records

Discussion started by: guiguy