Print rows, having pattern in specific column...


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Print rows, having pattern in specific column...
# 15  
Old 10-06-2009
As Scrutinizer says, what is assigned to A[$1] is unimportant. It just has to be a value. Why are you changing the A[$1] assignment? what are you trying to do ?

BTW, then 'next' statement has a good side effect... it prevents the A[$3] clause from being executed. If a novice decides to modify the script, it will prevent some undesired behavior. And awk doesn't have to do useless work processing a clause that isn't useful for the pattern file....
# 16  
Old 10-06-2009
Hi,

Thanks ya.
I fully understand about all the code now dSmilie
Can I ask you one more things?
Is it when I used A[$x]=$y
The x & y MUST be the same number,right?

Besides that, if I used A[$x]=1
The result will got some empty space for those not match with the pattern file.
awk 'NR == FNR { A[$1]=1 } A[$3]' pattern_file input_file
x
x
bca cd002 cd003 cza
bac cd004 cd005 zac
acb cd006 cd007 caz
cab cd007 cd008 azc
x
x


x represent the empty space.
Quote:
Originally Posted by Scrutinizer
A[$1] = $1 is used to set up the array of patterns. It is set when NR is equal to FNR, in other words when the first file is read i.e. the pattern file. It means fill the associative array "$1" to the value of "$1", $1 being the first field of your pattern file So with the OP's provided inputs it gets filled like so:
Code:
A[cd003]=cd003
A[cd005]=cd005
A[cd007]=cd007
A[cd008]=cd008

So once the pattern file is done it starts reading the input file and it will print each line if field 3 exist as a key in the array.

The script is only testing the existence of the array elements, not using its contents. So IMO the use of $3 is a tiny bit superfluous. I think we could also just set it to 1 instead of $1:
Code:
awk 'NR == FNR { A[$1]=1; next } A[$3]' pattern_file input_file

or, since pattern file does not have a third column.
Code:
awk 'NR == FNR { A[$1]=1 } A[$3]' pattern_file input_file



---------- Post updated at 01:31 AM ---------- Previous update was at 01:30 AM ----------

Thanks jp2542a,
I fully understand about the code now d Smilie
Really thanks for all of your explanationSmilie

Quote:
Originally Posted by jp2542a
As Scrutinizer says, what is assigned to A[$1] is unimportant. It just has to be a value. Why are you changing the A[$1] assignment? what are you trying to do ?

BTW, then 'next' statement has a good side effect... it prevents the A[$3] clause from being executed. If a novice decides to modify the script, it will prevent some undesired behavior. And awk doesn't have to do useless work processing a clause that isn't useful for the pattern file....
# 17  
Old 10-06-2009
Quote:
Originally Posted by jp2542a
As Scrutinizer says, what is assigned to A[$1] is unimportant. It just has to be a value. [...]
It could be undefined as well, in that case one should check if the key exists:

Code:
$1 in A

# 18  
Old 10-06-2009
Quote:
Originally Posted by radoulov
Code:
$1 in A

Hi Radoulov, you mean $3 in A, no?
# 19  
Old 10-06-2009
While the A[$3] construct works where I've tried it, I think the ($3 in A) is much better...
# 20  
Old 10-06-2009
Quote:
Originally Posted by Scrutinizer
Hi Radoulov, you mean $3 in A, no?
Yes, sorry.
It should be the field in the second non empty input file that you want to match:

Code:
awk 'NR == FNR { A[$1]; next } $3 in A' pattern_file input_file

# 21  
Old 10-06-2009
Quote:
Originally Posted by jp2542a
BTW, then 'next' statement has a good side effect... it prevents the A[$3] clause from being executed. If a novice decides to modify the script, it will prevent some undesired behavior. And awk doesn't have to do useless work processing a clause that isn't useful for the pattern file....
You are right, leaving out next only works in this case because the column number in the pattern file and the input file happen to differ. But even then it causes unnecessary processing.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

If pattern in column 3 matches pattern in column 2 (any row), print value in column 1

Hi all, I have searched and searched, but I have not found a solution that quite fits what I am trying to do. I have a long list of data in three columns. Below is a sample: 1,10,8 2,12,10 3,13,12 4,14,14 5,15,16 6,16,18 Please use code tags What I need to do is as follows: If a... (4 Replies)
Discussion started by: bleedingturnip
4 Replies

2. Shell Programming and Scripting

How to print multiple specific column after a specific word?

Hello.... Pls help me (and sorry my english) :) So I have a file (test.txt) with 1 long line.... for example: isgc jsfh udgf osff 8462 error iwzr 653 idchisfb isfbisfb sihfjfeb isfhsi gcz eifh How to print after the "error" word the 2nd 4th 5th and 7th word?? output well be: 653 isfbisfb... (2 Replies)
Discussion started by: marvinandco
2 Replies

3. Shell Programming and Scripting

Converting Single Column into Multiple rows, but with strings to specific tab column

Dear fellows, I need your help. I'm trying to write a script to convert a single column into multiple rows. But it need to recognize the beginning of the string and set it to its specific Column number. Each Line (loop) begins with digit (RANGE). At this moment it's kind of working, but it... (6 Replies)
Discussion started by: AK47
6 Replies

4. UNIX for Dummies Questions & Answers

Deleting rows where the value in a specific column match

Hi, I have a tab delimited text file where I want to delete all rows that have the same string for column 1. How do I go about doing that? Thanks! Example Input: aa 1 aa 2 aa 3 bb 4 bc 5 bb 6 cd 8 Output: bc 5 cd 8 (4 Replies)
Discussion started by: evelibertine
4 Replies

5. Shell Programming and Scripting

awk command to print only selected rows in a particular column specified by column name

Dear All, I have a data file input.csv like below. (Only five column shown here for example.) Data1,StepNo,Data2,Data3,Data4 2,1,3,4,5 3,1,5,6,7 3,2,4,5,6 5,3,5,5,6 From this I want the below output Data1,StepNo,Data2,Data3,Data4 2,1,3,4,5 3,1,5,6,7 where the second column... (4 Replies)
Discussion started by: ks_reddy
4 Replies

6. UNIX for Dummies Questions & Answers

How to Detect Specific Pattern and Print the Specific String after It?

I'm still beginner and maybe someone can help me. I have this input: the great warrior a, b, c and what i want to know is, with awk, how can i detect the string with 'warrior' string on it and print the a, b, and c seperately, become like this : Warrior Type a b c Im still very... (3 Replies)
Discussion started by: radynaraya
3 Replies

7. Shell Programming and Scripting

Replace column that matches specific pattern, with column data from another file

Can anyone please help with this? I have 2 files as given below. If 2nd column of file1 has pattern foo1@a, find the matching 1st column in file2 & replace 2nd column of file1 with file2's value. file1 abc_1 foo1@a .... abc_1 soo2@a ... def_2 soo2@a .... def_2 foo1@a ........ (7 Replies)
Discussion started by: prashali
7 Replies

8. Shell Programming and Scripting

print first few lines, then apply regex on a specific column to print results.

abc.dat tty cpu tin tout us sy wt id 0 0 7 3 19 71 extended device statistics r/s w/s kr/s kw/s wait actv wsvc_t asvc_t %w %b device 0.0 133.2 0.0 682.9 0.0 1.0 0.0 7.2 0 79 c1t0d0 0.2 180.4 0.1 5471.2 3.0 2.8 16.4 15.6 15 52 aaaaaa1-xx I want to skip first 5 line... (4 Replies)
Discussion started by: kchinnam
4 Replies

9. Shell Programming and Scripting

Print out specific pattern column data

Input file: adc_0123 haa_1000 bcc_520 adc_0150 bcc_290 adc_0112 haa_8000 adc_0139 haa_7000 Output file: adc_0123 adc_0123 haa_1000 bcc_520 adc_0150 adc_0150 bcc_290 (3 Replies)
Discussion started by: patrick87
3 Replies

10. Shell Programming and Scripting

Question about sort specific column and print other column at the same time !

Hi, This is my input file: ali 5 usa abc abu 4 uk bca alan 6 brazil bac pinky 10 utah sdc My desired output: pinky 10 utah sdc alan 6 brazil bac ali 5 usa abc abu 4 uk bca Based on the column two, I want to do the descending order and print out other related column at the... (3 Replies)
Discussion started by: patrick87
3 Replies
Login or Register to Ask a Question