Match patterns from another file and tag Post: 302930529

Sponsored Content

Top Forums UNIX for Dummies Questions & Answers Match patterns from another file and tag Post 302930529 by senhia83 on Monday 5th of January 2015 09:55:18 AM

01-05-2015

Registered User

Match patterns from another file and tag

Hi all,

I have a file , which has 6 tab delimited fields, with $3 and $4 subfielded with spaces. I wamt to match cols $2,$3,$4 of tmp1 with tmp2, ..and then flag the 5th col if found.

tmp1

Code:

1756    Xerm    XermA XermB XermC XermD AA TT AA GG     A       1
1763    Xerm    XermA XermB XermC XermD AA TT AA GG     A       1
18078   Xerm    XermA XermB XermC XermD AA TT AA GG     A       1
18115   Xerm    XermA XermB XermC XermD GG AA AA GG     B       2
18251   Xerm    XermA XermB XermC XermD GG AA AA GG     B       2
19352-005       Xerm    XermA XermB XermC XermD AA TT AA GG     A       1
19352-006       Xerm    XermA XermB XermC XermD AA TT AA GG     A       1
195A    Xerm    XermA XermB XermC XermD AA TT AA GG     A       1
A02     Xerm    XermA XermB XermC XermD GG TT GG GG     C       3
A04     Xerm    XermA XermB XermC XermD GG TT AA GG     D       4

tmp2

Code:

Xerm    XermA XermB XermC XermD AA TT AA GG

Expected output

Code:

1756    Xerm    XermA XermB XermC XermD AA TT AA GG     *A*       1
1763    Xerm    XermA XermB XermC XermD AA TT AA GG     *A*       1
18078   Xerm    XermA XermB XermC XermD AA TT AA GG     *A*       1
18115   Xerm    XermA XermB XermC XermD GG AA AA GG     B       2
18251   Xerm    XermA XermB XermC XermD GG AA AA GG     B       2
19352-005       Xerm    XermA XermB XermC XermD AA TT AA GG     *A*       1
19352-006       Xerm    XermA XermB XermC XermD AA TT AA GG     *A*      1
195A    Xerm    XermA XermB XermC XermD AA TT AA GG     *A*       1
A02     Xerm    XermA XermB XermC XermD GG TT GG GG     C       3
A04     Xerm    XermA XermB XermC XermD GG TT AA GG     D       4

Here is what I tried, not producing desired output

Code:

awk -F'\t' 'NR==FNR{a[$0]=$0;next} $2"\t"$3"\t"$4 in a {$5="*"$5"*"}1' OFS="\t"  tmp2 tmp1

senhia83

View Public Profile for senhia83

Find all posts by senhia83

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

sed/awk help to match list of patterns and remove from org file

Hi, From the pattern mentioned below remove lines based on pattern range. Conditions 1 Look For all lines starting with ALTER TABLE and Ending with ; and contains the word MOVE.I wanto to remove these lines from the file sample below. Note : The above pattern list could be found in...

2. Shell Programming and Scripting

Searching patterns in 1 file and deleting all lines with those patterns in 2nd file

Hi Gurus, I have a file say for ex. file1 which has 3500 lines in it which are different account numbers and another file (file2) which has 230000 lines in it. I want to read all the lines in file1 and delete all those lines from file2 which has that same pattern as in file1. I am not quite...

3. Shell Programming and Scripting

Removing file lines that each match to a different patterns

I have a very large file (10,000,000 lines), that contains a sample id and a property of that sample. I have another file that contains around 1,000,000 lines with sample ids that I want to remove from the original file (create a new file without these lines). I know how to do this in Perl, but it...

4. Shell Programming and Scripting

script to match patterns in 2 different files.

I am new to shell scripting and need some help. I googled, but couldn't find a similar scenario. Basically, I need to rename a datafile. This is the scenario - I have a file, readonly.txt that has 2 columns - file# and name. I have another file,missing_files.txt that has id and name. Both the...

5. Shell Programming and Scripting

Match multiple patterns in a file and then print their respective next line

Dear all, I need to search multiple patterns and then I need to print their respective next lines. For an example, in the below table, I will look for 3 different patterns : 1) # ATC_Codes: 2) # Generic_Name: 3) # Drug_Target_1_Gene_Name: #BEGIN_DRUGCARD DB00001 # AHFS_Codes:...

6. Shell Programming and Scripting

Match 2 different patterns and print the lines

Hi, i have been trying to extract multiple lines based on two different patterns as below:- file1 @jkm|kdo|aas012|192.2.3.1 blablbalablablkabblablabla sjfdsakfjladfjefhaghfagfkafagkjsghfalhfk fhajkhfadjkhfalhflaffajkgfajkghfajkhgfkf jahfjkhflkhalfdhfwearhahfl @jkm|sdf|wud08q|168.2.1.3...

7. Shell Programming and Scripting

Match 2 patterns together

How can I quickly print out lines in a datafile which has presence of both patterns in a row of another file. Maybe awk can do it much faster than bash. Patternfile ID1 PAT11 PAT12 ID1 PAT21 PAT22 ID2 PAT31 PAT32 datafile headerline...

8. Shell Programming and Scripting

Egrep patterns in a file and limit number of matches to print for each pattern match

Hi I need to egrep patterns in a file and limit number of matches to print for each matched pattern. -m10 option is not working out in my sun solaris 5.10 Please guide me the options to achieve. if i do head -10 , i wont be getting all pattern match results as output since for a...

9. Shell Programming and Scripting

awk to print match or non-match and select fields/patterns for non-matches

In the awk below I am trying to output those lines that Match between file1 and file2, those Missing in file1, and those missing in file2. Using each $1,$2,$4,$5 value as a key to match on, that is if those 4 fields are found in both files the match, but if those 4 fields are not found then missing...

10. Shell Programming and Scripting

awk to match file1 and extract specific tag values

File2 is tab-delimeted and I am trying to use $2 in file1 (space delimeted) as a search term in file2. If it is found then the AF= in and the FDP= values from file2 are extracted and printed next to the file1 line. I commented the awk before I added the lines in bold the current output resulted. I...

LEARN ABOUT MOJAVE

locale::codes::langvar

Locale::Codes::LangVar(3pm)				 Perl Programmers Reference Guide			       Locale::Codes::LangVar(3pm)

NAME

       Locale::Codes::LangVar - standard codes for language variation identification

SYNOPSIS

	  use Locale::Codes::LangVar;

	  $lvar = code2langvar('acm');		       # $lvar gets 'Mesopotamian Arabic'
	  $code = langvar2code('Mesopotamian Arabic'); # $code gets 'acm'

	  @codes   = all_langvar_codes();
	  @names   = all_langvar_names();

DESCRIPTION

       The "Locale::Codes::LangVar" module provides access to standard codes used for identifying language variations, such as those as defined in
       the IANA language registry.

       Most of the routines take an optional additional argument which specifies the code set to use. If not specified, the default IANA language
       registry codes will be used.

SUPPORTED CODE SETS

       There are several different code sets you can use for identifying language variations. A code set may be specified using either a name, or
       a constant that is automatically exported by this module.

       For example, the two are equivalent:

	  $lvar = code2langvar('en','alpha-2');
	  $lvar = code2langvar('en',LOCALE_CODE_ALPHA_2);

       The codesets currently supported are:

       alpha
	   This is the set of alphanumeric codes from the IANA language registry, such as 'arevela' for Eastern Armenian.

	   This code set is identified with the symbol "LOCALE_LANGVAR_ALPHA".

	   This is the default code set.

ROUTINES

       code2langvar ( CODE [,CODESET] )
       langvar2code ( NAME [,CODESET] )
       langvar_code2code ( CODE ,CODESET ,CODESET2 )
       all_langvar_codes ( [CODESET] )
       all_langvar_names ( [CODESET] )
       Locale::Codes::LangVar::rename_langvar  ( CODE ,NEW_NAME [,CODESET] )
       Locale::Codes::LangVar::add_langvar  ( CODE ,NAME [,CODESET] )
       Locale::Codes::LangVar::delete_langvar  ( CODE [,CODESET] )
       Locale::Codes::LangVar::add_langvar_alias  ( NAME ,NEW_NAME )
       Locale::Codes::LangVar::delete_langvar_alias  ( NAME )
       Locale::Codes::LangVar::rename_langvar_code  ( CODE ,NEW_CODE [,CODESET] )
       Locale::Codes::LangVar::add_langvar_code_alias  ( CODE ,NEW_CODE [,CODESET] )
       Locale::Codes::LangVar::delete_langvar_code_alias  ( CODE [,CODESET] )
	   These routines are all documented in the Locale::Codes::API man page.

SEE ALSO

       Locale::Codes
	   The Locale-Codes distribution.

       Locale::Codes::API
	   The list of functions supported by this module.

       http://www.iana.org/assignments/language-subtag-registry
	   The IANA language subtag registry.

AUTHOR

       See Locale::Codes for full author history.

       Currently maintained by Sullivan Beck (sbeck@cpan.org).

COPYRIGHT

	  Copyright (c) 2011-2013 Sullivan Beck

       This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

perl v5.18.2							    2014-01-06					       Locale::Codes::LangVar(3pm)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

sed/awk help to match list of patterns and remove from org file

Discussion started by: rajan_san

2. Shell Programming and Scripting

Searching patterns in 1 file and deleting all lines with those patterns in 2nd file

Discussion started by: toms

3. Shell Programming and Scripting

Removing file lines that each match to a different patterns

Discussion started by: Jo_puzzled

4. Shell Programming and Scripting

script to match patterns in 2 different files.

Discussion started by: mathews

5. Shell Programming and Scripting

Match multiple patterns in a file and then print their respective next line

Discussion started by: AshwaniSharma09

6. Shell Programming and Scripting

Match 2 different patterns and print the lines

Discussion started by: redse171

7. Shell Programming and Scripting

Match 2 patterns together

Discussion started by: abh.kumar

8. Shell Programming and Scripting

Egrep patterns in a file and limit number of matches to print for each pattern match

Discussion started by: ananan

9. Shell Programming and Scripting

awk to print match or non-match and select fields/patterns for non-matches

Discussion started by: cmccabe

10. Shell Programming and Scripting

awk to match file1 and extract specific tag values

Discussion started by: cmccabe

LEARN ABOUT MOJAVE

locale::codes::langvar