Match part of string in file2 based on column in file1
I have a file containing texts and indexes. I need the text between (and including ) INDEX and number "1" alone in line. I have managed this:
It works for all indexes.
And then I have second file with years and indexes per year, one per line
I need all indexes for a given year (and ONLY for a given year). I have managed this, too:
What I'd like to do is join the two- so that I would get all the texts from file1 that match indexes printed by second command.
any ideas?
Last edited by Franklin52; 05-28-2013 at 08:56 AM..
Reason: Please use code tags
Hi, i've two files (file1, file2) i want to take value (in column1) and search in file2 if the they match print the value from file2.
this is what i have so far.
awk 'FILENAME=="file1"{ arr=$1 }
FILENAME=="file2"
{print $0}
' file1 file2 (2 Replies)
Match column 3 in file1 to column 1 in file 2 and replace with column 2 from file2
file 1 sample
SNDK 80004C101 AT
XLNX 983919101 BB
NETL 64118B100 BS
AMD 007903107 CC
KLAC 482480100 DC
TER 880770102 KATS
ATHR 04743P108 KATS... (7 Replies)
hi;
i am looking for simple search script that find string from file1 in file 2
file 1 contain a loot of string like:
204080111111111
204080222222222
204080333333333
in each row
and i would like to take the first row for example 204080111111111 from file1 and find it in file2 when it... (1 Reply)
I have very limited coding skills but I'm wondering if someone could help me with this. There are many threads about matching strings in two files, but I have no idea how to add a column from one file to another based on a matching string.
I'm looking to match column1 in file1 to the number... (3 Replies)
file1:
file2:
I need to find matches for any lines in file1 that appear in file2. Desired output is '>' plus the file1 term, followed by the line after the match in file2 (so the title is a little misleading):
This is honestly beyond what I can do without spending the whole night on it, so I'm... (2 Replies)
I have two files.
File 1 is a two-column index file, e.g.
comp11084_c0_seq6:130-468(-) comp12746_c0_seq3:140-478(+)
comp11084_c0_seq3:201-539(-) comp12746_c0_seq2:191-529(+)
File 2 is a sequence file with headers named with the same terms that populate file 1. ... (1 Reply)
I have a list of IDs in file1 and a list of sequences in file2. I can print sequences from file2, but I'm asking for help in printing the sequences in the same order as the IDs appear in file1.
file1:
EN_comp12952_c0_seq3:367-1668
ES_comp17168_c1_seq6:1-864
EN_comp13395_c3_seq14:231-1088... (5 Replies)
I am trying to use awk to find all the $2 values in file2 which is ~30MB and tab-delimited, that are between $2 and $3 in file1 which is ~2GB and tab-delimited.
I have just found out that I need to use $1 and $2 and $3 from file1 and $1 and $2of file2 must match $1 of file1 and be in the range... (6 Replies)
Discussion started by: cmccabe
6 Replies
LEARN ABOUT HPUX
comm
comm(1) General Commands Manual comm(1)NAME
comm - select or reject lines common to two sorted files
SYNOPSIS
file1 file2
DESCRIPTION
comm reads file1 and file2, which should be ordered in increasing collating sequence (see sort(1) and Environment Variables below), and
produces a three-column output:
Column 1: Lines that appear only in file1,
Column 2: Lines that appear only in file2,
Column 3: Lines that appear in both files.
If is used for file1 or file2, the standard input is used.
Options 1, 2, or 3 suppress printing of the corresponding column. Thus prints only the lines common to the two files; prints only lines in
the first file but not in the second; does nothing useful.
EXTERNAL INFLUENCES
Environment Variables
determines the collating sequence expects from the input files.
determines the language in which messages are displayed.
If is not specified in the environment or is set to the empty string, the value of determines the language in which messages are displayed.
If is not specified in the environment or is set to the empty string, the value of is used as a default. If is not specified or is set to
the empty string, a default of ``C'' (see lang(5)) is used instead of If any internationalization variable contains an invalid setting,
behaves as if all internationalization variables are set to ``C''. See environ(5).
International Code Set Support
Single- and multi-byte character code sets are supported.
EXAMPLES
The following examples assume that and have been ordered in the collating sequence defined by the or environment variable.
Print all lines common to and (in other words, print column 3):
Print all lines that appear in but not in (in other words, print column 1):
Print all lines that appear in but not in (in other words, print column 2):
SEE ALSO cmp(1), diff(1), sdiff(1), sort(1), uniq(1).
STANDARDS CONFORMANCE comm(1)