Compare columns in two different files using awk Post: 302555705

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

How to compare 2 files & get only few columns based on a condition related to both files?

Hiiiii friends I have 2 files which contains huge data & few lines of it are as shown below File1: b.dat(which has 21 columns) SSR 1976 8 12 13 10 44.00 39.0700 70.7800 7.0 0 0.00 0 2.78 0.00 0.00 0 0.00 2.78 0 NULL ISC 1976 8 12 22 32 37.39 36.2942 70.7338...

2. Shell Programming and Scripting

awk compare 2 columns, 2 files, output whole line

Hello, I have not been able to find what I'm looking for via searching the forum. I could use some help with an awk script or one-liner to solve this simple problem. I have two files. If $1 and $2 from file1 match $1 and $2 from file2, print the whole line from file2. Example file1 ...

3. Shell Programming and Scripting

awk compare specific columns from 2 files, print new file

Hello. I have two files. FILE1 was extracted from FILE2 and modified thanks to help from this post. Now I need to replace the extracted, modified lines into the original file (FILE2) to produce the FILE3. FILE1 1466 55.27433 14.72050 -2.52E+03 3.00E-01 1.05E+04 2.57E+04 1467 55.27433...

4. Shell Programming and Scripting

Compare Columns of two files

Hi I have file 1 like this and file 2 like this I need to compare column 3 of both files and delete lines in file1 with same column 3 values in two files. So the output is I tried with perl but didnt work. A perl code will be good as I am learning the language, but any other code would...

5. Shell Programming and Scripting

Compare intervals (columns) from two files (awk, grep, Perl?)

Hi dear users, I need to compare numeric columns in two files. These files have the following structure. K.txt (4 columns) A001 chr21 9805831 9846011 A002 chr21 9806202 9846263 A003 chr21 9887188 9988593 A003 chr21 9887188 ...

6. Shell Programming and Scripting

Compare columns in different files

Hi, I have two files like this: 8 1.3 10 1.3 12 1.3 15 1.3 21 1.3 and 1 2 3 4 10 11 15 16 21 22

7. Shell Programming and Scripting

[Solved] awk compare two different columns of two files and print all from both file

Hi, I want to compare two columns from file1 with another two column of file2 and print matched and unmatched column like this File1 1 rs1 abc 3 rs4 xyz 1 rs3 stu File2 1 kkk rs1 AA 10 1 aaa rs2 DD 20 1 ccc ...

8. Shell Programming and Scripting

Compare 2 csv files by columns, then extract certain columns of matcing rows

Hi all, I'm pretty much a newbie to UNIX. I would appreciate any help with UNIX coding on comparing two large csv files (greater than 10 GB in size), and output a file with matching columns. I want to compare file1 and file2 by 'id' and 'chain' columns, then extract exact matching rows'...

9. UNIX for Dummies Questions & Answers

Help need to compare columns in files

Hi, Below is my requirement file1 id|cnt 1|1 2|2 3|3 file2 id_1|cnt_1 1|1 2|1 3|1 I want to compare cnt and cnt_1 columns, if they are differ then give the details Am using below awk command, but the output is not as expected.

10. Shell Programming and Scripting

Compare 2 columns of files awk

hello everybody I have 2 files the file1 has 10 columns and the form: ... 110103 0802 1.16 38 20.16 22 1.21 8.77 0.00 20 120103 0832 23.40 38 22.10 21 46.35 10.17 0.00 28 120103 1413 45.00 38 24.50 21 48.85 7.89 0.00 38 130103 1112 23.40 38 22.10 21 48.85 ...

LEARN ABOUT SUNOS

join

join(1) 							   User Commands							   join(1)

NAME

       join - relational database operator

SYNOPSIS

       join [-a filenumber | -v filenumber]  [-1 fieldnumber] [-2 fieldnumber] [-o list] [-e string] [-t char] file1 file2

       join [-a filenumber] [-j fieldnumber] [-j1 fieldnumber] [-j2 fieldnumber] [-o list] [-e string] [-t char] file1 file2

DESCRIPTION

       The join command forms, on the standard output, a join of the two relations specified by the lines of file1 and file2.

       There  is  one  line in the output for each pair of lines in file1 and file2 that have identical join fields. The output line normally con-
       sists of the common field, then the rest of the line from file1, then the rest of the line from file2. This format can be changed by  using
       the  -o	option	(see  below).  The  -a	option	can be used to add unmatched lines to the output. The -v option can be used to output only
       unmatched lines.

       The default input field separators are blank, tab, or new-line. In this case, multiple separators count as one field separator, and leading
       separators are ignored. The default output field separator is a blank.

       If the input files are not in the appropriate collating sequence, the results are unspecified.

OPTIONS

       Some of the options below use the argument filenumber. This argument should be a 1 or a 2 referring to either file1 or file2, respectively.

       -a filenumber	       In addition to the normal output, produce a line for each unpairable line in file filenumber, where filenumber is 1
			       or 2. If both -a 1 and -a 2 are specified, all unpairable lines will be output.

       -e string	       Replace empty output fields in the list selected by option -o with the string string.

       -j fieldnumber	       Equivalent to -1fieldnumber -2fieldnumber.

       -j1 fieldnumber	       Equivalent to -1fieldnumber.

       -j2 fieldnumber	       Equivalent to -2fieldnumber. Fields are numbered starting with 1.

       -o list		       Each output line includes the fields specified in list. Fields selected by list that do not  appear  in	the  input
			       will be treated as empty output fields. (See the -e option.) Each element of which has the either the form filenum-
			       ber.fieldnumber, or 0, which represents the join field.	The  common  field  is	not  printed  unless  specifically
			       requested.

       -t char		       Use character char as a separator. Every appearance of char in a line is significant. The character char is used as
			       the field separator for both input and output. With this option specified, the collating term should be the same as
			       sort without the -b option.

       -v filenumber	       Instead of the default output, produce a line only for each unpairable line in filenumber, where filenumber is 1 or
			       2. If both -v 1 and -v 2 are specified, all unpairable lines will be output.

       -1 fieldnumber	       Join on the fieldnumberth field of file 1. Fields are decimal integers starting with 1.

       -2fieldnumber	       Join on the fieldnumberth field of file 2. Fields are decimal integers starting with 1.

OPERANDS

       The following operands are supported:

       file1

       file2	A path name of a file to be joined. If either of the file1 or file2 operands is -, the standard input is used in its place.

       file1 and file2 must be sorted in increasing collating sequence as determined by LC_COLLATE on the fields on which they are to  be  joined,
       normally the first in each line (see sort(1)).

USAGE

       See largefile(5) for the description of the behavior of join when encountering files greater than or equal to 2 Gbyte (2**31 bytes).

EXAMPLES

       Example 1: Joining the password file and group file

       The following command line will join the password file and the group file, matching on the numeric group ID, and outputting the login name,
       the group name and the login directory. It is assumed that the files have been sorted in ASCII collating sequence on the group ID fields.

       example% join -j1 4-j2 3 -o 1.1 2.1 1.6 -t:/etc/passwd /etc/group

       Example 2: Using the -o option

       The -o 0 field essentially selects the union of the join fields. For example, given file phone:

       !Name	       Phone Number
       Don	       +1 123-456-7890
       Hal	       +1 234-567-8901
       Yasushi	       +2 345-678-9012

       and file fax:

       !Name	       Fax Number

       Don	       +1 123-456-7899

       Keith	       +1 456-789-0122

       Yasushi	       +2 345-678-9011

       where the large expanses of white space are meant to each represent a single tab character), the command:

       example% join -t"tab" -a 1 -a 2 -e '(unknown)' -o 0,1.2,2.2 phone fax

       would produce

       !Name	       Phone Number	      Fax Number
       Don	       +1 123-456-7890	       +1 123-456-7899
       Hal	       +1 234-567-8901	       (unknown
       Keith	       (unknown)	       +1 456-789-012
       Yasushi	       +2 345-678-9012	       +2 345-678-9011

ENVIRONMENT VARIABLES

       See environ(5) for descriptions of the following environment variables that affect the execution of join: LANG, LC_ALL,	LC_CTYPE,  LC_MES-
       SAGES, LC_COLLATE, and NLSPATH.

EXIT STATUS

       The following exit values are returned:

       0	All input files were output successfully.

       >0	An error occurred.

ATTRIBUTES

       See attributes(5) for descriptions of the following attributes:

       +-----------------------------+-----------------------------+
       |      ATTRIBUTE TYPE	     |	    ATTRIBUTE VALUE	   |
       +-----------------------------+-----------------------------+
       |Availability		     |SUNWcsu			   |
       +-----------------------------+-----------------------------+
       |CSI			     |Enabled			   |
       +-----------------------------+-----------------------------+
       |Interface Stability	     |Standard			   |
       +-----------------------------+-----------------------------+

SEE ALSO

       awk(1), comm(1), sort(1), uniq(1), attributes(5), environ(5), largefile(5), standards(5)

NOTES

       With default field separation, the collating sequence is that of sort -b; with -t, the sequence is that of a plain sort.

       The conventions of the join, sort, comm, uniq, and awk commands are wildly incongruous.

SunOS 5.10							    8 Feb 2000								   join(1)