AWK Matching Fields and Combining Files Post: 302301980

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Matching and combining two files

Hi, How can I match the first two fields of file2 against the first two fields of file1 and where they match combine the two lines. If the name (example-Aidan Rielly) is in file1 but not in file2 then just write the info from file1 to the combined output file. If the name (example-Silvia...

2. Shell Programming and Scripting

combining fields in awk

I am using: ps -A -o command,%cpu to get process and cpu usage figures. I want to use awk to split up the columns it returns. If I use: awk '{print "Process: "$1"\nCPU Usage: "$NF"\n"}' the $NF will get me the value in the last column, but if there is more than one word in the...

3. Shell Programming and Scripting

Matching and combining two files

Hi there, I have two files. What I want to do is search for the values in second field of file1 in the 6th field of the file2 and of they match to add the fields 1-5 of the file2 at the end of the line of file1 with a comma before. E.g File1 FWB,CHUAGT87HUMAS/BUD01,REUAIR08KLM...

4. Shell Programming and Scripting

To get an output by combining fields from two different files

Hi guys, I couldn't find solution to this problem. If anyone knows please help me out. your guidance is highly appretiated. I have two files - FILE1 has the following 7 columns ( - has been added to make columns visible enough else columns are separated by single space) 155.34 - leg - 1...

5. Shell Programming and Scripting

AWK- delimiting the strings and matching the fields

Hello, I am newbie in awk. I have just started learning it. 1) I have input file which looks like: {4812 4009 1602 2756 306} {4814 4010 1603 2757 309} {8116 9362 10779 } {10779 10121 9193 10963 10908} {1602 2756 306 957 1025} {1603 2757 307} and so on..... 2) In output: a)...

6. Shell Programming and Scripting

Awk: adding fields after matching $1

Dear AWK-experts! I did get stuck in the task of combining files after matching fields, so I'm still awkward with learning AWK. There are 2 files: one containing 3 columns with ID, coding status, and score for long noncoding RNAs: file1 (1.txt) (>5000 lines) ...

7. UNIX for Beginners Questions & Answers

Awk: matching multiple fields between 2 files

Hi, I have 2 tab-delimited input files as follows. file1.tab: green A apple red B apple file2.tab: apple - A;Z Objective: Return $1 of file1 if, . $1 of file2 matches $3 of file1 and, . any single element (separated by ";") in $3 of file2 is present in $2 of file1 In order to...

8. Shell Programming and Scripting

awk to print fields that match using conditions and a default value for non-matching in two files

Trying to use awk to match the contents of each line in file1 with $5 in file2. Both files are tab-delimited and there may be a space or special character in the name being matched in file2, for example in file1 the name is BRCA1 but in file2 the name is BRCA 1 or in file1 name is BCR but in file2...

9. UNIX for Beginners Questions & Answers

Continued trouble matching fields in different files and selective field printing ([g]awk)

I apologize in advance, but I continue to have trouble searching for matches between two files and then printing portions of each to output in awk and would very much appreciate some help. I have data as follows: File1 PS012,002 PRQ 0 1 1 17 1 0 -1 3 2 1 2 -1 ...

10. UNIX for Beginners Questions & Answers

awk for matching fields between files with repeated records

Hello all, I am having trouble with what should be an easy task, but seem to be missing something fundamental. I have two files, with File 1 consisting of a single field of many thousands of records. I also have File 2 with two fields and many thousands of records. My goal is that when $1 of...

LEARN ABOUT OPENSOLARIS

join

join(1) 							   User Commands							   join(1)

NAME

       join - relational database operator

SYNOPSIS

       join [-a filenumber | -v filenumber] [-1 fieldnumber]
	    [-2 fieldnumber] [-o list] [-e string] [-t char] file1 file2

       join [-a filenumber] [-j fieldnumber] [-j1 fieldnumber]
	    [-j2 fieldnumber] [-o list] [-e string] [-t char] file1 file2

DESCRIPTION

       The join command forms, on the standard output, a join of the two relations specified by the lines of file1 and file2.

       There  is  one  line in the output for each pair of lines in file1 and file2 that have identical join fields. The output line normally con-
       sists of the common field, then the rest of the line from file1, then the rest of the line from file2. This format can be changed by  using
       the  -o	option	(see  below).  The  -a	option	can be used to add unmatched lines to the output. The -v option can be used to output only
       unmatched lines.

       The default input field separators are blank, tab, or new-line. In this case, multiple separators count as one field separator, and leading
       separators are ignored. The default output field separator is a blank.

       If the input files are not in the appropriate collating sequence, the results are unspecified.

OPTIONS

       Some of the options below use the argument filenumber. This argument should be a 1 or a 2 referring to either file1 or file2, respectively.

       -a filenumber	   In  addition to the normal output, produce a line for each unpairable line in file filenumber, where filenumber is 1 or
			   2. If both -a 1 and -a 2 are specified, all unpairable lines will be output.

       -e string	   Replace empty output fields in the list selected by option -o with the string string.

       -j fieldnumber	   Equivalent to -1fieldnumber -2fieldnumber.

       -j1 fieldnumber	   Equivalent to -1fieldnumber.

       -j2 fieldnumber	   Equivalent to -2fieldnumber. Fields are numbered starting with 1.

       -o list		   Each output line includes the fields specified in list. Fields selected by list that do not appear in the input will be
			   treated as empty output fields. (See the -e option.) Each element of which has the either the form filenumber.fieldnum-
			   ber, or 0, which represents the join field. The common field is not printed unless specifically requested.

       -t char		   Use character char as a separator. Every appearance of char in a line is significant. The character char is used as the
			   field  separator  for  both input and output. With this option specified, the collating term should be the same as sort
			   without the -b option.

       -v filenumber	   Instead of the default output, produce a line only for each unpairable line in filenumber, where filenumber is 1 or	2.
			   If both -v 1 and -v 2 are specified, all unpairable lines will be output.

       -1 fieldnumber	   Join on the fieldnumberth field of file 1. Fields are decimal integers starting with 1.

       -2fieldnumber	   Join on the fieldnumberth field of file 2. Fields are decimal integers starting with 1.

OPERANDS

       The following operands are supported:

       file1

       file2	 A path name of a file to be joined. If either of the file1 or file2 operands is -, the standard input is used in its place.

       file1  and  file2 must be sorted in increasing collating sequence as determined by LC_COLLATE on the fields on which they are to be joined,
       normally the first in each line (see sort(1)).

USAGE

       See largefile(5) for the description of the behavior of join when encountering files greater than or equal to 2 Gbyte (2^31 bytes).

EXAMPLES

       Example 1 Joining the password file and group file

       The following command line will join the password file and the group file, matching on the numeric group ID, and outputting the login name,
       the group name and the login directory. It is assumed that the files have been sorted in ASCII collating sequence on the group ID fields.

	 example% join -j1 4-j2 3 -o 1.1 2.1 1.6 -t:/etc/passwd /etc/group

       Example 2 Using the -o option

       The -o 0 field essentially selects the union of the join fields. For example, given file phone:

	 !Name		 Phone Number
	 Don		 +1 123-456-7890
	 Hal		 +1 234-567-8901
	 Yasushi	 +2 345-678-9012

       and file fax:

	 !Name		 Fax Number

	 Don		 +1 123-456-7899

	 Keith		 +1 456-789-0122

	 Yasushi	 +2 345-678-9011

       where the large expanses of white space are meant to each represent a single tab character), the command:

	 example% join -t"tab" -a 1 -a 2 -e '(unknown)' -o 0,1.2,2.2 phone fax

       would produce

	 !Name		 Phone Number		Fax Number
	 Don		 +1 123-456-7890	 +1 123-456-7899
	 Hal		 +1 234-567-8901	 (unknown
	 Keith		 (unknown)		 +1 456-789-012
	 Yasushi	 +2 345-678-9012	 +2 345-678-9011

ENVIRONMENT VARIABLES

       See  environ(5)	for descriptions of the following environment variables that affect the execution of join: LANG, LC_ALL, LC_CTYPE, LC_MES-
       SAGES, LC_COLLATE, and NLSPATH.

EXIT STATUS

       The following exit values are returned:

       0      All input files were output successfully.

       >0     An error occurred.

ATTRIBUTES

       See attributes(5) for descriptions of the following attributes:

       +-----------------------------+-----------------------------+
       |      ATTRIBUTE TYPE	     |	    ATTRIBUTE VALUE	   |
       +-----------------------------+-----------------------------+
       |Availability		     |SUNWcsu			   |
       +-----------------------------+-----------------------------+
       |CSI			     |Enabled			   |
       +-----------------------------+-----------------------------+
       |Interface Stability	     |Standard			   |
       +-----------------------------+-----------------------------+

SEE ALSO

       awk(1), comm(1), sort(1), uniq(1), attributes(5), environ(5), largefile(5), standards(5)

NOTES

       With default field separation, the collating sequence is that of sort -b; with -t, the sequence is that of a plain sort.

       The conventions of the join, sort, comm, uniq, and awk commands are wildly incongruous.

SunOS 5.11							    8 Feb 2000								   join(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Matching and combining two files

Discussion started by: p3t3r

2. Shell Programming and Scripting

combining fields in awk

Discussion started by: json4639

3. Shell Programming and Scripting

Matching and combining two files

Discussion started by: sickboy

4. Shell Programming and Scripting

To get an output by combining fields from two different files

Discussion started by: smriti_shridhar

5. Shell Programming and Scripting

AWK- delimiting the strings and matching the fields

Discussion started by: kajolo

6. Shell Programming and Scripting

Awk: adding fields after matching $1

Discussion started by: kben

7. UNIX for Beginners Questions & Answers

Awk: matching multiple fields between 2 files

Discussion started by: beca123456

8. Shell Programming and Scripting

awk to print fields that match using conditions and a default value for non-matching in two files

Discussion started by: cmccabe

9. UNIX for Beginners Questions & Answers

Continued trouble matching fields in different files and selective field printing ([g]awk)

Discussion started by: jvoot

10. UNIX for Beginners Questions & Answers

awk for matching fields between files with repeated records

Discussion started by: jvoot

LEARN ABOUT OPENSOLARIS

join