Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Compare 2 files print the lines of file 2 that contain a string from file 1 Post 302537724 by KevinRidley on Saturday 9th of July 2011 02:33:47 PM
Old 07-09-2011
Question Compare 2 files print the lines of file 2 that contain a string from file 1

Hello I am a new unix user, and I have a work related task to compare 2 files and print all of the lines in file 2 that contain a string from file 1 Note: the fields are in different columns in the files. I suspect the is a good use for awk? Thanks for your time & help

File 1

123 232 W343
321 323 W454
999 666 W777

File 2

333 W343 989
577 W323 979
888 W777 555

Result =

333 W343 989
888 W777 555
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Need help to print lines contains particular string format in a file

Hi, I want to print the lines in a file that matches particular string format using shell scripting. (4 Replies)
Discussion started by: sudhakaryadav
4 Replies

2. Shell Programming and Scripting

compare two columns of different files and print the matching second file..

Hi, I have two tab separated files; file1: S.No ddi fi cu o/l t+ t- 1 0.5 0.6 o 0.1 0.2 2 0.2 0.3 l 0.3 0.4 3 0.5 0.8 l 0.1 0.6 ... (5 Replies)
Discussion started by: vasanth.vadalur
5 Replies

3. Shell Programming and Scripting

print string at the end of lines in text file

hello, I go text file like this E:/DDD/Dyndede/wwww E:/DDD/sss.com/ffffg/fff E:/DDD/vvvvvv/dd E:/DDD/sss.com/bbbbbb E:/DDD/sss.com/nnnn/xxI want to print /alpha.jpg at the end of every lines like that E:/DDD/Dyndede/wwww/alpha.jpg E:/DDD/sss.com/ffffg/fff/alpha.jpg... (8 Replies)
Discussion started by: davidkhan
8 Replies

4. Shell Programming and Scripting

awk to compare flat files and print output to another file

Hello, I am strugling from quite a some time to compare flat files with over 1 million records could anyone please help me. I want to compare two pipe delimited flat files, file1 with file2 and output the unmatched rows from file2 in file3 Sample File1: ... (9 Replies)
Discussion started by: suhaeb
9 Replies

5. Shell Programming and Scripting

Print #of lines after search string in a big file

I have a command which prints #lines after and before the search string in the huge file nawk 'c-->0;$0~s{if(b)for(c=b+1;c>1;c--)print r;print;c=a}b{r=$0}' b=0 a=10 s="STRING1" FILE The file is 5 gig big. It works great and prints 10 lines after the lines which contains search string in... (8 Replies)
Discussion started by: prash184u
8 Replies

6. Shell Programming and Scripting

awk compare specific columns from 2 files, print new file

Hello. I have two files. FILE1 was extracted from FILE2 and modified thanks to help from this post. Now I need to replace the extracted, modified lines into the original file (FILE2) to produce the FILE3. FILE1 1466 55.27433 14.72050 -2.52E+03 3.00E-01 1.05E+04 2.57E+04 1467 55.27433... (1 Reply)
Discussion started by: jm4smtddd
1 Replies

7. Shell Programming and Scripting

How to print the lines in a file for the given string?

Hi, I have a file with contents test id text day test sah dh dhs yeay fg jsh jsjk my need: I give a string as a input, it check the file and display the lines with the given string e.g input : test output: test id text day test sah dh dhs (1 Reply)
Discussion started by: nanthagopal
1 Replies

8. Shell Programming and Scripting

Compare columns of multiple files and print those unique string from File1 in an output file.

Hi, I have multiple files that each contain one column of strings: File1: 123abc 456def 789ghi File2: 123abc 456def 891jkl File3: 234mno 123abc 456def In total I have 25 of these type of file. (5 Replies)
Discussion started by: owwow14
5 Replies

9. Shell Programming and Scripting

[Solved] awk compare two different columns of two files and print all from both file

Hi, I want to compare two columns from file1 with another two column of file2 and print matched and unmatched column like this File1 1 rs1 abc 3 rs4 xyz 1 rs3 stu File2 1 kkk rs1 AA 10 1 aaa rs2 DD 20 1 ccc ... (2 Replies)
Discussion started by: justinjj
2 Replies

10. Shell Programming and Scripting

How to compare 2 files and create a result file with unmatched lines from first file.?

HI, I have 2 text files. file1 and file2. file1.txt (There are no duplicates in this file) 1234 3232 4343 3435 6564 6767 1213 file2.txt 1234,wq,wewe,qwqw 1234,as,dfdf,dfdf 4343,asas,sdds,dsds 6767,asas,fdfd,fdffd I need to search each number in file1.txt in file2.txt's 1st... (6 Replies)
Discussion started by: Little
6 Replies
join(1) 						      General Commands Manual							   join(1)

NAME
join - Joins the lines of two files SYNOPSIS
Current syntax join [-a file_number | -v file_number] [-e string] [-o number.field,...] [-t character] [-1 field] [-2 field] file1 file2 Obsolescent syntax [join] [-a number] [-e string] [-j number | field | number field] [-o number.field,...] [-t character] file1 file2 The join command reads file1 and file2 and joins lines in the files that contain common fields, or otherwise according to the options, and writes the results to standard output. STANDARDS
Interfaces documented on this reference page conform to industry standards as follows: join: XCU5.0 Refer to the standards(5) reference page for more information about industry standards and associated tags. OPTIONS
Joins on the fieldth field of file1. Fields are decimal integers starting with 1. Joins on the fieldth field of file2. Fields are decimal integers starting with 1. Produces an output line for each unpairable line found in file1 if number is 1, or file2 if number is 2. With- out -a, join produces output only for lines containing a common field. If both -a 1 and -a 2 are used, all unpairable lines will be out- put. Replaces empty output fields with string. Joins the two files on field of file number, where number is 1 for file1 or 2 for file2. If you do not specify number, join uses field in each file. Without -j, join uses the first field in each file. The default value for both number and field is 1. (Obsolescent) If you enter only a 1 or a 2 as an argument to -j, join interprets this argument as the file number (number); integers greater than 2 are interpreted as the field number (field). Therefore, if you want to specify a field number of 2, you must precede this speci- fication with a number argument; otherwise, the join program interprets the 2 as the file number (number). Produces output lines consisting of the fields specified in one or more number.field arguments, where number is 1 for file1 or 2 for file2, and field is a field number. Multiple -o arguments should be separated with commas. Uses character (a single character) as the field separator character in the input and the output. Every appearance of character in a line is significant. The default separator is a space. If you do not specify -t, join also recognizes the tab and newline characters as separators. With default field separation, the collating sequence is that of sort -b. If you specify -t, the sequence is that of a plain sort. To specify a tab character, enclose it in '' (single quotes). Produces an output line for each unpairable line in file_number (where file_number is 1 or 2), instead of the default output. If both -v 1 and -v 2 are specified, produces output lines for all unpairable lines. OPERANDS
The pathnames of files to be used as input. If - (hyphen) is specified for either file, standard input is read. DESCRIPTION
The join field is the field in the input files that join looks at to determine what will be included in the output. One line appears in the output for each identical join field appearing in both file1 and file2. The output line consists of the join field, the rest of the line from file1, then the rest of the line from file2. Both input files must be sorted according to the collating sequence specified by the LC_COLLATE environment variable, if set, for the fields where they are to be joined (usually the first field in each line). Fields are normally separated by a space, a tab character, or a newline character. In this case, join treats consecutive separators as one, and discards leading separators. Use the -t option to specify another field separator. EXIT STATUS
The following exit values are returned: Successful completion. An error occurred. EXAMPLES
Note that the vertical alignment shown in these examples may not be consistent with your output. To perform a simple join operation on two files, phonedir and names, whose first fields are the same, enter: join phonedir names If phonedir contains the following telephone directory: Binst 555-6235 Dickerson 555-1842 Eisner 555-1234 Green 555-2240 Hrarii 555-0256 Janatha 555-7358 Lewis 555-3237 Takata 555-5341 Wozni 555-1234 and names is this listing of names and department numbers: Eisner Dept. 389 Frost Dept. 217 Green Dept. 311 Takata Dept. 454 Wozni Dept. 520 then join phonedir names displays: Eisner 555-1234 Dept. 389 Green 555-2240 Dept. 311 Takata 555-5341 Dept. 454 Wozni 555-1234 Dept. 520 Each line consists of the join field (the last name), followed by the rest of the line found in phonedir and the rest of the line in names. To display unmatched lines as well as matched lines, enter: join -a 2 phonedir names If phonedir contains: Binst 555-6235 Dickerson 555-1842 Eisner 555-1234 Green 555-2240 Hrarii 555-0256 Janatha 555-7358 Lewis 555-3237 Takata 555-5341 Wozni 555-1234 and names contains: Eisner Dept. 389 Frost Dept. 217 Green Dept. 311 Takata Dept. 454 Wozni Dept. 520 then join -a 2 phonedir names displays: Eisner 555-1234 Dept. 389 Frost Dept. 217 Green 555-2240 Dept. 311 Takata 555-5341 Dept. 454 Wozni 555-1234 Dept. 520 This performs the same join operation as in the first example, and also lists the lines of names that have no match in phonedir. It includes Frost's name and department number in the listing, although there is no entry for Frost in phonedir. To display selected fields, enter: join -o 2.3,2.1,1.2 phonedir names This displays the following fields: Field 3 of names (Department Number) Field 1 of names (Last Name) Field 2 of phonedir (Telephone Number) If phonedir contains: Binst 555-6235 Dickerson 555-1842 Eisner 555-1234 Green 555-2240 Hrarii 555-0256 Janatha 555-7358 Lewis 555-3237 Takata 555-5341 Wozni 555-1234 and names contains: Eisner Dept. 389 Frost Dept. 217 Green Dept. 311 Takata Dept. 454 Wozni Dept. 520 then join -o 2.3,2.1,1.2 phonedir names displays: 389 Eisner 555-1234 311 Green 555-2240 454 Takata 555-5341 520 Wozni 555-1234 To perform the join operation on a field other than the first, enter: sort -b -k 2,3 phonedir | join -1 2 - numbers This combines the lines in phonedir and names, comparing the second field of phonedir to the first field of numbers. First, this sorts phonedir by the second field because both files must be sorted by their join fields. The output of sort is then piped to join. The - (dash) by itself causes the join command to use this output as its first file. The -1 2 defines the second field of the sorted phonedir as the join field. This is compared to the first field of numbers because its join field is not speci- fied with a -2 option. If phonedir contains: Binst 555-6235 Dickerson 555-1842 Eisner 555-1234 Green 555-2240 Hrarii 555-0256 Janatha 555-7358 Lewis 555-3237 Takata 555-5341 Wozni 555-1234 and numbers contains: 555-0256 555-1234 555-5555 555-7358 then sort ... | join ... displays: 555-0256 Hrarii 555-1234 Eisner 555-1234 Wozni 555-7358 Janatha Each number in numbers is listed with the name listed in phonedir for that number. Note that join lists all the matches for a given field. In this case, join lists both Eisner and Wozni as having the telephone number 555-1234. The number 555-5555 is not listed because it does not appear in phonedir. ENVIRONMENT VARIABLES
The following environment variables affect the execution of join: Provides a default value for the internationalization variables that are unset or null. If LANG is unset or null, the corresponding value from the default locale is used. If any of the internationalization vari- ables contain an invalid setting, the utility behaves as if none of the variables had been defined. If set to a non-empty string value, overrides the values of all the other internationalization variables. Determines the locale for the interpretation of sequences of bytes of text data as characters (for example, single-byte as opposed to multi-byte characters in arguments and input files). Determines the locale for the format and contents of diagnostic messages written to standard error. Determines the location of message catalogues for the processing of LC_MESSAGES. SEE ALSO
Commands: awk(1), cmp(1), comm(1), cut(1), diff(1), grep(1), paste(1), sdiff(1), sed(1), sort(1), uniq(1) Standards: standards(5) join(1)
All times are GMT -4. The time now is 06:35 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy