Sponsored Content
Top Forums Shell Programming and Scripting Match strings in two files and compare columns of both Post 302393400 by basso on Monday 8th of February 2010 04:25:17 PM
Old 02-08-2010
I'm working on a similar problem. Perhaps it's okay to piggyback on the thread?

I've two files:

First file:
data1
data2
...
string1:string2:string3

and the second:

string2:string3:n1:n2

I'd like each input line from file1 to appear in the output, and if string2:string3 appears in file2, append :n1:n2 to the matching
line in file1.

(My awk is really rusty...)

Thanks for any pointers you can provide!
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

How to compare 2 files & get only few columns based on a condition related to both files?

Hiiiii friends I have 2 files which contains huge data & few lines of it are as shown below File1: b.dat(which has 21 columns) SSR 1976 8 12 13 10 44.00 39.0700 70.7800 7.0 0 0.00 0 2.78 0.00 0.00 0 0.00 2.78 0 NULL ISC 1976 8 12 22 32 37.39 36.2942 70.7338... (6 Replies)
Discussion started by: reva
6 Replies

2. Shell Programming and Scripting

Compare one files with strings from another + remove lines

Have two files and want to compare the content of file1 with file2. When matched remove the line. awk 'NR==FNR {b; next} !(b in $0)' file1 file2file1 1. if match 2. removefile2 1. this line has to be removed if match 2. this line has a match, remove 3. this line has no match, no removingThe... (3 Replies)
Discussion started by: sdf
3 Replies

3. Shell Programming and Scripting

Match list of strings in File A and compare with File B, C and write to a output file in CSV format

Hi Friends, I'm a great fan of this forum... it has helped me tone my skills in shell scripting. I have a challenge here, which I'm sure you guys would help me in achieving... File A has a list of job ids and I need to compare this with the File B (*.log) and File C (extend *.log) and copy... (6 Replies)
Discussion started by: asnandhakumar
6 Replies

4. Shell Programming and Scripting

Match columns several files

Hey fellas! Here come my problem. I appreciate if you have a look at it. I have several files with following structure: file_1:1 21 4 45 file_2:2 31 4 153 6 341 and so on... and I have a 'reference' file look like this: File_ref:A 1 B 2 C 3 (5 Replies)
Discussion started by: @man
5 Replies

5. Shell Programming and Scripting

Match the columns between 2 files

I have two files I want to match ids in the 5th column of the file 1 with the first column of the file 2 and get the description for the matched ids as shown in the output sno nm no nm2 ID 1 cc 574372 yyyi |6810|51234| 2 bb 119721 nmjk |6810|51234|51179| ... (4 Replies)
Discussion started by: raj_k
4 Replies

6. Shell Programming and Scripting

Match strings in 2 different files

Hi, i am trying to match strings from 2 different files based on position like below:- file1 (tab delimited) f07270 lololol fff u12730 gggddd dddkkrr mmm file2 (not tab delimited) %f07270 APSLH bl%alalalalallaadsdsfdfdfdgsgfss %g13450 GDIDFLRIP%ILITEAPPRKgsfgsgsf %d08880... (11 Replies)
Discussion started by: redse171
11 Replies

7. Shell Programming and Scripting

Compare 2 csv files by columns, then extract certain columns of matcing rows

Hi all, I'm pretty much a newbie to UNIX. I would appreciate any help with UNIX coding on comparing two large csv files (greater than 10 GB in size), and output a file with matching columns. I want to compare file1 and file2 by 'id' and 'chain' columns, then extract exact matching rows'... (5 Replies)
Discussion started by: bkane3
5 Replies

8. UNIX for Dummies Questions & Answers

Match the columns between two files and output

Hi Help, I have two files namely a.txt and b.txt a.txt looks like a.txt 1 2 2 1 3 3 2 4 4 4 5 6 6 7 7 b.txt looks like, b.txt 1 2 1 1 3 2 2 4 3 3 4 4 4 5 5 (2 Replies)
Discussion started by: Indra2011
2 Replies

9. UNIX for Beginners Questions & Answers

Data match 2 files based on first 2 columns matching only and join if match

Hi, i have 2 files , the data i need to match is in masterfile and i need to pull out column 3 from master if column 1 and 2 match and output entire row to new file I have tried with join and awk and i keep getting blank outputs or same file is there an easier way than what i am... (4 Replies)
Discussion started by: axis88
4 Replies

10. UNIX for Beginners Questions & Answers

Match patterns between two files and extract certain range of strings

Hi, I need help to match patterns from between two different files and extract region of strings. inputfile1.fa >l-WR24-1:1 GCCGGCGTCGCGGTTGCTCGCGCTCTGGGCGCTGGCGGCTGTGGCTCTACCCGGCTCCGG GGCGGAGGGCGACGGCGGGTGGTGAGCGGCCCGGGAGGGGCCGGGCGGTGGGGTCACGTG... (4 Replies)
Discussion started by: bunny_merah19
4 Replies
tr(1)							      General Commands Manual							     tr(1)

NAME
tr - translate characters SYNOPSIS
string1 string2 string1 string1 string1 string1 DESCRIPTION
copies the standard input to the standard output with substitution or deletion of selected characters. Input characters from string1 are replaced with the corresponding characters in string2. If necessary, string1 and string2 can be quoted to avoid pattern matching by the shell. recognizes the following command line options: Translates on a byte-by-byte basis. When this flag is specified does not support extended characters. Complements the set of characters in string1, which is the set of all characters in the current character set, as defined by the current setting of except for those actually specified in the string1 argument. These characters are placed in the array in ascending collation sequence, as defined by the current setting of Deletes all occurrences of input characters or collating elements found in the array specified in string1. If and are both specified, all characters except those specified by string1 are deleted. The contents of string2 are ignored, unless is also specified. Note, however, that the same string cannot be used for both the and the flags; when both flags are specified, both string1 (used for deletion) and string2 (used for squeezing) are required. If is not specified, each input character or collating element found in the array specified by string1 is replaced by the character or collating element in the same relative position specified by string2. Replaces any character specified in string1 that occurs as a string of two or more repeating characters as a single instance of the character in string2. If the string2 contains a character class, the argument's array contains all of the characters in that character class. For example: In a case conversion, however, the string2 array contains only those characters defined as the second characters in each of the or character pairs, as appropriate. For example: The following abbreviation conventions can be used to introduce ranges of characters, repeated characters or single-character collating elements into the strings: c1-c2 or Stands for the range of collating elements c1 through c2, inclusive, as defined by the current setting of the locale category. Stands for all the characters belonging to the defined character class, as defined by the current setting of locale category. The following character class names will be accepted when spec- ified in string1: or Character classes are expanded in collation order. When the and flags are specified together, any of the character class names are accepted in string2; otherwise, only character class names or are accepted in string2 and then only if the corresponding character class and respectively) is specified in the same relative position in string1. Such a specification is interpreted as a request for case conversion. When appears in string1 and appears in string2, the arrays contain the characters from the mapping in the category of the current locale. When appears in string1 and appears in string2, the arrays contain the characters from the map- ping in the category of the current locale. Stands for all the characters or collating elements belonging to the same equivalence class as c, as defined by the current setting of locale category. An equivalence class expression is allowed only in string1, or in string2 when it is being used by the combined and options. Stands for n repetitions of a. If the first digit of n is n is considered octal; otherwise, n is treated as a decimal value. A zero or missing n is interpreted as large enough to extend string2-based sequence to the length of the string1-based sequence. The escape character can be used as in the shell to remove special meaning from any character in a string. In addition, followed by 1, 2, or 3 octal digits represents the character whose ASCII code is given by those digits. An ASCII NULL character in string1 or string2 can be represented only as an escaped character; i.e. as but is treated like other characters and translated correctly if so specified. NULL characters in the input are not stripped out unless the option is given. EXTERNAL INFLUENCES
Environment Variables provides a default value for the internationalization variables that are unset or null. If is unset or null, the default value of "C" (see lang(5)) is used. If any of the internationalization variables contains an invalid setting, will behave as if all internationalization variables are set to "C". See environ(5). If set to a non-empty string value, overrides the values of all the other internationalization variables. determines the interpretation of text as single and/or multi-byte characters, the classification of characters as printable, and the char- acters matched by character class expressions in regular expressions. determines the locale that should be used to affect the format and contents of diagnostic messages written to standard error and informa- tive messages written to standard output. determines the location of message catalogues for the processing of RETURN VALUE
exits with one of the following values: All input was processed successfully. An error occurred. EXAMPLES
For the ASCII character set and default collation sequence, create a list of all the words in file1, one per line in file2, where a word is taken to be a maximal string of alphabetics. Quote the strings to protect the special characters from interpretation by the shell (012 is the ASCII code for a new-line (line feed) character): Same as above, but for all character sets and collation sequences: Translate all lower case characters in file1 to upper case and write the result to standard output. Use an equivalence class to identify accented variants of the base character in file1, strip them of diacritical marks and write the result to file2: Translate each digit in file1 to a (number sign), and write the result to file2. The (asterisk) tells to repeat the (number sign) enough times to make the second string as long as the first one. AUTHOR
was developed by OSF and HP. SEE ALSO
ed(1), sh(1), ascii(5), environ(5), lang(5), regexp(5). STANDARDS CONFORMANCE
tr(1)
All times are GMT -4. The time now is 12:16 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy