Using AWK to match CSV files with duplicate patterns Post: 302597415

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

sed/awk help to match list of patterns and remove from org file

Hi, From the pattern mentioned below remove lines based on pattern range. Conditions 1 Look For all lines starting with ALTER TABLE and Ending with ; and contains the word MOVE.I wanto to remove these lines from the file sample below. Note : The above pattern list could be found in...

2. Shell Programming and Scripting

script to match patterns in 2 different files.

I am new to shell scripting and need some help. I googled, but couldn't find a similar scenario. Basically, I need to rename a datafile. This is the scenario - I have a file, readonly.txt that has 2 columns - file# and name. I have another file,missing_files.txt that has id and name. Both the...

3. Shell Programming and Scripting

Find files that do not match specific patterns

Hi all, I have been searching online to find the answer for getting a list of files that do not match certain criteria but have been unsuccessful. I have a directory that has many jpg files. What I need to do is get a list of the files that do not match both of the following patterns (I have...

4. Shell Programming and Scripting

Match paragraph between two patterns, delete the duplicate paragraphs

Hello all I have a file my DNS server where there are duplicate paragrapsh like below. How can I remove the duplicate paragraph so that only one paragraph remains. BEGIN; replace into domains (name,type) values ('225.168.192.in-addr.arpa','MASTER'); replace into records (domain_id,...

5. Shell Programming and Scripting

Match columns from two csv files and update field in one of the csv file

Hi, I have a file of csv data, which looks like this: file1: 1AA,LGV_PONCEY_LES_ATHEE,1,\N,1,00020460E1,0,\N,\N,\N,\N,2,00.22335321,0.00466628 2BB,LES_POUGES_ASF,\N,200,200,00006298G1,0,\N,\N,\N,\N,1,00.30887539,0.00050312...

6. Shell Programming and Scripting

Match multiple patterns sequentially in order - grep or awk

Hello. grep v2.21 Debian 8 I wish to search for and output these patterns in order; "From " "To: " "Subject: " "Message-Id: " "Date: " "To: " grep works, but not in strict order... $ grep -a -E "^From |^Subject:|^From: |^Message-Id: |^Date: |^To: " InboxResult; From - Wed Feb 18...

7. UNIX for Beginners Questions & Answers

Match duplicate ids in two files

I have two text files. File 1 has 150 ids but all the ids exists in duplicates so it has 300 ids in total. File 2 has 1500 ids but all exists in duplicates so file 2 has 300 ids in total. i want to match the first occurance of every id in file 1 with first occurance of thet id in file 2 and 2nd...

8. Shell Programming and Scripting

awk pattern match by looping through search patterns

Hi I am using Solaris 5.10 & ksh Wanted to loop through a pattern file by reading it and passing it to the awk to match that value present in column 1 of rawdata.txt , if so print column 1 & 2 in to Avlblpatterns.txt. Using the following code but it seems some mistakes and it is running for...

9. Shell Programming and Scripting

awk to print match or non-match and select fields/patterns for non-matches

In the awk below I am trying to output those lines that Match between file1 and file2, those Missing in file1, and those missing in file2. Using each $1,$2,$4,$5 value as a key to match on, that is if those 4 fields are found in both files the match, but if those 4 fields are not found then missing...

10. UNIX for Beginners Questions & Answers

Match patterns between two files and extract certain range of strings

Hi, I need help to match patterns from between two different files and extract region of strings. inputfile1.fa >l-WR24-1:1 GCCGGCGTCGCGGTTGCTCGCGCTCTGGGCGCTGGCGGCTGTGGCTCTACCCGGCTCCGG GGCGGAGGGCGACGGCGGGTGGTGAGCGGCCCGGGAGGGGCCGGGCGGTGGGGTCACGTG...

LEARN ABOUT BSD

egrep

GREP(1) 						      General Commands Manual							   GREP(1)

NAME

       grep, egrep, fgrep - search a file for a pattern

SYNOPSIS

       grep [ option ] ...  expression [ file ] ...

       egrep [ option ] ...  [ expression ] [ file ] ...

       fgrep [ option ] ...  [ strings ] [ file ]

DESCRIPTION

       Commands  of  the  grep	family search the input files (standard input default) for lines matching a pattern.  Normally, each line found is
       copied to the standard output.  Grep patterns are limited regular expressions in the style of ex(1); it	uses  a  compact  nondeterministic
       algorithm.   Egrep  patterns  are  full regular expressions; it uses a fast deterministic algorithm that sometimes needs exponential space.
       Fgrep patterns are fixed strings; it is fast and compact.  The following options are recognized.

       -v     All lines but those matching are printed.

       -x     (Exact) only lines matched in their entirety are printed (fgrep only).

       -c     Only a count of matching lines is printed.

       -l     The names of files with matching lines are listed (once) separated by newlines.

       -n     Each line is preceded by its relative line number in the file.

       -b     Each line is preceded by the block number on which it was found.	This is sometimes useful in locating disk block  numbers  by  con-
	      text.

       -i     The  case  of  letters  is ignored in making comparisons -- that is, upper and lower case are considered identical.  This applies to
	      grep and fgrep only.

       -s     Silent mode.  Nothing is printed (except error messages).  This is useful for checking the error status.

       -w     The expression is searched for as a word (as if surrounded by `<' and `>', see ex(1).)	(grep only)

       -e expression
	      Same as a simple expression argument, but useful when the expression begins with a -.

       -f file
	      The regular expression (egrep) or string list (fgrep) is taken from the file.

       In all cases the file name is shown if there is more than one input file.  Care should be taken when using the characters $ * [ ^ | ( ) and
        in the expression as they are also meaningful to the Shell.  It is safest to enclose the entire expression argument in single quotes ' '.

       Fgrep searches for lines that contain one of the (newline-separated) strings.

       Egrep accepts extended regular expressions.  In the following description `character' excludes newline:

	      A  followed by a single character other than newline matches that character.

	      The character ^ matches the beginning of a line.

	      The character $ matches the end of a line.

	      A .  (period) matches any character.

	      A single character not otherwise endowed with special meaning matches that character.

	      A  string  enclosed in brackets [] matches any single character from the string.	Ranges of ASCII character codes may be abbreviated
	      as in `a-z0-9'.  A ] may occur only as the first character of the string.  A literal - must be placed where it can't be mistaken	as
	      a range indicator.

	      A  regular  expression  followed	by  an	* (asterisk) matches a sequence of 0 or more matches of the regular expression.  A regular
	      expression followed by a + (plus) matches a sequence of 1 or more matches of the regular expression.  A regular expression  followed
	      by a ? (question mark) matches a sequence of 0 or 1 matches of the regular expression.

	      Two regular expressions concatenated match a match of the first followed by a match of the second.

	      Two regular expressions separated by | or newline match either a match for the first or a match for the second.

	      A regular expression enclosed in parentheses matches a match for the regular expression.

       The order of precedence of operators at the same parenthesis level is [] then *+? then concatenation then | and newline.

       Ideally there should be only one grep, but we don't know a single algorithm that spans a wide enough range of space-time tradeoffs.

SEE ALSO

       ex(1), sed(1), sh(1)

DIAGNOSTICS

       Exit status is 0 if any matches are found, 1 if none, 2 for syntax errors or inaccessible files.

BUGS

       Lines are limited to 256 characters; longer lines are truncated.

4th Berkeley Distribution					  April 29, 1985							   GREP(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

sed/awk help to match list of patterns and remove from org file

Discussion started by: rajan_san

2. Shell Programming and Scripting

script to match patterns in 2 different files.

Discussion started by: mathews

3. Shell Programming and Scripting

Find files that do not match specific patterns

Discussion started by: nikos-koutax

4. Shell Programming and Scripting

Match paragraph between two patterns, delete the duplicate paragraphs

Discussion started by: sb245