Remove duplicate occurrences of text pattern

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Remove comments like pattern from text

Hi , We need to remove comment like pattern from a code text. The possible comment expressions are as follows. Input BizComment : Special/*@ Name:bzt_53_3aea640a_51783afa_5d64_0 BizHidden:true @*/ /* lookup Disease Category Therapuetic Class */ a=b;...

2. Shell Programming and Scripting

How to remove duplicate text blocks from a file?

Hi All I have a list of files which will have duplicate list of blocks of text. Following is a sample of the file, I have removed the sensitive information from the file. All the code samples starts from <TR BGCOLOR="white"> and Ends with IP address and two html tags like this. 10.14.22.22...

3. Windows & DOS: Issues & Discussions

Remove duplicate lines from text files.

So, I have text files, one "fail.txt" And one "color.txt" I now want to use a command line (DOS) to remove ANY line that is PRESENT IN BOTH from each text file. Afterwards there shall be no duplicate lines.

4. Shell Programming and Scripting

Remove duplicate line starting with a pattern

HI, I have the below input file /* ----------------- cmdsDlyStartFWJ -----------------*/ UNIX_JOB CMDS065J RUN ANY CMDNAME sleep 5 AGENT CMDSHP USER proddata RUN MON,TUE,WED,THU,FRI DELAYSUB 02:00 /* "Triggers daily file watcher jobs" */ ENVAR...

5. Shell Programming and Scripting

Filter or remove duplicate block of text without distinguishing marks or fields

Hello, Although I have found similar questions, I could not find advice that could help with our problem. The issue: We have several hundreds text files containing repeated blocks of text (I guess back at the time they were prepared like that to optmize printing). The block of texts...

6. Shell Programming and Scripting

Help with remove last text of a file that have specific pattern

Input file matrix-remodelling_associated_8_ aurora_interacting_1_ L20 von_factor_A_domain_1 ATP_containing_3B_ . . Output file matrix-remodelling_associated_8 aurora_interacting_1 L20 von_factor_A_domain_1 ATP_containing_3B . .

7. Shell Programming and Scripting

How to remove all text except pattern

i have nasty html file with 2000+ simbols in 1 row...i need to remove whole the code except title="Some title..." and store those into file with titles (the whole text is in variable text) i've tried something like this: echo $text | sed 's/.*$title=\".+\"$.*/\1/' > titles.html BUT it does...

8. Shell Programming and Scripting

Count the number of occurrences of a pattern between each occurrence of a different pattern

I need to count the number of occurrences of a pattern, say 'key', between each occurrence of a different pattern, say 'lu'. Here's a portion of the text I'm trying to parse: lu S1234L_149_m1_vg.6, part-att 1, vdp-att 1 p-reserver IID 0xdb registrations: key 4156 4353 0000 0000 ...

9. Shell Programming and Scripting

Remove duplicate files based on text string?

Hi I have been struggling with a script for removing duplicate messages from a shared mailbox. I would like to search for duplicate messages based on the “Message-ID” string within the messages files. I have managed to find the duplicate “Message-ID” strings and (if I would like) delete...

10. Shell Programming and Scripting

Remove duplicate text

Hello, I have a log file which is generated by a script which looks like this: userid: 7 starttime: Sat May 24 23:24:13 CEST 2008 endtime: Sat May 24 23:26:57 CEST 2008 total time spent: 2.73072 minutes / 163.843 seconds date: Sat Jun 7 16:09:03 CEST 2008 userid: 8 starttime: Sun May...

LEARN ABOUT SUNOS

egrep

egrep(1)																  egrep(1)

NAME

       egrep - search a file for a pattern using full regular expressions

SYNOPSIS

       /usr/bin/egrep [-bchilnsv] [-e pattern_list] [-f file] [strings] [file...]

       /usr/xpg4/bin/egrep [-bchilnsvx] [-e pattern_list] [-f file] [strings] [file...]

       The  egrep  (expression grep) utility searches files for a pattern of characters and prints all lines that contain that pattern. egrep uses
       full regular expressions (expressions that have string values that use the full set of alphanumeric and special characters)  to	match  the
       patterns.  It uses a fast deterministic algorithm that sometimes needs exponential space.

       If  no  files  are  specified,  egrep  assumes standard input. Normally, each line found is copied to the standard output. The file name is
       printed before each line found if there is more than one input file.

   /usr/bin/egrep
       The /usr/bin/egrep utility accepts full regular expressions as described on the regexp(5) manual page, except for ( and ), ( and ),	{
       and }, < and >, and 
, and with the addition of:

       1.  A full regular expression followed by + that matches one or more occurrences of the full regular expression.

       2.  A full regular expression followed by ? that matches 0 or 1 occurrences of the full regular expression.

       3.  Full regular expressions separated by | or by a NEWLINE that match strings that are matched by any of the expressions.

       4.  A full regular expression that can be enclosed in parentheses ()for grouping.

       Be careful using the characters $, *, [, ^, |, (, ), and  in full regular expression, because they are also meaningful to the shell. It is
       safest to enclose the entire full regular expression in single quotes  '... '.

       The order of precedence of operators is [], then *?+, then concatenation, then | and NEWLINE.

   /usr/xpg4/bin/egrep
       The /usr/xpg4/bin/egrep utility uses the regular expressions described in the EXTENDED REGULAR EXPRESSIONS  section of the regex(5)  manual
       page.

       The following options are supported for both /usr/bin/egrep and /usr/xpg4/bin/egrep:

       -b	       Precede each line by the block number on which it was found. This can be useful in locating block numbers by context (first
		       block is 0).

       -c	       Print only a count of the lines that contain the pattern.

       -e pattern_list Search for a pattern_list (full regular expression that begins with a -).

       -f file	       Take the list of full regular expressions from file.

       -h	       Suppress printing of filenames when searching multiple files.

       -i	       Ignore upper/lower case distinction during comparisons.

       -l	       Print the names of files with matching lines once, separated by NEWLINEs. Does not repeat the names of files when the  pat-
		       tern is found more than once.

       -n	       Precede each line by its line number in the file (first line is 1).

       -s	       Work silently, that is, display nothing except error messages. This is useful for checking the error status.

       -v	       Print all lines except those that contain the pattern.

   /usr/xpg4/bin/egrep
       The following option is supported for /usr/xpg4/bin/egrep only:

       -x	Consider only input lines that use all characters in the line to match an entire fixed string or regular expression to be matching
		lines.

       The following operands are supported:

       file	       A path name of a file to be searched for the patterns. If no file operands are specified, the standard input is used.

   /usr/bin/egrep
       pattern	       Specify a pattern to be used during the search for input.

   /usr/xpg4/bin/egrep
       pattern	       Specify one or more patterns to be used during the search for input. This operand is treated as if  it  were  specified	as
		       -epattern_list.

USAGE

       See largefile(5) for the description of the behavior of egrep when encountering files greater than or equal to 2 Gbyte ( 2**31 bytes).

       See  environ(5)	for  descriptions of the following environment variables that affect the execution of egrep: LC_COLLATE, LC_CTYPE, LC_MES-
       SAGES, and NLSPATH.

       The following exit values are returned:

       0	If any matches are found.

       1	If no matches are found.

       2	For syntax errors or inaccessible files (even if matches were found).

       See attributes(5) for descriptions of the following attributes:

   /usr/bin/egrep
       +-----------------------------+-----------------------------+
       |      ATTRIBUTE TYPE	     |	    ATTRIBUTE VALUE	   |
       +-----------------------------+-----------------------------+
       |Availability		     |SUNWcsu			   |
       +-----------------------------+-----------------------------+
       |CSI			     |Not Enabled		   |
       +-----------------------------+-----------------------------+

   /usr/xpg4/bin/egrep
       +-----------------------------+-----------------------------+
       |      ATTRIBUTE TYPE	     |	    ATTRIBUTE VALUE	   |
       +-----------------------------+-----------------------------+
       |Availability		     |SUNWxcu4			   |
       +-----------------------------+-----------------------------+
       |CSI			     |Enabled			   |
       +-----------------------------+-----------------------------+

       fgrep(1), grep(1), sed(1), sh(1), attributes(5), environ(5), largefile(5), regex(5), regexp(5), XPG4(5)

       Ideally there should be only one grep command, but there is not a single algorithm that spans a wide enough range of space-time tradeoffs.

       Lines are limited only by the size of the available virtual memory.

   /usr/xpg4/bin/egrep
       The /usr/xpg4/bin/egrep utility is identical to /usr/xpg4/bin/grep -E (see grep(1)). Portable applications  should  use	/usr/xpg4/bin/grep
       -E.

								    23 May 2005 							  egrep(1)

Shell Programming and Scripting