Egrep find word that occurs twice in a row Post: 303005570

Sponsored Content

Top Forums UNIX for Beginners Questions & Answers Egrep find word that occurs twice in a row Post 303005570 by Don Cragun on Thursday 19th of October 2017 06:33:11 PM

10-19-2017

Registered User

According to the standards, extended regular expressions do not have back-references; only basic regular expressions have back-references. Therefore, with a standards conforming version of egrep (which the standards specify as grep -E (not egrep), it is almost impossible to find a variable string that appears twice on a line.

If you use grep instead of egrep (as Scrutinizer suggested in post #3), you can use it to print lines that have a string matching the basic regular expression (AKA BRE) pattern followed by a second occurrence of the same string.

The command:

Code:

grep '/.*|.*/'

and the command:

Code:

grep '/.*\|.*/'

will both print lines that contain a / immediately following by any string of 0 or more characters followed by a | followed by any string of 0 or more characters followed by a / (which does not seem to in any way match what you said you're looking for).

If you're looking for a string of one or more lower-case alphabetic characters (in a locale where the underlying codeset is a superset of ASCII) immediately followed by a by a duplicate of that same string (with nothing between them), you could get that using the grep command:

Code:

grep '\([a-z]+\)\1'

and if you wanted to find two adjacent words that appear at the start of a line or immediately follow a space and are followed by a space or the end of a line that occur next to each other separated by a single space, that would be something like:

Code:

grep -e '^\([a-z\) \1$' -e '^\([a-z\) \1 ' -e ' \([a-z\) \1 ' -e ' \([a-z\) \1$'

As noted by Scrutinizer in post #7, the above BREs are incorrect. The corrected form (assuming there is a single space character between words) is:

Code:

grep -e '^\([a-z][a-z]*\) \1$' -e '^\([a-z][a-z]*\) \1 ' -e ' \([a-z][a-z]*\) \1 ' -e ' \([a-z][a-z]*\) \1$'

In the above command the first BRE looks for two identical lower-case words alone on a line, the 2nd BRE looks for two identical words at the start of a line followed by one or more other words, the 3rd BRE looks for two identical words following one or more other words an followed by one or more other words, and the last BRE looks for two identical lower-case words at the end of a line following one or more other words.

Some versions of grep do not conform to the standards unless additional parameters are specified to force standards conformance. Without knowing what operating system you're using, we have no way of knowing if this problem might affect you.

Last edited by Don Cragun; 10-19-2017 at 11:00 PM.. Reason: Fix typos in BREs noted by Scrutinizer.

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

TO find the word which occurs maximum number of times

Hi Folks !!!!!!!!!!!!!!!!!!! My Requirement is............. i have a input file: 501,501.chan 502,502.anand 503,503.biji 504,504.raja 505,505.chan 506,506.anand 507,507.chan and my o/p should be chan->3 i.e. the word which occurs maximum number of times in a file should be...

2. Shell Programming and Scripting

find a word in a file, and change a word beneath it ??

Hi all, I have a file with lines written somewhat like this. aaaa ccc aa linux browse = no xssxw cdcedc dcsdcd csdw police dwed dwd browse = no cdecec

3. Shell Programming and Scripting

Looking for a single line to count how many times one character occurs in a word...

I've been looking on the internet, and haven't found anything simple enough to use in my code. All I want to do is count how many times "-" occurs in a string of characters (as a package name). It seems it should be very simple, and shouldn't require more than one line to accomplish. And this is...

4. Shell Programming and Scripting

Need to replace the first word of a line if it occurs again in the next line(shell)

Hi folks, have a look into the attachment, i am not familiar with unix, can you please help me in this regard. thanks in advance, :) regards, Geeko

5. Shell Programming and Scripting

Find and replace a word in all the files (that contain the word) under a directory

Hi Everyone, I am looking for a simple way for replacing all the files under a directory that use the server "xsgd1234dap" with "xsdr3423pap". For Example: In the Directory, $pwd /home/nick $ grep -l "xsgd1234dap" *.sh | wc -l 119 I have "119" files that are still using...

6. UNIX for Dummies Questions & Answers

Find EXACT word in files, just the word: no prefix, no suffix, no 'similar', just the word

I have a file that has the words I want to find in other files (but lets say I just want to find my words in a single file). Those words are IDs, so if my word is ZZZ4, outputs like aaZZZ4, ZZZ4bb, aaZZZ4bb, ZZ4, ZZZ, ZyZ4, ZZZ4.8 (or anything like that) WON'T BE USEFUL. I need the whole word...

7. Shell Programming and Scripting

perl lwp find word and print next word :)

hi all, I'm new there, I'm just playing with perl and lwp and I just successfully created a script for log in to a web site with post. I have a response but I would like to have something like this: I have in my response lines like: <div class="sender">mimi020</div> <some html code.....>...

8. Shell Programming and Scripting

How to find a phrase and pull all lines that follow until the phrase occurs again?

I want to burst a report by using the page number value in the report header. Each section starts with *PAGE NO:* 1 Each section might have several pages, but the next section always starts back at 1. So I want to find the "*PAGE NO:* 1" value and pull all lines that follow until "*PAGE NO:* 1"...

9. Shell Programming and Scripting

Find word in a line and output in which line the word occurs / no. of times it occurred

I have a file: file.txt, which contains the following data in it. This is a file, my name is Karl, what is this process, karl is karl junior, file is a test file, file's name is file.txt My name is not Karl, my name is Karl Joey What is your name? Do you know your name and...

10. Shell Programming and Scripting

Find a word and increment the number in the word & save into new files

Hi All, I am looking for a perl/awk/sed command to auto-increment the numbers line in file, P1.tcl: run_build_model sparc_ifu_dec run_drc set_faults -model path_delay -atpg_effectiveness -fault_coverage add_delay_paths P1 set_atpg -abort_limit 1000 run_atpg -ndetects 1000 I would like...

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

TO find the word which occurs maximum number of times

Discussion started by: aajan

2. Shell Programming and Scripting

find a word in a file, and change a word beneath it ??

Discussion started by: vikas027

3. Shell Programming and Scripting

Looking for a single line to count how many times one character occurs in a word...

Discussion started by: Shingoshi

4. Shell Programming and Scripting

Need to replace the first word of a line if it occurs again in the next line(shell)

Discussion started by: geeko

5. Shell Programming and Scripting

Find and replace a word in all the files (that contain the word) under a directory

Discussion started by: filter

6. UNIX for Dummies Questions & Answers

Find EXACT word in files, just the word: no prefix, no suffix, no 'similar', just the word

Discussion started by: chicchan

7. Shell Programming and Scripting

perl lwp find word and print next word :)

Discussion started by: vogueestylee

8. Shell Programming and Scripting

How to find a phrase and pull all lines that follow until the phrase occurs again?

Discussion started by: Scottie1954

9. Shell Programming and Scripting

Find word in a line and output in which line the word occurs / no. of times it occurred

Discussion started by: anuragpgtgerman

10. Shell Programming and Scripting

Find a word and increment the number in the word & save into new files

Discussion started by: jypark22