06-23-2009
Regular Expression to match repeated characters
Hello All
I have file which contain sample data like below -
test.txt
----------------------------------------------
jambesh aaa india
trxxx
sdasd
mentor
asss
light
train
bbblah
---------------------------------------------
I want to write a regX which would print only those line which contain patteren like xxx or aaa bbb etc.
When ever the same character repeated atleast 3 times that line should print
Here the output should :
------------------------
jambesh aaa india
trxxx
asss
bbblah
etc.
Any suggestion ?
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hi
I have a file with data arranged into columns. The first column is the chromosome name.
When I use grep to subset only rows with chr1, I get chr1 but also chr10, chr11,..
How do I get only rows with chr1?
grep chr1 filein > fileout
head fileout
chr1 59757841
chr11 108258691 ... (2 Replies)
Discussion started by: jdhahbi
2 Replies
2. UNIX for Dummies Questions & Answers
Hi,
I'm struggling with a regex that would match a 'b' that follows an 'a' and is at the end of a string of non-white characters. For example:
Line 1: aba abab b abb aab bab baa
I can find the right strings but I'm lacking knowledge of how to "discard" the bits that precede bs.... (2 Replies)
Discussion started by: machinogodzilla
2 Replies
3. Shell Programming and Scripting
Hi all,
any idea how to match the following:
char*<no or any string or space> buf and
char *<no or any string or space> buf
i need to capture the buf characters too.
currently i need two checks to cover this:
#search char* <any string> buf or char *<any string> buf
@noarray =... (2 Replies)
Discussion started by: ChaMeN
2 Replies
4. Shell Programming and Scripting
Hi all,
I am looking for a regex syntax to match repeated appearance. Likes,
']+]+' matches for string '65A SOME MORE AND 78B'
Now, this gets messy if I need to extract all such repeated appearance. I don't want to write ] four or five times for matching repeated appearance.
Thanks in... (2 Replies)
Discussion started by: guruparan18
2 Replies
5. Shell Programming and Scripting
I am trying to match a similar line using grep with regular expression
the line is
/remote/mac/pbbbb/abc/def/hij/hop/include/abc/tif/element/test/testfiles/Office.cpp:57: const OfficeType& getType().get() const;
I just need to extract the bold characters using grep with regular expression.... (5 Replies)
Discussion started by: prasbala
5 Replies
6. Shell Programming and Scripting
hi everyone
suppose we have two scenario
echo ABCD | grep \{4\}
DATE
echo SYSDATE | grep \{4\}
SYSDATE
i want to match the string of four length only please help (5 Replies)
Discussion started by: aishsimplesweet
5 Replies
7. Shell Programming and Scripting
Hi,
I read the book of <<unix shell programming>>. The regular expression ^\(.\)\1 matches the first character on the line and stores it in register 1. Then the expression matches whatever is stored in the register 1, as specified by the \1. The net effect of this regular expression is to match... (2 Replies)
Discussion started by: jianma
2 Replies
8. Homework & Coursework Questions
Hi Everybody!
I need some help with a regular expression in Perl that will match files named messages, but also files named message.1, message.2 and so on. So really I need one that will find messages and messages that might be followed by a period and a digit without matching other files like... (2 Replies)
Discussion started by: Hax0rc1ph3r
2 Replies
9. Shell Programming and Scripting
I cannot seem to get this to work correct:
my ($k, $v) = split(/F/, $fc{$DIR}{symbolic}, 2);
Below is the input (the $fc{$DIR}{symbolic} variable):
QMH2562 FW:v5.06.03 DVR:v8.03.07.15.05.09-kbut i also need it to break on FV:
Emulex NC553i FV4.2.401.6 DV8.3.5.86.2pthe code above... (2 Replies)
Discussion started by: rusted_planet
2 Replies
10. Shell Programming and Scripting
echo 20110101 | awk '{ print match($0,/^((17||18||19||20)|)-*(|0|1)-*(|0||3)$/))
I am getting a match for the above, where as it shouldn't, as there is no hyphen in the echoed date.
Another question is what is the difference between || and | in the above statement (4 Replies)
Discussion started by: tostay2003
4 Replies
LEARN ABOUT DEBIAN
g2p-sk
g2p-sk(1) USER COMMANDS g2p-sk(1)
NAME
g2p-sk - phonetic transcription for Slovak
SYNOPSIS
g2p-sk [--color] [--dl debug level] [--help] [--stats] [--ofile <file_name>] [<input file>]
DESCRIPTION
The phonetic transcription is essential for some linguistic or speech recognition applications. Depending on the language either rule based
or statistical approach is being used. g2p-sk implements the rule based approach but in the future it may be replaced by statistical one.
Each input word consisting of the sequence of graphemes is transcribed in to the sequence of phones in the SAMPA coding. If no input file
is specified, the standard input is expected. If input file is used then the output is written in to the file as well. The filename is
input filename with the extension "_trans.txt".
The input output code page is ISO 8859-2. To use it with different CP use some CP converter and pipes. For example to have input and output
in UTF-8 use (for interactive use): filterm UTF8-iso2 iso2-UTF8 g2p-sk or (for batch processing) iconv -f UTF-8 -t ISO_8859-2 | g2p-sk |
iconv -f ISO_8859-2 -t UTF-8
Performance of the phonetic transcription depend on the morphematic segmentation. To improve the quality of the morphematic segmentation is
possible to replace the small version of the simple morphematic dictionary in the /usr/share/g2p_sk/Exceptions/morfemy.ddat with the better
one. The syllabic segmentation is as important as morphematic one. The syllabic segmentation is provided by sylseg-sk package.
The design of the g2p-sk is language dependent. To use it for another language the all rules need to be rewritten.
OPTIONS
--color
Enable color output.
--dl 1..5
Set the debug level. Control the amount of displayed information The debug level 0 displays nothing. The maximum level 5 displays
full debugging report. The default debug level is 1.
--help Display a short help text
--ofile <file_name>
Write output also in to given file.
--stats
Count and display statistic for each phone
EXAMPLES
Use standard input and debug level 3:
g2p-sk --dl 3
Process all the from file aaa.txt:
g2p-sk aaa.txt
EXIT STATUS
g2p-sk returns a zero if it succeeds to process all the input words
AUTHOR
Jozef Ivanecky (dodo (at) kanoistika.sk)
SEE ALSO
sylseg-sk(1), filterm(1), iconv(1), konwert(1)
version 0.4 May 17, 2009 g2p-sk(1)