06-23-2015
Extract all the sentences that matched two patterns
Hi
I have two lists of patterns named A and B consisting of around 200 entries in each and I want to extract all the sentences from a big text file which match atleast one pattern from both A and B.
For example, pattern list A consists of :
ama
ani
ahum
mari
...
...
and pattern list B consists of :
kok
sam
lai
mit
....
....
The sample input text file looks like below:
HTML Code:
This is first sentence with only one pattern ama.
This is second sentence with ahum and kok patterns.
This is the third sentence with only one pattern mit.
This is the fourth sentence consisting of samson and marigold.
This is the fifth sentence with more patterns such as ama, ani, lai and mit.
This is the sixth sentence with no patterns.
..
..
..
The sample output of the script for the given input text file is given below
HTML Code:
This is second sentence with ahum and kok patterns.
This is the fourth sentence consisting of samson and marigold.
This is the fifth sentence with more patterns such as ama, ani, lai and mit.
..
..
..
I need help to write a script to extract all the sentences that matches atleast one pattern from each A and B. That is, the output contains atleast one GREEN pattern and one RED pattern for every sentence. Thanks in advance.
This User Gave Thanks to my_Perl For This Post:
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi guys,i got this problem which is..i need to find those sentences with date inside and extract them out,the input is somehow like this
eg:
$DATA42.GANTRY2.GA161147 DISKFILE 2007-10-16 11:56:45 SUPER.OPR \NETS.$Y4CB.#IN
... (4 Replies)
Discussion started by: cyberray
4 Replies
2. Programming
Hi ,
i have a text file that contain a story
How do i extract the out all the sentences that contain the word Mon. in C++
I only want to show those sentences that contain the word mon
eg.
Monkey on a tree.
Rabbit jumping around the tree.
I am very rich, I have lots of money.
Today... (1 Reply)
Discussion started by: xiaojesus
1 Replies
3. Shell Programming and Scripting
Hi,
I have an array with 3 words in it and i have to match all the array contents and display the exact matched sentence i.e all 3 words should match with the sentence.
Here are sentences.
$arr1="Our data suggests that epithelial shape and growth control are unequally affected depending... (5 Replies)
Discussion started by: vanitham
5 Replies
4. Shell Programming and Scripting
Hi,
I have a master file that i need to split into multiple files based on matched patterns. sample of my data as follows:-
scaff_1 a e 123 130 c_scaff_100
scaff_1 a e 132 138 c_scaff_101
scaff_1 a e 140 150 ... (2 Replies)
Discussion started by: redse171
2 Replies
5. Shell Programming and Scripting
I am wanting to fetch the content of the table within a file
the table begins with data label like
N Batch Mn(I) RMSdev I/rms Rmerge Number Nrej Cm%poss AnoCmp MaxRes CMlplc SmRmerge SmMaxRes $$ $$
. #columns of data
.
.
.
.
.
$$
I tried the command
awk... (18 Replies)
Discussion started by: piynik
18 Replies
6. Shell Programming and Scripting
Hi,
i need help to delete all the lines between 2 matched patterns and the first pattern must be deleted too. sample as follows:
inputfile.txt
>kump_1
...........................
...........................
>start_0124
dgfhghgfh
fgfdgfh
fdgfdh
>kump_2
............................. (7 Replies)
Discussion started by: redse171
7 Replies
7. Shell Programming and Scripting
Hi,
I need help to match pattern started with "RW" in file 1 and with pattern in $1 in file 2 as follows:-
File 1
BH /TOTAL=466(423); /POSITIVE=300(257); /UNKNOWN=25(25);
BH /F_P=141(141); /F_N=136; /P=4;
CC /TAX=!?; /MAX-R=2;
CC /VER=2;
RW P9610, AR_BSU , T; PAE25, AE_E57... (10 Replies)
Discussion started by: redse171
10 Replies
8. Shell Programming and Scripting
Hi,
I am trying to extract some patterns from a line. The input file is space delimited and i could not use column to get value after "IN" or "OUT" patterns as there could be multiple white spaces before the next digits that i need to print in the output file . I need to print 3 patterns in a... (3 Replies)
Discussion started by: redse171
3 Replies
9. Shell Programming and Scripting
Hi
I have a big text file. I want to extract all the sentences that matches at least 70% (seventy percent) of the words from each sentence based on a word list called A.
Say the format of the text file is as given below:
This is the first sentence which consists of fifteen words... (4 Replies)
Discussion started by: my_Perl
4 Replies
10. Shell Programming and Scripting
My input looks like this.
# Lot Of CODE Before
AppType_somethinglese=$(cat << EOF
AppType_test1='test-tool/blatest-tool-ear'
AppType_test2='test/blabla-ear'
# Lot Of CODE After
I want to print text betwen 1) _ and = and 2)/ and ' from each line
and exclude lines with "EOF".
Output... (2 Replies)
Discussion started by: kchinnam
2 Replies
nmea(n) NMEA protocol implementation nmea(n)
__________________________________________________________________________________________________________________________________________________
NAME
nmea - Process NMEA data
SYNOPSIS
package require Tcl 8.2
package require nmea ?0.1.1?
::nmea::open_port port ?speed?
::nmea::open_file file rate
::nmea::input sentence
::nmea::configure_port settings
::nmea::close_port
::nmea::close_file
::nmea::do_line
::nmea::log file
::nmea::checksum data
::nmea::write sentence data
_________________________________________________________________
DESCRIPTION
This package provides a standard interface for writing software which recieves NMEA standard input data. It allows for reading data from
COM ports, files, or programmatic input. It also supports the checksumming and logging of incoming data. After parsing, input is dis-
patched to user defined handler commands for processing. To define a handler, create a proc with the NMEA sentence name in the ::nmea
namespace. For example, to process GPS fix data use "proc ::nmea::GPGSA". The proc must take one argument, which is a list of the data val-
ues.
COMMANDS
::nmea::open_port port ?speed?
Open the specified COM port and read NMEA sentences when available. Port speed is set to 4800bps by default or to speed.
::nmea::open_file file rate
Open file file and read NMEA sentences, one per line, at the rate by rate in milliseconds. The file format may omit the leading $
and/or the checksum. If rate is <= 0 then lines will only be processed when a call to do_line is made. The rate may be adjusted by
setting ::nmea::nmea(rate).
::nmea::input sentence
Processes and dispatches the supplied sentence. If sentence contains no commas it is treated as a Tcl list, otherwise it must be
standard comma delimited NMEA data, with an optional checksum and leading $.
::nmea::configure_port settings
Changes the current port settings. settings has the same format as fconfigure -mode.
::nmea::close_port
Close the open port
::nmea::close_file
Close the open file
::nmea::do_line
If there is a currently open file, this command will read and process a single line from it. Returns the number of lines read.
::nmea::log file
Starts or stops file logging. If a file name is specified then all NMEA output will be logged to the file in append mode. If file is
an empty string then any logging will be stopped.
::nmea::checksum data
Returns the checksum of the supplied data
::nmea::write sentence data
If there is a currently open port, this command will write the specified sentence and data in proper NMEA checksummed format.
VARIABLES
::nmea::checksum
A boolean value which determines whether incoming sentences are validated or not.
::nmea::rate
When reading from a file this sets the rate that lines are processed in milliseconds.
BUGS, IDEAS, FEEDBACK
This document, and the package it describes, will undoubtedly contain bugs and other problems. Please report such in the category nmea of
the Tcllib SF Trackers [http://sourceforge.net/tracker/?group_id=12883]. Please also report any ideas for enhancements you may have for
either package and/or documentation.
KEYWORDS
gps, nmea
COPYRIGHT
Copyright (c) 2006-2007, Aaron Faupell <afaupell@users.sourceforge.net>
nmea 0.1 nmea(n)