@arr1=("epithelial","downregulation","growth");
@arr2=("Our data suggests that epithelial shape and growth control are unequally affected depending on how wt p53 function is impaired and whether partial or full tumor suppressor activity is lost","The growth of epithelial tissue is downregulation");
foreach $word(@arr1)
{
foreach $arr(@arr2)
{
if($arr=~/\b$word\b/i)
{
print "<br>matched<br>";
}
}
}
In such case how can i get the maximum matched sentence???
The first sentence in the output matches all the words so that is the highest priority and it is printed first.
Next sentence matches with only 2 words so the next priority.
Output:
Code:
The growth of epithelial tissue is downregulation
Our data suggests that epithelial shape and growth control are unequally affected depending on how wt p53 function is impaired and whether partial or full tumor suppressor activity is lost
How to get the desired highest maximum sentence ??
Hi,
I have sentences like this:
$sent=
Protein modeling studies reveal that the RG-rich region is part of a three to four strand antiparallel beta-sheet, which in other RNA binding protein functions as a platform for nucleic acid interactions.
Heterogeneous nuclear ribonucleoparticle... (19 Replies)
Just wondering if there's a better way to get these complete sentences into an array and keep the quotes intact? All the quotes make it look ugly to me but it works. I want to be able to refer to the full sentences by index. I've tried a few qw and qq/ aproaches but what I have below seems about... (4 Replies)
Hi i have just registered
So i am at university studying forensic computing and we have to learn c++
i have never done anything with c++ before and i am abit stuck
i need to create a programme to display the contents of an array of characters forwards and in reverse
Can anyone help me... (1 Reply)
Basically to illuminate i want to take a file with mutliple lines,
C:\searching4theseletters.txt
a
b
c
Read this into an array
@ARRAY
and then use this to compare against another file
C:\inputletters.txt
b
o
a
c
n
a (9 Replies)
I'm currently working with dozens of FASTA files, and I'm tired of having to manually change the filename in my Perl script.
I'm trying to write a simple Perl script that'll create a 2-dimensional array containing the name of the folders and its contents.
For example, I would like the output... (6 Replies)
Hi there
I have a counter called
my $counter = 0;
I am trying to build an array that will have a name that is for example
my @array0 = ("some", "stuff");
but instead of hard coding the "0" in the array name i want to use whatever value the aforementioned $counter has in it...so
... (1 Reply)
Hi ,
I have been trying to write a perl script to do this job. But i am not able to achieve the desired result. Below is my code.
my $current_value=12345;
my @users=("bob","ben","tom","harry");
open DBLIST,"<","/var/tmp/DBinfo";
my @input = <DBLIST>;
foreach (@users)
{
my... (11 Replies)
ksh
eg
arrayname=(1 2 3 4 5)
I'm trying to display the individual contents of an array on a new line without using a loop, using one line of code.
output
1
2
3
4
5 (3 Replies)
Hi
I have two lists of patterns named A and B consisting of around 200 entries in each and I want to extract all the sentences from a big text file which match atleast one pattern from both A and B.
For example, pattern list A consists of :
ama
ani
ahum
mari
...
...
and pattern... (1 Reply)
Discussion started by: my_Perl
1 Replies
LEARN ABOUT DEBIAN
ucto
ucto(1) General Commands Manual ucto(1)NAME
ucto - Unicode Tokenizer
SYNOPSYS
ucto [[options]] [input-file] [[output-file]]
DESCRIPTION
ucto ucto tokenizes text files: it separates words from punctuation, splits sentences (and optionally paragraphs), and finds paired quotes.
Ucto is preconfigured with tokenisation rules for several languages.
OPTIONS -c configfile
read settings from a file
-d value
set debug mode to 'value'
-e value
set input encoding. (default UTF8)
-f
disable filtering of special characters
-L language
Automatically selects a configuration file by language code. e.g. 'fr' will select the file tokconfig-fr from the installation
directory
-l
Convert to all lowercase
-u
Convert to all uppercase
-n
Assume one sentence per line on input
-m
Emit one sentence per line on output
--passthru
Don't tokenize, but perform input decoding and simple token role detection
-P
Disable Paragraph Detection
-Q
Enable Quote Detection. (this is experimental and may lead to unexpected results)
-S
Disable Sentence Detection
-s <string>
Set End-of-sentence marker. (Default <utt>)
-V
Show version information
-v
set Verbose mode
-x <DocId>
Output FoLiA XML, use the specified Document ID. (this disables usage of most other options: -nulPQvsS)
-F
Read a FoLiA XML document, tokenize it, and output the modified doc. (this disables usage of most other options: -nulPQvsS)
BUGS
likely
AUTHORS
Maarten van Gompel proycon@anaproy.nl
Ko van der Sloot Timbl@uvt.nl
2011 november 28 ucto(1)