Sponsored Content
Full Discussion: comparing sentences
Top Forums Shell Programming and Scripting comparing sentences Post 302248531 by vanitham on Saturday 18th of October 2008 06:20:49 AM
Old 10-18-2008
comparing sentences

Hi,

I have a file and that file contains the following sentences.

Code:
Here we show that a virus-encoded transcription factor, viral mRNA, cellular RNA-binding protein heterodimer G3BP/Caprin-1 (p137), translation initiation factors eIF4E and eIF4G, and ribosomal proteins are concentrated in the same subdomains of cytoplasmic DNA factories
The single-stranded DNA- and RNA-binding protein, Puralpha, has been implicated in many biological processes, including control of transcription of multiple genes, initiation of DNA replication, and RNA transport and translation
RNA-binding proteins are involved in processes such as protection of RNAs from RNase degradation, prevention of ribosome binding to mRNA, control of formation of secondary structures of the mRNA that permit or prevent translation initiation, and termination/antitermination of transcription in response to external signals
d)The La autoantigen is an RNA-binding protein that is involved in initiation and termination of RNA polymerase III transcription

Here is the code to match the sentence.

Code:
//Suppose if this is my input string

$str="Here we show that a virus-encoded transcription factor, viral mRNA, cellular RNA-binding protein heterodimer G3BP/Caprin-1 (p137), translation initiation factors eIF4E and eIF4G, and ribosomal proteins are concentrated in the same subdomains of cytoplasmic DNA factories";

open(FH,"sample.txt");
while(<FH>)
{
    if($_=~/$str/)
   {
      print "matched\n";
   }
  else
  {
     print "not match\n";
  }
}
close FH;

If the above string is matching it should print matched but even though that sentences is there still it is not matcing.

How should i match the above sentence?

How can i match sentences like these in perl?

With regards
Vanitha

Last edited by vanitham; 10-20-2008 at 12:47 AM..
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

spliting up sentences

hello, i'm looking to split up text into a list of words but can't figure it out, any help would be great. thanks steven (2 Replies)
Discussion started by: stevox
2 Replies

2. Shell Programming and Scripting

How to identify sentences from a text?

Hi, I have to identify sentences from this text. If i split these statements by this way: @sentence= split(/\.\W*/,$text); I will get these following things also in the output along with proper sentences. Biol Reprod. 2002 Mar;66(3):785-95. Egydio de Carvalho C, Tanaka H,... (2 Replies)
Discussion started by: vanitham
2 Replies

3. Shell Programming and Scripting

How to get exact match sentences?

Hi, I have sentences like this: $sent= Protein modeling studies reveal that the RG-rich region is part of a three to four strand antiparallel beta-sheet, which in other RNA binding protein functions as a platform for nucleic acid interactions. Heterogeneous nuclear ribonucleoparticle... (19 Replies)
Discussion started by: vanitham
19 Replies

4. Shell Programming and Scripting

Separate sentences from two texts, combine them

I havet two books, one in Swedish, the other in English. Two text files. I want to combine them into one, with each sentence having it's translation next to it. ------------ Text file one. Example sentence in English. Example 2 sentence 2 in English 2. -------------- Text file two. ... (2 Replies)
Discussion started by: LaraMej
2 Replies

5. UNIX for Dummies Questions & Answers

How to filter sentences??

Hi, I have few sentences here. $a1="Division of Hematology-Oncology, and Stem cell transplantation, Schneider Childrens Hospital, Albert Einstein College of Medicine, New Hyde Park, New York. "; $a2="Department of Cell Biology and Anatomy, College of Medicine, National Cheng Kung... (3 Replies)
Discussion started by: vanitham
3 Replies

6. Programming

compile a program in C with teradata sentences using cc

hi, I want to compile a program in C. It have a multiple calls to teradata. I have no idea how to compile in Aix. The compiler that I should be use is cc. I tried cc -G -KPIC tdsfbd0358.c this generates a tdsfbd0358.i and after I have no Idea what I have to make, a link? how?... (3 Replies)
Discussion started by: kajum
3 Replies

7. UNIX for Dummies Questions & Answers

extracting sentences that only contain a word

Hi guys Need your help how do I extract sentences with only a word i.e. today is hot hot very humid humid2 Sample output hot very (0 Replies)
Discussion started by: jamestan
0 Replies

8. Shell Programming and Scripting

extracting sentences that only contain a word

Hi guys Need your help how do I extract sentences with only a word i.e. today is hot hot very humid humid2 Sample output hot (6 Replies)
Discussion started by: jamestan
6 Replies

9. Shell Programming and Scripting

Using awk to find sentences.

I am trying to print out sentences that meets a regular expression in awk (I’m open to using other tools, too). I got the regular expression I want to use, "(\+ \{4\})" from user ripat in a grep forum. Unfortunately with grep I couldn't print only the sentence. While searching for awk... (8 Replies)
Discussion started by: danbroz
8 Replies

10. Shell Programming and Scripting

How to use $variable in conditional sentences?

Hello all I am doing a Makefile but I can't return the value of $var to use it in conditional sentences: #!/bin/sh GO=$(shell) go GOPATH=$(GO) env GOPATH make: @$(GOPATH) @if ; then mkdir -p "$(GOPATH)/bin" ; fi When I type "make", @$GOPATH returns /home/icvallejo/go... (5 Replies)
Discussion started by: icvallejo
5 Replies
VOIKKOGC(1)						      General Commands Manual						       VOIKKOGC(1)

NAME
voikkogc - test program for Voikko grammar checker SYNOPSIS
voikkogc [options] DESCRIPTION
voikkogc is a test program for grammar checking functionality in libvoikko, library of Finnish language tools. It reads sentences or para- graphs from stdin (one per line) and print the results to stdout. The results are structures containing information about grammar errors found in the input paragraph. OPTIONS
--tokenize Instead of looking for grammar errors, split input into tokens. The tokens are prefixed by type: "W" is a word, "P" is punctuation, "S" is whitespace, "U" is unknown and "E" is a prefix for error messages. --split-sentences Instead of looking for grammar errors, split input into sentences. The sentences are prefixed by type: "B" means that end of sen- tence is a probably correct, "P" means that end of sentence is a possibly correct (but probably this and the next identified sen- tence should be joined) and "E" means that sentence ends at the end of input. -n Prefix all grammar checker messages with line number of input data. accept_titles=n accept_unfinished_paragraphs=n accept_bulleted_lists=n Set the value of the specified boolean option. explanation_language=langcode Print human readable error explanation in the specified language. BUGS
Human readable error explanations are printed in UTF-8 regardless of current locale settings. SEE ALSO
voikkospell for common options of different Voikko test tools. AUTHOR
voikkogc and this manual page were written by Harri Pitkanen (hatapitk@iki.fi). 2010-05-06 VOIKKOGC(1)
All times are GMT -4. The time now is 04:48 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy