Copying Text between two unique text patterns Post: 302119185

Sponsored Content

Top Forums UNIX for Dummies Questions & Answers Copying Text between two unique text patterns Post 302119185 by ennstate on Monday 28th of May 2007 03:03:36 PM

05-28-2007

Registered User

Hi Simon,
Though there could some other smarter solution,I have used the following approach to solve this problem.

Assuming we have the contents of the file /tmp/MyNewArticleFile.rtf as ,

cat /tmp/MyNewArticleFile.rtf

HTML Code:

Times of India
Edition-1
Date:27 th May

Document 1 of 20

All blah blah goes here
Ad Page
Blah

================================

Document 2 of 20

All blah blah goes here
Ad Page
Blah

================================

Document 3 of 20

All blah blah goes here
Ad Page
Blah

================================
Document 4 of 20

All blah blah goes here
Ad Page
Blah

================================
End of the Edition
Thanks
Editor

I have written the following script that process the above file to generate the output.
Here the assumption is the Document has 20 Pages.

Code:

#!/bin/ksh
let page=1
while [[ page -le 20 ]] ; do
sed -n /Document\ $page/,/==========*/p /tmp/MyNewArticleFile.rtf > /tmp/ArticleSplitPage-$page
((page=page+1))
done

Upon execution of the above script i get 20 pages spilt according to the Document no.

cat /tmp/ArticleSpiltPage-1

HTML Code:

Document 1 of 20

All blah blah goes here
Ad Page
Blah

================================

Thanks,
Nagarajan Ganesan.

ennstate

View Public Profile for ennstate

Find all posts by ennstate

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

extracting unique lines from text file

I have a file with 14million lines and I would like to extract all the unique lines from the file into another text file. For example: Contents of file1 happy sad smile happy funny sad I want to run a command against file one that only returns the unique lines (ie 1 line for happy...

2. Shell Programming and Scripting

Extracting Text Between Two Unique Lines

Hi all! Im trying to extract a portion of text from a file and put it into a new file. I need all the lines between <Placement> and </Placement> including the Placemark lines themselves. Is there a way to extract all instances of these and not just the first one found? I've tried using sed and...

3. Shell Programming and Scripting

Extracting several lines of text after a unique string

I'm attempting to write a script to identify users who have sudo access on a server. I only want to extract the ID's of the sudo users after a unique line of text. The list of sudo users goes to the EOF so I only need the script to start after the unique line of text. I already have a script to...

4. UNIX for Advanced & Expert Users

Vi copying text

Is there a trick for copying from something like a powerpoint into vi? Every time I try to copy text from something like a powerpoint to vi my spacing gets messed up. I think it has something to do with my .vimrc file. When I renamed it was able to copy it in just fine so can someone please...

5. UNIX for Dummies Questions & Answers

Copying text from Windows to AIX - missing text?

Hi All, I'm hoping this is an easy question, but I'm having a weird problem trying to simply copy and paste text from MS Windows (XP) Notepad and then pasting into vi or vim in AIX. When I type "oslevel" I get "5.3.0.0". The problem is that once the text is pasted, there are sections of text...

6. Shell Programming and Scripting

Replacing text between two patterns

I would like to replace ], with ]]], between /* SECTION2-BEGIN */ and /* SECTION2-END */ in my file. My file contains the following information: /* SECTION1-BEGIN */ , /* SECTION1-END */ /* SECTION2-BEGIN */ , /* SECTION2-END */ /*...

7. Shell Programming and Scripting

Need to extract text repetitively between two patterns

Hi All, I want to extract the text between some pattern which occurs repeatedly in a file. For example my input is like, /home/..... ..........java:25: cannot find symbol ............ /home/...... /home/....... I want to display...

8. Shell Programming and Scripting

Find patterns and filter the text

I need to filter the text in between two patterns and output that to a different file. Please help me how to do it. Ex: ............. <some random text> ............. Pattern_1 <Few lines that need to be output to different file> Pattern_2 ................ ............... <more text in...

9. Shell Programming and Scripting

Command for non-unique text

awk -F "" '/<TestName>|<testname>|<Offerer>|<offerer>|<Line1>|<line1>|<City>|<city>|<State>|<state>/ {print $2, $3}' OFS='\t' UBE3A.xml > UBE3A.txt Is it possible to use the code above to search for a pattern that is non-unique? For example, if I wanted to capture the<MethodList>|<string>...

10. Shell Programming and Scripting

awk to print unique text in field

I am trying to use awk to print the unique entries in $2 So in the example below there are 3 lines but 2 of the lines match in $2 so only one is used in the output. File.txt chr17:29667512-29667673 NF1:exon.1;NF1:exon.2;NF1:exon.38;NF1:exon.4;NF1:exon.46;NF1:exon.47 703.807...

LEARN ABOUT DEBIAN

plucene::document::field

Plucene::Document::Field(3pm)				User Contributed Perl Documentation			     Plucene::Document::Field(3pm)

NAME

       Plucene::Document::Field - A field in a Plucene::Document

SYNOPSIS

	       my $field = Plucene::Document::Field->Keyword($name, $string);
	       my $field = Plucene::Document::Field->Text($name, $string);

	       my $field = Plucene::Document::Field->UnIndexded($name, $string);
	       my $field = Plucene::Document::Field->UnStored($name, $string);

DESCRIPTION

       Each Plucene::Document is made up of Plucene::Document::Field objects. Each of these fields can be stored, indexed or tokenised.

FIELDS

   name
       Returns the name of the field.

   string
       Returns the value of the field.

   is_stored
       Returns true if the field is or will be stored, or false if it was created with "UnStored".

   is_indexed
       Returns true if the field is or will be indexed, or false if it was created with "UnIndexed".

   is_tokenized
       Returns true if the field is or will be tokenized, or false if it was created with "UnIndexed" or "Keyword".

METHODS

   Keyword
	       my $field = Plucene::Document::Field->Keyword($name, $string);

       This will make a new Plucene::Document::Field object that is stored and indexed, but not tokenised.

   UnIndexed
	       my $field = Plucene::Document::Field->UnIndexded($name, $string);

       This will make a new Plucene::Document::Field object that is stored, but not indexed or tokenised.

   Text
	       my $field = Plucene::Document::Field->Text($name, $string);

       This will make a new Plucene::Document::Field object that is stored, indexed and tokenised.

   UnStored
	       my $field = Plucene::Document::Field->UnStored($name, $string);

       This will make a new Plucene::Document::Field object that isn't stored, but is indexed and tokenised.

perl v5.12.4							    2011-08-14					     Plucene::Document::Field(3pm)