Hi Simon,
Though there could some other smarter solution,I have used the following approach to solve this problem.
Assuming we have the contents of the file /tmp/MyNewArticleFile.rtf as ,
cat /tmp/MyNewArticleFile.rtf
HTML Code:
Times of India
Edition-1
Date:27 th May
Document 1 of 20
All blah blah goes here
Ad Page
Blah
================================
Document 2 of 20
All blah blah goes here
Ad Page
Blah
================================
Document 3 of 20
All blah blah goes here
Ad Page
Blah
================================
Document 4 of 20
All blah blah goes here
Ad Page
Blah
================================
End of the Edition
Thanks
Editor
I have written the following script that process the above file to generate the output.
Here the assumption is the Document has 20 Pages.
Upon execution of the above script i get 20 pages spilt according to the Document no.
cat /tmp/ArticleSpiltPage-1
HTML Code:
Document 1 of 20
All blah blah goes here
Ad Page
Blah
================================
I have a file with 14million lines and I would like to extract all the unique lines from the file into another text file.
For example:
Contents of file1
happy
sad
smile
happy
funny
sad
I want to run a command against file one that only returns the unique lines (ie 1 line for happy... (3 Replies)
Hi all! Im trying to extract a portion of text from a file and put it into a new file. I need all the lines between <Placement> and </Placement> including the Placemark lines themselves. Is there a way to extract all instances of these and not just the first one found? I've tried using sed and... (4 Replies)
I'm attempting to write a script to identify users who have sudo access on a server. I only want to extract the ID's of the sudo users after a unique line of text. The list of sudo users goes to the EOF so I only need the script to start after the unique line of text. I already have a script to... (1 Reply)
Is there a trick for copying from something like a powerpoint into vi? Every time I try to copy text from something like a powerpoint to vi my spacing gets messed up. I think it has something to do with my .vimrc file. When I renamed it was able to copy it in just fine so can someone please... (2 Replies)
Hi All,
I'm hoping this is an easy question, but I'm having a weird problem trying to simply copy and paste text from MS Windows (XP) Notepad and then pasting into vi or vim in AIX. When I type "oslevel" I get "5.3.0.0". The problem is that once the text is pasted, there are sections of text... (2 Replies)
I would like to replace ], with ]]], between /* SECTION2-BEGIN */ and /* SECTION2-END */ in my file. My file contains the following information:
/* SECTION1-BEGIN */
,
/* SECTION1-END */
/* SECTION2-BEGIN */
,
/* SECTION2-END */
/*... (5 Replies)
Hi All,
I want to extract the text between some pattern which occurs repeatedly in a file. For example my input is like,
/home/.....
..........java:25: cannot find symbol
............
/home/......
/home/.......
I want to display... (2 Replies)
I need to filter the text in between two patterns and output that to a different file. Please help me how to do it.
Ex:
.............
<some random text>
.............
Pattern_1
<Few lines that need to be output to different file>
Pattern_2
................
...............
<more text in... (4 Replies)
awk -F "" '/<TestName>|<testname>|<Offerer>|<offerer>|<Line1>|<line1>|<City>|<city>|<State>|<state>/ {print $2, $3}' OFS='\t' UBE3A.xml > UBE3A.txt
Is it possible to use the code above to search for a pattern that is non-unique?
For example, if I wanted to capture the<MethodList>|<string>... (6 Replies)
I am trying to use awk to print the unique entries in $2
So in the example below there are 3 lines but 2 of the lines match in $2 so only one is used in the output.
File.txt
chr17:29667512-29667673 NF1:exon.1;NF1:exon.2;NF1:exon.38;NF1:exon.4;NF1:exon.46;NF1:exon.47 703.807... (5 Replies)
Discussion started by: cmccabe
5 Replies
LEARN ABOUT MOJAVE
ppi::document::fragment5.18
PPI::Document::Fragment(3) User Contributed Perl Documentation PPI::Document::Fragment(3)NAME
PPI::Document::Fragment - A fragment of a Perl Document
DESCRIPTION
In some situations you might want to work with a fragment of a larger document. "PPI::Document::Fragment" is a class intended for this
purpose. It is functionally almost identical to a normal PPI::Document, except that it is not possible to get line/column positions for
the elements within it, and it does not represent a scope.
METHODS
index_locations
Unlike when called on a PPI::Document object, you should not be attempting to find locations of things within a PPI::Document::Fragment,
and thus any call to the "index_locations" will print a warning and return "undef" instead of attempting to index the locations of the
Elements.
TO DO
Integrate this into the rest of PPI so it has actual practical uses. The most obvious would be to implement arbitrary cut/copy/paste more
easily.
SUPPORT
See the support section in the main module.
AUTHOR
Adam Kennedy <adamk@cpan.org>
COPYRIGHT
Copyright 2001 - 2011 Adam Kennedy.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
The full text of the license can be found in the LICENSE file included with this module.
perl v5.18.2 2011-02-25 PPI::Document::Fragment(3)