Hi Simon,
Though there could some other smarter solution,I have used the following approach to solve this problem.
Assuming we have the contents of the file /tmp/MyNewArticleFile.rtf as ,
cat /tmp/MyNewArticleFile.rtf
HTML Code:
Times of India
Edition-1
Date:27 th May
Document 1 of 20
All blah blah goes here
Ad Page
Blah
================================
Document 2 of 20
All blah blah goes here
Ad Page
Blah
================================
Document 3 of 20
All blah blah goes here
Ad Page
Blah
================================
Document 4 of 20
All blah blah goes here
Ad Page
Blah
================================
End of the Edition
Thanks
Editor
I have written the following script that process the above file to generate the output.
Here the assumption is the Document has 20 Pages.
Upon execution of the above script i get 20 pages spilt according to the Document no.
cat /tmp/ArticleSpiltPage-1
HTML Code:
Document 1 of 20
All blah blah goes here
Ad Page
Blah
================================
I have a file with 14million lines and I would like to extract all the unique lines from the file into another text file.
For example:
Contents of file1
happy
sad
smile
happy
funny
sad
I want to run a command against file one that only returns the unique lines (ie 1 line for happy... (3 Replies)
Hi all! Im trying to extract a portion of text from a file and put it into a new file. I need all the lines between <Placement> and </Placement> including the Placemark lines themselves. Is there a way to extract all instances of these and not just the first one found? I've tried using sed and... (4 Replies)
I'm attempting to write a script to identify users who have sudo access on a server. I only want to extract the ID's of the sudo users after a unique line of text. The list of sudo users goes to the EOF so I only need the script to start after the unique line of text. I already have a script to... (1 Reply)
Is there a trick for copying from something like a powerpoint into vi? Every time I try to copy text from something like a powerpoint to vi my spacing gets messed up. I think it has something to do with my .vimrc file. When I renamed it was able to copy it in just fine so can someone please... (2 Replies)
Hi All,
I'm hoping this is an easy question, but I'm having a weird problem trying to simply copy and paste text from MS Windows (XP) Notepad and then pasting into vi or vim in AIX. When I type "oslevel" I get "5.3.0.0". The problem is that once the text is pasted, there are sections of text... (2 Replies)
I would like to replace ], with ]]], between /* SECTION2-BEGIN */ and /* SECTION2-END */ in my file. My file contains the following information:
/* SECTION1-BEGIN */
,
/* SECTION1-END */
/* SECTION2-BEGIN */
,
/* SECTION2-END */
/*... (5 Replies)
Hi All,
I want to extract the text between some pattern which occurs repeatedly in a file. For example my input is like,
/home/.....
..........java:25: cannot find symbol
............
/home/......
/home/.......
I want to display... (2 Replies)
I need to filter the text in between two patterns and output that to a different file. Please help me how to do it.
Ex:
.............
<some random text>
.............
Pattern_1
<Few lines that need to be output to different file>
Pattern_2
................
...............
<more text in... (4 Replies)
awk -F "" '/<TestName>|<testname>|<Offerer>|<offerer>|<Line1>|<line1>|<City>|<city>|<State>|<state>/ {print $2, $3}' OFS='\t' UBE3A.xml > UBE3A.txt
Is it possible to use the code above to search for a pattern that is non-unique?
For example, if I wanted to capture the<MethodList>|<string>... (6 Replies)
I am trying to use awk to print the unique entries in $2
So in the example below there are 3 lines but 2 of the lines match in $2 so only one is used in the output.
File.txt
chr17:29667512-29667673 NF1:exon.1;NF1:exon.2;NF1:exon.38;NF1:exon.4;NF1:exon.46;NF1:exon.47 703.807... (5 Replies)
Discussion started by: cmccabe
5 Replies
LEARN ABOUT DEBIAN
qboobmsg
QBOOBMSG(1) General Commands Manual QBOOBMSG(1)NAME
qboobmsg - Qt application to read an reply to messages on various websites
SYNOPSIS
qboobmsg [-h] [-dqv] [-b backends] ...
qboobmsg [--help] [--version]
DESCRIPTION
Qt application allowing to read messages on various websites and reply to them.
Supported websites:
* aum ("Adopte un Mec" French dating website)
* bnporc (BNP Paribas French bank website)
* bouygues (Bouygues Telecom French mobile phone provider)
* dlfp (Da Linux French Page news website)
* ecrans (Ecrans French news website)
* fourchan (4chan image board)
* hds (Histoires de Sexe French erotic novels)
* inrocks (Les Inrocks French news website)
* lefigaro (Le Figaro French newspaper website)
* minutes20 (2 Minutes French newspaper website)
* newsfeed (Loads RSS and Atom feeds from any website)
* okc (OkCupid dating website)
* orange (Orange French mobile phone provider)
* phpbb (phpBB forum)
* presseurop (Presseurop website)
* sfr (SFR French mobile phone provider)
* taz (Taz newspaper website)
OPTIONS --version
show program's version number and exit
-h, --help
show this help message and exit
-b BACKENDS, --backends=BACKENDS
what backend(s) to enable (comma separated)
LOGGING OPTIONS -d, --debug
display debug messages
-q, --quiet
display only error messages
-v, --verbose
display info messages
--logging-file=LOGGING_FILE
file to save logs
-a, --save-responses
save every response
COPYRIGHT
Copyright(C) 2010-2011 Romain Bignon
For full COPYRIGHT see COPYING file with weboob package.
FILES
"~/.config/weboob/backends"
SEE ALSO
Home page: http://weboob.org/applications/qboobmsg
qboobmsg 0.c 04 May 2012 QBOOBMSG(1)