08-20-2010
Hello Guys,
Thanks for your nice suggestion.......
Its working out ......
superbbbbbbbbbbbbbb.....................
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I need to look at a log file every half hour or so to make sure that some activity is going on there. I thought I would write a quick PERL script that would copy the current log file to the temp directory, compare it to the previous log file (from 30 minutes ago) and exit with an error status if... (1 Reply)
Discussion started by: Cbish68
1 Replies
2. Shell Programming and Scripting
Hi!
I have a need to do this in Perl.
script.pl -config file
The script would be doing a wget/LWP on a URL which is defined in the config file.
So when I run the script it should return either one of these conditions -
1) OK with exit status 0.
Should also print "wget URL"
2)... (6 Replies)
Discussion started by: jacki
6 Replies
3. Shell Programming and Scripting
hi guys
i have this very messy script, that looks in /var/log/messages.all for an error and reports if it finds the key works
how can i get it to look at more then one file, i.e /var/log/message.all *
so it looks in old logs as well
thanks
exit 0 if (isRenderNode(hostname));
my... (4 Replies)
Discussion started by: ab52
4 Replies
4. Shell Programming and Scripting
Hello everyone, I need to write a shell script for a file consisting of 3 columns, first column is frequency the second one is power and the last one is number of occurence. I basically need to get the power and the frequency corresponding to the highest number of occurrence number. Below is the... (6 Replies)
Discussion started by: johankor
6 Replies
5. Shell Programming and Scripting
Hi All,
I ahve requirement where I want to put the text file in into proper format. I am wondering how can i achieve that:-
Host/Alias Name IP Address Resolved
sinuiy01.infra.go2uti.com 10.240.8.158 N
sinuid20.devtst.go2uti.com 10.240.8.230 N
sinuid21.devtst.go2uti.com... (6 Replies)
Discussion started by: sharsour
6 Replies
6. Shell Programming and Scripting
Hi All,
I have the below text file from which I have to cut particular section starting from PTR_Security_Rpeorting.cpf to PTR_Security_Reporting_Env93_export.
Report Model............: "D:\Cognos_Publishing\tmp.a2R94KLQec"\PTR_Security_Reporting.cpf
Report Output Script....:... (4 Replies)
Discussion started by: Vikram_Tanwar12
4 Replies
7. Shell Programming and Scripting
Hi,
I need to compare 2 text files with around 60000 rows and 1 column. I need to compare these and write the mismatch data to 3rd file.
File1 - file2 = file3
wc -l file1.txt
58112
wc -l file2.txt
55260
head -5 file1.txt
101214200123
101214700300
101250030067
101214100500... (10 Replies)
Discussion started by: Divya Nochiyil
10 Replies
8. Shell Programming and Scripting
Is it possible to replace a line of text within a file while it's closed with a single command or a script? Please show me an example or point me to a webpage that shows an example. The file has this line of text:
LoginGraceTime 100
I want to replace it with the following:
... (2 Replies)
Discussion started by: wdg74
2 Replies
9. Shell Programming and Scripting
Hi,
I want to compare two columns from file1 with another two column of file2 and print matched and unmatched column like this
File1
1 rs1 abc
3 rs4 xyz
1 rs3 stu
File2
1 kkk rs1 AA 10
1 aaa rs2 DD 20
1 ccc ... (2 Replies)
Discussion started by: justinjj
2 Replies
10. Shell Programming and Scripting
Hello again,
I have put together a shell script using sed and some shell commands, and it runs pretty well when I am in terminal, but when I save it as a text file and invoke it through the terminal by typing its path, all I get are errors.
Can some one give me some hints as to what I am doing... (13 Replies)
Discussion started by: Paul Walker
13 Replies
LEARN ABOUT DEBIAN
pdf2txt
PDF2TXT(1) PDFMiner Manual PDF2TXT(1)
NAME
pdf2txt - extracts text contents of PDF files
SYNOPSIS
pdf2txt [option...] file...
DESCRIPTION
pdf2txt extracts text contents from a PDF file. It extracts all the text that is to be rendered programmatically, i.e. text represented as
ASCII or Unicode strings. It cannot recognize text drawn as images that would require optical character recognition. It also extracts the
corresponding locations, font names, font sizes, writing direction (horizontal or vertical) for each text portion. You need to provide a
password for protected PDF documents when its access is restricted. You cannot extract any text from a PDF document which does not have
extraction permission.
OPTIONS
-o file
Specifies the output file name. The default is to print the extracted contents to standand output in text format.
-p pageno[,pageno,...]
Specifies the comma-separated list of the page numbers to be extracted. Page numbers start at one. By default, it extracts text from
all the pages.
-c codec
Specifies the output codec.
-t type
Specifies the output format. The following formats are currently supported:
text
Text format. This is the default.
html
HTML format. It is not recommended.
xml
XML format. It provides the most information.
tag
"Tagged PDF" format. A tagged PDF has its own contents annotated with HTML-like tags. pdf2txt tries to extract its content streams
rather than inferring its text locations. Tags used here are defined in the PDF Reference, Sixth Edition[1] (S10.7 "Tagged PDF").
-D writing-mode
Specifies the writing mode of text outputs:
lr-tb
Left-to-right, top-to-bottom.
tb-rl
Top-to-bottom, right-to-left.
auto
Determine writing mode automatically
-M char-margin, -L line-margin, -W word-margin
These are the parameters used for layout analysis. In an actual PDF file, text portions might be split into several chunks in the
middle of its running, depending on the authoring software. Therefore, text extraction needs to splice text chunks. In the figure
below, two text chunks whose distance is closer than the char-margin is considered continuous and get grouped into one. Also, two lines
whose distance is closer than the line-margin is grouped as a text box, which is a rectangular area that contains a "cluster" of text
portions. Furthermore, it may be required to insert blank characters (spaces) as necessary if the distance between two words is greater
than the word-margin, as a blank between words might not be represented as a space, but indicated by the positioning of each word.
Each value is specified not as an actual length, but as a proportion of the length to the size of each character in question. The
default values are char-margin = 1.0, line-margin = 0.3, and W = 0.2, respectively.
-n
Suppress layout analysis.
-A
Force layout analysis for all the text strings, including text contained in figures.
-V
Enable detection of vertical writing.
-s scale
Specifies the output scale. This option can be used in HTML format only.
-m n
Specifies the maximum number of pages to extract. By default, all the pages in a document are extracted.
-P password
Provides the user password to access PDF contents.
-d
Increase the debug level.
EXAMPLES
Extract text as an HTML file whose filename is output.html:
$ pdf2txt -o output.html samples/naacl06-shinyama.pdf
Extract a Japanese HTML file in vertical writing:
$ pdf2txt -c euc-jp -D tb-rl -o output.html samples/jo.pdf
Extract text from an encrypted PDF file:
$ pdf2txt -P mypassword -o output.txt secret.pdf
SEE ALSO
dumppdf(1)
AUTHORS
Jakub Wilk <jwilk@debian.org>
Wrote this manual page for the Debian system.
Yusuke Shinyama <yusuke@cs.nyu.edu>
Author of PDFMiner and its original HTML documentation.
NOTES
1. PDF Reference, Sixth Edition
http://www.adobe.com/devnet/acrobat/pdfs/pdf_reference_1-7.pdf
pdf2txt 08/24/2011 PDF2TXT(1)