02-24-2009
hi,
I'm using SunOS. I just ftp'ed pdf file to unix box from windows... Need to give this file as input to a shell script... Is there any possible way to convert this PDF file to readable text format in the unix box?
Thanks,
Geetha
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hello,
I'd like to view ps and pds file under Unix(Xwindow)
who could tell me the which software/command can work?
Thanks!
Vicky (2 Replies)
Discussion started by: vicky20000
2 Replies
2. UNIX for Dummies Questions & Answers
How can I open the page that I want to read when I used ghostview to read the pdf files?
Thanks. (0 Replies)
Discussion started by: new_hand
0 Replies
3. UNIX for Dummies Questions & Answers
Sometimes the gv does work well.But sometimes it doesn't work.
The error message:
...
error:/undefined in /GBpc-EUC-H
...
Can anybody help me?
Thanks. (2 Replies)
Discussion started by: new_hand
2 Replies
4. Shell Programming and Scripting
hi
I unload the table results from oracle to csv file foramt.
i need increse the width of each column using unix commands
could you pl tell me how to increase the width of each column to spefic width uisng sed unix command or na other unix commands
i have file name called report.csv
inside... (38 Replies)
Discussion started by: raosurya
38 Replies
5. Programming
Hi,
I need to uncompress a gzip and bzip file using java on unix solaris environment. I also need to retreive the header information of the file inorder to differentiate between gzip and bzip file. Please help
Pooja (0 Replies)
Discussion started by: wadhwa.pooja
0 Replies
6. Shell Programming and Scripting
I cannot get the following substitution ($ORACLE_SID) to work:
The variable ORACLE_SID is set to wardin my environment. It has been exported.
I have a text file called test.dat:
/u07/oradata/${ORACLE_SID}/extab/finmart/summit/ps_voucher_line_crnt_ex.dbf... (2 Replies)
Discussion started by: bradyd
2 Replies
7. Shell Programming and Scripting
Hi Experts,
I have a requirement where i need to setup a batch job which runs everymonth and move the pdf files from unix server to windows servers.
Could some body provide the inputs for this.
and also please provide the inputs on how to map the network dirve in the unix like that... (1 Reply)
Discussion started by: ger199901
1 Replies
8. UNIX for Dummies Questions & Answers
on a PROGRESS environment, i create an invoice which at printing it must generate both the .dat for the invoice that was sent to the printer and the .dat for the PDF version. we have never printed PDF files in our lp printer until recently. i've done a bit of googling and it comes down to that i... (2 Replies)
Discussion started by: pdf2ps
2 Replies
9. Shell Programming and Scripting
Hi I have created the following shell script file with the following content.
#!/bin/csh
set VAR1="abcxyz" << EOF
EOF
echo "---------------------"
echo "VAR1 = $VAR1"
echo "---------------------"
i am not able to echo the previously set VAR1.
Can any one suggested what could be wrong?... (5 Replies)
Discussion started by: srinu_b
5 Replies
10. HP-UX
I have a very strange issue. Now that we have a lot of our users using iPads to read statements, this is becoming more of an issue.
We have some financial statements that are generated into PDF format by an application that runs in HP-UX, and then we use uuencode to attach the statements to the... (4 Replies)
Discussion started by: lawadm1
4 Replies
LEARN ABOUT MINIX
pdftotext
pdftotext(1) General Commands Manual pdftotext(1)
NAME
pdftotext - Portable Document Format (PDF) to text converter (version 3.00)
SYNOPSIS
pdftotext [options] [PDF-file [text-file]]
DESCRIPTION
Pdftotext converts Portable Document Format (PDF) files to plain text.
Pdftotext reads the PDF file, PDF-file, and writes a text file, text-file. If text-file is not specified, pdftotext converts file.pdf to
file.txt. If text-file is '-', the text is sent to stdout.
OPTIONS
-f number
Specifies the first page to convert.
-l number
Specifies the last page to convert.
-r number
Specifies the resolution, in DPI. The default is 72 DPI.
-x number
Specifies the x-coordinate of the crop area top left corner
-y number
Specifies the y-coordinate of the crop area top left corner
-W number
Specifies the width of crop area in pixels (default is 0)
-H number
Specifies the height of crop area in pixels (default is 0)
-layout
Maintain (as best as possible) the original physical layout of the text. The default is to 'undo' physical layout (columns, hyphen-
ation, etc.) and output the text in reading order.
-raw Keep the text in content stream order. This is a hack which often "undoes" column formatting, etc. Use of raw mode is no longer
recommended.
-htmlmeta
Generate a simple HTML file, including the meta information. This simply wraps the text in <pre> and </pre> and prepends the meta
headers.
-bbox Generate an XHTML file containing bounding box information for each word in the file.
-enc encoding-name
Sets the encoding to use for text output. This defaults to "UTF-8".
-listenc
Lits the available encodings
-eol unix | dos | mac
Sets the end-of-line convention to use for text output.
-nopgbrk
Don't insert page breaks (form feed characters) between pages.
-opw password
Specify the owner password for the PDF file. Providing this will bypass all security restrictions.
-upw password
Specify the user password for the PDF file.
-q Don't print any messages or errors.
-v Print copyright and version information.
-h Print usage information. (-help and --help are equivalent.)
BUGS
Some PDF files contain fonts whose encodings have been mangled beyond recognition. There is no way (short of OCR) to extract text from
these files.
EXIT CODES
The Xpdf tools use the following exit codes:
0 No error.
1 Error opening a PDF file.
2 Error opening an output file.
3 Error related to PDF permissions.
99 Other error.
AUTHOR
The pdftotext software and documentation are copyright 1996-2004 Glyph & Cog, LLC. pdffonts(1), pdfimages(1), pdfinfo(1), pdftocairo(1),
pdftohtml(1), pdftoppm(1), pdftops(1)
22 January 2004 pdftotext(1)