Sponsored Content
Top Forums UNIX for Advanced & Expert Users How to read pdf file in UNIX environment? Post 302290897 by iamgeethuj on Tuesday 24th of February 2009 08:34:00 AM
Old 02-24-2009
hi,
I'm using SunOS. I just ftp'ed pdf file to unix box from windows... Need to give this file as input to a shell script... Is there any possible way to convert this PDF file to readable text format in the unix box?
Thanks,
Geetha
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

How to view ps and pdf file under unix

Hello, I'd like to view ps and pds file under Unix(Xwindow) who could tell me the which software/command can work? Thanks! Vicky (2 Replies)
Discussion started by: vicky20000
2 Replies

2. UNIX for Dummies Questions & Answers

Someone used ghostview to read the pdf files?

How can I open the page that I want to read when I used ghostview to read the pdf files? Thanks. (0 Replies)
Discussion started by: new_hand
0 Replies

3. UNIX for Dummies Questions & Answers

Using gv to read the pdf file.

Sometimes the gv does work well.But sometimes it doesn't work. The error message: ... error:/undefined in /GBpc-EUC-H ... Can anybody help me? Thanks. (2 Replies)
Discussion started by: new_hand
2 Replies

4. Shell Programming and Scripting

FORMAT OF CSV FILE under unix environment.

hi I unload the table results from oracle to csv file foramt. i need increse the width of each column using unix commands could you pl tell me how to increase the width of each column to spefic width uisng sed unix command or na other unix commands i have file name called report.csv inside... (38 Replies)
Discussion started by: raosurya
38 Replies

5. Programming

Uncompress a gzip and bzip file using java on unix solaris environment

Hi, I need to uncompress a gzip and bzip file using java on unix solaris environment. I also need to retreive the header information of the file inorder to differentiate between gzip and bzip file. Please help Pooja (0 Replies)
Discussion started by: wadhwa.pooja
0 Replies

6. Shell Programming and Scripting

Environment Variables in text file and read command

I cannot get the following substitution ($ORACLE_SID) to work: The variable ORACLE_SID is set to wardin my environment. It has been exported. I have a text file called test.dat: /u07/oradata/${ORACLE_SID}/extab/finmart/summit/ps_voucher_line_crnt_ex.dbf... (2 Replies)
Discussion started by: bradyd
2 Replies

7. Shell Programming and Scripting

Batch job in unix server to move the pdf file from unix to windows.

Hi Experts, I have a requirement where i need to setup a batch job which runs everymonth and move the pdf files from unix server to windows servers. Could some body provide the inputs for this. and also please provide the inputs on how to map the network dirve in the unix like that... (1 Reply)
Discussion started by: ger199901
1 Replies

8. UNIX for Dummies Questions & Answers

how to print a PDF file in UNIX

on a PROGRESS environment, i create an invoice which at printing it must generate both the .dat for the invoice that was sent to the printer and the .dat for the PDF version. we have never printed PDF files in our lp printer until recently. i've done a bit of googling and it comes down to that i... (2 Replies)
Discussion started by: pdf2ps
2 Replies

9. Shell Programming and Scripting

Unable to read Environment Variable

Hi I have created the following shell script file with the following content. #!/bin/csh set VAR1="abcxyz" << EOF EOF echo "---------------------" echo "VAR1 = $VAR1" echo "---------------------" i am not able to echo the previously set VAR1. Can any one suggested what could be wrong?... (5 Replies)
Discussion started by: srinu_b
5 Replies

10. HP-UX

Cannot read PDF emailed by HP-UX server

I have a very strange issue. Now that we have a lot of our users using iPads to read statements, this is becoming more of an issue. We have some financial statements that are generated into PDF format by an application that runs in HP-UX, and then we use uuencode to attach the statements to the... (4 Replies)
Discussion started by: lawadm1
4 Replies
pdftotext(1)						      General Commands Manual						      pdftotext(1)

NAME
pdftotext - Portable Document Format (PDF) to text converter (version 3.00) SYNOPSIS
pdftotext [options] [PDF-file [text-file]] DESCRIPTION
Pdftotext converts Portable Document Format (PDF) files to plain text. Pdftotext reads the PDF file, PDF-file, and writes a text file, text-file. If text-file is not specified, pdftotext converts file.pdf to file.txt. If text-file is '-', the text is sent to stdout. OPTIONS
-f number Specifies the first page to convert. -l number Specifies the last page to convert. -r number Specifies the resolution, in DPI. The default is 72 DPI. -x number Specifies the x-coordinate of the crop area top left corner -y number Specifies the y-coordinate of the crop area top left corner -W number Specifies the width of crop area in pixels (default is 0) -H number Specifies the height of crop area in pixels (default is 0) -layout Maintain (as best as possible) the original physical layout of the text. The default is to 'undo' physical layout (columns, hyphen- ation, etc.) and output the text in reading order. -raw Keep the text in content stream order. This is a hack which often "undoes" column formatting, etc. Use of raw mode is no longer recommended. -htmlmeta Generate a simple HTML file, including the meta information. This simply wraps the text in <pre> and </pre> and prepends the meta headers. -bbox Generate an XHTML file containing bounding box information for each word in the file. -enc encoding-name Sets the encoding to use for text output. This defaults to "UTF-8". -listenc Lits the available encodings -eol unix | dos | mac Sets the end-of-line convention to use for text output. -nopgbrk Don't insert page breaks (form feed characters) between pages. -opw password Specify the owner password for the PDF file. Providing this will bypass all security restrictions. -upw password Specify the user password for the PDF file. -q Don't print any messages or errors. -v Print copyright and version information. -h Print usage information. (-help and --help are equivalent.) BUGS
Some PDF files contain fonts whose encodings have been mangled beyond recognition. There is no way (short of OCR) to extract text from these files. EXIT CODES
The Xpdf tools use the following exit codes: 0 No error. 1 Error opening a PDF file. 2 Error opening an output file. 3 Error related to PDF permissions. 99 Other error. AUTHOR
The pdftotext software and documentation are copyright 1996-2004 Glyph & Cog, LLC. pdffonts(1), pdfimages(1), pdfinfo(1), pdftocairo(1), pdftohtml(1), pdftoppm(1), pdftops(1) 22 January 2004 pdftotext(1)
All times are GMT -4. The time now is 09:39 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy