Sponsored Content
Full Discussion: PDF to Text Conversion
Top Forums UNIX for Advanced & Expert Users PDF to Text Conversion Post 302611717 by drl on Friday 23rd of March 2012 10:54:46 AM
Old 03-23-2012
Hi.

In the Debian repositories are many items that deal with PDFs. Among them is:
Code:
NAME
       pdftotext - Portable Document Format (PDF) to text converter (version
       3.02)

SYNOPSIS
       pdftotext [options] [PDF-file [text-file]]

DESCRIPTION
       Pdftotext converts Portable Document Format (PDF) files to plain text.

I found very little in the standard SuSE repositories. Perhaps you can search for better repositories for SuSE.

The package that includes pdftotext is:
Code:
 xpdf is a suite of tools for Portable Document Format (PDF) files. (These are
 sometimes called 'Acrobat' files after the name of Adobe's PDF software.)
 .
 The tools include xpdf, a PDF viewer (in the package xpdf-reader),
 and PDF converters (including to/from PostScript) (in the package
 xpdf-utils).

And a link that may be of use is: Xpdf At the site several links are provided for pre-compiled instances for numerous *nix including BSDs, Solaris, and HP-UX.

I just tried a test run of pdftotext on the documentation pdf for msort. It produced text just as promised -- pdftotext version 3.02

Best wishes ...cheers, drl
 

9 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

pdf to text

Hi, Can anyone help me in converting a PDF file into a text file? Thanks, sskb (2 Replies)
Discussion started by: sskb
2 Replies

2. AIX

PDF to postscript conversion poroblems

Hi Gurus, We are currently using a JAVA program that enables us to use some forms and convert them from PDF to print them postscript. It works great except that everytime it runs JAVA sucks 100% of the CPU. What can I do to reduce the impact? I am looking at: 1. Nice the java... (1 Reply)
Discussion started by: Luck
1 Replies

3. HP-UX

pdftotext / PDF conversion to .txt binaries

Good day, I've been trying to look for a way to compile the Xpdf sources in our HP-UX server, but have been failing to do so because there is no GCC installed, and I don't have privileges to install GCC. I was looking for a functionality to convert PDF files to .txt, which is exactly like the... (2 Replies)
Discussion started by: mike_s_6
2 Replies

4. Shell Programming and Scripting

Pdf to text conversion and vice versa

Hi, I have a pdf file. i want to convert it to text file and do some work on it and later want to convert it back to pdf. Can this be done via unix? or Is there a way unix can directly work on PDF file? (2 Replies)
Discussion started by: saltysumi
2 Replies

5. Shell Programming and Scripting

conversion from ps to pdf

My folder contains many files with .ps extensions. How to convert all of these to .pdf at a time using a shell script /command. Help me. (1 Reply)
Discussion started by: hravisankar
1 Replies

6. UNIX and Linux Applications

Word doc to pdf conversion?

I am trying to find a way to convert a Word doc saved in the XML format to a PDF file (that looks like the original Word doc). Yes, there are all kinds of ways to do this interactively. What I need is something that can be run as a background process so that when XML files are placed in a... (0 Replies)
Discussion started by: garzon
0 Replies

7. Shell Programming and Scripting

Pdf to text

Is there a way using the pdf to text utility to convert all the pdf in a given directory? So instead of one at a time: pdftotext pdftotext hp-manual.pdf hp-manual.txt a directory of 50 pdf files would be converted: pdftotext /home/dnascopev/Desktop/PDF.pdf /home/dnascopev/Desktop/PDF.txt ... (6 Replies)
Discussion started by: cmccabe
6 Replies

8. Shell Programming and Scripting

Text to pdf

Is there a way to covert te attached text file to a pdf? I have tried: enscript-p output.ps article.txt PRE.cjk { font-family: "WenQuanYi Micro Hei",monospace; }PRE.ctl { font-family: "Lohit Hindi",monospace; }P { margin-bottom: 0.08in; } awk '{ A=$2; next} END { for (i in A) print... (7 Replies)
Discussion started by: cmccabe
7 Replies

9. HP-UX

Best text to pdf converter for Hp/UX 11.31

Hi, Can anyone tell me the best converter I can use to convert text to pdf for HP/UX 11.31? Thanks, Linda (0 Replies)
Discussion started by: lnemitz
0 Replies
PDF::API2::Basic::PDF::Dict(3pm)			User Contributed Perl Documentation			  PDF::API2::Basic::PDF::Dict(3pm)

NAME
PDF::API2::Basic::PDF::Dict - PDF Dictionaries and Streams. Inherits from PDF::Objind INSTANCE VARIABLES
There are various special instance variables which are used to look after, particularly, streams. Each begins with a space: stream Holds the stream contents for output streamfile Holds the stream contents in an external file rather than in memory. This is not the same as a PDF file stream. The data is stored in its unfiltered form. streamloc If both ' stream' and ' streamfile' are empty, this indicates where in the source PDF the stream starts. METHODS
$d->outobjdeep($fh) Outputs the contents of the dictionary to a PDF file. This is a recursive call. It also outputs a stream if the dictionary has a stream element. If this occurs then this method will calculate the length of the stream and insert it into the stream's dictionary. $d->read_stream($force_memory) Reads in a stream from a PDF file. If the stream is greater than "PDF::Dict::mincache" (defaults to 32768) bytes to be stored, then the default action is to create a file for it somewhere and to use that file as a data cache. If $force_memory is set, this caching will not occur and the data will all be stored in the $self->{' stream'} variable. $d->val Returns the dictionary, which is itself. perl v5.14.2 2014-03-09 PDF::API2::Basic::PDF::Dict(3pm)
All times are GMT -4. The time now is 02:53 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy