Sponsored Content
Full Discussion: PDF to Text Conversion
Top Forums UNIX for Advanced & Expert Users PDF to Text Conversion Post 302611717 by drl on Friday 23rd of March 2012 10:54:46 AM
Old 03-23-2012
Hi.

In the Debian repositories are many items that deal with PDFs. Among them is:
Code:
NAME
       pdftotext - Portable Document Format (PDF) to text converter (version
       3.02)

SYNOPSIS
       pdftotext [options] [PDF-file [text-file]]

DESCRIPTION
       Pdftotext converts Portable Document Format (PDF) files to plain text.

I found very little in the standard SuSE repositories. Perhaps you can search for better repositories for SuSE.

The package that includes pdftotext is:
Code:
 xpdf is a suite of tools for Portable Document Format (PDF) files. (These are
 sometimes called 'Acrobat' files after the name of Adobe's PDF software.)
 .
 The tools include xpdf, a PDF viewer (in the package xpdf-reader),
 and PDF converters (including to/from PostScript) (in the package
 xpdf-utils).

And a link that may be of use is: Xpdf At the site several links are provided for pre-compiled instances for numerous *nix including BSDs, Solaris, and HP-UX.

I just tried a test run of pdftotext on the documentation pdf for msort. It produced text just as promised -- pdftotext version 3.02

Best wishes ...cheers, drl
 

9 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

pdf to text

Hi, Can anyone help me in converting a PDF file into a text file? Thanks, sskb (2 Replies)
Discussion started by: sskb
2 Replies

2. AIX

PDF to postscript conversion poroblems

Hi Gurus, We are currently using a JAVA program that enables us to use some forms and convert them from PDF to print them postscript. It works great except that everytime it runs JAVA sucks 100% of the CPU. What can I do to reduce the impact? I am looking at: 1. Nice the java... (1 Reply)
Discussion started by: Luck
1 Replies

3. HP-UX

pdftotext / PDF conversion to .txt binaries

Good day, I've been trying to look for a way to compile the Xpdf sources in our HP-UX server, but have been failing to do so because there is no GCC installed, and I don't have privileges to install GCC. I was looking for a functionality to convert PDF files to .txt, which is exactly like the... (2 Replies)
Discussion started by: mike_s_6
2 Replies

4. Shell Programming and Scripting

Pdf to text conversion and vice versa

Hi, I have a pdf file. i want to convert it to text file and do some work on it and later want to convert it back to pdf. Can this be done via unix? or Is there a way unix can directly work on PDF file? (2 Replies)
Discussion started by: saltysumi
2 Replies

5. Shell Programming and Scripting

conversion from ps to pdf

My folder contains many files with .ps extensions. How to convert all of these to .pdf at a time using a shell script /command. Help me. (1 Reply)
Discussion started by: hravisankar
1 Replies

6. UNIX and Linux Applications

Word doc to pdf conversion?

I am trying to find a way to convert a Word doc saved in the XML format to a PDF file (that looks like the original Word doc). Yes, there are all kinds of ways to do this interactively. What I need is something that can be run as a background process so that when XML files are placed in a... (0 Replies)
Discussion started by: garzon
0 Replies

7. Shell Programming and Scripting

Pdf to text

Is there a way using the pdf to text utility to convert all the pdf in a given directory? So instead of one at a time: pdftotext pdftotext hp-manual.pdf hp-manual.txt a directory of 50 pdf files would be converted: pdftotext /home/dnascopev/Desktop/PDF.pdf /home/dnascopev/Desktop/PDF.txt ... (6 Replies)
Discussion started by: cmccabe
6 Replies

8. Shell Programming and Scripting

Text to pdf

Is there a way to covert te attached text file to a pdf? I have tried: enscript-p output.ps article.txt PRE.cjk { font-family: "WenQuanYi Micro Hei",monospace; }PRE.ctl { font-family: "Lohit Hindi",monospace; }P { margin-bottom: 0.08in; } awk '{ A=$2; next} END { for (i in A) print... (7 Replies)
Discussion started by: cmccabe
7 Replies

9. HP-UX

Best text to pdf converter for Hp/UX 11.31

Hi, Can anyone tell me the best converter I can use to convert text to pdf for HP/UX 11.31? Thanks, Linda (0 Replies)
Discussion started by: lnemitz
0 Replies
PODOFOXMP(1)							     podofoxmp							      PODOFOXMP(1)

NAME
podofoxmp - Modify or extract XMP information from a PDF file SYNOPSIS
podofoxmp [inputfile] [xmpfile outputfile] DESCRIPTION
podofoxmp is one of the command line tools from the PoDoFo library that provide several useful operations to work with PDF files. It can extract or modify XMP information in a PDF file. OPTIONS
[inputfile] Input PDF file. This is the only option needed to extract the XMP information from a PDF file. [xmpfile] Optional file that provides PDF XMP structure. It must be used in conjuction with an [outputfile]. [outputfile] Output PDF file. It is only used in conjuction with an [xmpfile]. SEE ALSO
podofobox(1), podofocountpages(1), podofocrop(1), podofoencrypt(1), podofoimg2pdf(1), podofoimgextract(1), podofoimpose(1), podofoincremen- talupdates(1), podofomerge(1), podofopages(1), podofopdfinfo(1), podofotxt2pdf(1), podofotxtextract(1), podofouncompress(1), AUTHORS
PoDoFo is written by Dominik Seichter <domseichter@web.de> and others. This manual page was written by Oleksandr Moskalenko <malex@debian.org> for the Debian Project (but may be used by others). PoDoFo 2010-12-09 PODOFOXMP(1)
All times are GMT -4. The time now is 07:56 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy