Sponsored Content
Full Discussion: Word doc to pdf conversion?
Special Forums UNIX and Linux Applications Word doc to pdf conversion? Post 302608595 by garzon on Sunday 18th of March 2012 02:46:55 AM
Old 03-18-2012
Word doc to pdf conversion?

I am trying to find a way to convert a Word doc saved in the XML format to a PDF file (that looks like the original Word doc). Yes, there are all kinds of ways to do this interactively. What I need is something that can be run as a background process so that when XML files are placed in a certain directory a cron job can periodically convert them to a PDF file. I would assume there are some Windows libraries that have something like this, but I am trying to do this on UNIX. Without 1,000 hours of programming.
 

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

converting PDF to text, rtf doc format

Hi all Is there any program which can convert PDF to word processor file ? If the PDF has smart quotes, bullet icons, copyright and trademark symbols, etc. what happens to them intext format? So ideally would like to conver into rtf or doc. Thanks SS (1 Reply)
Discussion started by: saurya_s
1 Replies

2. AIX

PDF to postscript conversion poroblems

Hi Gurus, We are currently using a JAVA program that enables us to use some forms and convert them from PDF to print them postscript. It works great except that everytime it runs JAVA sucks 100% of the CPU. What can I do to reduce the impact? I am looking at: 1. Nice the java... (1 Reply)
Discussion started by: Luck
1 Replies

3. Windows & DOS: Issues & Discussions

convert pdf's to word

Does anyone know any good tools to convert a pdf to word, I can usually cut & paste without a hitch but I have a pdf that doesn't like that and surrounds text with lines as though a table or prints tables in miniscule text. It's only one pdf file so I'm reluctant to buy a product. (3 Replies)
Discussion started by: gefa
3 Replies

4. HP-UX

pdftotext / PDF conversion to .txt binaries

Good day, I've been trying to look for a way to compile the Xpdf sources in our HP-UX server, but have been failing to do so because there is no GCC installed, and I don't have privileges to install GCC. I was looking for a functionality to convert PDF files to .txt, which is exactly like the... (2 Replies)
Discussion started by: mike_s_6
2 Replies

5. Shell Programming and Scripting

Pdf to text conversion and vice versa

Hi, I have a pdf file. i want to convert it to text file and do some work on it and later want to convert it back to pdf. Can this be done via unix? or Is there a way unix can directly work on PDF file? (2 Replies)
Discussion started by: saltysumi
2 Replies

6. Shell Programming and Scripting

conversion from ps to pdf

My folder contains many files with .ps extensions. How to convert all of these to .pdf at a time using a shell script /command. Help me. (1 Reply)
Discussion started by: hravisankar
1 Replies

7. Shell Programming and Scripting

Creating word document (.doc) with attachment in unix solaris

Hi All, Is it possible to creation a word document (.doc) in unix solaris which includes an attachment (i.e similar to insert -> object add attachment in windows) Requirement is to add files into .doc and the files is present in unix servers. Is it possible to do this within unix instead... (5 Replies)
Discussion started by: ajay547
5 Replies

8. UNIX for Advanced & Expert Users

PDF to Text Conversion

Hi Guys, My OS is Suse Linux.. Is there a Command to convert PDF file to Text?? Cheers!!!!! (2 Replies)
Discussion started by: mac4rfree
2 Replies

9. Programming

Is it possible to change search inside .pdf or .doc files?

the titele was wrong ... the true one is: Is it possible to search words inside .pdf or .doc files? is it possible if i changed the word into binary combination:eek:? and this way is super too hyper huge of greatest codes i ever seen:D to read only 1 word so is there any other ways:confused:? ... (1 Reply)
Discussion started by: fwrlfo
1 Replies

10. UNIX for Beginners Questions & Answers

Print word doc using lpr command

Hi , We are trying to automate Printing the word document directly from the Server level to a Printer. We have printer registered and command I use is lp -d printername 1234.doc Even though the program completes fine, a print does not reach to the printer for word document. In... (4 Replies)
Discussion started by: PrashantTalele
4 Replies
PDFTOHTML(1)						      General Commands Manual						      PDFTOHTML(1)

NAME
       pdftohtml - program to convert PDF files into HTML, XML and PNG images

SYNOPSIS
       pdftohtml [options] <PDF-file> [<HTML-file> <XML-file>]

DESCRIPTION
       This  manual  page documents briefly the pdftohtml command.  This manual page was written for the Debian GNU/Linux distribution because the
       original program does not have a manual page.

       pdftohtml is a program that converts PDF documents into HTML. It generates its output in the current working directory.

OPTIONS
       A summary of options are included below.

       -h, -help
	      Show summary of options.

       -f <int>
	      first page to print

       -l <int>
	      last page to print

       -q     do not print any messages or errors

       -v     print copyright and version info

       -p     exchange .pdf links with .html

       -c     generate complex output

       -s     generate single HTML that includes all pages

       -i     ignore images

       -noframes
	      generate no frames. Not supported in complex output mode.

       -stdout
	      use standard output

       -zoom <fp>
	      zoom the PDF document (default 1.5)

       -xml   output for XML post-processing

       -enc <string>
	      output text encoding name

       -opw <string>
	      owner password (for encrypted files)

       -upw <string>
	      user password (for encrypted files)

       -hidden
	      force hidden text extraction

       -dev   output device name for Ghostscript (png16m, jpeg etc).  Unless this option is specified, Splash will be used

       -fmt   image file format for Splash output (png or jpg).  If complex is selected, but neither -fmt or -dev are specified, -fmt png will	be
	      assumed

       -nomerge
	      do not merge paragraphs

       -nodrm override document DRM settings

AUTHOR
       Pdftohtml was developed by Gueorgui Ovtcharov and Rainer Dorsch. It is based and benefits a lot from Derek Noonburg's xpdf package.

       This manual page was written by Soren Boll Overgaard <boll@debian.org>, for the Debian GNU/Linux system (but may be used by others).

SEE ALSO
       pdffonts(1), pdfimages(1), pdfinfo(1), pdftocairo(1), pdftoppm(1), pdftops(1), pdftotext(1)

																      PDFTOHTML(1)
All times are GMT -4. The time now is 08:36 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy