11-20-2019
Limitations of 'pdftotext' in Linux...
Guys: I have a customer using the 'pdftotext' utility under Linux. PDFs are received via email, converted to text, etc. and it has worked nicely for years. They received a PDF from a customer and the utility will not read it. The text file is created but it's either empty or has 1-2 bytes of garbage in it. Acrobat renders the document correctly. I did a FILE -> SAVE AS TEXT inside of Acrobat and the same thing happened... an empty file. I tried it on another PDF and it worked. So why would pdftotext have an issue with a certain PDF? Could it be encrypted? Most of these files are 17 to 20kb in size. The one that will not process is a whopping 426kb. Is there a size limitation? I have used pdftotext but I have only 'used it'... I am not an expert on all its abilities. Thanks for reading and thanks for the help.
EDIT: I can upload the PDF here if anyone is interested. Thx.
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
What characters can't be used with a mkdir? Any limits on length of name?
Thank you,
Randy M. Zeitman
http://www.StoneRoseDesign.com (12 Replies)
Discussion started by: flignar
12 Replies
2. UNIX for Dummies Questions & Answers
I am trying to use the csplit file on a file that contains records that have more than 2048 characters on a line. The resultant split file seems to ignore the rest of the line and I lose the data.
Is there any way that csplit can handle record lengths greater than 2048?
Thanks (0 Replies)
Discussion started by: ravagga
0 Replies
3. HP-UX
Good day,
I've been trying to look for a way to compile the Xpdf sources in our HP-UX server, but have been failing to do so because there is no GCC installed, and I don't have privileges to install GCC. I was looking for a functionality to convert PDF files to .txt, which is exactly like the... (2 Replies)
Discussion started by: mike_s_6
2 Replies
4. UNIX and Linux Applications
I'm running a simulation (programmed in C) which makes calls to gnuplot periodically to plot data I have stored.
First I open a pipe to gnuplot and set it to multiplot:
FILE * pipe = popen("gnuplot", "w");
fprintf(pipe, "set multiplot\n");
fflush(pipe);
(this pipe stays open until the... (0 Replies)
Discussion started by: sedavidw
0 Replies
5. Red Hat
Hi,
I need a documentation about limitations on the linux partition. On how many primary and extended I could create. And also on different type of storage, how many big capacity I can create.
Thanks. (3 Replies)
Discussion started by: itik
3 Replies
6. UNIX for Dummies Questions & Answers
Hi,
I have used pdftotext with good results in the past, but today for some reason I keep getting the same error message.
My command is as follows:
And the error message is
I am using Vmware player with Ubuntu server, but I don't think that is causing this issue as I have been using... (2 Replies)
Discussion started by: Joq
2 Replies
7. Red Hat
Hi,
I have noticed some performance issues on my RHEL5 server but the memory and CPU utilization on the box is fine.
I have a 1G full duplexed eth0 card and I am suspicious that this may be causing the problem. My eth0 settings are as follows:
Settings for eth0:
Supported ports: ... (12 Replies)
Discussion started by: Duffs22
12 Replies
8. Solaris
Hi,
I recently started working with Solaris, and what I noticed is that a lot of commands I used to regularly use don't work, like sed -i and grep -r. I have found work arounds for these problems though but it's a pain in the ass.
I'm just wondering why they decided not to include these handy... (4 Replies)
Discussion started by: Subbeh
4 Replies
9. Linux
In recently reading an article on linux basics before I embark and my personal installation project I came across this passage -
IDE drives have three types of partition: primary, logical, and extended. The partition table is located in the master boot record (MBR) of a disk. The MBR is the... (12 Replies)
Discussion started by: Synchlavier
12 Replies
10. UNIX for Dummies Questions & Answers
I have a directory having a number of pdf files.
I want to convert all the files to text, stored in a single text file
The following creates multiple text files
ls *.pdf | xargs -n1 pdftotext (1 Reply)
Discussion started by: kristinu
1 Replies