Sponsored Content
Top Forums Shell Programming and Scripting Scanning a pdf file in Linux shell Post 302954948 by drl on Sunday 13th of September 2015 02:52:03 PM
Old 09-13-2015
Hi.

In reviewing this, I'm wondering if the OP was interested in the PDF meta-information. The fist and last lines of one rendition of a PFD looks like:
Code:
%PDF-1.1
1 0 obj
<<
/CreationDate (D:20150913125458)
/Producer (text2pdf v1.1 (\251 Phil Smith, 1996))
/Title (pdfgrep.txt)
   ---
/Root 2 0 R
/Info 1 0 R
>>
startxref
6452
%%EOF

In which case, a simple grep would probably suffice:
Code:
$ egrep 'Producer|Title' pdf-from-text2pdf.pdf
/Producer (text2pdf v1.1 (\251 Phil Smith, 1996))
/Title (pdfgrep.txt)

as has been posted by several responders here. I don't know enough about PDFs to say that Producer is/might be the same as Author. However, some PDFs seem to have binary data, so grep might not work as desired on those.

Best wishes ... cheers, drl
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

scanning for '0' value in .txt file

Hello I am a novice shell scripting programmer, so please bare with me. I have embedded a simple SQL statement into a shell script, which simply returns an integer (its a count (*) statement). The result of the statement is then oputput to .txt file. So, the number could be 0, 1,2, 10,... (4 Replies)
Discussion started by: man80
4 Replies

2. UNIX for Advanced & Expert Users

Scanning file backwards

Is there any way to look for a directory path that is listed any number of lines *before* a keyword in an error message? I have a script that is trying to process different files that are always down a certain portion of a path, and if there is an error, then says there is an error, contact... (2 Replies)
Discussion started by: tekster757
2 Replies

3. Programming

Linux C - how to open a pdf file with default reader

sorry if i repost this... hi.. i want to ask how to open pdf files using C in Linux in Windows, i just use this code: ShellExecute(GetDesktopWindow(), "open", "D:\\Folder\\File.pdf", NULL, NULL, SW_SHOWNORMAL); thanks for advance... (3 Replies)
Discussion started by: sunardo
3 Replies

4. UNIX for Dummies Questions & Answers

scanning the file for a particular column

I have a file containing 4 columns. need to scan that file, if all the rows in the column4 have a value ZERO, it should print "everything is fine". And if all are not ZERO , at the first encounter of non ZERO value of 4th column it should print "some problem " may be a silly question, but at... (11 Replies)
Discussion started by: gotam
11 Replies

5. Red Hat

Setting Password For PDF File--Linux

Hi, I am in need of help. My requirements are : 1) To convert the existing files (irrespective of their format) in a directory to PDF format 2) To make the converted files password protected. I did the attempt to do the same. Though the existing files (irrespective of their format) are... (1 Reply)
Discussion started by: MKR
1 Replies

6. Shell Programming and Scripting

Shell Script to Dynamically Extract file content based on Parameters from a pdf file

Hi Guru's, I am new to shell scripting. I have a unique requirement: The system generates a single pdf(/tmp/ABC.pdf) file with Invoices for Multiple Customers, the format is something like this: Page1 >> Customer 1 >>Invoice1 + invoice 2 >> Page1 end Page2 >> Customer 2 >>Invoice 3 + Invoice 4... (3 Replies)
Discussion started by: DIps
3 Replies

7. Shell Programming and Scripting

Convert excel file to PDF file using shell script

Hi All, Is it possible to convert the excel file to PDF file(Without loosing any format) using unix shell scripting ??? If yes Kindly help me on the code Thanks in advance!!! (5 Replies)
Discussion started by: Balasankar
5 Replies

8. Shell Programming and Scripting

Reg scanning time based log file

Hi, I have a requirement to scan Oracle's alert log file. This file logs all event for Oracle database and each line will have timestamp followed by messages (which might be one or more lines). Example. Thu Aug 15 17:35:59 2013 VKTM detected a time drift. Please check trace file for more... (1 Reply)
Discussion started by: manickaraja
1 Replies

9. Shell Programming and Scripting

Retrieving a paragraph from a pdf file using shell commands

In the reference section of a research paper(in pdf form), many other paper names are cited which have been used inside the pdf at different places. If I give an input, the name of a paper which has been cited in the reference section and want to display the section (the paragraph) inside the pdf... (1 Reply)
Discussion started by: SK33
1 Replies

10. Shell Programming and Scripting

Create a text file and a pdf file from Linux command results.

Hello. The task : Using multiple commands like : gdisk -l $SOME_DISK >> $SOME_FILEI generate some text file. For readiness I must insert page break. When the program is finished I want to convert the final text file to a pdf file. When finished, I got two files : One text file and One pdf... (1 Reply)
Discussion started by: jcdole
1 Replies
PODOFOINCREMENTALUPDATES(1)				     podofoincrementalupdates				       PODOFOINCREMENTALUPDATES(1)

NAME
podofoincrementalupdates - Provides information about incremental updates in PDF files SYNOPSIS
podofoincrementalupdates [-e N out.pdf] file.pdf DESCRIPTION
podofoincrementalupdates is one of the command line tools from the PoDoFo library that provide several useful operations to work with PDF files. It can print information of incremental updates to file.pdf. By default the number of incremental updates will be printed. OPTIONS
-e N Extract the Nth update out.pdf Output PDF file. file.pdf Input PDF file. SEE ALSO
podofobox(1), podofocountpages(1), podofocrop(1), podofoencrypt(1), podofoimg2pdf(1), podofoimgextract(1), podofoimpose(1), podofomerge(1), podofopages(1), podofopdfinfo(1), podofotxt2pdf(1), podofotxtextract(1), podofouncompress(1), podofoxmp(1) AUTHORS
PoDoFo is written by Dominik Seichter <domseichter@web.de> and others. This manual page was written by Oleksandr Moskalenko <malex@debian.org> for the Debian Project (but may be used by others). PoDoFo 2010-12-09 PODOFOINCREMENTALUPDATES(1)
All times are GMT -4. The time now is 07:02 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy