Sponsored Content
Top Forums Shell Programming and Scripting Shell Script to Dynamically Extract file content based on Parameters from a pdf file Post 302804017 by Chubler_XL on Wednesday 8th of May 2013 12:38:41 AM
Old 05-08-2013
If you have python on your system you could try the PyPDF2 Library


Assumption is last page of invoice contains some text you can match to like "Total Due:"

Code:
#!/usr/bin/env python
from PyPDF2.pdf import PdfFileReader, PdfFileWriter
import sys

filenum = 1
Pageadded = False

output_pdf = PdfFileWriter()
input_pdf = PdfFileReader(open(sys.argv[1], "rb"))

for i in range(0, input_pdf.getNumPages()):
    Pageadded = True
    output_pdf.addPage(input_pdf.getPage(i))

    if input_pdf.getPage(i).extractText().find("Total Due:") != -1:
        outputStream = file("Cust_" + str(filenum) + ".pdf", "wb")
        output_pdf.write(outputStream)
        filenum = filenum + 1
        output_pdf = PdfFileWriter()
        Pageadded = False
if Pageadded:
    outputStream = file("Cust_" + str(filenum) + ".pdf", "wb")
    output_pdf.write(outputStream)

Note: indentation is a part of the python syntax so ensure you keep the indent levels correct. Call the script like this:

Code:
$ ./split_invoice.py Invoice_file.pdf

 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Delete content of file 1 in file 2 with shell script

OK, best is I explain what the operating enviroment is. Linux, but Motomagx. It is a Linux operated mobile phone, Motorola V8. I am writting a shell script, but got stuck. I have to delete the complete content of file 1 in file 2. I have attached the 2 files. You can see that the content of... (2 Replies)
Discussion started by: rasputin007
2 Replies

2. Shell Programming and Scripting

Calling sql file from shell script with parameters.

Hi, I am calling a sql file script.sql from shell script and passing few parameters also as shown below: sqlplus -S id/password @script.sql $param1 $param2 Now,In sql file I have to create a extract text file after querying oracle tables based on the parameters passed(param1,param2) as... (7 Replies)
Discussion started by: anil029
7 Replies

3. Shell Programming and Scripting

Create shell script to extract unique information from one file to a new file.

Hi to all, I got this content/pattern from file http.log.20110808.gz mail1 httpd: Account Notice: close igchung@abc.com 2011/8/7 7:37:36 0:00:03 0 0 1 mail1 httpd: Account Information: login sastria9@abc.com proxy sid=gFp4DLm5HnU mail1 httpd: Account Notice: close sastria9@abc.com... (16 Replies)
Discussion started by: Mr_47
16 Replies

4. Shell Programming and Scripting

HELP: Shell Script to read a Log file line by line and extract Info based on KEYWORDS matching

I have a LOG file which looks like this Import started at: Mon Jul 23 02:13:01 EDT 2012 Initialization completed in 2.146 seconds. -------------------------------------------------------------------------------- -- Import summary for Import item: PolicyInformation... (8 Replies)
Discussion started by: biztank
8 Replies

5. Shell Programming and Scripting

Shell script to monitor new file in a directory and mail the file content

Hi I am looking for a help in designing a bash script on linux which can do below:- 1) Look in a specific directory for any new files 2) Mail the content of the new file Appreciate any help Regards Neha (5 Replies)
Discussion started by: neha0785
5 Replies

6. Shell Programming and Scripting

Convert excel file to PDF file using shell script

Hi All, Is it possible to convert the excel file to PDF file(Without loosing any format) using unix shell scripting ??? If yes Kindly help me on the code Thanks in advance!!! (5 Replies)
Discussion started by: Balasankar
5 Replies

7. UNIX for Dummies Questions & Answers

Shell script to extract data from csv file based on certain conditions

Hi Guys, I am new to shell script.I need your help to write a shell script. I need to write a shell script to extract data from a .csv file where columns are ',' separated. The file has 5 columns having values say column 1,column 2.....column 5 as below along with their valuesm.... (1 Reply)
Discussion started by: Vivekit82
1 Replies

8. Shell Programming and Scripting

PDF Script to extract PDF Links MOD in Need

In here we have a script to extract all pdf links from a single page.. any idea's in how make this read instead of a page a list of pages.. and extract all pdf links ? #!/bin/bash # NAME: pdflinkextractor # AUTHOR: Glutanimate (http://askubuntu.com/users/81372/), 2013 #... (1 Reply)
Discussion started by: danielldf
1 Replies

9. Shell Programming and Scripting

Shell script to create runtime variables based on the number of parameters passed in the script

Hi All, I have a script which intends to create as many variables at runtime, as the number of parameters passed to it. The script needs to save these parameter values in the variables created and print them abc.sh ---------- export Numbr_Parms=$# export a=1 while do export... (3 Replies)
Discussion started by: dev.devil.1983
3 Replies

10. Shell Programming and Scripting

Parameterizing to dynamically generate the extract file from Oracle table using Shell Script

I have below 2 requirements for parameterize the generate the extract file from Oracle table using Shell Script. Could you please help me by modifying the script and show me how to execute it. First Requirement: I have a requirement where I need to parameterize to generate one... (0 Replies)
Discussion started by: hareshvikram
0 Replies
XFIG-PDF-VIEWER(1)						Debian Users Manual						XFIG-PDF-VIEWER(1)

NAME
xfig-pdf-viewer - view a PDF document using a PDF browser under X11 SYNOPSIS
xfig-pdf-viewer file.pdf DESCRIPTION
xfig-pdf-viewer is a little shell script, which tries to find out which PDF viewers you have installed on your system and then starts them. xfig-pdf-viewer tries the following PDF viewers with descending priority: - xpdf(1) - kpdf(1) - evince(1) - acroread(1) - gpdf(1) - gv(1) - gnome-gv(1) - kghostview(1) - ghostview(1) If the environment variable PDFVIEWER is set, this is used with highest priority. ENVIRONMENT
PDFVIEWER you can define your favorite browser with this variable, it overrides the priority of the above mentioned viewers. AUTHOR
Roland Rosenfeld <roland@spinnaker.de> SEE ALSO
xpdf(1), kpdf(1), evince(1), acroread(1), gpdf(1), gv(1), gnome-gv(1), kghostview(1), ghostview(1) Debian Project JULY 2006 XFIG-PDF-VIEWER(1)
All times are GMT -4. The time now is 11:54 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy