Shell Script to Dynamically Extract file content based on Parameters from a pdf file


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Shell Script to Dynamically Extract file content based on Parameters from a pdf file
# 1  
Old 05-02-2013
Shell Script to Dynamically Extract file content based on Parameters from a pdf file

Hi Guru's,
I am new to shell scripting. I have a unique requirement:
The system generates a single pdf(/tmp/ABC.pdf) file with Invoices for Multiple Customers, the format is something like this:
Page1 >> Customer 1 >>Invoice1 + invoice 2 >> Page1 end
Page2 >> Customer 2 >>Invoice 3 + Invoice 4 >> Continue to Page3 >> page3 End

I have to email Individual Statements to these customers individually based on the customer number/Email Address coming in the file.
Using Shell script how can i achieve this?

Thanks in Advance.
Regards,
Dips
# 2  
Old 05-07-2013
Start with something like pdf2txt so shell tools can see the pdf strings.
# 3  
Old 05-08-2013
If you have python on your system you could try the PyPDF2 Library


Assumption is last page of invoice contains some text you can match to like "Total Due:"

Code:
#!/usr/bin/env python
from PyPDF2.pdf import PdfFileReader, PdfFileWriter
import sys

filenum = 1
Pageadded = False

output_pdf = PdfFileWriter()
input_pdf = PdfFileReader(open(sys.argv[1], "rb"))

for i in range(0, input_pdf.getNumPages()):
    Pageadded = True
    output_pdf.addPage(input_pdf.getPage(i))

    if input_pdf.getPage(i).extractText().find("Total Due:") != -1:
        outputStream = file("Cust_" + str(filenum) + ".pdf", "wb")
        output_pdf.write(outputStream)
        filenum = filenum + 1
        output_pdf = PdfFileWriter()
        Pageadded = False
if Pageadded:
    outputStream = file("Cust_" + str(filenum) + ".pdf", "wb")
    output_pdf.write(outputStream)

Note: indentation is a part of the python syntax so ensure you keep the indent levels correct. Call the script like this:

Code:
$ ./split_invoice.py Invoice_file.pdf

# 4  
Old 05-08-2013
See what the structure is after you dump it to text, or have python tell you what it finds. Then we can figure how to reprocess that.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Parameterizing to dynamically generate the extract file from Oracle table using Shell Script

I have below 2 requirements for parameterize the generate the extract file from Oracle table using Shell Script. Could you please help me by modifying the script and show me how to execute it. First Requirement: I have a requirement where I need to parameterize to generate one... (0 Replies)
Discussion started by: hareshvikram
0 Replies

2. Shell Programming and Scripting

Shell script to create runtime variables based on the number of parameters passed in the script

Hi All, I have a script which intends to create as many variables at runtime, as the number of parameters passed to it. The script needs to save these parameter values in the variables created and print them abc.sh ---------- export Numbr_Parms=$# export a=1 while do export... (3 Replies)
Discussion started by: dev.devil.1983
3 Replies

3. Shell Programming and Scripting

PDF Script to extract PDF Links MOD in Need

In here we have a script to extract all pdf links from a single page.. any idea's in how make this read instead of a page a list of pages.. and extract all pdf links ? #!/bin/bash # NAME: pdflinkextractor # AUTHOR: Glutanimate (http://askubuntu.com/users/81372/), 2013 #... (1 Reply)
Discussion started by: danielldf
1 Replies

4. UNIX for Dummies Questions & Answers

Shell script to extract data from csv file based on certain conditions

Hi Guys, I am new to shell script.I need your help to write a shell script. I need to write a shell script to extract data from a .csv file where columns are ',' separated. The file has 5 columns having values say column 1,column 2.....column 5 as below along with their valuesm.... (1 Reply)
Discussion started by: Vivekit82
1 Replies

5. Shell Programming and Scripting

Convert excel file to PDF file using shell script

Hi All, Is it possible to convert the excel file to PDF file(Without loosing any format) using unix shell scripting ??? If yes Kindly help me on the code Thanks in advance!!! (5 Replies)
Discussion started by: Balasankar
5 Replies

6. Shell Programming and Scripting

Shell script to monitor new file in a directory and mail the file content

Hi I am looking for a help in designing a bash script on linux which can do below:- 1) Look in a specific directory for any new files 2) Mail the content of the new file Appreciate any help Regards Neha (5 Replies)
Discussion started by: neha0785
5 Replies

7. Shell Programming and Scripting

HELP: Shell Script to read a Log file line by line and extract Info based on KEYWORDS matching

I have a LOG file which looks like this Import started at: Mon Jul 23 02:13:01 EDT 2012 Initialization completed in 2.146 seconds. -------------------------------------------------------------------------------- -- Import summary for Import item: PolicyInformation... (8 Replies)
Discussion started by: biztank
8 Replies

8. Shell Programming and Scripting

Create shell script to extract unique information from one file to a new file.

Hi to all, I got this content/pattern from file http.log.20110808.gz mail1 httpd: Account Notice: close igchung@abc.com 2011/8/7 7:37:36 0:00:03 0 0 1 mail1 httpd: Account Information: login sastria9@abc.com proxy sid=gFp4DLm5HnU mail1 httpd: Account Notice: close sastria9@abc.com... (16 Replies)
Discussion started by: Mr_47
16 Replies

9. Shell Programming and Scripting

Calling sql file from shell script with parameters.

Hi, I am calling a sql file script.sql from shell script and passing few parameters also as shown below: sqlplus -S id/password @script.sql $param1 $param2 Now,In sql file I have to create a extract text file after querying oracle tables based on the parameters passed(param1,param2) as... (7 Replies)
Discussion started by: anil029
7 Replies

10. Shell Programming and Scripting

Delete content of file 1 in file 2 with shell script

OK, best is I explain what the operating enviroment is. Linux, but Motomagx. It is a Linux operated mobile phone, Motorola V8. I am writting a shell script, but got stuck. I have to delete the complete content of file 1 in file 2. I have attached the 2 files. You can see that the content of... (2 Replies)
Discussion started by: rasputin007
2 Replies
Login or Register to Ask a Question