Sponsored Content
Full Discussion: [python] merge pdf files.
Top Forums Programming [python] merge pdf files. Post 303039514 by SaltCityScripts on Tuesday 8th of October 2019 04:47:23 PM
Old 10-08-2019
[python] merge pdf files.

First off I am very new to python but not to scripting I have done a lot of bash scripting.

I need to create a python script for work that will combine multiple pdf files into one pdf file and archive both the combined file and the original pdf files.

So we receive zip files from a client (the file name will either begin with a number #########.zip or MU3-#######-#.zip) and I need to unzip the zip file and use the file name as the name of the combined file.

example
MU3-6493489-1.zip

when I unzip this file I have the following pdf files.:
MU3-6493489-1_006493489-001_ARINV.pdf
MU3-6493489-1_3461.pdf
MU3-6493489-1_7501.pdf
MU3-6493489-1_CI_2.pdf
MU3-6493489-1_CI_3.pdf
MU3-6493489-1_CI_4.pdf
MU3-6493489-1_CI_5.pdf
MU3-6493489-1_CI.pdf

I need to combine all of these pdf file into a new file called MU3-6493489-1_combined.pdf

I did find a script online that has the basic stuff but it will not work for my needs.

Code:
#pdf_merger.py

import glob
from PyPDF2 import PdfFileMerger

def merger(output_path, imput_paths):
    pdf_merger = PdfFileMerger()
    file_handles = []

    for path in input_paths:
        pdf_merger.append(path)

    with open(ouput_path, 'wb') as fileobj:
        pdf_merger.write(fileobj)

if _name_ == '_main_':
    paths = glob.glob('MU3_*.pdf')
    paths.sort()
    merger('pdf_merger2.pdf', paths)


This is not my coding and I am not attached to it in anyway. I know I will have to make a lot of changes to get that to work.

If anyone has any thoughts on how to do this I would be forever thankful and so would my work.

--- Post updated at 08:47 PM ---

on a side note: this can be in either perl or python. I just thought python would be better to learn.

I was not able to edit my original post.
 

2 More Discussions You Might Find Interesting

1. Programming

Scipy Python/ get pdf

Hi all I'm just starting usign scipy with python, really powerfull. It comes with some many functions but I'm not able to solve my problem. I have an array x and i need to find a probability density function (pdf) from data. does anyone know how to do that? (i've done it evaluating first... (1 Reply)
Discussion started by: Dedalus
1 Replies

2. Shell Programming and Scripting

Merge PDF

Hi I am trying to find a software to merge several pdfs into one pdf on our solaris machine. After merging the document, I have to direct it to a printer. Does anyone have a recommendation? I try to contact a couple of vendor but no replies. Thanks. (5 Replies)
Discussion started by: lillyt2006
5 Replies
PS_ADD_PDFLINK(3)							 1							 PS_ADD_PDFLINK(3)

ps_add_pdflink - Adds link to a page in a second pdf document

SYNOPSIS
bool ps_add_pdflink (resource $psdoc, float $llx, float $lly, float $urx, float $ury, string $filename, int $page, string $dest) DESCRIPTION
Places a hyperlink at the given position pointing to a second pdf document. Clicking on the link will branch to the document at the given page. The first page in a document has number 1. The hyperlink's source position is a rectangle with its lower left corner at ($llx, $lly) and its upper right corner at ($urx, $ury). The rectangle has by default a thin blue border. The note will not be visible if the document is printed or viewed but it will show up if the document is converted to pdf by either Acrobat Distillertm or Ghostview. PARAMETERS
o $psdoc - Resource identifier of the postscript file as returned by ps_new(3). o $llx - The x-coordinate of the lower left corner. o $lly - The y-coordinate of the lower left corner. o $urx - The x-coordinate of the upper right corner. o $ury - The y-coordinate of the upper right corner. o $filename - The name of the pdf document to be opened when clicking on this link. o $page - The page number of the destination pdf document o $dest - The parameter $dest determines how the document is being viewed. It can be fitpage, fitwidth, fitheight, or fitbbox. RETURN VALUES
Returns TRUE on success or FALSE on failure. SEE ALSO
ps_add_launchlink(3), ps_add_locallink(3), ps_add_weblink(3). PHP Documentation Group PS_ADD_PDFLINK(3)
All times are GMT -4. The time now is 04:50 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy