Perl script to copy contents of a web page

Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Perl script to copy contents of a web page
# 8  
Old 09-08-2009
Originally Posted by pludi
Erm, why? Open the page in any web browser, select File->Print, and off you go.
Or if you cannot print from the machine accessing the Internet but have CUPS available, there's CUPS-PDF
HI Pludi,

So you wanna me to print each page manually??? No other way to have a script to make a local copy of entire pages as a single document???
# 9  
Old 09-08-2009
Sorry, didn't see that RH went with putting each section on a page of it's own. But by clicking around I found a link here that points to a PDF of it.
# 10  
Old 09-08-2009
Originally Posted by pludi
Sorry, didn't see that RH went with putting each section on a page of it's own. But by clicking around I found a link here that points to a PDF of it.
Ok, Tats fine pludi. Consider some sites which doesn't provide PDF's so wat shall we do with those scenarios???
# 11  
Old 09-08-2009
I can't think of an easy way to do this, since HTML doesn't have any definite way of telling when the actual content of a file starts and where it ends. The RH pages, for example, include multiple links at the top and the bottom, which might be easy to filter out here, but this might break on another page.

The hard way would be to use a parser, deparse the HTML, filter out anything above or below certain elements (which have to be unique), and write it out again. But this, again, will fail as soon as the beginning and the end aren't unique, or if the page itself isn't HTML but XML with XSLT.

In short: if you encounter such a site, it's always possible that there's a printable version on the site itself, or that the author might have one.
This User Gave Thanks to pludi For This Post:
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Refresh web page in bash script

hello, I am trying to refresh my web page which is created in bash script. I have a HTML page which when press a button calls a bash script. this bash script created the same page with dynamic data. When pressing the button I am calling to a function that set time out of 7 seconds and and after... (1 Reply)
Discussion started by: SH78
1 Replies

2. Shell Programming and Scripting

Copy text from web page and add to file

I need help to make a script for Ubuntu to OSCam that copy the text on this website that only contains "C: ip port randomUSERNAME password" and want to exclude the text "C:" and replace the rest with the old in my test.server file. (line 22) device = ip,port (line 23) user =... (6 Replies)
Discussion started by: baxarn
6 Replies

3. Shell Programming and Scripting

Random web page download wget script

Hi, I've been attempting to create a script that downloads web pages at random intervals to mimic typical user usage. However I'm struggling to link $url to the URL list and thus wget complains of a missing URL. Any ideas? Thanks #!/bin/sh #URL List url1=""... (14 Replies)
Discussion started by: shadyuk
14 Replies

4. HP-UX

Help running a unix script from a web page

First, let me state that I am completely out of my realm with this. I have a server running HPUX. I'm not even sure if this can be considered a UNIX question and for that let me apologize in advance. I need to create a web page where a client can input 2 variables (i.e. date and phone number).... (0 Replies)
Discussion started by: grinds
0 Replies

5. Shell Programming and Scripting

How to input a number in a web page and pass to a script?

I am working on an embedded linux router and trying to make a webpage where the user can input a desired number of CPE and have a script update that number on the router. I have a CLI where I can log in and type the following to change that number echo "20">/proc/net/dbrctl/maxcpe which then... (7 Replies)
Discussion started by: BobTheBulldog
7 Replies

6. Shell Programming and Scripting

Perl Script to find & copy words from Web.

I need to write a perl script to search for a specific set of numbers that occur after a series of words but before another. Specifically, I need to locate the phrase today at the summit, then immediately prior to the words tonnes/day copy the number that will be between 100 and 9,999, for example,... (1 Reply)
Discussion started by: libertyforall
1 Replies

7. Shell Programming and Scripting

how to redirect to a web-page by shell script

Dear all, I am calling a korn shell script(CGI script) by a web-page. This shell script do some checking in a unix file and return true or false. Now within the same script, If it returns true then I want to redirect to another web-page stored in htdocs directory. Example: Login page sends a... (3 Replies)
Discussion started by: ravi18s
3 Replies

8. Shell Programming and Scripting

PERL - copy fiel contents to array then compare against other file

Basically to illuminate i want to take a file with mutliple lines, C:\searching4theseletters.txt a b c Read this into an array @ARRAY and then use this to compare against another file C:\inputletters.txt b o a c n a (9 Replies)
Discussion started by: bradleykins
9 Replies

9. Shell Programming and Scripting

How to direct a script to a new web page after a script got completed?

Hi , I have a cgi code with shell script on it.I am submitting a form in the first.cgi.These values are posted to second.cgi.Second.cgi do some process with these values. After that i want to direct my page from second.cgi to first.cgi. What is the command i can use from cgi(shell) script? ... (2 Replies)
Discussion started by: rajbal
2 Replies

10. Shell Programming and Scripting

running shell script thru WEB page ....

....passing variable via list... here 's the HTML code extract : **************** <form method=post action=http://servername/cgi-bin/cgi-comptage_diff.ksh> <table border...........> .............. </table> <table bgcolor=#FFFFFF width="980"> ... (6 Replies)
Discussion started by: Nicol
6 Replies
Login or Register to Ask a Question