There is no easy way to do what you want to do using wget. Looking at the source for that page would have shown you what is going on.
For example, consider the document entitled "Drought-tolerant plant growth promoting Bacillus ... ". The corresponding PDF file is "930332435.pdf" To retrieve that document you would have to parse this HTML code
to the extract the content tag, i.e. a930332435, and build a new URL which wget could then use to retrieve the document.
I've been having problems downloading Red Hat 7.2 from their FTP site. It downloads rather slowly(between 2-3k/sec, I'm on broadband) and after about 10 minutes stops downloading altogether. Am I doing something wrong? (2 Replies)
Hi,
I builded the linux kernel 2.6 with the following tool chain
binutils:2.16
gcc:3.4.4
glibc:2.3.5
kernel:2.6.10
and applied the corresponding patches to it.I got the kernel Image.I downloaded the Image on to the AT91RM9200 board.But when i am booting the image it is showing the... (1 Reply)
Hi All,
I have a requirement of dowloading the dynamic form content displayed in a webpage as a pdf file. The form content is not too complex but intermediate - it has textboxes, images, textarea, radiobuttons,dropdowns etc.
Can anyone suggest how i can achieve this? Your... (0 Replies)
Hello,
I am getting a HTTP error while downloading solaris patches using wget.
'Downloading unsigned patch 113096-03.
--2010-06-18 03:51:15-- http://sunsolve.sun.com/pdownload.pl?target=113096-03&method=h
Resolving sunsolve.sun.com (sunsolve.sun.com)... 192.18.108.40
Connecting to... (5 Replies)
Hi there,
I've got my own domain, ftp etc.. I'm using cPanel and I want to download a file periodically, every say 24 hours.
I've used this command:
wget -t inf http : / / www . somesite . com / webcam.jpg
ftp : / / i @ MyDomain . net : Password @ ftp . MyDomain . net^no spaces... (24 Replies)
Hello everyone. I'm new both to the forum and to unix scripting, and this website has been very useful in putting together a script I am working on. However, I have run into a bit of a snag, which is why I have come here seeking help. First I will say what I am trying to do, and then what I have... (2 Replies)
Hi,
I would like to download a file from a https website. I don't have the file name as it changes every day.
I am using the following command:
wget --no-check-certificate -r -np --user=ABC --password=DEF -O temp.txt https://<website/directory>
I am getting followin error in my... (9 Replies)
wget -i genedx.txt
The code above will download multiple pdf files from a site, but how can i download and convert these to .txt?
I have attached the master list (genedx.txt - which contains the url and file names)
as well as the two PDF's that are downloaded. I am trying to have those... (7 Replies)
I need a hint for using wget for getting a free content from a TV station that is streaming its material for a while until it appears on any video platform, that means no use of illegal methods, because it is on air, recently published and available. But reading the manual for wget I tried the... (5 Replies)
Discussion started by: 1in10
5 Replies
LEARN ABOUT SUSE
pdf2dsc
PDF2DSC(1) Ghostscript Tools PDF2DSC(1)NAME
pdf2dsc - generate a PostScript page list of a PDF document
SYNOPSIS
pdf2dsc input.pdf [ output.dsc ]
DESCRIPTION
pdf2dsc uses gs(1) to read an Adobe Portable Document Format (PDF) document "input.pdf" and create a PostScript(tm) document "output.dsc"
that conforms to Adobe's Document Structuring Conventions (DSC) requirements.
This new document simply tells Ghostscript to read the PDF file and to display pages one at a time. The generated document can then be
viewed with any PostScript viewer based on Ghostscript, like ghostview(1) on Unix or GSview on Windows, with which the user can browse
through the pages of the PDF document in any order.
If no output file is named on the command line, the name of the output file is that of the input file with any extension removed, followed
by the extension ".dsc".
CAVEATS
The DSC document uses Ghostscript-specific procedures. In addition, the original PDF document must be accessible when the DSC document is
processed.
You need the file "pdf2dsc.ps" (originally by Russell Lang) supplied with Ghostscript since release 3.53.
SEE ALSO gs(1), ghostview(1)VERSION
This document was last revised for Ghostscript version 8.70.
AUTHOR
Yves Arrouye <yves.arrouye@usa.net> and Russell Lang gsview at ghostgum.com.au
8.70 31 July 2009 PDF2DSC(1)