01-02-2004
Quote:
Originally posted by fundidor
Unfortunately my Unix does not include the wget comand.
You can download
wget from the net and install it:
http://wget.sunsite.dk/index.html
Quote:
GNU Wget is a free software package for retrieving files using HTTP, HTTPS and FTP, the two most widely-used Internet protocols. It is a non-interactive commandline tool, so it may easily be called from scripts, cron jobs, terminals without Xsupport, etc.
Wget has many features to make retrieving large files or mirroring entire web or FTP sites easy, including:
* Can resume aborted downloads, using REST and RANGE
* Can use filename wild cards and recursively mirror directories
* NLS-based message files for many different languages
* Optionally converts absolute links in downloaded documents to relative, so that downloaded documents may link to each other locally
* Runs on most UNIX-like operating systems as well as Microsoft Windows
* Supports HTTP and SOCKS proxies
* Supports HTTP cookies
* Supports persistent HTTP connections
* Unattended / background operation
* Uses local file timestamps to determine whether documents need to be re-downloaded when mirroring
* GNU wget is distributed under the GNU General Public License.
9 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
hello Unix guru
i want to show performance results on my internal website .
We are manitaing the site through Wiki . I have to add new page in that .
can someone help me to write shell script for that ..
i want to display files datewise .
my files names are starting with date .
if... (3 Replies)
Discussion started by: deepa20
3 Replies
2. UNIX for Dummies Questions & Answers
is there a command that allows you to take a url and grab the source code from the page and output it to stdout?
i want to know because i want to grab a page and pass it thru another program to analyze the page.
any help would be appreciated
thanks (3 Replies)
Discussion started by: jaymzlee
3 Replies
3. Programming
Hello dear...i'm need web browser source code just the simple web browser..for my development site porn detected..thanks,,, (3 Replies)
Discussion started by: demhyt
3 Replies
4. Shell Programming and Scripting
I want to download a particular page from the internet and get the source code of the page in html format.
I want to parse the source code to find a specific parameters using grep command.
could someone tell me the linux command to download a specific page and parse the source code of it.
... (1 Reply)
Discussion started by: ahamed
1 Replies
5. Shell Programming and Scripting
is it possible to pass webpages to remove all tag style information, but leave the tag...
say I have
<h1 style='font-size: xxx; color: xxxxxx'>headline 1</h1>
i want to get
<h1>headline 1</h1>
BTW, i got an oneliner here to remove all tags:
sed -n '/^$/!{s/<*>//g;p;
Thanks a... (4 Replies)
Discussion started by: dtdt
4 Replies
6. Shell Programming and Scripting
I have downloaded a web source page to a file. I then egrep a single word to extract a line containing it to another file.
I then cat the second file and remove everything before a word and after a second word to capture the phrase desired.
This did not work. I used vi to validate that the 2... (1 Reply)
Discussion started by: slak0
1 Replies
7. Shell Programming and Scripting
Hi everybody,
I am trying to remove bunch of lines from web pages between two tags:
one is <h1> and the other is <table
it looks like
<h1>Anniversary cards roses</h1>
many
lines here
<table summary="Free anniversary greeting cards." cellspacing="8" cellpadding="8" width="70%">my goal... (5 Replies)
Discussion started by: georgi58
5 Replies
8. Shell Programming and Scripting
Hi guys|
I need to retrieve a specific .m3u8 link from a web page, which makes use of iframes and JavaScript
I tried to get the full source with "wget", "lynx", "w3m" and "phantomjs", but they can't dump all the source, with the part containing the link that i need, which seems to be inside... (0 Replies)
Discussion started by: Marmz
0 Replies
9. Shell Programming and Scripting
Hi everyone,
I have two question
1- I want to execute command in shell and after execution result show in a web server. (kind of making UI )
e.g.
in shell
root ~: show list
item1
item2
item(n)in web server
in a page draw a table and show those items in itno | name... (1 Reply)
Discussion started by: indeed_1
1 Replies
LEARN ABOUT DEBIAN
apt-mirror
APT-MIRROR(1) User Contributed Perl Documentation APT-MIRROR(1)
NAME
apt-mirror - apt sources mirroring tool
SYNOPSIS
apt-mirror [configfile]
DESCRIPTION
A small and efficient tool that lets you mirror a part of or the whole Debian GNU/Linux distribution or any other apt sources.
Main features:
* It uses a config similar to apts sources.list
* It's fully pool comply
* It supports multithreaded downloading
* It supports multiple architectures at the same time
* It can automatically remove unneeded files
* It works well on overloaded channel to internet
* It never produces an inconsistent mirror including while mirroring
* It works on all POSIX compliant systems with perl and wget
COMMENTS
apt-mirror uses /etc/apt/mirror.list as a configuration file. By default it is tuned to official Debian or Ubuntu mirrors. Change it for
your needs.
After you setup the configuration file you may run as root:
# su - apt-mirror -c apt-mirror
Or uncomment line in /etc/cron.d/apt-mirror to enable daily mirror updates.
FILES
/etc/apt/mirror.list
Main configuration file
/etc/cron.d/apt-mirror
Cron configuration template
/var/spool/apt-mirror/mirror
Mirror places here
/var/spool/apt-mirror/skel
Place for temporarily downloaded indexes
/var/spool/apt-mirror/var
Log files placed here. URLs and MD5 summs also here.
CONFIGURATION EXAMPLES
The mirror.list configuration supports many options, the file is well commented explinging each option. here are some sample mirror
configuration lines showing the various supported ways :
Normal: deb http://example.com/debian stable main contrib non-free
Arch Specific: ( many other arch's are supported ) deb-powerpc http://example.com/debian stable main contrib non-free
HTTP and FTP Auth or non-standard port: deb http://user:pass@example.com:8080/debian stable main contrib non-free
Source Mirroring: deb-src http://example.com/debian stable main contrib non-free
ORIGINAL AUTHOR
Dmitry N. Hramtsov <hdn@nsu.ru>
CURRENT AUTHORS
Dmitry N. Hramtsov <hdn@nsu.ru> Brandon Holtsclaw <me@brandonholtsclaw.com>
perl v5.14.2 2012-01-28 APT-MIRROR(1)