Wget -i URLs.txt problem Post: 302732621

9 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Sorting problem "sort -k 16,29 sample.txt > output.txt"

Hi all, Iam trying to sort the contents of the file based on the position of the file. Example: $cat sample.txt 0101020060731 ## Header record 1c1 Berger Awc ANP20070201301 4000.50 1c2 Bose W G ANP20070201609 6000.70 1c2 Andy CK ANP20070201230 28000.00...

2. UNIX for Advanced & Expert Users

Wget FTP problem!

Hi, I've tried to download from ftp sites by wget but it failed and says "Service unavailable" but when I use sftp in binary mode and use "get" command it works perfectly. What's the problem? BTW: I tried both passive and active mode in wget. thnx for ur help

3. Shell Programming and Scripting

Problem with wget

Hi, I want to download some patches from SUN by using a script and I am using "wget" as the utillity for this. The website for downloading has a "https:" in its name as below https://sunsolve.sun.com/private-cgi/pdownload.pl?target=${line}&method=h and on running wget as below wget...

4. Shell Programming and Scripting

Extract urls from index.html downloaded using wget

Hi, I need to basically get a list of all the tarballs located at uri I am currently doing a wget on urito get the index.html page Now this index page contains the list of uris that I want to use in my bash script. can someone please guide me ,. I am new to Linux and shell scripting. ...

5. UNIX for Dummies Questions & Answers

Problem with wget no check certificate.

Hi, I'm trying to install some libraries, when running the makefile I get an error from the "wget --no check certificate option". I had a look help and the option wasn't listed. Anyone know what I'm missing.

6. UNIX for Dummies Questions & Answers

find lines in file1.txt not found in file2.txt memory problem

I have a diff command that does what I want but when comparing large text/log files, it uses up all the memory I have (sometimes over 8gig of memory) diff file1.txt file2.txt | grep '^<'| awk '{$1="";print $0}' | sed 's/^ *//' Is there a better more efficient way to find the lines in one file...

7. Shell Programming and Scripting

Problem with wget and cookie

Dear people, I got a problem with an scrip using wget to download pdf-files from an website which uses session-cookies. Background: for university its quite nasty to look up weekly which new homeworks, papers etc. are available on the different sites of the universites chairs. So I wanted a...

8. Shell Programming and Scripting

Download pdf's using wget convert to txt

wget -i genedx.txt The code above will download multiple pdf files from a site, but how can i download and convert these to .txt? I have attached the master list (genedx.txt - which contains the url and file names) as well as the two PDF's that are downloaded. I am trying to have those...

9. Proxy Server

Problem with wget

I cannot download anything using wget in centos 6.5 and 7. But I can update yum etc. # wget https://wordpress.org/latest.tar.gz --2014-10-23 13:50:23-- https://wordpress.org/latest.tar.gz Resolving wordpress.org... 66.155.40.249, 66.155.40.250 Connecting to wordpress.org|66.155.40.249|:443......

LEARN ABOUT DEBIAN

httpindex

httpindex(1)						      General Commands Manual						      httpindex(1)

NAME

       httpindex - HTTP front-end for SWISH++ indexer

SYNOPSIS

       wget [ options ] URL...	2>&1 | httpindex [ options ]

DESCRIPTION

       httpindex is a front-end for index++(1) to index files copied from remote servers using wget(1).  The files (in a copy of the remote direc-
       tory structure) can be kept, deleted, or replaced with their descriptions after indexing.

OPTIONS

   wget Options
       The wget(1) options that are required are: -A, -nv, -r, and -x; the ones that are highly recommended are: -l, -nh, -t, and  -w.	 (See  the
       EXAMPLE.)

   httpindex Options
       httpindex accepts the same short options as index++(1) except for -H, -I, -l, -r, -S, and -V.

       The following options are unique to httpindex:

       -d     Replace the text of local copies of retrieved files with their descriptions after they have been indexed.  This is useful to display
	      file descriptions in search results without having to have complete copies of the remote files thus saving filesystem  space.   (See
	      the extract_description() function in WWW(3) for details about how descriptions are extracted.)

       -D     Delete  the  local copies of retrieved files after they have been indexed.  This prevents your local filesystem from filling up with
	      copies of remote files.

EXAMPLE

       To index all HTML and text files on a remote web server keeping descriptions locally:

	    wget -A html,txt -linf -t2 -rxnv -nh -w2 http://www.foo.com 2>&1 |
	    httpindex -d -e'html:*.html,text:*.txt'

       Note that you need to redirect wget(1)'s output from standard error to standard output in order to pipe it to httpindex.

EXIT STATUS

       Exits with a value of zero only if indexing completed sucessfully; non-zero otherwise.

CAVEATS

       In addition to those for index++(1), httpindex does not correctly handle the use of multiple -e, -E, -m, or -M options  (because  the  Perl
       script uses the standard GetOpt::Std package for processing command-line options that doesn't).	The last of any of those options ``wins.''

       The work-around is to use multiple values for those options seperated by commas to a single one of those options.  For example, if you want
       to do:

	    httpindex -e'html:*.html' -e'text:*.txt'

       do this instead:

	    httpindex -e'html:*.html,text:*.txt'

SEE ALSO

       index++(1), wget(1), WWW(3)

AUTHOR

       Paul J. Lucas <pauljlucas@mac.com>

SWISH++ 							  August 2, 2005						      httpindex(1)

9 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Sorting problem "sort -k 16,29 sample.txt > output.txt"

Discussion started by: ganapati

2. UNIX for Advanced & Expert Users

Wget FTP problem!

Discussion started by: mjdousti

3. Shell Programming and Scripting

Problem with wget

Discussion started by: max29583

4. Shell Programming and Scripting

Extract urls from index.html downloaded using wget

Discussion started by: mnanavati

5. UNIX for Dummies Questions & Answers

Problem with wget no check certificate.

Discussion started by: davcra

6. UNIX for Dummies Questions & Answers

find lines in file1.txt not found in file2.txt memory problem

Discussion started by: raptor25

7. Shell Programming and Scripting

Problem with wget and cookie

Discussion started by: jackomo

8. Shell Programming and Scripting

Download pdf's using wget convert to txt

Discussion started by: cmccabe

9. Proxy Server

Problem with wget

Discussion started by: nirosha

LEARN ABOUT DEBIAN

httpindex