Hi folks!
I am using MacOsX that runs freeBSD. Could you tell me what comand to type on the Unix Terminal to display on the terminal the source code of a certain web page?
I think something like
#<comand> http://www.apple.com
will display on the terminal's window the html source code... (11 Replies)
is there a command that allows you to take a url and grab the source code from the page and output it to stdout?
i want to know because i want to grab a page and pass it thru another program to analyze the page.
any help would be appreciated
thanks (3 Replies)
I'm new to PERL, but I want to take the page source and write it to a file or standard output. I used perl.org as a test website. Here is the script:
use strict;
use warnings;
use LWP::Simple;
getprint('http://www.perl.org') or die 'Unable to get page';
exit 0;
... (1 Reply)
is it possible to pass webpages to remove all tag style information, but leave the tag...
say I have
<h1 style='font-size: xxx; color: xxxxxx'>headline 1</h1>
i want to get
<h1>headline 1</h1>
BTW, i got an oneliner here to remove all tags:
sed -n '/^$/!{s/<*>//g;p;
Thanks a... (4 Replies)
I have downloaded a web source page to a file. I then egrep a single word to extract a line containing it to another file.
I then cat the second file and remove everything before a word and after a second word to capture the phrase desired.
This did not work. I used vi to validate that the 2... (1 Reply)
I need to get the source code of a webpage. I have tried to use wget and curl, but it doesn't show the necessary javascript part of the source. I don't have to execute it, only to view the source.
How do I do that? (1 Reply)
Hi,
I want to remove the following code from Source files (or replace the code with empty.) from all the source files in given directory.
finally {
if (null != hibernateSession && hibernateSession.isOpen()) {
//hibernateSession.close();
}
}
It would be great if the script has... (2 Replies)
Hi Friends,
I have a bunch of URLs.
Each URL will open up an abstract page.
But, the source contains a link to the main PDF article.
I am looking for a script to do the following task
1. Read input file with URLs.
2. Parse the source and grab all the lines that has the word 'PDF'.... (1 Reply)
Hi guys|
I need to retrieve a specific .m3u8 link from a web page, which makes use of iframes and JavaScript
I tried to get the full source with "wget", "lynx", "w3m" and "phantomjs", but they can't dump all the source, with the part containing the link that i need, which seems to be inside... (0 Replies)
Discussion started by: Marmz
0 Replies
LEARN ABOUT DEBIAN
jigdo-lite
JIGDO-LITE(1)JIGDO-LITE(1)NAME
jigdo-lite - Download jigdo files using wget
SYNOPSIS
jigdo-lite [ URL ]
DESCRIPTION
See jigdo-file(1) for an introduction to Jigsaw Download.
Given the URL of a `.jigdo' file, jigdo-lite downloads the large file (e.g. a CD image) that has been made available through that URL.
wget(1) is used to download the necessary pieces of administrative data (contained in the `.jigdo' file and a corresponding `.template'
file) as well as the many pieces that the large file is made from. The jigdo-file(1) utility is used to reconstruct the large file from the
pieces.
`.jigdo' files that contain references to Debian mirrors are treated specially: When such a file is recognized, you are asked to select one
mirror out of a list of all Debian mirrors.
If URL is not given on the command line, the script prompts for a location to download the `.jigdo' file from. The following command line
options are recognized:
-h --help
Output short summary of command syntax.
-v --version
Output version number.
--scan FILES
Do not ask for "Files to scan", use this path.
--noask
Do not ask any questions, instead behave as if the user had pressed Return at all prompts. This can be useful when running jigdo-
lite from cron jobs or in other non-interactive environments.
SEE ALSO jigdo-file(1), jigdo-mirror(1), wget(1) (or `info wget')
CD images for Debian Linux can be downloaded with jigdo <URL:http://www.debian.org/CD/jigdo-cd/>.
AUTHOR
Jigsaw Download <URL:http://atterer.net/jigdo/> was written by Richard Atterer <jigdo atterer.net>, to make downloading of CD ROM images
for the Debian Linux distribution more convenient.
19 May 2006 JIGDO-LITE(1)