HXWLS(1) HTML-XML-utils HXWLS(1)NAME
hxwls - list links in an HTML file
SYNOPSIS
hxwls [ -l ] [ -t ] [ -r ] [ -h ] [ -b base ] [ file ]
DESCRIPTION
The hxwls command reads an HTML file (standard input by default) and prints out all links it finds. The output is written to stdout.
OPTIONS
The following options are supported:
-l Produce a long listing. Instead of just the URI, hxwls prints three columns: the element name, the value of the REL attribute,
and the target URI.
-t Produce a tuple listing. hxwls prints four columns: the URI of the document itself, the element name, the value of the REL
attribute, and the target URI.
-r Print relative URLs as they are, without converting them to absolute URLs.
-b base Use base as the initial base URL. If there is a <base> element in the document, it will override the -b option.
-h Output as HTML. The output will be listed in the form of <a> elements.
OPERANDS
The following operand is supported:
file The name or the URL of an HTML file. If absent, standard input is read instead.
DIAGNOSTICS
The following exit values are returned:
0 Successful completion.
> 0 An error occurred in the parsing of the HTML file. hxwls will try to correct the error and produce output anyway.
SEE ALSO asc2xml(1), hxnormalize(1), hxnum(1), xml2asc(1)6.x 10 Jul 2011 HXWLS(1)
Check Out this Related Man Page
HXCOUNT(1) HTML-XML-utils HXCOUNT(1)NAME
hxcount - count elements and attributes in HTML or XML files
SYNOPSIS
hxcount [ file-or-URL ]
DESCRIPTION
The hxcount command counts the number of elements and attributes of each type that appears in the input and prints a report on stdout.
OPERANDS
The following operand is supported:
file-or-URL
The name or URL of an HTML or XML file. If absent, standard input is read instead.
EXIT STATUS
The following exit values are returned:
0 Successful completion.
> 0 An error occurred in the parsing of the HTML or XML file. hxcount will try to recover from the error and produce output anyway.
ENVIRONMENT
To use a proxy to retrieve remote files, set the environment variables http_proxy and ftp_proxy. E.g., http_proxy="http://localhost:8080/"
BUGS
Don't trust the output if there were errors in the input.
Remote files (specified with a URL) are currently only supported for HTTP. Password-protected files or files that depend on HTTP "cookies"
are not handled. (You can use tools such as curl(1) or wget(1) to retrieve such files.)
SEE ALSO asc2xml(1), hxprune(1), hxnormalize(1), hxnum(1), hxtoc(1), hxunent(1), xml2asc(1), UTF-8 (RFC 2279)
6.x 10 Jul 2011 HXCOUNT(1)
Dear All
I have a pattern which look like this :
2 20080226_18:02:09.749 ISC-Libya Egypt-Cairo2 111 IAM 2913258040 218927157966 b 61 REL f 143 RLC :COMMA:NCI=10,FCI=6101,CPC=0A,TMR=00,OFI=80,USI: :COMMB:: :RELCAUSE:15:
2 20080226_18:02:11.629 ISC-Libya Egypt-Cairo2 170 IAM 93572641... (8 Replies)
Dear All
I have a group of patterns which look like this :
2 20080223_15:12:15.259 ISC-Libya BritishTelecom2 104 IAM 20104016641 218925164479 b 3330 ACM b 3337 ANM f 78309 REL b 78520 RLC :COMMA: NCI=10,FCI=2001,CPC=0A,TMR=00,USI: :COMMB: BCI=0434,OBI=01: :RELCAUSE:10:
I want a script that... (4 Replies)
Hi,
Our requirement is we have to create file from one file where all the rows from source file is converted into columns of the target file.
For example
Source file is :
Status Report ABC
Generated: 2009-05-05 08:49:47
Job start time=2009-05-05 08:43:43
Job end time=2009-05-05... (1 Reply)
Hi All,
I have a input file named 'inputtest.txt' like below:
"335588","DW02.CAPTURED_ROC_14","13016063","00000000000001541897035.77"
"335587","DW02.NCL01_1ST_PSMT_14","446301","00000000000000040370306.43"
My desired output will be like below:
335588 DW02.CAPTURED_ROC_14 13016063 ... (3 Replies)
Hi there,
I wanna define a variable 'tempbase'. Therefore I read a text file "base.out". "base.out" contains a list with four columns. 'tempbase' is the 4th entry in the line, where the first entry is equal to the predefined variable $orb1 and the second entry is equal to $orb2. I wrote the code... (2 Replies)
Hello,
I have a huge file which contain space separated columns. I want to add a tab after every two columns in the file.
for example
input
A B C D E F G H
A B C D E F G H
output
A B C D E F GH
A B C D E F GH
Please help. (1 Reply)
Hi guys and gals...
Someone pointed me to this URL this morning and thought I might be interested.
Not the best bash solution but at my level I solved it this way, in around 5-10 minutes...
What about your attempts...
Produce the number 2014 without any numbers in your source code -... (40 Replies)
Hi,
I have an input file that looks like this (columns are tab delimited:
Data000005-RA GO:0003735 GO:0005840 GO:0006412
Data000005-RA GO:0003735
Data000009-RA GO:0003735 GO:0005622 GO:0005840 GO:0006412 ... (2 Replies)
Hi,
Can anyone help with the below please?
I have written some code which takes an input file, and and prints the contents out to a new file - it then loops round and prints the same columns, but increments the ID column by 1 each time.
Input file;
NAME,1,15-Dec-15,
NAME,1,21-Dec-15,... (9 Replies)