I have a html file called myfile. If I simply put "cat myfile.html" in UNIX, it shows all the html tags like <a href=r/26><img src="http://www>. But I want to extract only text part.
Same problem happens in "type" command in MS-DOS.
I know you can do it by opening it in Internet Explorer,... (4 Replies)
hi
i need to use unix to extract data from several rows of a table coded in html. I know that rows within a table have the tags <tr> </tr> and so i thought that my first step should be to to delete all of the other html code which is not contained within these tags. i could then use this method... (8 Replies)
Hiya,
I am trying to extract a news article from a web page. The sed I have written brings back a lot of Javascript code and sometimes advertisments too. Can anyone please help with this one ??? I need to fix this sed so it picks up the article ONLY (don't worry about the title or date .. i got... (2 Replies)
Hello,
i try to extract urls from google-search-results, but i have problem with sed filtering of html-code.
what i wont is just list of urls thay apears between ........<p><a href=" and next following " in html code.
here is my code, i use wget and pipelines to filtering. wget works, but... (13 Replies)
Hello everyone, I'm new to this forum and i am new as a shell scripter.
my problem is to have html files in a directory and I would like to extract from these some data that lies between two different lines
Here's my situation
<td align="default"> oxidizability (mg / l):
data_to_extract... (6 Replies)
Hi
I've searched for it for few hours now and i can't seem to find anything working like i want. I've got webpage, saved in file par with form like this:
<html><body><form name='sendme' action='http://example.com/' method='POST'>
<textarea name='1st'>abc123def678</textarea>
<textarea... (9 Replies)
I have bash, awk, and sed available on my portable device. I need to extract 10 fields from each table row from a web page that looks like this:
</tr>
<tr>
<td>28 Apr</td>
<td><a... (6 Replies)
Hi, I'm trying to get some data from an html file, but the problem is before it can extract the information I have multiple patterns that need to be passed through.
https://www.unix.com/shell-programming-scripting/150711-extract-data-awk-html-files.html
Is a similar problem. The only... (5 Replies)
I'm extracting text between table tags in HTML
<th><a href="/wiki/Buick_LeSabre" title="Buick LeSabre">Buick LeSabre</a></th>
using this:
awk -F "</*th>" '/<\/*th>/ {print $2}' auto2 > auto3
then this (text between a href):
sed -e 's/\(<*>\)//g' auto3 > auto4
How to shorten this into one... (8 Replies)
I am trying to extract text after keywords fron an html file. The keywords are reportLink":, "barcodedSamples": {", "barcodedSamples": {". Both the perl and awk run but the output is just the entire index.html not the desired output. Also for the reportLink": only the text after the second / until... (5 Replies)
Discussion started by: cmccabe
5 Replies
LEARN ABOUT OSF1
iptos
iptos(4) Kernel Interfaces Manual iptos(4)NAME
iptos - Defines the IP Type Of Service (TOS) for FTP and Telnet
SYNOPSIS
/etc/iptos
DESCRIPTION
The /etc/iptos file configures the Type Of Service (TOS) of the Internet Protocol (IP) used by FTP and Telnet.
The TOS field in the Internet datagram is to specify how the datagram should be handled. It is a mechanism to allow control information to
have precedence over data.
Generally, protocols that are involved in direct interaction with a human should select low delay, while data transfers that involve large
blocks of data need high throughput. Finally, high reliability is most important for datagram-based Internet management functions.
In the Tru64 UNIX operating system, the ftp and telnet applications and the ftpd and telnetd daemons allow the configuring of TOS values.
These applications check to see if the /etc/iptos file exists; if the file exists, the applications obtain the TOS value from the file and
use that value to set the TOS field. If the /etc/iptos file does not exist, the applications default to the following TOS values recom-
mended by RFC1060: Low delay High throughput Low delay
Users who want to configure their own TOS values for the TOS field should provide the /etc/iptos file.
Note
Most IP routers do not differentiate based on TOS, and therefore providing values other than the default would have no affect. You
should not change the default values for FTP and Telnet.
Each entry should consist of a single line of the form:
Application Proto TOS-bits aliases
The entry fields contain the following information: The name of an application TOS entry. The protocol name for which the entry is appro-
priate. The TOS value to be set for the entry. A list of aliases that exist for the entry.
Items on an entry line are separated by any number of blanks, tabs, or combination of blanks and tabs. A number sign (#) indicates that
the rest of the line is a comment and is not interpreted by routines that search the file. Blank lines in the file are ignored.
Valid TOS entry names are ftp-control and ftp-data for FTP and telnet for Telnet.
The TOS value for the entry should be one of the following hexadecimal numbers, corresponding to TOS bits: Low delay High throughput High
reliability
If you need to disable the use of TOS bits, because you are having troubling communicating with a TCP/IP host that doe not conform entirely
with the IP specification, you can disable the TOS bits by using the the following settings in the /etc/iptos file:
# # Format of this file: # Application Proto TOS-bits aliases #
ftp-control tcp 0x0 ftp-data tcp 0x0 telnet tcp 0x0
EXAMPLES
The following example shows typical entries in the /etc/iptos file:
# # Format of this file: # Application Proto TOS-bits aliases #
ftp-control tcp 0x10 ftp-data tcp 0x08 telnet tcp 0x10
RELATED INFORMATION
RFC1060, ftp(1), telnet(1), ftpd(8), telnetd(8) delim off
iptos(4)