I need to extract the title (text between <title> and </title>) of a set of HTML documents.
I've found a command that makes the work of extracting the text, but it does not always work.
It works with the next example:
However, it does not works with a real example:
Hello folks,
I am facing a problem with the following korn shell script snippet:
ftp -n -i -v <<EOF
print -p open $CURR_HOST
print -p user $USER $PASSWD
print -p binary
print -p cd /mydir/subdir/datadir
print -p get $FILENAME
print -p bye
EOF
exit
It gives me the following... (3 Replies)
Code for the tweak (not my fave 'running process' but the more popular 'working directory') :
case "$TERM" in
xterm*|rxvt*|rxvt-unicode*)
PROMPT_COMMAND='echo -e "\033]0;$TERM: ${PWD}\007"'
;;
*)
;;
esac
Where it works: rxvt (the one I run 'rootless' outside of ... (0 Replies)
Hi All,
I am not able to read my HTML form inputs properly in my script.
I have a textarea in my form where user needs to enter sql query... but when user enter query like below :
select * from order_queue where NUM_OF_PICKUP >=3 and TRANSACTION_TYPE=4 ;
its coming like :
select 171_arc... (3 Replies)
Hi Team,
I am not able to extract string between parenthesis.I need to extract string between first parenthesis only.
Please find the sample data and code.
But the below my code is returning "DW_EFD_TXN_ID", "PRCS_DTE" & INITIAL 52428800 NEXT 1048576 MINEXTENTS 1 MAXEXTENTS 2147483645... (12 Replies)
Hi All,
I have some HTML files and my requirement is to extract all the anchor text words from the HTML files along with their URLs and store the result in a separate text file separated by space. For example, <a href="/kid/stay_healthy/">Staying Healthy</a>
which has /kid/stay_healthy/ as... (3 Replies)
Hi everyone:
I want to extract string which is in between certain html tag.
e.g.
I tried with grep,cut, awk but could not find exact syntax for this one. :wall:
PS>Sorry about bad english. (8 Replies)
My file looks like this and i need to only extract those with PDT_AP21_B and output it to another file. Can anyone help? Thanks.
PDT_AP21_R,,, 11 TYS,,,,T17D1207230742TYO***T17DS,,C
PDT_AP21_L,,,9631166650001 ,,,,T17D1207230903TYOTYST17DS ,,C... (3 Replies)
Hi
I am new to string extractions in shell script... I am trying to extract a string such as #1753 from html tag looks like below.
<a class="model-link tl-tr" href="lastSuccessfulBuild/">Last successful build (#1753), 40 min ago</a>
and want the value as
1753
Could someone help me to... (3 Replies)
I have a script which converts a .csv file to html nicely. Trying to add 3 colors, green, yellow and red to the output depending upon the values in the cells. Tried some printf command but just can't seem to get any where. Any ideas would be appreciated. nawk 'BEGIN{
FS=","
print ... (7 Replies)
Discussion started by: jimmyf
7 Replies
LEARN ABOUT OPENSOLARIS
pdbtxt2html
pdbtxt2html(1) General Commands Manual pdbtxt2html(1)NAME
pdbtxt2html - Doc Text to HTML converter for Palm Pilots
SYNOPSIS
pdbtxt2html [ -t ] file.txt [ file.html ]
pdbtxt2html -v
DESCRIPTION
pdbtxt2html converts text converted from a Doc(4) file via txt2pdbdoc(1) to HTML. If no HTML filename is given, the generated HTML is sent
to standard output.
Document Title
The first line of the file is used for the HTML document title.
Bookmarks
The last line of the file is examined and, if it contains a string enclosed between < and >, that is taken to be the bookmark marker. The
entire file is then scanned looking for lines beginning with it (ignoring leading whitespace). These lines are converted to HTML headings.
The number of whitespace characters after the first bookmark marker is used for heading level 1. The level of subsequent headings is set
to the number of whitespace characters between the bookmark marker and the bookmark text minus the number for the first bookmark plus one.
Embedded URLs
Valid URLs (according to RFC 1630) embedded in the text are turned into hyperlinks. The ftp, gopher, http, https, mailto, news, telnet,
and wais URLs are recognized.
OPTIONS -t Compile a table of contents and insert it between the first heading and the body.
-v Print the version number to standard output and exit.
EXAMPLE
To convert a Doc file to HTML:
txt2pdbdoc alice.pdb alice.txt
pdbtxt2html alice.txt alice.html
SEE ALSO html2pdbtxt(1), txt2pdbdoc(1), doc(4), pdb(4)
Tim Berners Lee. Universal Resource Identifiers in WWW, Network Working Group of the Internet Engineering Task Force, June 1994.
http://info.internet.isi.edu/in-notes/rfc/files/rfc1630.txt
AUTHOR
Paul J. Lucas <pauljlucas@mac.com>
txt2pdbdoc January 21, 2005 pdbtxt2html(1)