Itinerate throught HTML table


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Itinerate throught HTML table
# 8  
Old 03-22-2010
Quote:
Originally Posted by drewk
OK -- itinerate Smilie

Try Tyler's perl script (either) with wget or curl:

Code:
curl "http://www.yururl.com" | perl -lne 'BEGIN{undef $/}while (/.*<tr.*?"orderLink">(\d+)<.*?>Complete<.*?\/tr>.*/msg){print $1}'

That will download and itinerate
Thanks, i will have a look.
# 9  
Old 03-25-2010
There is a new recuriments. I was ask not to search for Status = Completed but all the others differents thatn Rejected.

Can this be done using the current awk ?

$ awk -F"[><]" ' /orderLink/ { f=1; _ord=$5; } f && /Rejected/ {
_sta=$5; f=0; print _ord ","}' f1 | tail -1

Thanks in advance
# 10  
Old 03-25-2010
Quote:
Originally Posted by valigula
...I was ask not to search for Status = Completed but all the others differents thatn Rejected.
...
I don't quite understand this statement. Do you want to fetch orderLinks -
(a) with "Rejected" status ?
(b) with statuses other than "Complete" and "Rejected" ?
(c) with statuses other than "Rejected" ?

I shall assume that you want (a).

Quote:
Can this be done using the current awk ?

$ awk -F"[><]" ' /orderLink/ { f=1; _ord=$5; } f && /Rejected/ {
_sta=$5; f=0; print _ord ","}' f1 | tail -1

...
Just try it on your HTML and see for yourself !
You have your HTML file, you have your awk script; what's stopping you from testing it out ?

Here's what I see when I run it on the HTML file you supplied in your first post -

Code:
$ 
$ cat -n f5
     1  <html>
     2  <body>
     3    <div>
     4    <table id="orderList">
     5      <thead>
     6        <tr>
     7          <th>order number</th>
     8          <th>order type</th>
     9          <th>product type</th>
    10          <th>status</th>
    11          <th>status date</th>
    12        </tr>
    13      </thead>
    14      <tbody>
    15        <tr class="odd">
    16          <td><span id="orderLink">24978900</a></td>
    17          <td><span id="orderType">Provide</span></td>
    18          <td><span id="productType">Prod1</span></td>
    19          <td><span id="status">Complete</span></td>
    20          <td><span id="statusDate">18/12/09</span></td>
    21          <td><span id="bucket"></span></td>
    22        </tr><tr class="even">
    23          <td><span id="orderLink">27004805</a></td>
    24          <td><span id="orderType">Cease</span></td>
    25          <td><span id="productType"></span></td>
    26          <td><span id="status">Rejected</span></td>
    27          <td><span id="statusDate">17/02/10</span></td>
    28        </tr>
    29      </tbody>
    30    </table>
    31
    32  </div>
    33  </body>
    34  </html>
$ 
$ awk -F"[><]" ' /orderLink/ { f=1; _ord=$5; } f && /Rejected/ { _sta=$5; f=0; print _ord ","}' f5
27004805,
$

Is this what you wanted ?

In any case, you could probably simplify the script thusly -

Code:
awk -F"[><]" '/orderLink/ {f=1; ord=$5} f && /Rejected/ {print ord}' f5

tyler_durden
# 11  
Old 03-26-2010
Sorry for my terrible writting.

First i was asked to search for orderLinks with status = Completed. But it was too many exception (other statuses to be consider) , so know i rather do a "different than" Rejected instead.

In my first example i need to retrive the:
24978900
i added a grep at the end of the awk

awk -v telf="$1" -F"[><]" ' /orderLink/ { f=1; _ord=$5; } f && /productType/ {_pro=$5; f=1 ;} f && /status/ { $5; f=0; print telf","_ord", "_pro"," $5}' /tmp/$1.htm | grep -v Rejected

That returns all NOT Rejected,
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Export HTML table

HI , I have a HTML tables as below. It has 2 tables ,I want to extract the second table . Please help me in doing it. <html> <body> <b><br>Running Date: </b>11-JAN-2019 03:07</br> <h2> Schema mapping and info </h2> <BR><TABLE width="100%" class="x1h" cellpadding="1"... (3 Replies)
Discussion started by: deepti01
3 Replies

2. UNIX for Dummies Questions & Answers

Extract table from an HTML file

I want to extract a table from an HTML file. the table starts with <table class="tableinfo" and ends with next closing table tag </table> how can I do this with awk/sed... ---------- Post updated at 04:34 PM ---------- Previous update was at 04:28 PM ---------- also I want to... (4 Replies)
Discussion started by: koutroul
4 Replies

3. Shell Programming and Scripting

Creating html table from data in file

Hi. I need to create html table from file which contains data. No awk please :) In example, ->cat file num1 num2 num3 23 3 5 2 3 4 (between numbers and words single TAB). after running mycode i need to get (heading is the first line): <table>... (2 Replies)
Discussion started by: Manu1234567
2 Replies

4. Web Development

Help on filtering the table in HTML

1. how to get the filter option on table so that user can enter the fields which ever they want to print only according to the need ? 2.how to print the full fledge table if there is no value in the rows of the table but it should print the whole rows and column in proper tabular form? (2 Replies)
Discussion started by: sidhi
2 Replies

5. Shell Programming and Scripting

Get HTML table

Hi all, I have a html that contains several tables in it. Need to extract the data from one of them named "orderList". Is it any easy way without using loops. Thanks (4 Replies)
Discussion started by: valigula
4 Replies

6. Shell Programming and Scripting

help with a bash script to create a html table

Hi guys as the title says i need a little help i have partisally written a bash script to create a table in html so if i use ./test 3,3 i get the following output for the third arguement in the script i wish to include content that will be replace the A characters in the... (2 Replies)
Discussion started by: dunryc
2 Replies

7. Shell Programming and Scripting

Format txt file as html table

I have a short time to solve a problem, so I need some help. I've searched the forum, but I couldn't find a solution to my problem. I made a script to filter some text and now I have a new requirement to make it available as html table. Problem is that I more than one files with different set... (2 Replies)
Discussion started by: tetreb
2 Replies

8. Shell Programming and Scripting

PHP: Sorting HTML table

Hi All, I have an html table which looks like this: <table align="center" border="1"> <CAPTION><EM>Heading for Table</EM></CAPTION> <tr><td><b>1</b></TD><TD><b>2</b></TD><TD><b>3</b></TD><TD><b>4</b></TD><TD><b>TOTAL</b></TD><TD><b>DATE</b></td></tr> <tr><td>88088283</TD> <TD>87613101</TD>... (1 Reply)
Discussion started by: pondlife
1 Replies

9. Shell Programming and Scripting

Export a HTML table to Xcel

Hello All, I have a perl script that prints a HMTL table. I want to convert this data into a report and this want to export this information into Excel. How can I do this? Regards, garric (3 Replies)
Discussion started by: garric
3 Replies

10. Shell Programming and Scripting

HTML table to CSV

Hi !! I have HTML Tables through which i want to generate graphs, but for creating graphs i need the file in CSV format so can anyone can please help me in how can i convert my HTML table file to CSV format. Thanks in Advance (2 Replies)
Discussion started by: i_priyank
2 Replies
Login or Register to Ask a Question