Regular Expression


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Regular Expression
# 1  
Old 07-08-2012
Regular Expression

Hello,

I want to extract text between <td class="di_resultscolumnheader"> and </td>.
I wrote the below code to extract text. But I am able to extract the text for the first match only.
Can some one help me in this?

Thanks in advance.

Code:

Code:
if ($line =~ /<td class="di_resultscolumnheader">(.*?)<\/td>/g)
{
	print $1,"\n";
}

Sample Input:

Code:
<td class="di_resultscolumnheader">Object</td><td class="di_resultscolumnheader">Precipitant</td><td class="di_resultscolumnheader">Therapeutic&nbsp;Class</td>

Sample Output:

Code:
Object
Precipitant
Therapeutic&nbsp;Class


Last edited by Scrutinizer; 07-09-2012 at 06:38 AM.. Reason: code tags
# 2  
Old 07-08-2012
You could try something like this:

Code:
sed 's!<td class="di_resultscolumnheader">.*</td>!&!g; s!</td>!\n!g; s!<td class="di_resultscolumnheader">!!g;s/\n$//; /^$/d; ' input-file >output-file

This doesn't handle the case where there might be text between </td> and <td...>. A simple, but slightly longer, sed would be needed to deal with that.
# 3  
Old 07-09-2012
Code:
$ nawk -F"[<>]" '{for(i=1;i<=NF;i++)if($i~/resultscolumnheader/){print $(i+1)}}' input.txt
Object
Precipitant
Therapeutic&nbsp;Class

# 4  
Old 07-09-2012
Code:
# sed -n '/^<td class="di_resultscolumnheader">/,/<\/td>/{s///;s/<td class="[^ ]*">//g;s//\n/g;:a;s/<\/td>/\n/;/td.*td/ta;s/<\/td>$//;p}' infile
Object
Precipitant
Therapeutic&nbsp;Class

# 5  
Old 07-09-2012
Code:
awk '/^td class="di_resultscolumnheader"/{print $2}' RS=\< FS=\> infile

# 6  
Old 07-09-2012
Perl:
Code:
while (/<td class="di_resultscolumnheader">(.*?)<\/td>/g)
{
    print "$1\n";
}

Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Regular expression

I would like to extract "JDBC Thin Client" from (PROGRAM=JDBC Thin Client); and "C:\Python26\python.exe" from (PROGRAM=C:\Python26\python.exe); "txm_v_awsrf_db.pr" from (PROGRAM=txm_v_awsrf_db.pr). if input line like "(PROGRAM=)", it return unknown. Would you help suggest a regular... (4 Replies)
Discussion started by: cstsang
4 Replies

2. Shell Programming and Scripting

Regular expression

Can someone please explain me what does this mean? ^{1,50}$ (1 Reply)
Discussion started by: Anupam_Halder
1 Replies

3. UNIX for Dummies Questions & Answers

Regular expression help

Hi, I am quite knew to scripting and I am trying to get a regular expression to work to check that a user enters a valid version number such as 1 or 1.1 or 12.3 etc. I dont seem to be able to get it to work as it picks up versions such as 1.......2. I only want it to work with a single dot.... (12 Replies)
Discussion started by: frodo61
12 Replies

4. UNIX for Advanced & Expert Users

sed: -e expression #1, char 0: no previous regular expression

Hello All, I'm trying to extract the lines between two consecutive elements of an array from a file. My array looks like: problem_arr=(PRS111 PRS213 PRS234) j=0 while } ] do k=`expr $j + 1` sed -n "/${problem_arr}/,/${problem_arr}/p" problemid.txt ---some operation goes... (11 Replies)
Discussion started by: InduInduIndu
11 Replies

5. Programming

Perl: How to read from a file, do regular expression and then replace the found regular expression

Hi all, How am I read a file, find the match regular expression and overwrite to the same files. open DESTINATION_FILE, "<tmptravl.dat" or die "tmptravl.dat"; open NEW_DESTINATION_FILE, ">new_tmptravl.dat" or die "new_tmptravl.dat"; while (<DESTINATION_FILE>) { # print... (1 Reply)
Discussion started by: jessy83
1 Replies

6. Shell Programming and Scripting

Integer expression expected: with regular expression

CA_RELEASE has a value of 6. I need to check if that this is a numeric value. if not error. source $CA_VERSION_DATA if * ] then echo "CA_RELESE $CA_RELEASE is invalid" exit -1 fi + source /etc/ncgl/ca_version_data ++ CA_PRODUCT_ID=samxts ++ CA_RELEASE=6 ++ CA_WEEK_NO=7 ++... (3 Replies)
Discussion started by: ketkee1985
3 Replies

7. UNIX for Dummies Questions & Answers

ls with regular expression

I currently list and sort all the files in a directory which begin with an Upper Case C and end with the extension '#finished#'. Here is the command I use: ls -tr $currentDir/*.#finished# what i need to do now is list all the files in a directory that begin with upper case C and end... (3 Replies)
Discussion started by: rkgudde
3 Replies

8. Linux

Regular expression to extract "y" from "abc/x.y.z" .... i need regular expression

Regular expression to extract "y" from "abc/x.y.z" (2 Replies)
Discussion started by: rag84dec
2 Replies

9. Programming

What does the regular expression ['(^[^~]+~).*'] mean?

What does the regular expression +~).*'] mean while using it with regexec.When the string "RCHNUSNT35C~rs07/ASM-RS07" is used with the regular expression +~).*'] regexec gives an error. I know what regexec does,but i do not understand what this expression means wrt to this string... any help... (2 Replies)
Discussion started by: anupamar
2 Replies

10. Shell Programming and Scripting

Regular Expression + Aritmetical Expression

Is it possible to combine a regular expression with a aritmetical expression? For example, taking a 8-numbers caracter sequece and casting each output of a grep, comparing to a constant. THX! (2 Replies)
Discussion started by: Z0mby
2 Replies
Login or Register to Ask a Question