Sed and Awk Help


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers Sed and Awk Help
# 1  
Old 06-17-2008
Question Sed and Awk Help

I am trying to grab some text from an HTML file.

I have something similar to this:

Code:
<tr>
<td><a  href="somepath.html">Some Text<HR></HR></a></td>
</tr>
<tr>
<td><a  href="somepath.html" >Some More Text<HR></HR></a></td>
</tr>

And just want the bolded text. Any suggestions?
# 2  
Old 06-17-2008
You can try this.

cat a.txt |awk -F">" '{print $3}' | awk -F"<" '{print $1}'
# 3  
Old 06-17-2008
Code:
awk  'NF!=0'  RS="<[^<>]+>" filename


Last edited by rubin; 06-18-2008 at 02:20 AM.. Reason: code improvement
# 4  
Old 06-18-2008
Thanks Rubin, that worked great! Now I need to figure out how to store each line as a new line. When I just plain run the script and look at the output in the command prompt, it looks fine. If I try and pipe it to a text file, its all mashed together.

Any suggestions on how to separate it?
# 5  
Old 06-18-2008
If you have lynx:

Code:
lynx --force-html --nolist --dump file

Or if you have html2text:
Code:
html2text file

# 6  
Old 06-18-2008
When I try and run this through sed to remove all numbers that are in my file, I lose my line breaks. Any idea on why or how to keep them?

sed 's/[0-9]*//g'

is the command I am using.
# 7  
Old 06-19-2008
Quote:
Originally Posted by ryanewing
When I try and run this through sed to remove all numbers that are in my file, I lose my line breaks. Any idea on why or how to keep them?

sed 's/[0-9]*//g'

is the command I am using.

Can you post the input file ( the processed output file ) before running the sed command, and your expected / final output ?

I tested the command with a bigger sample file, redirected the output to another file, saved it as a text file, and it opened just fine either in Linux or Windows, with their respective editors and with no line breaks. Note though that the command works fine if a record has one part to extract as shown in your sample, if there are more, then each extract will be on a newline. But the command can be changed to handle that too. Post a more complete sample if needed.
 
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

sed and awk giving error ./sample.sh: line 13: sed: command not found

Hi, I am running a script sample.sh in bash environment .In the script i am using sed and awk commands which when executed individually from terminal they are getting executed normally but when i give these sed and awk commands in the script it is giving the below errors :- ./sample.sh: line... (12 Replies)
Discussion started by: satishmallidi
12 Replies

2. Shell Programming and Scripting

Is this possible using SED and AWK?

Dear Geeks, I want to manipulate a file with certain modifications for that using sed or AWK how to do this process for one file i have this type of data. Input File: "Restricted and Reserved names .ANISH",3798,"TEST.CO",1201208,6/16/10 0:00,6/16/13 0:00,,,"CO","2nd"^M "Restricted and... (4 Replies)
Discussion started by: anishkumarv
4 Replies

3. UNIX for Dummies Questions & Answers

sed/awk or help please

I have a file that contain the data below: B1 1 2 3 B2 20 30 40 B3 7 8 B4 100 B5 21 22 23How can I retrieve the data for B1 into a seperate file. (8 Replies)
Discussion started by: bobo
8 Replies

4. Shell Programming and Scripting

Need help using awk or sed.

Hi All, Is there a way of comparing two columns in the same file and deleting the row if the values of the columns match. I have the sample data file as below. M024900|175309.00|968.00|17 M025001|19861.79|97.90|148 M025002|431.70|159.00|3 M025003|912.30|159.90|6 ... (6 Replies)
Discussion started by: nua7
6 Replies

5. Shell Programming and Scripting

Using sed or awk?

What if I wanted to add a word such as IT after the first character and if theres 3 characters, after the 2nd character? output would be: G, it H G, H it P G, H, P it L I'm thinking that AWK would be the easiest way to do this... Currently looking it up. Right now I'm using awk but I... (13 Replies)
Discussion started by: puttster
13 Replies

6. UNIX for Dummies Questions & Answers

sed or awk?

I've got an inventory database with eight columns with things like product name, manufacturer, UPC code, etc. on each line. Our PO (purchase order) number is in the first column. I can grep the date and get the full line of data but I would like to strip out everything but the PO number in the... (5 Replies)
Discussion started by: NetJones
5 Replies

7. UNIX for Advanced & Expert Users

Awk or Sed help

Hi, I have a data file with 5 columns - like this: "20080401 09:43:08.770798 +0100s","TEST 1","R 1","A TEST","Nov 27 2007","1" "20080401 09:43:08.770798 +0100s","THIS IS A TEST","R 2","B TEST","Nov 30 2007","10" "20080401 09:43:08.770798 +0100s","ANOTHER TEST","R 3","B TEST","Nov 05... (7 Replies)
Discussion started by: MrG-San
7 Replies

8. UNIX for Advanced & Expert Users

sed in awk ? or nested awk ?

Hey all, Can I put sed command inside the awk action ?? If not then can i do grep in the awk action ?? For ex: awk '$1=="174" { ppid=($2) ; sed -n '/$ppid/p' tempfind.txt ; }' tempfind.txt Assume: 174 is string. Assume: tempfind.txt is used for awk and sed both. tempfind.txt... (11 Replies)
Discussion started by: varungupta
11 Replies

9. Shell Programming and Scripting

sed,awk

Hi, I know sed is stream text editor and not a bit more than that. Can anyone explain its usage and advantages? How is awk different from sed? I donno i am a bit confused about it. But i have coded in awk and shell. Thanks, Nisha :confused: (7 Replies)
Discussion started by: Nisha
7 Replies

10. Shell Programming and Scripting

awk / sed

I have many messages such as the test message below: 00:00000:00021:2002/05/13 13:57:00.51 ERROR:- Test error, my test error!!! I am writing a script in which I need to get everything from the word "ERROR:-" onwards. I normally use awk for these things, but I am not an expert at it so i am... (6 Replies)
Discussion started by: baileyr1
6 Replies
Login or Register to Ask a Question