Getting only the text from xml file then to lower case.


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Getting only the text from xml file then to lower case.
# 1  
Old 08-26-2011
Getting only the text from xml file then to lower case.

I have a file with xml code and want to remove everything except the text using awk.

So I need to remove anything within "<" and ">" and also the "_".

Then changing to lower case except for first letter after a stop "."

How can I do it?



Code:
−
<p begin="00:41:16.994" style="1">
GODDARD, ARE YOU STILL WITH US?
<br/>
OH, OKAY.
</p>
<p begin="00:41:22.860" style="1">SO GODDARD HAS SHUT DOWN FOR US FOR THE DAY.</p>
−
<p begin="00:41:24.160" style="1">
BUT WE DO HAVE A COMMENT, I'M HEARING, FROM OUR SPACE 
</p>
<p begin="00:41:26.860" style="1">WEATHER ACTION TEAM.</p>
<p begin="00:41:28.294" style="1">THIS SHOULD BE VERY INTERESTING.</p>
<p begin="00:41:29.560" style="1">PAUL, WHAT'S HAPPENING OVER THERE?</p>
−
<p begin="00:41:32.960" style="1">
WELL, OUR OWN TEAM ALSO HAS SOME QUESTIONS TO POSE TO 
</p>
−
<p begin="00:41:33.227" style="1">
OUR EXPERT SCIENTISTS, SO HERE'S ANOTHER QUESTION FOR
<br/>
YOU. 
</p>
−
<p begin="00:41:37.794" style="1">
WELL, I KNOW THAT THE SUN IS QUIET RIGHT NOW, SO I 
</p>
<p begin="00:41:39.760" style="1">WOULD LIKE TO KNOW WHEN WILL IT BE ACTIVE AGAIN?</p>
<p begin="00:41:41.460" style="1">THAT'S A VERY GOOD QUESTION.</p>
−


Last edited by kristinu; 08-26-2011 at 10:25 PM..
# 2  
Old 08-26-2011
please post what you have tried so far.
# 3  
Old 08-26-2011
Code:
awk '/^<$/,/^>$/ { next }; { print }' nasa-content.xml

---------- Post updated at 09:06 PM ---------- Previous update was at 09:02 PM ----------

This gets me the text

Code:
awk '/</,/>/ {next}; {print}' nasa-content.xml

---------- Post updated at 09:08 PM ---------- Previous update was at 09:06 PM ----------

This removes things between <> and changes to lower case

Code:
awk '/</,/>/ {next}; {print tolower($0)}' nasa-content.txt

---------- Post updated at 09:16 PM ---------- Previous update was at 09:08 PM ----------

I'm getting there

Code:
awk '/</,/>/ {next}; {if (NF > 1) {print tolower($0)}}' nasa-content.xml

# 4  
Old 08-27-2011
If you have Ruby(1.9+)
Code:
$ ruby -ne 'print $_.gsub(/<.*?>/,"").downcase.capitalize.gsub(/[.?]\s*[a-z]/){|x|x.upcase} if not /^−/' file

Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Convert text to lower case except the strings within single quotes

Shell : bash that comes with RHEL 6.7 I have SQL scripts like below. I want to convert all the text in these files to lower case except the strings enclosed within single quotes . Any idea how I can achieve this ? Sample text: $ cat sample.txt SELECT ... (6 Replies)
Discussion started by: John K
6 Replies

2. Shell Programming and Scripting

Change first letter of a word from lower case to upper case

Hi all, I am trying to find a way to change first letter in a word from lower case to upper case. It should be done for each first word in text or in paragraph, and also for each word after punctuation like . ; : ! ?I found the following command sed -i 's/\s*./\U&\E/g' $@ filenamebut... (7 Replies)
Discussion started by: georgi58
7 Replies

3. Shell Programming and Scripting

Upper to lower case in encoded file

Hi All, I want to change the out put of a decode file from lower to upper. i used tr command but facing issue. set -vx id=$(id) dt=$(date) store=$1 if ]; then cd $APPL_TOP/local/bin cp .sqlpass.Z $$.temp.Z uncompress $$.temp.Z sed -e s/sqlpass/$$.sqlpass/ $$.temp >... (5 Replies)
Discussion started by: nag_sathi
5 Replies

4. Shell Programming and Scripting

Conversion from Upper Case to Lower Case Condition based

Hello Unix Gurus : It would be really appreciative if can find a solution for this . I have records in a file . I need to Capitalize the records based on condition . For Example i tried the following Command COMMAND --> fgrep "2000YUYU" /export/home/oracle/TST/data.dat | tr '' ''... (12 Replies)
Discussion started by: tsbiju
12 Replies

5. Shell Programming and Scripting

[Solved] Change Upper case to Lower case in C shell

Is there a command that can switch a character variable from UPPER case to lower case? like foreach AC ( ABC BCD PLL QIO) set ac `COMMAND($AC)` ... end Thanks a lot! (3 Replies)
Discussion started by: rockytodd
3 Replies

6. Shell Programming and Scripting

data array needs to change upper case to lower case

Hi all, i have a data array as followes. ARRAY=DFSG345GGG ARRAY=234FDFG090 ARRAY=VDFVGBGHH so on.......... i need all english letters to be change to lower case. So i am expecting to see ARRAY=dfsg345ggg ARRAY=234fdfg090 ARRAY=vdfvgbghh so on........ If i have to copy this data in... (8 Replies)
Discussion started by: usustarr
8 Replies

7. Shell Programming and Scripting

Convert contents of file to lower case with SED

Hi I what to add option to existing sed code to convert target file to lower case #!/bin/ksh SOURCE_DATA_DEST=/ora TARGET_DATA_DEST=/home/oracle/alexz TARGET_DB_SID=T102_test sed -e "s/REUSE/SET/g" \ -e "s/NORESETLOGS/RESETLOGS/g" \ T102_ccf.sql > target.sql Thanks (2 Replies)
Discussion started by: zam
2 Replies

8. Shell Programming and Scripting

Script to Convert Upper case to Lower case

Hi All I have a script which extracts values from a Database (A persons name) and puts it into a variable in my script IE: $NAME However the Value in the DB is all in uppercase and contains the users first name and last name EG: > echo $NAME GRAHAM BOYLE > What I need is only the... (7 Replies)
Discussion started by: grahambo2005
7 Replies

9. Shell Programming and Scripting

Script needed to select and delete lower case and mixed case records

HELLO ALL, URGENTLY NEEDED A SCRIPT TO SELECT AND DELETE LOWER AND MIXED CASE RECORDS FROM A COLUMN IN A TABLE. FOR EXAMPLE : Table name is EMPLOYEE and the column name is CITY and the CITY column records will be: Newyork washington ... (1 Reply)
Discussion started by: abhilash mn
1 Replies

10. UNIX for Dummies Questions & Answers

lower case to upper case string conversion in shell script

How can convert a Lower case variable value to an upper case in the kron shell script. (3 Replies)
Discussion started by: dchalavadi
3 Replies
Login or Register to Ask a Question