--Parsing out strings for repeating delimiters for everyline


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting --Parsing out strings for repeating delimiters for everyline
# 1  
Old 10-23-2019
--Parsing out strings for repeating delimiters for everyline

Hello:

I have some text output, on SunOS 5.11 platform using KSH:

Quote:
QREMOTE(BOS.FOS.T.CDG.MEDGTOC.01) CLUSTER(CLENTT1) DEFBIND(NOTFIXED) DEFPSIST(YES) DESCR(Cargo dangerous goods queue) RQMNAME(CLCRGT1) RNAME(BOS.FOS.T.CDG.MEDGTOC.01) XMITQ( )
QREMOTE(CLFOST1) CLUSTER(CLENTT1) DEFBIND(NOTFIXED) DEFPSIST(YES) DESCR(Qmgr Alias for CLFOST1 Cluster) XMITQ( )
QREMOTE(RPT.PSS.T.VILS.ODY.01) CLUSTER(CLENTT1) DEFPSIST(YES) DESCR(PSS MSGS TO VILS ODYSSEY) RQMNAME(CLITAT1) RNAME(RPT.PSS.T.VILS.ODY.01) XMITQ( )
I am trying to parse out each string within the () for each line.

I tried, as example:
Code:
perl -lanF"[()']" -e 'print "$F[1] $F[2] $F[3] $F[4] $F[5] $F[6]"'

But for some reason, the output gets all garbled after the the first fields.
Guess I can try the following but it is very messy, as I would have to do that for each Descriptor before the first (.
The number of fields can change dynamically..

Another example:

Code:
cat $FILE | |nawk -FDESCR '{print $2}'| perl -lanF"[()]" -e 'print $F[1]'

So the desired output woul be:

Quote:
BOS.FOS.T.CDG.MEDGTOC.01 CLENTT1 NOTFIXED) YES Cargo dangerous goods queue CLCRGT1 BOS.FOS.T.CDG.MEDGTOC.01
CLFOST1 CLENTT1 NOTFIXED YES Qmgr Alias for CLFOST1 Cluster
RPT.PSS.T.VILS.ODY.01 CLENTT1 YES PSS MSGS TO VILS ODYSSEY CLITAT1 RPT.PSS.T.VILS.ODY.01
Not sure what else I can try.

Thanking you for any advice !!
# 2  
Old 10-23-2019
Here is an approach using nawk by checking each character:-
Code:
nawk '
        {
                for ( i = 1; i <= length; i++ )
                {
                        c = substr($0,i,1)

                        if ( c == "(" )
                                flag = 1

                        if ( flag && c != "(" && c != ")" )
                                printf c

                        if ( c == ")" )
                                flag = 0
                }
                printf "\n"
        }
' file

This User Gave Thanks to Yoda For This Post:
# 3  
Old 10-23-2019
Why switch between perl and awk? Stick to one:
Code:
awk -F"[()]" '{for (i=2; i<=NF; i+=2) printf "%s ", $i; printf RS}' file
BOS.FOS.T.CDG.MEDGTOC.01 CLENTT1 NOTFIXED YES Cargo dangerous goods queue CLCRGT1 BOS.FOS.T.CDG.MEDGTOC.01   
CLFOST1 CLENTT1 NOTFIXED YES Qmgr Alias for CLFOST1 Cluster   
RPT.PSS.T.VILS.ODY.01 CLENTT1 YES PSS MSGS TO VILS ODYSSEY CLITAT1 RPT.PSS.T.VILS.ODY.01

AND:

Quote:
Originally Posted by Don Cragun
If you are using a Solaris/SunOS system, use /usr/xpg4/bin/awk or nawk instead of awk.



EDIT: or
Code:
sed 's/^[^(]*(\|)[^(]*(\|[^)]*) *$/ /g' file
 BOS.FOS.T.CDG.MEDGTOC.01 CLENTT1 NOTFIXED YES Cargo dangerous goods queue CLCRGT1 BOS.FOS.T.CDG.MEDGTOC.01  
 CLFOST1 CLENTT1 NOTFIXED YES Qmgr Alias for CLFOST1 Cluster  
 RPT.PSS.T.VILS.ODY.01 CLENTT1 YES PSS MSGS TO VILS ODYSSEY CLITAT1 RPT.PSS.T.VILS.ODY.01


Last edited by RudiC; 10-23-2019 at 07:08 PM..
This User Gave Thanks to RudiC For This Post:
# 4  
Old 10-24-2019
Thats working ..

Thank you !!
# 5  
Old 10-24-2019
one more question:

How to get only the header strings:

From:
Quote:
QREMOTE(BOS.FOS.T.CDG.MEDGTOC.01) CLUSTER(CLENTT1) DEFBIND(NOTFIXED) DEFPSIST(YES) DESCR(Cargo dangerous goods queue) RQMNAME(CLCRGT1) RNAME(BOS.FOS.T.CDG.MEDGTOC.01) XMITQ( )
to:
Quote:
QREMOTE CLUSTER DEFBIND DEFPSIST DESCR RQMNAME RNAME XMITQ
Tried this but not working::
Code:
nawk -F"[)(]" '{for (i=2; i<=NF; i+=2) printf "%s ", $i; printf RS}' file

Thnx again !!
# 6  
Old 10-24-2019
Quote:
Originally Posted by gilgamesh
...
Tried this but not working::
Of course not - that was meant to extract the strings within parentheses, and you shouldn't expect it to do the opposite. You need to modify it slightly:
Code:
awk -F"[()]" '{for (i=1; i<=NF; i+=2) printf "%s ", $i; printf RS}' file
QREMOTE  CLUSTER  DEFBIND  DEFPSIST  DESCR  RQMNAME  RNAME  XMITQ

# 7  
Old 10-24-2019
Gonna try to understand the syntax better.

Thank you ..
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Programming

Segfault When Parsing Delimiters In C

Another project, another bump in the road and another chance to learn. I've been trying to open gzipped files and parse data from them and hit a snag. I have data in gzips with a place followed by an ip or ip range sort of like this: Some place:x.x.x.x-x.x.x.x I was able to modify some code... (6 Replies)
Discussion started by: Azrael
6 Replies

2. Shell Programming and Scripting

Script to rename the repeating strings

All, I have a sample text like below. Key (Header) Key1 ABC Key2 ABC Key3 ABC ABC Key4 ABC Key5 ABC ABC ABC Required Output Key (Header) Key1 (2 Replies)
Discussion started by: ks_reddy
2 Replies

3. Shell Programming and Scripting

How to append server name to everyline?

I am executing df -mP to see the disk utilization. I would like to append servername also to each and every line. df -mP | awk '{ print $1","$2","$3","$4","$5","$6 }' trying to add something like this df -mP | awk '{ print $1","$2","$3","$4","$5","$6","$hostname }' ... (1 Reply)
Discussion started by: lazydev
1 Replies

4. UNIX for Dummies Questions & Answers

Adding variables to repeating strings

Hello, I want to add a letter to the end of a string if it repeats in a column. so if I have a file like this: DOG001 DOG0023 DOG004 DOG001 DOG0023 DOG001 the output should look like this: DOG001-a DOG0023-a DOG004 DOG001-b (15 Replies)
Discussion started by: verse123
15 Replies

5. Shell Programming and Scripting

Extract strings within XML file between different delimiters

Good afternoon! I have an XML file from which I want to extract only certain elements contained within each line. The problem is that the format of each line is not exactly the same (though similiar). For example, oa_var will be in each line, however, there may be no value or other... (3 Replies)
Discussion started by: bab@faa
3 Replies

6. Shell Programming and Scripting

Parsing Strings

Hello All, I am new to shell scripting and programming. I am looking for a guide on how I can parse specific information from a plain text file with thousands of lines. Specifically I need to parse an email address from each line. The line looks something like this:... (9 Replies)
Discussion started by: solvdsystems
9 Replies

7. Shell Programming and Scripting

Awk new datetime everyline

Hi, I'm using awk in HP-UX machine which does not support systime(), strftime(). So to get the date time I was using : seq 1 100000 | awk ' "date +%Y%m%d%H%M%s" | getline curtime; print curtime }' However the above code gets the date only once, next time it is not updated. For... (2 Replies)
Discussion started by: Random_Net
2 Replies

8. Shell Programming and Scripting

Parsing file to match strings

I have a file with the following format 12g data/datasets/cct 8g data/dataset/cct 10 g data/two 5g data/something_different 10g something_different 5g data/two is there a way to loop through this... (1 Reply)
Discussion started by: yawalias
1 Replies

9. Shell Programming and Scripting

cut columns in everyline

Is there a betterway to cut certain columns in everyline based on positions. Basically, I have a largefile and eachline is of 1000 characters and I need to cut the characters 17-30, 750-775, 776-779, 780-805 while do fptr=`cat $tempfile | head -$i | tail -1` ... (4 Replies)
Discussion started by: gunaah
4 Replies

10. UNIX for Dummies Questions & Answers

parsing with multible delimiters

I have data that looks like this aaa!bbb!ccc/ddd/eee It is not fixed format. I need to parse ddd into a var in order to decide if I want to process that row. If I do I need to put ccc and bbb into vars to process it. I need to do this during a while loop one record at a time. Any... (11 Replies)
Discussion started by: gillbates
11 Replies
Login or Register to Ask a Question