awk extract strings matching multiple patterns Post: 302863219

Sponsored Content

Top Forums Shell Programming and Scripting awk extract strings matching multiple patterns Post 302863219 by chrissycc on Sunday 13th of October 2013 05:13:18 AM

10-13-2013

Registered User

awk extract strings matching multiple patterns

Hi,

I wasn't quite sure how to title this one! Here goes:

I have some already partially parsed log files, which I now need to extract info from. Because of the way they are originally and the fact they have been partially processed already, I can't make any assumptions on the number of fields and the exact format etc. All I know is I can look for certain patterns. An extract of the original source is:

Code:

Job <1>, Job Name <BLAH>, Queue-- MEMLIMIT 10 G Fri Oct 11 09:55:48: Started on <cn035>, -- The CPU time is 12 seconds. MEM: 1 Gbytes; 
Job <2>, Job Name <BLAH>, Queue-- MEMLIMIT 10 G Fri Oct 11 09:55:48: Started on <cn069>, -- The CPU time is 10 seconds. MEM: 1 Gbytes; 
Job <3>, Job Name <BLAH>,  MEMLIMIT 10 G Fri Oct 11 09:55:48: Started on <cn049>, ;-- The CPU time is 13 seconds. MEM: 2 Gbytes; 
Job <4>, Job Name <BLAH>,  Status <RUN>,  Command <-- The CPU time is 76 seconds. MEM: 3 Gbytes; 
Job <7>, Job Name <BLAH>,  Stat us <RUN>,  Command <-- The CPU time is 49 seconds. MEM: 1014 Mbytes; 
Job <8>, Job Name <BLAH> , Status <RUN>, -- MEMLIMIT 10 G Fri Oct 11 22:13:19: Started on <cn014>;-- The CPU time is 12 seconds. MEM: 391 Mbytes; 
Job <9>, Job Name <BLAH>,  Status <RUN >,  Command <: Started on <cn026>,-- The CPU time is 71 seconds. MEM: 13 Mbytes; 
Job <10>, Job Name <BLAH>,  Sta tus <RUN>,  Command <#!/bi-- MEMLIMIT 22 G  Started on <cn064>, -- The CPU time is 25 seconds. MEM: 12 Gbytes;

I want to extract based on:

Code:

Started on <____>,
MEMLIMIT __ G
MEM: ___ bytes;

The first line example being:

Code:

MEMLIMIT 10 G Fri Oct 11 09:55:48: Started on <cn035>, -- The CPU time is 12 seconds. MEM: 1 Gbytes;

Each line may contain all, some or none of the above. My ideal output based on the above would be something like:

Code:

Started: cn035 MEMLIMIT: 10 G MEM: 1 G
Started: cn069 MEMLIMIT: 10 G MEM: 1 G 
etc
etc

(ideally, if there is no MEMLIMIT found on a line for example):
Started: cn026 MEMLIMIT: 0 G MEM: 13 M

I've messed around with gsub in awk to extract a single instance but couldn't work out how to select on multiple patterns...

Any help as always would be appreciated!

Last edited by Scrutinizer; 10-13-2013 at 06:38 AM.. Reason: additional code tags

chrissycc

View Public Profile for chrissycc

Find all posts by chrissycc

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

AWK: matching patterns in 2 different files

In a directory, there are two different file extensions (*.txt and *.xyz) having similar names of numerical strings (*). The (*.txt) contains 5000 multiple files and the (*.xyz) also contains 5000 multiple files. Each of the files has around 4000 rows and 8 columns, with several unique string...

2. Shell Programming and Scripting

matching patterns inside a condition in awk

I have the following in an awk script. I want to do them on condition that: fext == "xt" FNR == NR { />/ && idx = ++i $2 || val = $1 next } FNR in idx { v = val] } { !/>/ && srdist = abs($1 - v) } />/ || NF == 2 && srdist < dsrmx {...

3. UNIX for Dummies Questions & Answers

Search and extract matching patterns

%%%%%

4. Shell Programming and Scripting

awk? extract quoted "" strings from multiple lines.

I am trying to extract multiple strings from snmp-mib files like below. ----- $ cat IF-MIB.mib <snip> linkDown NOTIFICATION-TYPE OBJECTS { ifIndex, ifAdminStatus, ifOperStatus } STATUS current DESCRIPTION "A linkDown trap signifies that the SNMP entity, acting in...

5. UNIX for Dummies Questions & Answers

[SOLVED] awk: matching degenerate patterns

Hi Folks, I have two arrays a: aaa bbb ccc ddd ddd aaa bbb ccc ddd ccc aaa bbb b: aaa bbb ccc aaa ccc bbb bbb aaa ccc ccc bbb aaa I want to compare row by row a(c1:c4) to b(c1:c3). If elements of 'b' match...

6. Shell Programming and Scripting

Extract multiple occurance of strings between 2 patterns

I need to extract multiple occurance strings between 2 different patterns in given line. For e.g. in below as input ------------------------------------------------------------------------------------- mike(hussey) AND mike(donald) AND mike(ryan) AND mike(johnson)...

7. Shell Programming and Scripting

Extract multiple strings from line

Hello I have an output that has a string between quotes and another between square brackets on the same line. I need to extract these 2 strings Example line Device "nrst3a" attributes=(0x4) RAW SERIAL_NUMBER=SNL2 Output should look like nrst3a VD073AV1443BVW00083 I was trying with sed...

8. Shell Programming and Scripting

Find files not matching multiple patterns and then delete anything older than 10 days

Hi, I have multiple files in my log folder. e.g: a_m1.log b_1.log c_1.log d_1.log b_2.log c_2.log d_2.log e_m1.log a_m2.log e_m2.log I need to keep latest 10 instances of each file. I can write multiple find commands but looking if it is possible in one line. m file are monthly...

9. UNIX for Beginners Questions & Answers

How to extract the partial matching strings among two files?

I have a two file as shown below, file:1 >Contig_152_415 (REVERSE SENSE) >Contig_152_420 (REVERSE SENSE) >Contig_152_472 (REVERSE SENSE) >Contig_152_484 (REVERSE SENSE) File:2 >Contig_152:49081-49929 ATCGAGCAGCGCCGCGTGCGGTGCACCCTTGTGCAGATCGGGAGTAACCACGCGCACGGC...

10. UNIX for Beginners Questions & Answers

Match patterns between two files and extract certain range of strings

Hi, I need help to match patterns from between two different files and extract region of strings. inputfile1.fa >l-WR24-1:1 GCCGGCGTCGCGGTTGCTCGCGCTCTGGGCGCTGGCGGCTGTGGCTCTACCCGGCTCCGG GGCGGAGGGCGACGGCGGGTGGTGAGCGGCCCGGGAGGGGCCGGGCGGTGGGGTCACGTG...

LEARN ABOUT PHP

qstat

QSTAT(1)						User Contributed Perl Documentation						  QSTAT(1)

NAME

       qstat - display job/partition information in a familiar pbs format

SYNOPSIS

       qstat [-f] [-a|-i|-r] [-n [-1]] [-G|-M] [-u user_list] [-? | --help] [--man] [job_id...]

       qstat -Q [-f]

       qstat -q

DESCRIPTION

       The qstat command displays information about jobs.

OPTIONS

       -a  Displays all jobs in a single-line format. See the STANDARD OUTPUT section for format details.

       -i  Displays information about idle jobs. This includes jobs which are queued or held.

       -f  Displays the full information for each selected job in a multi-line format. See the STANDARD OUTPUT section for format details.

       -G  Display size information in gigabytes.

       -M  Show size information, disk or memory in mega-words.  A word is considered to be 8 bytes.

       -n  Displays nodes allocated to a job in addition to the basic information.

       -1  In combination with -n, the -1 option puts all of the nodes on the same line as the job id.

       -r  Displays information about running jobs. This includes jobs which are running or suspended.

       -u user_list
	   Display job information for all jobs owned by the specified user(s). The format of user_list is: user_name[,user_name...].

       -? | --help
	   brief help message

       --man
	   full documentation

STANDARD OUTPUT

       Displaying Job Status

       If the -a, -i, -f, -r, -u, -n, -G, and -M options are not specified, the brief single-line display format is used. The following items are
       displayed on a single line, in the specified order, separated by white space:

       the job id
       the job name
       the job owner
       the cpu time used
       the job state
	   C -	Job is completed after having run E -  Job is exiting after having run.  H -  Job is held.  Q -  job is queued, eligible to run or
	   routed.  R -  job is running.  T -  job is being moved to new location.  W -  job is waiting for its execution time (-a option) to be
	   reached.  S -  job is suspended.

       the queue that the job is in

       If the -f option is specified, the multi-line display format is used. The output for each job consists of the header line: Job Id:  job
       identifier followed by one line per job attribute of the form: attribute_name = value

       If any of the options -a, -i, -r, -u, -n, -G or -M are specified, the normal single-line display format is used. The following items are
       displayed on a single line, in the specified order, separated by white space:

       the job id
       the job owner
       the queue the job is in
       the job name
       the session id (if the job is running)
       the number of nodes requested by the job
       the number of cpus or tasks requested by the job
       the amount of memory requested by the job
       either the cpu time, if specified, or wall time requested  by the job, (in hh:mm)
       the job state
       The amount of cpu time or wall time used by the job (in hh:mm)

EXIT STATUS

       On success, qstat will exit with a value of zero. On failure, qstat will exit with a value greater than zero.

perl v5.14.2							    2012-04-10								  QSTAT(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

AWK: matching patterns in 2 different files

Discussion started by: asanjuan

2. Shell Programming and Scripting

matching patterns inside a condition in awk

Discussion started by: kristinu

3. UNIX for Dummies Questions & Answers

Search and extract matching patterns

Discussion started by: lucasvs

4. Shell Programming and Scripting

awk? extract quoted "" strings from multiple lines.

Discussion started by: genzo

5. UNIX for Dummies Questions & Answers

[SOLVED] awk: matching degenerate patterns

Discussion started by: heecha

6. Shell Programming and Scripting

Extract multiple occurance of strings between 2 patterns

Discussion started by: sameermohite

7. Shell Programming and Scripting

Extract multiple strings from line

Discussion started by: bombcan

8. Shell Programming and Scripting

Find files not matching multiple patterns and then delete anything older than 10 days

Discussion started by: wahi80

9. UNIX for Beginners Questions & Answers

How to extract the partial matching strings among two files?

Discussion started by: dineshkumarsrk

10. UNIX for Beginners Questions & Answers

Match patterns between two files and extract certain range of strings

Discussion started by: bunny_merah19

LEARN ABOUT PHP

qstat