I have many occurances of the following headers in a file. I need to grep for the word changed/inserted in the header, calculate the difference between the two numbers and list the count incrementally.
Headers in a file look like this:
----------[changed to 229-232]---------
-----------[inserted 294-296]----------
------------[changed to 861]-----------
I should grep for the word changed, also grep the numbers listed 229 and 232 and find their difference. The difference in the number of lines is the output. Similarly grep for inserted and the difference in the numbers. If the header contains just one number increment by one.
Output should be like:
Number of lines changed :4 (here 3 from first header and one from last header matching the word changed)
Number of lines inserted :2
Thanks for your reply!
This header simply means that only one line has been changed so i just need to increment by one for pattern in changed.
------------[changed to 861]-----------
$
$ cat f1
----------[changed to 229-232]---------
blah blah
some stuff here
-----------[inserted 294-296]----------
and some more here
and more
------------[changed to 861]-----------
and yet some more...
not the last line.
----------[changed to 679-689]---------
line 11
-----------[inserted 297]----------
and this is line 13
----------[changed to 789-795]---------
line 15
-----------[inserted 299]----------
this is the last line.
$
$
$ ##
$ perl -lne 'if (/\[changed to (\d+)]/) {$chgcnt++}
> elsif (/\[changed to ([\d-]+)]/) {$chgcnt += abs(eval($1))}
> elsif (/\[inserted (\d+)]/) {$inscnt++}
> elsif (/\[inserted ([\d-]+)]/) {$inscnt += abs(eval($1))}
> END {print "Number of lines changed : $chgcnt";
> print "Number of lines inserted: $inscnt"}' f1
Number of lines changed : 20
Number of lines inserted: 4
$
$
A tad shorter:
Code:
$
$ ##
$ perl -lne 'if (/\[changed to ([\d-]+)]/) {$chgcnt += index($1,"-") == -1 ? 1 : abs(eval($1))}
> elsif (/\[inserted ([\d-]+)]/) {$inscnt += index($1,"-") == -1 ? 1 : abs(eval($1))}
> END {print "Number of lines changed : $chgcnt";
> print "Number of lines inserted: $inscnt"}' f1
Number of lines changed : 20
Number of lines inserted: 4
$
$
tyler_durden
Last edited by durden_tyler; 09-29-2009 at 02:10 PM..
I do like the way that durden_tyler's works.
Some one better at perl than I might be able
to mash each line of the file down to the
three most needed values (changed or inserted
plus the one or two line values) as I have, in
fewer key strokes.
Code:
#!/bin/perl
open(FH, "/path/to_your/file") or die "Can't open file: $!";
%result = ();
while (<FH>) {
chomp;
s/\bto\b//;
tr/-\[\] / /s;
s/\s$//;
$cnt = ($action, $val1, $val2) = split (' ');
$result{$action}++ if $cnt == 2;
$result{$action} += ($val2 - $val1) + 1 if $cnt == 3;
}
foreach $action ( keys %result ) {
print "$action: $result{$action}\n";
}
close (FH)
Also, forgot to mention, and I may be wrong, as you certainly know
your data better than I do, but if you count the changed or inserted
lines including and between 229-232 that is 4 lines, not 3, so you'll
need to add one to the result to be accurate.
Like I said, you know your data better than I do, so I could be wrong
on you needing this, but I thought I'd point that out, just in case. ;-)
I am using Perl version 5.8.4 and trying to understand the use of regular expression. Following is my code and output.
$string = "Perl is a\nScripting language";
($start) = ($string =~ /\A(.*?) /);
@lines = ($string =~ /^(.*?) /gm);
print "First Word (using \\A): $start\n","Line... (4 Replies)
Hi,
I have a list of IP, eg :
192.168.0.15
192.168.0.24
192.168.2.110
192.168.2.200
And I would like the shortest pattern who match with '192.168.0' and '192.168.2' (without the last dot and number). (7 Replies)
My log file looks as given below, its actually a huge file around 1 GB and these are some of the line:
conn=5368758 op=10628050 msgId=64 - RESULT err=0 tag=101 nentries=1 etime=0
conn=7462122 op=-1 msgId=-1 - fd=247 slot=247 LDAPS connection from 10.13.18.12:37645 to 10.18.6.45
conn=7462122... (5 Replies)
I have a 2 files in .gz format and it consists of 5 million lines the format of the file would be
gzcat file1.gz | more
abcde
aerere
ffgh56
..
..
12345
gzcat file2.gz | more
abcde , 12345 , 67890,
ffgh56 , 45623 ,12334
whatever the string is in the file1 should be matched... (3 Replies)
I am doing a file patterhn matching for a text file in PERL
I am using this,,, but it says that no file is found
$filepattern = '\d{1,4}.*A0NW9693.NDM.HBIDT.*.AD34XADJ.txt';
Can anyone help me out with Perl Pattern Matching concepts and how to do pattern matching for this txt file:... (4 Replies)
Hello experts,
I have a file containing the following text(shortened here).
File Begin
----------
< # Billboard.d3fc1302a677.imagePath=S:\\efcm_T4
< Billboard.d3fc1302a677.imagePath=S:\\efcm_T4
---
> # Billboard.d3fc1302a677.imagePath=S:\\efcm_Cassini
>... (2 Replies)
Hi,
I got doubt in Pattern matching, could you tell me how the following differs in action ??
if ( $line1==/$line2/ )
if ( $line1=~/$line2/ )
if ( $line1=~m/$line2/)
What is the significance of '~' in matching.
Thanks in advance
CoolBhai (5 Replies)
Hello
I got the below one from in one of this forums
For Ex: Loading File System Networking in nature
now i need to extract the patterns between the words File and Networking :
i.e. sample output: System
cmd used : cat <file> | sed 's/.*File //' | sed 's/Closing.*$//'
Actually... (0 Replies)
hi i am trying to get digits inside brackes from file , whose structure is defined below
CREATE TABLE TELM
(SOC_NO CHAR (3) NOT NULL,
TXN_AMOUNT NUMBER (17,3)
SIGN_ON_TIME CHAR (8)
TELLER_APP_LIMIT NUMBER (17,3)
FIL01 ... (2 Replies)
i can only find the first occurance of a pattern how do i set it to loop untill all occurances have changed.
#! /usr/bin/perl
use POSIX;
open (DFH_FILE, "./dfh") or die "Can not read file ($!)";
foreach (<DFH_FILE>) {
if ($_ !~ /^#|^$/) {
chomp;
... (1 Reply)