Parsing a large log

05-30-2008

Registered User

1,714, 63

Join Date: Apr 2004

Last Activity: 15 May 2020, 11:27 AM EDT

Location: Bordeaux, France

Posts: 1,714

Thanks Given: 2

Thanked 63 Times in 59 Posts

The most rapid solution is to write a program with a compiled language like C.

With awk you can do something like that :

Code:

awk -v from=$(date --date=yesterday +'%D') \
    -v   to=$(date +'%D')  '
$1 == from {
   if (int($2) >= 13)
      print;
   next;
}
$1 == to  {
   if (int($2) < 13) {
      print;
      next;
   } else
      exit;
}
' inputfile

Input file:

Code:

05/29/08 01:56:53 nsrexecd: select() error: Invalid argument
05/29/08 01:56:53 nsrexecd: select() error: Invalid argument
05/29/08 01:56:53 nsrexecd: select() error: Invalid argument
05/29/08 12:59:50 not selected
05/29/08 13:00:00 selected 1
05/29/08 23:59:59 selected 2
05/30/08 00:00:01 selected 3
05/30/08 12:59:59 selected 4
05/30/08 13:00:00 not selected
06/01/08 00:00:01 not selected

Output (current date is 05/30/08):

Code:

05/29/08 13:00:00 selected 1
05/29/08 23:59:59 selected 2
05/30/08 00:00:01 selected 3
05/30/08 12:59:59 selected 4

Jean-Pierre.

aigles

View Public Profile for aigles

Find all posts by aigles

06-02-2008

Registered User

12, 0

Join Date: May 2008

Last Activity: 8 January 2010, 1:10 AM EST

Posts: 12

Thanks Given: 0

Thanked 0 Times in 0 Posts

Thanks a lot.
But my problem is that my log is large- 300-400mb.
I am unable to use awk, sed. grep etc.
I need a solution in perl or shell for parsing the log for current date(24 hours)
and then searching the string

asth

View Public Profile for asth

Find all posts by asth

06-02-2008

Registered User

3,653, 12

Join Date: Mar 2008

Last Activity: 28 March 2011, 6:41 AM EDT

Location: /there/is/only/bin/sh

Posts: 3,653

Thanks Given: 0

Thanked 12 Times in 10 Posts

None of the tools you mentioned are sensitive to the file size. Other things being equal, they read the file one line at a time, and prints that line if certain conditions are met. (Of course you can write an awk or sed script which consumes memory for every line; but for this case, I don't think you need to.)

era

View Public Profile for era

Find all posts by era

06-03-2008

Registered User

12, 0

Join Date: May 2008

Last Activity: 8 January 2010, 1:10 AM EST

Posts: 12

Thanks Given: 0

Thanked 0 Times in 0 Posts

Quote:

Originally Posted by era

None of the tools you mentioned are sensitive to the file size. Other things being equal, they read the file one line at a time, and prints that line if certain conditions are met. (Of course you can write an awk or sed script which consumes memory for every line; but for this case, I don't think you need to.)

Please helppppppppp

But my problem is that my log is large- 300-400mb.
I am unable to use awk, sed. grep etc.
I need a solution in perl or shell for parsing the log for current date(24 hours)
and then searching the string

asth

View Public Profile for asth

Find all posts by asth

06-03-2008

Registered User

3,653, 12

Join Date: Mar 2008

Last Activity: 28 March 2011, 6:41 AM EDT

Location: /there/is/only/bin/sh

Posts: 3,653

Thanks Given: 0

Thanked 12 Times in 10 Posts

I'm sorry, no offense, but I cannot type this any slower than this: grep and sed and awk do not care what size the file is. They only read it one line at a time, just like cat.

Perl is unlikely to be any faster than grep. Here is a Perl script anyway.

Code:

perl -ne 'print if m/^05/(29/08 (1[3-9]|2[0-3])|30/08 (0|1[0-2]))/' file

Notice the similarity to the egrep solution I posted before. This one is probably going to be slower, and in any event will not be much faster.

Please answer the following questions:

What have you tried?
Have you tried the solutions various people have posted to this thread?
How long did it take to complete?
How long would you like it to take?
How quickly can you simply cat the file?
If you extract just one day's worth from the file, how long does that take to cat?

era

View Public Profile for era

Find all posts by era

06-03-2008

Registered User

12, 0

Join Date: May 2008

Last Activity: 8 January 2010, 1:10 AM EST

Posts: 12

Thanks Given: 0

Thanked 0 Times in 0 Posts

Quote:

Originally Posted by era

I'm sorry, no offense, but I cannot type this any slower than this: grep and sed and awk do not care what size the file is. They only read it one line at a time, just like cat.

Perl is unlikely to be any faster than grep. Here is a Perl script anyway.

Code:

perl -ne 'print if m/^05/(29/08 (1[3-9]|2[0-3])|30/08 (0|1[0-2]))/' file

Notice the similarity to the egrep solution I posted before. This one is probably going to be slower, and in any event will not be much faster.

Please answer the following questions:

What have you tried?
Have you tried the solutions various people have posted to this thread?
How long did it take to complete?
How long would you like it to take?
How quickly can you simply cat the file?
If you extract just one day's worth from the file, how long does that take to cat?

What have you tried? --I have tried "cat file |/bin/awk '$1 ~ /^$date/'"
Have you tried the solutions various people have posted to this thread?--yep but as i have mentioned for simply using cat it is timong out
How long did it take to complete? more than 5 min-- i quit b4 it completed..
How long would you like it to take? a normal time as it takes for cat or grep
How quickly can you simply cat the file? I am unable to cat the file it is not at all opening
If you extract just one day's worth from the file, how long does that take to cat?I am unable to extract with the awk or grep...i ma only able to use the tail and head command.
As mentioned earlier in the thread to use chunks of file.. i am unable to create a logic for chunks and find the log of 24 hours.

asth

View Public Profile for asth

Find all posts by asth

06-03-2008

Registered User

3,653, 12

Join Date: Mar 2008

Last Activity: 28 March 2011, 6:41 AM EDT

Location: /there/is/only/bin/sh

Posts: 3,653

Thanks Given: 0

Thanked 12 Times in 10 Posts

Quote:

Originally Posted by asth

What have you tried? --I have tried "cat file |/bin/awk '$1 ~ /^$date/'"

The cat is useless, simply run awk '$1 ~ /^06\/01\//' file

Quote:

Have you tried the solutions various people have posted to this thread?--yep but as i have mentioned for simply using cat it is timong out

You have not mentioned this very explicitly. I think there may be an unrelated problem here.

Quote:

How long did it take to complete? more than 5 min-- i quit b4 it completed..
How long would you like it to take? a normal time as it takes for cat or grep
How quickly can you simply cat the file? I am unable to cat the file it is not at all opening
If you extract just one day's worth from the file, how long does that take to cat?I am unable to extract with the awk or grep...i ma only able to use the tail and head command.

So if you, say, tail -n 10000 file | grep '^06/01/ ' do you get roughly what you want? How long does it take? Too long still?

Last edited by era; 06-03-2008 at 09:43 AM.. Reason: Minor edit of regexes

era

View Public Profile for era

Find all posts by era

Shell Programming and Scripting

Parsing a large log

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Parsing a subset of data from a large matrix

Discussion started by: Kanja

2. Shell Programming and Scripting

Parsing large files in Solaris 11

Discussion started by: os2mac

3. Shell Programming and Scripting

Log parsing

Discussion started by: Jaymz

4. UNIX for Dummies Questions & Answers

I need to isolate a date in a large log file

Discussion started by: spookydll

5. Shell Programming and Scripting

Help needed for parsing large XML with awk.

Discussion started by: jasonjustice

6. Red Hat

Help for capturing a URL from a line from large log file

Discussion started by: rockf1bull

7. Shell Programming and Scripting

parsing large CDR XML file

Discussion started by: saifsafaa

8. Shell Programming and Scripting

Cutting a large log file in to smaller ones

Discussion started by: MrTangent

9. Shell Programming and Scripting

Problem with parsing a large file

Discussion started by: gauravgoel

10. UNIX for Dummies Questions & Answers

Splitting a large log file

Discussion started by: simmonet