Please help on "if" statement.

Thread Tools Search this Thread
# 1  
Help on Loops to Grep logs


Thanks RudiC for bringing up the order of the timestamp. Yes it does matter. Please help.

Current Setup:
Logs were rotated every 4hrs and they were compressed. If the logs exceed 10gb, it will be compressed (.Z), if not script will just zipped the logs (.gz). So archive directory may sometime contains *.gz and *.Z files, and there are times that it only contains *.gz files

Help on Loops to Grep logs per domain, in a single text file, regardless if logs are in *gz or *Z.

domainlist="4prd 5prd 6prd 7prd 8prd"

expected output will be:


I try to create one, but having problem on if statement.
domainlist="4prd 5prd 6prd 7prd 8prd"
for domain in $domainlist
           if [ ? ] ; then
                gzgrep "$domain" $dir_arch/logs/*gz >> $dir_arch/logs/$domain_$myhost.txt    << for zip files (e.q log.gz)
                 gzip -f $dir_arch/logs/$domain_$myhost.txt
                zcat $dir_arch/logs/*Z | grep "$domain" >> $dir_arch/logs/$domain_$myhost.txt << for compress files(e.q log.Z)
                gzip -f $dir_arch/logs/$domain_$myhost.txt

Kindly help. Thanks!


Last edited by chococrunch6; 01-21-2013 at 01:52 PM.. Reason: stating clearer problem.
# 2  
Not sure I understand your problem nor your code snippet.
Let's assume you have a directory $dir_arch/logs containing .gz and .Z files, all of which you want to search for 5 items in domainlist. The order of the resulting output does not seem to matter, does it? So sth. like this should do the job for you:
#               ^--- "or" in EREs 
gzgrep -E "$domainlist" $dir_arch/logs/*.gz >> $dir_arch/logs/$domain_$myhost.txt
#       ^--- tell grep to use ERE        ^--- run this on all gzipped files
zcat $dir_arch/logs/*.Z | grep -E "$domainlist" >> $dir_arch/logs/$domain_$myhost.txt
#                     ^--- run this on all compressed files
gzip -f $dir_arch/logs/$domain_$myhost.txt

# 3  
Hi RudiC,

Thanks for your reply, i rephrase the problem above, i hope it is clear this time. Kindly help.
# 4  
Let me paraphrase your problem: You have in one single directory $dir_arch/logs a bunch of zipped/compressed logfiles, which you want to scan line by line for a set of domains, outputting each line to the respective $domain_$myhost.txt file, concatenating all logfiles' entries into single respective domain.txt file.
I still don't see how the TIMESTAMP you refer to is reflected in your code snippet.

I rate it unwise to gzgrep/zcat the files several times for the different domains for performance reasons. (g)unzip once to a tmp dir, order according to timestamp required, and run sth like an awk script over all files that distributes the lines to the respective files.
# 5  
thanks for clarification RudiC.

Actually there are only 6 log files which covers the whole day of logs( sometimes a combination of *gz and *Z files or all *gz files or all *Z files depending on the log size upon rotation).
Logs will be huge when (g)unzip, im afraid of having an FS issue by doing so i decided to use zcat | grep and gzgrep. Im sorry but im such a newbie and not so familiar with
"awk" command as you recommended.

Example logs on the archive:

        364M Jan 10 00:02 log.2013-Jan-10.00-00-50.Z
         75M Jan 10 04:00 log.2013-Jan-10.04-00-23.gz
         98M Jan 10 08:01 log.2013-Jan-10.08-00-32.gz
       174M Jan 10 12:02 log.2013-Jan-10.12-01-08.gz
        176M Jan 10 16:02 log.2013-Jan-10.16-01-23.gz
        354M Jan 10 20:02 log.2013-Jan-10.20-01-23.Z

        373M Jan 11 00:02 log.2013-Jan-11.00-00-53.Z
         83M Jan 11 04:01 log.2013-Jan-11.04-00-26.gz
        100M Jan 11 08:02 log.2013-Jan-11.08-00-31.gz
        344M Jan 11 12:02 log.2013-Jan-11.12-01-07.Z
        340M Jan 11 16:02 log.2013-Jan-11.16-01-23.Z
        362M Jan 11 20:02 log.2013-Jan-11.20-01-27.Z

        371M Jan 18 00:02 log.2013-Jan-18.00-00-52.Z
        91M Jan 18 04:01 log.2013-Jan-18.04-00-27.gz
        119M Jan 18 08:01 log.2013-Jan-18.08-00-31.gz
        154M Jan 18 12:02 log.2013-Jan-18.12-01-18.gz
         87M Jan 18 16:02 log.2013-Jan-18.16-01-40.gz
        105M Jan 18 20:02 log.2013-Jan-18.20-01-07.gz

The scripts will look like this:
date=`date +"%Y-%h-%d"`        
domainlist="4prd 5prd 6prd 7prd 8prd"
for domain in $domainlist
     if [ -f /archive/$date/* ] ; then
                    # ^--- check if logs are present in the archive dir
        ls -l /archive/$date/* | grep Z
                    # ^--- check for *Z files
            if [ $? -eq 0 ] ; then
                zcat /archive/$date/*Z | grep "$domain" >> /archive/$date/$domain.txt
                    # ^--- grep "domain" for *Z files
                gzgrep "$domain" /archive/$date/*gz >> /archive/$date/$domain.txt
                    # ^--- grep "domain" for *gz files
                gzip -f /archive/$date/$domain.txt 
                  # ^--- archive has *Z files, perform the ff commands above, but the TIMESTAMP on the output file may sometimes won't be in order.
                gzgrep "$domain" /archive/$date/*gz >> /archive/$date/$domain.txt
                gzip -f /archive/$date/$domain.txt
                # ^--- archive has no *Z files
        echo ">>Logs not found, kindly check archive directory.."

Kindly let me know if you have better solution for this. I would really appreciate your response. Thanks!

# 6  
No matter what you do, disk space will come into game. zcat and gzgrep will need space to uncompress, probably temporary files somewhere on disk, or swap file space. So - find a disk that can accomodate your huge files and uncompress there to, if possible maintining file time stamps. Then run sth like
$ domainlist="4prd|5prd|6prd|7prd|8prd"
$ awk 'match($0, D){ print $0 > substr ($0,RSTART,RLENGTH)}' D=$domainlist *

If * does not supply your files in the correct order, try to rename the files so they show up as needed in e.g. ls. The pipes in domainlist are mandatory for the regex to work!
# 7  
can you please explain what does this awk code do?

$ awk 'match($0, D){ print $0 > substr ($0,RSTART,RLENGTH)}' D=$domainlist *


Previous Thread | Next Thread
Thread Tools Search this Thread
Search this Thread:
Advanced Search

9 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Bash script - Print an ascii file using specific font "Latin Modern Mono 12" "regular" "9"

Hello. System : opensuse leap 42.3 I have a bash script that build a text file. I would like the last command doing : print_cmd -o page-left=43 -o page-right=22 -o page-top=28 -o page-bottom=43 -o font=LatinModernMono12:regular:9 some_file.txt where : print_cmd ::= some printing... (1 Reply)
Discussion started by: jcdole
1 Replies

2. UNIX for Dummies Questions & Answers

Using "mailx" command to read "to" and "cc" email addreses from input file

How to use "mailx" command to do e-mail reading the input file containing email address, where column 1 has name and column 2 containing “To” e-mail address and column 3 contains “cc” e-mail address to include with same email. Sample input file, email.txt Below is an sample code where... (2 Replies)
Discussion started by: asjaiswal
2 Replies

3. Shell Programming and Scripting

"if" statement based off "grep"

Hello, I am somewhat new to Linux/Unix. I am currently working on a shell script that is suppose to cat a file, grep the same file for a certain line, if that line is found save the file in a different location, else remove the file. This is a rough example of what I want. $Dating = False... (13 Replies)
Discussion started by: Amzerik
13 Replies

4. UNIX for Dummies Questions & Answers

What is the meaning of "-s" option in "if" statement?

Hi Guys, I'm sorry but I can't find answer for this, what is the meaning of -s option in "if" statement on unix scipting. Please see sample below: opath=/home/output for i in N1 N2 N3 N4 do echo $i if then grep $i $opath/N5_CRAI > $opath/N5_$i.crai chmod 777 $opath/N5_$i.crai ... (7 Replies)
Discussion started by: rymnd_12345
7 Replies

5. Shell Programming and Scripting

awk command to replace ";" with "|" and ""|" at diferent places in line of file

Hi, I have line in input file as below: 3G_CENTRAL;INDONESIA_(M)_TELKOMSEL;SPECIAL_WORLD_GRP_7_FA_2_TELKOMSEL My expected output for line in the file must be : "1-Radon1-cMOC_deg"|"LDIndex"|"3G_CENTRAL|INDONESIA_(M)_TELKOMSEL"|LAST|"SPECIAL_WORLD_GRP_7_FA_2_TELKOMSEL" Can someone... (7 Replies)
Discussion started by: shis100
7 Replies

6. Shell Programming and Scripting

What "-a" operator means in "if" statement

Hi I am trying to figure out what the following line does, I work in ksh88: ] && LIST="$big $LIST" Not sure what "-a" means in that case. Thanks a lot for any advice -A (1 Reply)
Discussion started by: aoussenko
1 Replies

7. Red Hat

"if" and "then" statement is not working in RedHat

Dear experts, I'm trying to write a script to calculate the usage of Log Archive in a directory, so if it gets to a point where the directory size is 60%, then send out an FYI.. email. So if then it reaches to 80%, move the logs from that directory. I have written the script as follow but... (10 Replies)
Discussion started by: Afi_Linux
10 Replies

8. Shell Programming and Scripting

" :- " characters in "if" statement

Hi I am trying to understand the following "if" statement: if ] specifically what " :- " means. ANy idea? Thanks a lot -A (1 Reply)
Discussion started by: aoussenko
1 Replies

9. UNIX for Dummies Questions & Answers

Explain the line "mn_code=`env|grep ""|awk -F"=" '{print $2}'`"

Hi Friends, Can any of you explain me about the below line of code? mn_code=`env|grep ""|awk -F"=" '{print $2}'` Im not able to understand, what exactly it is doing :confused: Any help would be useful for me. Lokesha (4 Replies)
Discussion started by: Lokesha
4 Replies

Featured Tech Videos