Script to validate file header and trailer


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Script to validate file header and trailer
# 1  
Old 07-09-2012
Script to validate file header and trailer

Hi,

I need a script that validates a file header/detail/trailer. File layout is:

Header - Rec_Type|File_name|File_Date
Detail - Rec_Type|field1|field2|field3...
Trailder - Rec_Type|File_name|File_Date|Record_count

Sample Data:
Code:
HDR|customer_data.dat|20120709
LIN|value1|value2|value3...
LIN|value1|value2|value3...
LIN|value1|value2|value3...
LIN|value1|value2|value3...
TRL|customer_data.dat|20120709|4

The script should validate that only 1 header and trailer exists. If more, raise exception.
The script should verify that total detail lines equal to the trailder record (Record_count)

I really appreciate if someone can provide me the script

Thanks

Moderator's Comments:
Mod Comment Please view this code tag video for how to use code tags when posting code and data.

Last edited by vbe; 07-10-2012 at 10:23 AM..
# 2  
Old 07-09-2012
Can you show us whatever that you have done on this so far?
# 3  
Old 07-09-2012
A minimal solution:

Code:
awk -F "|" ' /^HDR/ { h++; next; } /^TRL/ { exit( (h > 1) || $NF != NR - 2 ); }' file-name

If the exit code is non-zero there is an error.
# 4  
Old 07-10-2012
I don't have Unix experience, I am using this script in an ETL program that loads the file to a DB. So I havent done anything, Smilie
# 5  
Old 07-10-2012
That awk script looks complicated.

Why don't you break the problem down into several parts?
1) Check if the header exist. Use grep & head
2) Check if the trailer record exist. Use grep & tail
3) Check if there is multiple records of header & trailers. Use grep & wc
# 6  
Old 07-10-2012
Equally important requirement is to verify record count matches in detail line and the record count sum in the trailer. How can this be done?
# 7  
Old 07-10-2012
Quote:
Originally Posted by ash_sh
Equally important requirement is to verify record count matches in detail line and the record count sum in the trailer. How can this be done?
The awk programme that I posted and a small bug -- didn't properly check for multiple trailers. This does better:

Code:
awk -F "|" ' /^HDR/ { h++; next; } /^TRL/ {t++; next} END { exit( (h != 1) || t != 1 || $NF != NR - 2 ); }' input-file

It checks both for the existence of exactly one header and one trailer, AND checks that the record count in the trailer matches the records observed. If either fails the exit code is non-zero to indicate that there is an error. If you need more precision, knowing exactly why there is an error, a longer programme can be used:

Code:
awk -F "|" '
/^HDR/ { h++; next; }
/^TRL/ { t++; next; }
END {
    ec = 1;
    if( h != 1 || t != 1 )
        printf( "header/trailer count error: %d headers; %d trailers\n", h, t ) >"/dev/fd/2";
    else
        if( $NF != NR - 2 )
            printf( "bad record count: %d(t) != %d(rec)\n", $NF, NR ) >"/dev/fd/2";
        else
            ec = 0;
    exit( ec );
}' input-file

I am not familiar with what an 'ETL program' is, so I don't know if you can invoke awk or not.

If ETL is anything like and of the standard *NIX shells, you probably can do something like this:

Code:
if ! awk -F "|" ' /^HDR/ { h++; next; } /^TRL/ {t++; next} END { exit( (h != 1) || t != 1 || $NF != NR - 2 ); }' $file_name
then
   echo "file did not pass verification test: $file_name"
   exit 1
fi

# put rest of your processing on success here.

@chedlee88-1 -- for small files, reading each three times might not have a noticeable impact, but if the input being verified is large, reading each three times might be so inefficient as not to be practical. It makes more sense to read the file once.

This programme will do the same thing, non-zero exit code, and write the failure reason onto standard err.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Script to validate header in a csv file

Hi All; I am struggling to write a script that validates file header. Header file would be like below with TAB separated TRX # TYPE REF # Source Piece Code Destination Piece Code every time I need to check the txt file if the header was same as above fields if validation success... (6 Replies)
Discussion started by: heye18
6 Replies

2. UNIX for Advanced & Expert Users

Removing Header and Trailer record of a EBCDIC file

I have a EBCDIC multi layout file which has a header record which is 21 bytes, The Detail records are 2427 bytes long and the trailer record is 9 bytes long. Is there a command to remove the header as well as trailer record and read only the detail records while at the same time not altering... (1 Reply)
Discussion started by: abhilashnair
1 Replies

3. Shell Programming and Scripting

Split and add header and trailer from input file

I need to split the file based on pattern from position 34-37 while retaining the header and trailer records in each individual split file Also is it possible to output the TOM and PAT records in the same output file ? I need the output file names same as xyz_pattern_Datetimestamp.txt ... (23 Replies)
Discussion started by: techedipro
23 Replies

4. Shell Programming and Scripting

Verify the header and trailer in file

please see my requirement, I hope I am clear. (9 Replies)
Discussion started by: mirwasim
9 Replies

5. Shell Programming and Scripting

Remove last few characters in a file but keeping Header and trailer intact

Hi All, I am trying write a simple command using AWK and SED to this but without any success. Here is what I am using: head -1 test1.txt>test2.txt|sed '1d;$d' test1.txt|awk '{print substr($0,0,(length($0)-2))}' >>test2.txt|tail -1 test1.txt>>test2.txt Input: Header 1234567 abcdefgh... (2 Replies)
Discussion started by: nvuradi
2 Replies

6. UNIX for Dummies Questions & Answers

Adding header and trailer into a file

Hi, I want to add the below Header to all the files in sequence File1,File2,File3...etc "ABC,<number of chracter in the file>" e,g - If File1 is as below pqrstuvdt abcdefgh then I want to add the above header into it ,So that File1 becomes as below ABC,17 pqrstuvdt abcdefgh ... (9 Replies)
Discussion started by: spari2
9 Replies

7. Shell Programming and Scripting

Adding Header and Trailer records to a appended file

How can we a shell script and pass date parameters .I have 3 files comming from Datastage with |" delimited I need append 3 files as above: File1: P0000|"47416954|"AU|"000|"INS|"0000|"|"20060601|"99991231|"|"|"|"|"01 File 2:... (2 Replies)
Discussion started by: e1994264
2 Replies

8. Shell Programming and Scripting

Creating Header & Trailer for bulk volume data file

Hi all, I have a requirement to create a Header &Trailer for a file which is having 20 millions of records. If I use the following method, i think it will take more time. cat "Header"> file1.txt cat Data_File>>file1.txt cat "Trailer">>file1.txt since second CAT command has to read all... (4 Replies)
Discussion started by: Raamc
4 Replies

9. Shell Programming and Scripting

Removing Header & Trailer from a file

Hi All, I am karthik. I am new to this forum. I have one requirement. I have a file with header and footer. Header may be like HDR0001 or FILE20090110 (Assume it is unknown so far, but i am sure there is a header in the file) likewise file has the trailer too. I just... (7 Replies)
Discussion started by: karthi_gana
7 Replies

10. UNIX for Dummies Questions & Answers

Copy all the files with time stamp and remove header,trailer from file

All, I am new to unix and i have the following requirement. I have file(s) landing into input directory with timestamp, first i want to copy all these files into seperate directory then i want to rename these files without timestamp and also remove header,trailer from that file.. Could... (35 Replies)
Discussion started by: ksrams
35 Replies
Login or Register to Ask a Question