Split and add header and trailer from input file


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Split and add header and trailer from input file
# 22  
Old 07-07-2014
Thanks Chubler !

I do not have gawk on my machine , is there any other way we can accomplish the task ?

Also is there a way of retrieving what characters were replaced and how many times were they replaced ?

Can we do the replace operation as a stand alone script before we pass the input file to the split program , this way i can re-use the program

Appreciate your help!
# 23  
Old 07-07-2014
Here is a version that doesn't need gawk, in addition it produces "subcount.txt" which shows characters replaced and a replace count.


Code:
awk -v stamp=$(date +%Y%m%d%H%M%S) \
    -v f="TOM PAT SAM BOB KIM" \
    -v t="TOM TOM TOM BOB KIM" '
function hextoascii(str,i,ascii,ret) {
   for(i=1;i<length(str);i+=2) {
      ascii=index(HEXDIGITS,toupper(substr(str,i,1))) * 16 + \
            index(HEXDIGITS,toupper(substr(str,i+1,1)))
      ret=ret sprintf("%c", ascii)
   }
   return ret
}
BEGIN {
   HEXDIGITS="123456789ABCDEF"
   split(f,from)
   for(i=split(t,to);i;i--) CONV[from[i]]=to[i]
}
FNR==NR{
   v=split($0,vals,":")
   hexrepl[hextoascii(vals[1])]=vals[1]
   repl[hextoascii(vals[1])]=hextoascii(vals[2])
   next
}
{ for(i in repl) subcnt[i] += gsub(i, repl[i]) }
/^H/ {header=$0 ; next}
/^T/ {trailer=$0 ; next}
length>3{
   typ=substr($0,34,3)
   if (typ in CONV) {
       fname="xyz_" CONV[typ] "_" stamp ".txt"
       if (!(fname in A)) {A[fname]; print header > fname}
   } 
   else fname="xyz_error_" stamp ".txt"
   print $0 >> fname
   close(fname)
}
END {
  for (fname in A) print trailer >> fname
  for (i in subcnt) if(subcnt[i]>0)
     print hexrepl[i] "\t" subcnt[i] > "subcount.txt"
}' replace.txt Test.txt

---------- Post updated at 07:39 AM ---------- Previous update was at 06:16 AM ----------

Sorry didn't see the request for the replace program to be stand alone:

Code:
awk -v logfile=subcount.txt '
function hextoascii(str,i,ascii,ret) {
   for(i=1;i<length(str);i+=2) {
      ascii=index(HEXDIGITS,toupper(substr(str,i,1))) * 16 + \
            index(HEXDIGITS,toupper(substr(str,i+1,1)))
      ret=ret sprintf("%c", ascii)
   }
   return ret
}
BEGIN { HEXDIGITS="123456789ABCDEF" }
FNR==NR{
   v=split($0,vals,":")
   if(length(logfile)) hexrepl[hextoascii(vals[1])]=vals[1]
   repl[hextoascii(vals[1])]=hextoascii(vals[2])
   next
}
{ for(i in repl) subcnt[i] += gsub(i, repl[i]) }
1
END {
  if(length(logfile))
      for (i in subcnt) if(subcnt[i]>0)
         print hexrepl[i] "\t" subcnt[i] > logfile
}' replace.txt Test.txt

If you don't need counts, sed + tr may be all you need:

Code:
sed "s/"$'\x0D'"$//" | tr \
$'\xED'\
$'\xE9'\
$'\xD9'\
$'\xC2'\
$'\x80'\
$'\x99'\
$'\xE2'\
$'\xC3'\
$'\xC4'\
$'\xC9'\
$'\xA0'\
$'\xFF'\
$'\xB1'\
$'\x83'\
$'\xC1'\
$'\xE1'\
$'\xB7'\
$'\xF6'\
$'\xF1'\
$'\xF3'\
$'\xE3'\
$'\x1A'\
 " "


Last edited by Chubler_XL; 07-07-2014 at 07:15 PM..
This User Gave Thanks to Chubler_XL For This Post:
# 24  
Old 07-09-2014
Thanks Chubler!!

If I were to use the sed + tr combo , how can I loop through a set of characters which are stored in a file and then perform replace recursively on the input file ?

Probably a shell script ?

Appreciate your patience !
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

Removing Header and Trailer record of a EBCDIC file

I have a EBCDIC multi layout file which has a header record which is 21 bytes, The Detail records are 2427 bytes long and the trailer record is 9 bytes long. Is there a command to remove the header as well as trailer record and read only the detail records while at the same time not altering... (1 Reply)
Discussion started by: abhilashnair
1 Replies

2. Shell Programming and Scripting

Verify the header and trailer in file

please see my requirement, I hope I am clear. (9 Replies)
Discussion started by: mirwasim
9 Replies

3. Shell Programming and Scripting

Script to validate file header and trailer

Hi, I need a script that validates a file header/detail/trailer. File layout is: Header - Rec_Type|File_name|File_Date Detail - Rec_Type|field1|field2|field3... Trailder - Rec_Type|File_name|File_Date|Record_count Sample Data: HDR|customer_data.dat|20120709... (7 Replies)
Discussion started by: ash_sh
7 Replies

4. Shell Programming and Scripting

Remove last few characters in a file but keeping Header and trailer intact

Hi All, I am trying write a simple command using AWK and SED to this but without any success. Here is what I am using: head -1 test1.txt>test2.txt|sed '1d;$d' test1.txt|awk '{print substr($0,0,(length($0)-2))}' >>test2.txt|tail -1 test1.txt>>test2.txt Input: Header 1234567 abcdefgh... (2 Replies)
Discussion started by: nvuradi
2 Replies

5. UNIX for Dummies Questions & Answers

Adding header and trailer into a file

Hi, I want to add the below Header to all the files in sequence File1,File2,File3...etc "ABC,<number of chracter in the file>" e,g - If File1 is as below pqrstuvdt abcdefgh then I want to add the above header into it ,So that File1 becomes as below ABC,17 pqrstuvdt abcdefgh ... (9 Replies)
Discussion started by: spari2
9 Replies

6. Shell Programming and Scripting

Adding Header and Trailer records to a appended file

How can we a shell script and pass date parameters .I have 3 files comming from Datastage with |" delimited I need append 3 files as above: File1: P0000|"47416954|"AU|"000|"INS|"0000|"|"20060601|"99991231|"|"|"|"|"01 File 2:... (2 Replies)
Discussion started by: e1994264
2 Replies

7. Shell Programming and Scripting

Creating Header & Trailer for bulk volume data file

Hi all, I have a requirement to create a Header &Trailer for a file which is having 20 millions of records. If I use the following method, i think it will take more time. cat "Header"> file1.txt cat Data_File>>file1.txt cat "Trailer">>file1.txt since second CAT command has to read all... (4 Replies)
Discussion started by: Raamc
4 Replies

8. Shell Programming and Scripting

Removing Header & Trailer from a file

Hi All, I am karthik. I am new to this forum. I have one requirement. I have a file with header and footer. Header may be like HDR0001 or FILE20090110 (Assume it is unknown so far, but i am sure there is a header in the file) likewise file has the trailer too. I just... (7 Replies)
Discussion started by: karthi_gana
7 Replies

9. Shell Programming and Scripting

Split large file and add header and footer to each small files

I have one large file, after every 200 line i have to split the file and the add header and footer to each small file? It is possible to add different header and footer to each file? (7 Replies)
Discussion started by: ashish4422
7 Replies

10. Shell Programming and Scripting

Split large file and add header and footer to each file

I have one large file, after every 200 line i have to split the file and the add header and footer to each small file? It is possible to add different header and footer to each file? (1 Reply)
Discussion started by: ashish4422
1 Replies
Login or Register to Ask a Question