Category and count with awk


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Category and count with awk
# 1  
Old 08-07-2015
Category and count with awk

I want to categorize and count the as below:

Input file:

Code:
A1 G1 C1 F1
A2 G1 C1 F1
A3 G1 C1 F2
A4 G1 C2 F2
A7 G1 C2 F2
A8 G1 C2 F3
A11 G1 C2 F3
A23 G1 C2 F3
B4 G1 C2 F3
AC4 G2 C3 F4
B6 G2 C4 F4
BB5 G2 C4 F4
A25 G2 C5 F4
B13 G2 C5 F5
D12 G2 C5 F5
D2 G2 C5 F5
B89 G2 C5 F6
B44 G2 C5 F6


Desired Output:
Code:
Total            : 18
               G1 : 9
               G2 : 9
               F1 : 2
               F2 : 3
               F3 : 4
               F4 : 4
               F5 : 3
               F6 : 2

G1[9]

F1(2)
C1=A1,A2

F2(3)
C1=A3
C2=A4,A7

F3(4)
C2=A8,A11,A23,B4

G2[9]

F4(4)
C4=B6,BB5
C5=A25
C3=AC4

F5(3)
C5=B13,D12,D2

F6(2)
C5=B89,B44


I have tried:

Code:
#!/usr/bin/ksh
awk '  {
          D[$2]++
          A[$4]++
          B[$4 FS $3 FS $2] = B[$4 FS $3 FS $2] ? B[$4 FS $3 FS $2] "," $1 : $1
       }
    END{
        {printf "%20s%-3s \n", "Total            : ", NR }
        {for (i in D)printf "%20s%-3s \n", i" : ",D[i]}
        {for (i in A)printf "%20s%-3s \n", i" : ",A[i]}
        {print " "}
        for(k in D){
                    print k"["D[k]"] "
                           for(i in A){
                                   print i"("A[i]")"
                    for(j in B){
                                   split(j,X)
                                   if(X[3]==k && X[1]==i)
                                   print X[2]"="B[j]
                               }
                                   print ""
                       }
                    }
       }
' file.txt

But I get:

Code:
Total            : 18
               G1 : 9
               G2 : 9
               F1 : 2
               F2 : 3
               F3 : 4
               F4 : 4
               F5 : 3
               F6 : 2

G1[9]
F1(2)
C1=A1,A2

F2(3)
C1=A3
C2=A4,A7

F3(4)
C2=A8,A11,A23,B4

F4(4)

F5(3)

F6(2)

G2[9]
F1(2)

F2(3)

F3(4)

F4(4)
C4=B6,BB5
C5=A25
C3=AC4

F5(3)
C5=B13,D12,D2

F6(2)
C5=B89,B44


How do I get rid of F4,F5 and F6 from G1 category, and F1,F2 and F3 from G2 category.
# 2  
Old 08-08-2015
You seem to have exactly the problem described here.

I hope this helps.

bakunin
# 3  
Old 08-08-2015
After some reformatting and outlining (it helps!) I came up with the following quick fix:
Code:
#!/usr/bin/ksh
awk '
    {
      D[$2]++
      A[$4]++
      C[$3]
      B[$4,$3,$2] = B[$4,$3,$2] ? B[$4,$3,$2] "," $1 : $1
    }
    END {
      printf "%20s%-3s \n", "Total            : ", NR
      for (i in D)
        printf "%20s%-3s \n", i" : ",D[i]
      for (i in A)
        printf "%20s%-3s \n", i" : ",A[i]
      print " "
      for(k in D) {
        print k"["D[k]"]" RS
        for(i in A) {
          entries=0
          for(j in C) {
            if((i,j,k) in B) {
              if(!entries++)
                print i"("A[i]")"
              print j"="B[i,j,k]
            }
          }
          if(entries)
            print ""
        }
      }
    }
' file.txt

Which hopefully will get you going again..
This User Gave Thanks to Scrutinizer For This Post:
# 4  
Old 08-08-2015
Thanks, Solves it.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. What is on Your Mind?

Which category do you belong to?

Hello All, I was thinking to start this POLL, sometime back but couldn't get time so starting it today. So we all work either as an Admin or as a Developer or as a QA etc. So let's have a thread(POLL) where we could share our experiences(if it doesn't come anyone's privacy category) so that we... (17 Replies)
Discussion started by: RavinderSingh13
17 Replies

2. Shell Programming and Scripting

Data filtering and category assigning

Please consider the following file, I have many groups which can be of 3 types, T1 (Serial_Number 1) T2 (Serial_Number 2) and T1*T2 (all other Serial_Number). I want to only consider groups that have both T1 and T2 present and their values are different from each other. In the example file,... (8 Replies)
Discussion started by: jianp83
8 Replies

3. Shell Programming and Scripting

Total count in each category for given file list

I have list of file names in filename.txt below is file format >>File1 _________________________ 01~12345~Y~YES~aaaaa~can 02~23456~N~NO~bbbbb~can . . . 99~23__________________________ Need to find total count from each file depending on specific string and add them to have total count... (17 Replies)
Discussion started by: santoshdrkr
17 Replies

4. Shell Programming and Scripting

Awk: Print count for column in a file using awk

Hi, I have the following input in a file & need output as mentioned below(need counter of every occurance of field which is to be increased by 1). Input: 919143110065 919143110065 919143110052 918648846132 919143110012 918648873782 919143110152 919143110152 919143110152... (2 Replies)
Discussion started by: siramitsharma
2 Replies

5. Shell Programming and Scripting

awk - count character count of fields

Hello All, I got a requirement when I was working with a file. Say the file has unloads of data from a table in the form 1|121|asda|434|thesi|2012|05|24| 1|343|unit|09|best|2012|11|5| I was put into a scenario where I need the field count in all the lines in that file. It was simply... (6 Replies)
Discussion started by: PikK45
6 Replies

6. Shell Programming and Scripting

extract the max value category

Hi, I have a file and I want the category for each row to be its highest value. gene highest medium lower lowest ABC 20 30 50 70 DEF 90 20 60 0 o/p gene highest medium lower lowest category ABC... (6 Replies)
Discussion started by: Diya123
6 Replies

7. UNIX for Dummies Questions & Answers

Grep bunch of gzip files to count based on category

Started using unix commands recently. I have 50 gzip files. I want to grep each of these files for a line count based particular category in column 3. How can I do that? For example Sr.No Date City Description Code Address 1 06/09 NY living here 0909 10st st nyc 2 ... (5 Replies)
Discussion started by: jinxx
5 Replies

8. Shell Programming and Scripting

Split file into given category and others using awk

Hi All, Would it be possible using awk to split a given file into two files based on a certain condition such that one output file will contain all lines that fit the condition while the other output file will contain lines that did not fit the condition? Here is a sample input file ... (6 Replies)
Discussion started by: cympaulife
6 Replies

9. Shell Programming and Scripting

Parsing out the first (top) data lines of each category

Hi All, I need some help in parsing out the first (top) data lines of each category (categories are based on the first column a, b, c, d, e.( see example file below) from a big file a dfg 3 6 8 9 a fgh 5 7 0 9 a gkl 5 2 4 7 a glo 7 0 1 5 b ghj 9 0 4 2 b mkl 7 8 0 5 b jkl 9 0 4 5 c jkl 2... (1 Reply)
Discussion started by: Lucky Ali
1 Replies

10. Post Here to Contact Site Administrators and Moderators

How to change the category?

Hi, I submitted my blog on UNIX in the links section. On submitting, i chose the category as Unix/Linux standards, which i now feel is incorrect. I would like to change the category of my link, but i don't find any option to change the category. Please help me in doing the needful. Thanks... (7 Replies)
Discussion started by: guruprasadpr
7 Replies
Login or Register to Ask a Question