Bin iteratively based on each row

08-25-2013

Moderator

12,296, 3,792

Join Date: Nov 2008

Last Activity: 1 January 2021, 1:47 AM EST

Location: Amsterdam

Posts: 12,296

Thanks Given: 679

Thanked 3,792 Times in 3,282 Posts

I'll try:

Code:

awk '
  NR==1{                                                               # skip the header record
    next
  }
  NR==FNR{                                                             # when reading the file for the first time ( that is when NR equals FNR )
    B[$2]=$2+100                                                       # create a representation of the bins in the form of arrays, witch index $2 and value $2 + 100
    next                                                               # do not process the rest which is meant for the second time the file is read
  }
  {                                                                    # process the file for the second time
    for(i in B) {                                                      # for each index in the bins
      r=i "-" B[i]                                                     # compose the string that represents the bin's range
      if ( i+0<=$2+0 && $2+0 < B[i]+0 ) print $0, r > ( "bin_" r )     # if $2 is witin the bin's range then print to the corresponding file the record and the range to the corresponding file
    }
  }
' OFS='\t' file file                                                   # use a tab to separate the record range. Read file twice, once for the bins second for the output.

--
note: If there are too many bin files, close() statements will need to added to intermediately close file, otherwise there will be "too many files open" errors.

Scrutinizer

View Public Profile for Scrutinizer

Find all posts by Scrutinizer

Shell Programming and Scripting

Bin iteratively based on each row

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Splitting single row into multiple rows based on for every 10 digits of last field of the row

Discussion started by: kotra

2. Shell Programming and Scripting

Merge row based on replicates ID

Discussion started by: giuliangiuseppe

3. Shell Programming and Scripting

Delete duplicate row based on criteria

Discussion started by: shash

4. Shell Programming and Scripting

Field widths based on a row

Discussion started by: aydj

5. Shell Programming and Scripting

How to mark the row based on col value.?

Discussion started by: ken6503

6. Shell Programming and Scripting

Send email based on row count

Discussion started by: srini_106

7. Shell Programming and Scripting

Trying to remove duplicates based on field and row

Discussion started by: newbie2010

8. Shell Programming and Scripting

Deleting a row based on fetched value of column

Discussion started by: swasid

9. Shell Programming and Scripting

How to print column based on row number

Discussion started by: Surabhi_so_mh

10. UNIX for Dummies Questions & Answers

Search based on 1st char of a row.

Discussion started by: videsh77