Sponsored Content
Top Forums UNIX for Beginners Questions & Answers Split a file based on number sum at the second column and the third column. Post 303044707 by demo10 on Monday 2nd of March 2020 05:06:12 AM
Old 03-02-2020
Quote:
Originally Posted by nezabudka
Hi
Maybe just like that?
Code:
awk '
/^\S+\s+12109/,/^(\S+\s+){2}14678\s/ {print > "file1"}
/^\S+\s+15573/,/^(\S+\s+){2}15612\s/ {print > "file2"}
/^\S+\s+20498/,/^(\S+\s+){2}21668\s/ {print > "file3"}
' file


Thank you so much. This is what I need.


Quote:
Originally Posted by nezabudka
maybe so?
Code:
#!/bin/bash

step=2999
declare -i start=12109 end=start+step count=1
stop=$(awk '{if($3>max) max=$3} END {print max}' file)

while [ $end -le $stop ]; do
        awk -vA=$start -vZ=$end -vf="file$count" '
                $2>=A && $3<=Z {print > f}
        ' file
        start+=step
        end+=step
        count+=1
done

 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

How do I sum one column based on another column?

Hi, I am new to this forum and new to awk. I have a file that contains 2 columns. Heres an example of what it looks like: 10 + 20 + 40 + 50 - 70 - So the file is tab-delimited. What I want to do is add 10 to column 1 whenever column 2 is + and substract 10 from column 1... (1 Reply)
Discussion started by: phil_heath
1 Replies

2. Shell Programming and Scripting

Split single file into multiple files based on the number in the column

Dear All, I would like to split a file of the following format into multiple files based on the number in the 6th column (numbers 1, 2, 3...): ATOM 1 N GLY A 1 -3.198 27.537 -5.958 1.00 0.00 N ATOM 2 CA GLY A 1 -2.199 28.399 -6.617 1.00 0.00 ... (3 Replies)
Discussion started by: tomasl
3 Replies

3. Shell Programming and Scripting

Split the file based on column

Hi, I have a file sample_1.txt (300k rows) which has data like below: * Also each record is around 64k bytes 11|1|abc|102553|125589|64k bytes of data 10|2|def|123452|123356|...... 13|2|geh|144351|121123|... 25|4|fgh|165250|118890|.. 14|1|abc|186149|116657|......... (6 Replies)
Discussion started by: sol_nov
6 Replies

4. Shell Programming and Scripting

Sum Of Column Based On Column Condition

I have a following inputfile MT,AP,CDM,TTML,MUM,GS,SUCC,3 MT,AP,CDM,TTSL,AP,GS,FAIL,9 MT,AP,CDM,RCom,MAH,GS,SUCC,3 MT,AP,CDM,RTL,HP,GS,SUCC,1 MT,AP,CDM,Uni,UPE,GS,SUCC,2 MT,AP,CDM,Uni,MUM,GS,SUCC,2 TTSL,AP,GS,MT,MAH,CDM,SUCC,20 TTML,AP,GS,MT,MAH,CDM,FAIL,10... (2 Replies)
Discussion started by: siramitsharma
2 Replies

5. Shell Programming and Scripting

awk to sum a column based on duplicate strings in another column and show split totals

Hi, I have a similar input format- A_1 2 B_0 4 A_1 1 B_2 5 A_4 1 and looking to print in this output format with headers. can you suggest in awk?awk because i am doing some pattern matching from parent file to print column 1 of my input using awk already.Thanks! letter number_of_letters... (5 Replies)
Discussion started by: prashob123
5 Replies

6. Shell Programming and Scripting

Sum column values based in common identifier in 1st column.

Hi, I have a table to be imported for R as matrix or data.frame but I first need to edit it because I've got several lines with the same identifier (1st column), so I want to sum the each column (2nd -nth) of each identifier (1st column) The input is for example, after sorted: K00001 1 1 4 3... (8 Replies)
Discussion started by: sargotrons
8 Replies

7. Shell Programming and Scripting

Split column data if the table has n number of column's

please write a shell script Table -------------------------- 1 2 3 a b c 3 4 5 c d e 7 8 9 f g h Output should be like this --------------- 1 2 3 3 4 5 7 8 9 a b c c d e f g h (1 Reply)
Discussion started by: Priti2277
1 Replies

8. Shell Programming and Scripting

Split column data if the table has n number of column's with some record

Split column data if the table has n number of column's with some record then how to split n number of colmn's line by line with records Table --------- Col1 col2 col3 col4 ....................col20 1 2 3 4 .................... 20 a b c d .................... v ... (11 Replies)
Discussion started by: Priti2277
11 Replies

9. Shell Programming and Scripting

Sum of a column as new column based on header in a script

Hello, I am trying to store sum of a column as a new column inside a file but have to find the column names dynamically I/p c1,c2,c3,c4,c5 10,20,30,40,50 20,30,40,50,60 If i want to find sum only column c1, c3 and output it as c6,c7 O/p c1,c2,c3,c4,c5,c6,c7 10,20,30,40,50,30,70... (6 Replies)
Discussion started by: mkathi
6 Replies

10. UNIX for Beginners Questions & Answers

Sum in file based column

Hi All, I have a file as below and want to sum based on the id in the first column Input 10264;ATE; 12 10265;SES;11 10266AUT;50 10264;ATE;10 10265;SES;13 10266AUT;89 10264;ATE;1 10265;SES;15 10266AUT;78 Output 10264;ATE; 23 10265;SES;39 10266AUT;139 (6 Replies)
Discussion started by: arunkumar_mca
6 Replies
Time::Progress(3)					User Contributed Perl Documentation					 Time::Progress(3)

NAME
Time::Progress - Elapsed and estimated finish time reporting. SYNOPSIS
use Time::Progress; # autoflush to get working $| = 1; # get new `timer' my $p = new Time::Progress; # restart and report progress $p->restart; sleep 5; # or do some work here print $p->report( "done %p elapsed: %L (%l sec), ETA %E (%e sec) ", 50 ); # set min and max values $p->attr( min => 2, max => 20 ); # restart `timer' $p->restart; my $c; for( $c = 2; $c <= 20; $c++ ) { # print progress bar and percentage done print $p->report( "eta: %E min, %40b %p ", $c ); sleep 1; # work... } # stop timer $p->stop; # report times print $p->elapsed_str; DESCRIPTION
Shortest time interval that can be measured is 1 second. The available methods are: new my $p = new Time::Progress; Returns new object of Time::Progress class and starts the timer. It also sets min and max values to 0 and 100, so the next report calls will default to percents range. restart restarts the timer and clears the stop mark. optionally restart() may act also as attr() for setting attributes: $p->restart( min => 1, max => 5 ); is the same as: $p->attr( min => 1, max => 5 ); $p->restart(); If you need to count things, you can set just 'max' attribute since 'min' is already set to 0 when object is constructed by new(): $p->restart( max => 42 ); stop Sets the stop mark. This is only useful if you do some work, then finish, then do some work that shouldn't be timed and finally report. Something like: $p->restart; # do some work here... $p->stop; # do some post-work here print $p->report; # `post-work' will not be timed Stop is useless if you want to report time as soon as work is finished like: $p->restart; # do some work here... print $p->report; continue Clears the stop mark. (mostly useless, perhaps you need to restart?) attr Sets and returns internal values for attributes. Available attributes are: min This is the min value of the items that will follow (used to calculate estimated finish time) max This is the max value of all items in the even (also used to calculate estimated finish time) format This is the default report format. It is used if report is called without parameters. attr returns array of the set attributes: my ( $new_min, $new_max ) = $p->attr( min => 1, max => 5 ); If you want just to get values use undef: my $old_format = $p->attr( format => undef ); This way of handling attributes is a bit heavy but saves a lot of attribute handling functions. attr will complain if you pass odd number of parameters. report report is the most complex method in this package. :) expected arguments are: $p->report( format, [current_item] ); format is string that will be used for the result string. Recognized special sequences are: %l elapsed seconds %L elapsed time in minutes in format MM:SS %e remaining seconds %E remaining time in minutes in format MM:SS %p percentage done in format PPP.P% %f estimated finish time in format returned by localtime() %b %B progress bar which looks like: ##############...................... %b takes optional width: %40b -- 40-chars wide bar %9b -- 9-chars wide bar %b -- 79-chars wide bar (default) Parameters can be ommited and then default format set with attr will be used. Sequences 'L', 'l', 'E' and 'e' can have width also: %10e %5l ... Estimate time calculations can be used only if min and max values are set (see attr method) and current item is passed to report! if you want to use the default format but still have estimates use it like this: $p->format( undef, 45 ); If you don't give current item (step) or didn't set proper min/max value then all estimate sequences will have value `n/a'. You can freely mix reports during the same event. elapsed estimate helpers -- return elapsed/estimate seconds. elapsed_str estimate_str helpers -- return elapsed/estimated string in format: "elapsed time is MM:SS min. " "remaining time is MM:SS min. " all helpers need one argument -- current item. FORMAT EXAMPLES
# $c is current element (step) reached # for the examples: min = 0, max = 100, $c = 33.3 print $p->report( "done %p elapsed: %L (%l sec), ETA %E (%e sec) ", $c ); # prints: # done 33.3% elapsed time 0:05 (5 sec), ETA 0:07 (7 sec) print $p->report( "%45b %p ", $c ); # prints: # ###############.............................. 33.3% print $p->report( "done %p ETA %f ", $c ); # prints: # done 33.3% ETA Sun Oct 21 16:50:57 2001 GITHUB REPOSITORY
git@github.com:cade-vs/perl-time-progress.git git clone git://github.com/cade-vs/perl-time-progress.git AUTHOR
Vladi Belperchinov-Shabanski "Cade" <cade@biscom.net> <cade@datamax.bg> <cade@cpan.org> http://cade.datamax.bg perl v5.18.2 2013-07-08 Time::Progress(3)
All times are GMT -4. The time now is 08:02 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy