Aggregation of huge data

04-07-2014

Registered User

16, 0

Join Date: Aug 2012

Last Activity: 7 April 2014, 1:56 AM EDT

Location: Chennai

Posts: 16

Thanks Given: 12

Thanked 0 Times in 0 Posts

Hi Akshay,

I even removed the blank and tried it - still facing the same issue. Also I copied few records to a new file (Say 5 lines) and even then - it is occuring !

Data:

Code:

Command used:

Code:

 awk 'BEGIN { print "Z = 0;" } { sub(/-/, ""); print "Z += ",$1,";" } END { print "Z;" }' test.txt

Output:

Code:

Z = 0;
Z +=  21000000 ;
Z +=  3000 ;
Z +=  3000 ;
Z +=  670500 ;
Z +=  2963700 ;
Z;

To my knowledge the above stated output, the value of z should be incremented na ?

Kindly advise me on the same.

Regards,
Ravichander

Last edited by Ravichander; 04-07-2014 at 02:56 AM..

Ravichander

View Public Profile for Ravichander

Find all posts by Ravichander

04-07-2014

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

Three weeks ago I suggested the code:

Code:

awk -F'|' -v dqANDms='["-]' '
BEGIN {	f=156
	printf("s=0\n")
}
NR > 2 {gsub(dqANDms, "", $f)
	printf("s+=%s\n",  $f)
}
END {	printf("s\n")
}' file | bc

in another thread (Aggregation of Huge files) where you wanted to process the 156th field instead of the 1st field, wanted to strip out double quote characters if any were present, and had two header lines in your input that were to be ignored. You said that when your input file contained 7 million records, my code didn't work; but you weren't able to show any input that caused it to produce the wrong result. Instead of answering requests to show sample input that caused suggested scripts provided to you to fail, you started this new thread.

Simplifying that code for the data you've presented here yields:

Code:

awk '
BEGIN {	printf("s=0\n")
}
{	sub(/-/, "")
	printf("s+=%s\n", $1)
}
END {	printf("s\n")
}' test.txt | bc

which, with the sample input you provided in message #8 in this thread produces the output:

Code:

24640200

which still looks like the correct result to me. If this isn't the result you wanted, what were you expecting?

If it matters, the output from awk that the above script feeds into bc is:

Code:

s=0
s+=21000000
s+=3000
s+=3000
s+=670500
s+=2963700
s

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

Shell Programming and Scripting

Aggregation of huge data

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Phrase XML with Huge Data

Discussion started by: pareshkp

2. Solaris

The Fastest for copy huge data

Discussion started by: edydsuranta

3. Shell Programming and Scripting

awk does not work well with huge data?

Discussion started by: ariesto

4. Shell Programming and Scripting

Aggregation of Huge files

Discussion started by: Ravichander

5. Red Hat

Disk is Full but really does not contain huge data

Discussion started by: kalpeer

6. UNIX for Dummies Questions & Answers

Copy huge data into vi editor

Discussion started by: alok.behria

7. Shell Programming and Scripting

Split a huge data into few different files?!

Discussion started by: patrick87

8. UNIX for Advanced & Expert Users

A variable and sum of its value in a huge data.

Discussion started by: varungupta

9. Shell Programming and Scripting

How to extract data from a huge file?

Discussion started by: srsahu75

10. UNIX for Dummies Questions & Answers

search and grab data from a huge file

Discussion started by: ting123