Help summing a file using awk

03-28-2013

Registered User

26, 0

Join Date: Oct 2008

Last Activity: 7 August 2014, 4:50 PM EDT

Posts: 26

Thanks Given: 3

Thanked 0 Times in 0 Posts

Thanks, but I'm not sure what that printf is doing.

I was able to sort of get it to working by changing my code to :

Code:

nawk 'BEGIN { FIELDWIDTHS = "15 10" } ; { arr[$1] += $2 } END {for (i in arr) {print i arr[i] } }' count_sort.txt > count_sum.txt

It looks like it is summing like I want, but it is removing the leading spaces for the last field. Any way to keep it from doing that?

---------- Post updated at 02:33 PM ---------- Previous update was at 02:07 PM ----------

Update -

I tried your printf suggestion. I think I got it to work using this:

Code:

nawk 'BEGIN { FIELDWIDTHS = "15 10" } ; { arr[$1] += $2 } END {for (i in arr) {printf "%15s%10s\n", substr(i, 1, 15), arr[i] } }' count_sort.txt > count_sum.txt

I had to remove the first %s from your example. Otherwise it was giving me a not enough arguments for printf error. What was the %s doing in your example?

The %s is a little confusing to me.

Thanks again.

Drenhead

View Public Profile for Drenhead

Find all posts by Drenhead

bup-margin(1) General Commands Manual bup-margin(1) NAME
bup-margin - figure out your deduplication safety margin SYNOPSIS
bup margin [options...] DESCRIPTION
bup margin iterates through all objects in your bup repository, calculating the largest number of prefix bits shared between any two entries. This number, n, identifies the longest subset of SHA-1 you could use and still encounter a collision between your object ids. For example, one system that was tested had a collection of 11 million objects (70 GB), and bup margin returned 45. That means a 46-bit hash would be sufficient to avoid all collisions among that set of objects; each object in that repository could be uniquely identified by its first 46 bits. The number of bits needed seems to increase by about 1 or 2 for every doubling of the number of objects. Since SHA-1 hashes have 160 bits, that leaves 115 bits of margin. Of course, because SHA-1 hashes are essentially random, it's theoretically possible to use many more bits with far fewer objects. If you're paranoid about the possibility of SHA-1 collisions, you can monitor your repository by running bup margin occasionally to see if you're getting dangerously close to 160 bits. OPTIONS
--predict Guess the offset into each index file where a particular object will appear, and report the maximum deviation of the correct answer from the guess. This is potentially useful for tuning an interpolation search algorithm. --ignore-midx don't use .midx files, use only .idx files. This is only really useful when used with --predict. EXAMPLE
$ bup margin Reading indexes: 100.00% (1612581/1612581), done. 40 40 matching prefix bits 1.94 bits per doubling 120 bits (61.86 doublings) remaining 4.19338e+18 times larger is possible Everyone on earth could have 625878182 data sets like yours, all in one repository, and we would expect 1 object collision. $ bup margin --predict PackIdxList: using 1 index. Reading indexes: 100.00% (1612581/1612581), done. 915 of 1612581 (0.057%) SEE ALSO
bup-midx(1), bup-save(1) BUP
Part of the bup(1) suite. AUTHORS
Avery Pennarun <apenwarr@gmail.com>. Bup unknown- bup-margin(1)

Shell Programming and Scripting

Help summing a file using awk

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk split columns after matching on rows and summing the last column

Discussion started by: jacobs.smith

2. Shell Programming and Scripting

Summing all fields in a file

Discussion started by: Jotne

3. UNIX Desktop Questions & Answers

Summing file sizes

Discussion started by: Alexander4444

4. Shell Programming and Scripting

awk summing specific lines and fields

Discussion started by: nakaedu

5. UNIX for Dummies Questions & Answers

Summing lines in a file

Discussion started by: LearningLinux2

6. Shell Programming and Scripting

Please Help!!!! Awk for summing columns based on selected column value

Discussion started by: BrownBob

7. Shell Programming and Scripting

Using awk to summing from a given line

Discussion started by: firelink

8. Shell Programming and Scripting

Summing up a matrix using awk

Discussion started by: JRodrigoF

9. Shell Programming and Scripting

Awk: Summing values with group criteria

Discussion started by: gianluca2

10. Shell Programming and Scripting

awk scripting - matching records and summing up time

Discussion started by: Gonik

LEARN ABOUT DEBIAN

bup-margin