Sponsored Content
Full Discussion: Sum of Coulmns
Homework and Emergencies Homework & Coursework Questions Sum of Coulmns Post 302475541 by bakunin on Monday 29th of November 2010 05:08:29 AM
Old 11-29-2010
Quote:
Originally Posted by RGD
The problem statement:
it takes a directory name as an argument and handles the files in that directory according to the following rule:

The files that end with extension .csv (comma-separated values) are moved to subfolder CSV. However, before doing the move, the columns of each file have to be summed and a new row containing the totals should be appended to each .csv file. Assume that the .csv files have 4 columns each with the following column.
Ok, lets start with some plan: your program layout will have to look something like

Code:
main ()
{
while (every file in the dir named *csv)
     process_file( file )
done
}


function process_file( filename )
{
insert_column_with_sums( filename )
move_to_subfolder( filename )
}


Quote:
Originally Posted by RGD
I already did the part that takes the file and moves it to the new subfolder, I also managed to sum all the columns using awk , it may not be the best way ,but I’m still learning
Good. lets go over your solution:

First off, some explanation how awk works: basically it is a rule-based language. A file is read, line by line, and one rule after the other is applied to each line (which may or may not alter the lines contents). After application of all the rules the next line is read in and the process starts anew. A "rule" usually consists of following: a regexp and some commands. If the line matches the regexp, the commands accompanying it are executed, otherwise they aren't. No regexp means the commands are executed for all lines.

There are three special rules, named "BEGIN", "END" and one with no name at all. "BEGIN" is executed before any lines are read from input, "END" is executed after all the lines are being read. The rule with no name is executed for every line of the input file.

What does that mean for your script?

Code:
#!/bin/sh

#string, value, value, value.
#gets the sums of each column

clear
myfile=$1

awk'{total1 += $3 } END {print total1}' $myfile
awk '{total2 += $5 } END {print total2}' $myfile
awk '{total3 += $7 } END {print total3}' $myfile

First off, you could do it all in one pass. Btw., it is good style to initialize variables you are going to use, instead of taking them for granted:

Code:
awk '
BEGIN {
     total1=0;
     total2=0;
     total3=0;
}

{
     total1 += $3;
     total2 += $5;
     total3 += $7;
}

END {
     print total1;
     print total2;
     print total3;
}' $myfile

It should be easy for you now to modify the script according to your requirement.

Another point is: Might there be lines which don't need processing? Lets consider the following:

Code:
# header line with no meaning
string, value, value, value
string, value, value, value
# another line with no meaning
string, value, value, value

You don't want to process the lines 1 and 4 in this case. Do you have an idea how to achieve this, from what i told you?

Still, there is a more subtle point I'd like to raise - one, which isn't explicitly covered by your requirement, but is best learned from the very beginning of ones programming career: If you get user input you should validate it! You write:

Code:
myfile=$1

From where do you know that "$1" is a legitimate directory (or a legitimate file, for that purposes)? Lets say i enter "yourscript /foo/bar/gnarble/furble/isnodirorfileatall". What would your script do?

You might want to read the man page for "test" (which is handy Unix utility and comes under two names: "test" and "[") to understand the following:

Code:
if [ -d "$1" -a -r "$1" ] ; then
     mydir="$1"
else
     echo "ERROR: no reasonable directory name given" >&2
fi

I hope this helps.

bakunin

Last edited by bakunin; 11-29-2010 at 06:15 AM..
 

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

sum of more than 1 column

Hi I want to sum of 3 columns in file. Example: I want to sum of 3 ,6,8 th columns in file(SUM(3,6,8)). Using awk can sum of single column awk '{a+=$3} END {printf ("%f\n",a)' file_name Thanks inadvance MR (2 Replies)
Discussion started by: mohan705
2 Replies

2. Shell Programming and Scripting

sum

Hello everyone I need to write a script that sums numbers passed to it as arguments on the command line and displays the results. I must use a for loop and then rewrite it using a while loop. It would have to output something like 10+20+30=60 this is what I have so far fafountain@hfc:~$ vi sum... (1 Reply)
Discussion started by: Blinky85
1 Replies

3. Shell Programming and Scripting

Print sum and relative value of the sum

Hi i data looks like this: student 1 Subject1 45 55 Subject2 44 55 Subject3 33 44 // student 2 Subject1 45 55 Subject2 44 55 Subject3 33 44 i would like to sum $2, $3 (marks) and divide each entry in $2 and $3 with their respective sums and print for each student as $4 and... (2 Replies)
Discussion started by: saint2006
2 Replies

4. Solaris

How to Sum

Hi I need to incorporate a 'sum' as follows into a script and not sure how. I have a variable per line and I need them to be summed, e.g below 1 23 1,456 1 1 34 46 How do I calculate the sum of all these numbers to ouptut the answer ( 1,562) Thanks in advance (3 Replies)
Discussion started by: rob171171
3 Replies

5. UNIX for Dummies Questions & Answers

Getting the sum

I am trying to get the sum of the first column of a file. When I use the same method for other files it works just fine... for some reason for the file below it gives me an error that I don't understand... I tried looking at different lines of the file and tried different things, but I still... (7 Replies)
Discussion started by: cosmologist
7 Replies

6. Shell Programming and Scripting

How to sum these value.

Hi, Unix Gurus, I need sum values from a file. file format like: 0004004 0000817 0045000 0045000 0045000 0045000 0045000 0045000 0045000 0045000 0045000 0045000 0004406 the result should be 459227 (817+45000 ... + 4406) anybody can help me out (7 Replies)
Discussion started by: ken002
7 Replies

7. Shell Programming and Scripting

Format the value of sum

I have a list of values ( in Kb) I have the following code to sum up the values and convert the total to GB cat list 701368101370 101370101370 801554101370 701636101370 101757101370 101876101370 901951101370 And this is the output of my script awk '{ s += $1 } END {... (3 Replies)
Discussion started by: Sara_84
3 Replies

8. Shell Programming and Scripting

Sum of all columns

Hi Friends, I have a file with fields separated with comma. How to print sum of each field of the file? Eg: input file 1,3,6,7 2,1,2,1 0,1,1,0 I want to sum each field separately. Output file 3,5,9,8 Thanks, Suresh (2 Replies)
Discussion started by: suresh3566
2 Replies

9. Shell Programming and Scripting

How to get the sum?

hi Gurus, I have an extract as seen below: INPUT 2015-08-24 15:00:00.0 |TCSERVER01 |ServiceEventHandler |2283 2015-08-24 15:01:00.0 |TCSERVER01 |ServiceEventHandler |576 2015-08-24 15:02:00.0 |TCSERVER01 |ServiceEventHandler |833 2015-08-24 15:03:00.0 |TCSERVER01 |ServiceEventHandler... (6 Replies)
Discussion started by: reignangel2003
6 Replies

10. Shell Programming and Scripting

Sum of Columns

HI Guys, I gave Input file F.Txt ID H1 H2 H3 H4 H5 A 5 6 7 8 9 B 4 65 4 4 7 C 4 4 4 4 4 D 4 4 4 4 4 Output :- ID H1 H2 H3 H4 H5 Total 17 79 19 20 24 Sum of Each Columns (8 Replies)
Discussion started by: pareshkp
8 Replies
All times are GMT -4. The time now is 01:24 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy