Is it possible to extract rows with the same first column and then calculate its percentage?
A short excerpt of my .txt file looks like:
This is a 2-part question:
1) Is there a way for me to first extract each unique ID from the first column and all the affiliated rows (all the rows that start with 'CXRA3Z2J9MQKR' and 'A162JX4ML69UIC' etc) to a new .txt file??
2) After that, I need to ultimately calculate the percentage of A's, B's, and C's from each ID (ex: 'CXRA3Z2J9MQKR') in my data. So user 'BRNTTJUB8GXE9' would have: A=100%, B=0%, C=0%.
Is there a way to do math such as calculating percentage (not with numbers, but percentage of letters like in this case) in UNIX?
Thanks in advance for any help or feedback. I'm new to UNIX, and I'm being forced to learn it 'on the job' at my new workplace.
Last edited by Scott; 06-27-2010 at 06:29 AM..
Reason: Code tags, please...
This generates the extract you want - to get analysis put it into Excel. Or download openoffice and use the spreadheet in there:
Analysis is possible in UNIX. I think it would just be easier for you in excel.
Thanks for your help. But that awk script tallies up all the occurrences of A, B, and C separately and neglects to count up only the A's B's and C's from each unique ID in the 1st column. So basically the script gave me the result of 5 A's, 4 B's, and 3 C's in the excerpt I provided.
Is there a way to only count those based on the unique content in the first column per row? That's why I had thought maybe it's better to first extract all the rows by what their ID is. Is there a quick way to do that as well?
Can you please explain how you came up with it? You don't have to do every line, but just the general gist of the code for learning purposes??
---------- Post updated at 09:13 PM ---------- Previous update was at 09:01 PM ----------
I'm wondering specifically what you did so that it only counts all the unique items in first column but at the same time also including with it the responses from the 2nd column.
I have a input text file in this format:
ITEM1 10.9 20.1
ITEM2 11.6 12
ITEM3 14 15.7
ITEM5 20 50.6
ITEM6 25 23.6
I want to print those lines which have more than 5% difference between second and third columns. (8 Replies)
Hello,
Ive got a bunch of numbers here e.g:
6065
6094
6348
6297
6161
6377
6338
6290
How do I find out if there is a difference between 10% or more between one of these numbers ? I am trying to do this in Bash.. but no luck so far.. Does anyone have an Idea ??
Thanx,
- Pascal... (9 Replies)
I have 100 csv files like:
file_city_1 file_city_2 file_city_3 file_city_4
City name is variable, there is 25 cities, each city has 4 region. Each of the 4 region contain some statistics like:
parameter1 : number1
parameter1 : number2
.....
parameter50 : number50
... (7 Replies)
I have tried the following to no avail.
xargs -n8 < test.txt
awk '{if(NR%6!=0){p=""}else{p="\n"};printf $0" "p}' Mod_Alm_log.txt > test.txt
I have tried different variations of the above, the problem is mixes lines together.
And it includes the tags "%a and %A" I need them to be all tab... (16 Replies)
Input File:
5081
2058
175
8282
2358
7347
6612
3459
END OF INPUT FILE
I need to know how to calculate minimum,maximum,average of the values in the file and also what percentage is the values over some user defined value for example 1000 and what percentage of value is over 5000.
By... (2 Replies)
hello,
please can you help me.
jj and kk are two numbers which are the result of an sql program.
I would like to calculate the ratio jj/kk*100.
I have done this:
ratio=$((jj/kk * 100)) or ratio=`expr $jj \/ expr $kk) but the result is 0
What can i do?
Thanks for help. (3 Replies)
Hi
I need a awk script to calculate percentage.
I have to pass the pararmeters in to the awk script and calculate the percentage.
Sum = 50
passed = 43
failed = 7
I need to pass these value in to the awk script and calculate the percentage.
Please advice me. (8 Replies)
i have 3 files like
total.dat=18
equal.dat=14
notequal.dat=16
i need find the equal percentange means:
equalpercentage = ($equal.dat / $total.dat * 100)
How i can do this ?
I tried some of the answers to calculate the percentage in this forums.but it couldn't worked.Some one please... (6 Replies)
Hi, I am having the file which contains the following two columns.
518 _factorial
256 _main
73 _atol
52 ___do_global_ctors
170 ___main
52 ___do_g
How can calculate the percentage of each value in the first column ?
first need to get the sum of the first column and... (3 Replies)
int percent (int a, int b)
{
if (b/a*100 > 25)
return TRUE;
else
return FALSE;
}
I want to calculate what percentage of a is b.
say if b = 48, a = 100
so b is 48% of a
but wouldnt b/a give me 0 ??? what can be done ?? (6 Replies)