Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Finding data value that contains x% of points Post 302743825 by Corona688 on Thursday 13th of December 2012 11:34:35 AM
Old 12-13-2012
Your data doesn't seem to have a normal distribution.

There's a much more obvious way anyway, don't know why it didn't occur to me before Smilie Sort it, then look past the % number of lines you want for the threshold.

Code:
sort -n data > sorted

awk 'NR==FNR { N++; next } FNR > (.8*N) { print $1 ; exit }' sorted sorted

rm -f sorted

This User Gave Thanks to Corona688 For This Post:
 

10 More Discussions You Might Find Interesting

1. UNIX and Linux Applications

Gnuplot question: how to plot 3D points as colored points in map view?

I have a simple gnuplot question. I have a set of points (list of x,y,z values; irregularly spaced, i.e. no grid) that I want to plot. I want the plot to look like this: - points in map view (no 3D view) - color of each point should depend on its z-value. - I want to define my own color scale -... (0 Replies)
Discussion started by: karman
0 Replies

2. Shell Programming and Scripting

to extarct data points

suppose u have a file which consist of many data points separated by asterisk Question is to extract third part in each line . 0.0002*0.003*-0.93939*0.0202*0.322*0.3332*0.2222*0.22020 0.003*0.3333*0.33322*-0.2220*0.3030*0.2222*0.3331*-0.3030 0.0393*0.3039*-0.03038*0.033*0.4033*0.30384*0.4048... (5 Replies)
Discussion started by: cdfd123
5 Replies

3. Shell Programming and Scripting

recoding data points using SED??

Hello all, I have a data file that needs some serious work...I have no idea how to implement the changes that are needed! The file is a genotypic file with >64,000 columns representing genetic markers, a header line, and >1100 rows that looks like this: ID 1 2 3 4 ... (7 Replies)
Discussion started by: doobedoo
7 Replies

4. Shell Programming and Scripting

Writing an algorithm to recode data points

I have a file that has been partially recoded so that data points that were formerly letter combinations are now -1, 0, or 1. I need to finish recoding the GG and CC data points. The file looks like this: ID 1 2 3 4 5 6 7 8 83845676 0 0 0 0 CC -1 CC CC 838469. -1 -1 1 GG CC 0 CC 1 83847041... (10 Replies)
Discussion started by: doobedoo
10 Replies

5. Shell Programming and Scripting

Group search (multiple data points) in Linux

Hi All I have a data set like this tab delimited: weft fgr-1 345 -1 fgrythdgd weft fgr-3 456 -2 ghjdklflllff weft fgr-11 456 -3 ghtjuffl weft fgr-1 213 -2 ghtyjdkl weft fgr-34 567 -5 fghytkflf frgt fgr-36 567 -1 ghrjufjf frgt fgr-45 678 -2 ghjruir frgt fgr-34 546 -5 gjjjgkldlld frgt... (4 Replies)
Discussion started by: Lucky Ali
4 Replies

6. UNIX for Dummies Questions & Answers

How to get data only inside polygon created by points which is part of whole data from file?

hiii, Help me out..i have a huge set of data stored in a file.This file has has 2 columns which is latitude & longitude of a region. Now i have a program which asks for the number of points & based on this number it asks the user to enter that latitude & longitude values which are in the same... (7 Replies)
Discussion started by: reva
7 Replies

7. Programming

GNUPLOT- how to change the style of data points

Hi, I am trying to arrange my graphs with GNUPLOT. Although it looked like simple at the beginning, I could not figure out an answer for the following: I want to change the style of my data points (not the line, just exact data points) The terminal assigns first + and then x to them but what I... (0 Replies)
Discussion started by: natasha
0 Replies

8. Shell Programming and Scripting

Calculate difference between consecutive data points in a column from a file

Hi, I have a file with one column data (sample below) and I am trying to write a shell script to calculate the difference between consecutive data valuse i.e Var = Ni -N(i-1) 0.3141 -3.6595 0.9171 5.2001 3.5331 3.7022 -6.1087 -5.1039 -9.8144 1.6516 -2.725 3.982 7.769 8.88 (5 Replies)
Discussion started by: malandisa
5 Replies

9. Shell Programming and Scripting

Grabbing data between 2 points in text file

I have a text file that shows the output of my solar inverters. I want to separate this into sections. overview , device 1 , device 2 , device 3. Each device has different number of lines. but they all have unique starting points. Overview starts with 6 #'s, Devices have 4#'s and their data starts... (6 Replies)
Discussion started by: Mikey
6 Replies

10. Shell Programming and Scripting

Ranking data points from multiple files

I need to rank a large number of data points that exist in multiple files. My data points (Column 3) are based on unique values in columns 1 and 2. I need to rank the values that are in File 1, Column 3. For instance: Input File 1 AAA BBB 10 CCC DDD 16 EEE FFF 20 Input File 2 ... (47 Replies)
Discussion started by: ncwxpanther
47 Replies
sortbib(1)						      General Commands Manual							sortbib(1)

Name
       sortbib - sort bibliographic database

Syntax
       sortbib [-sKEYS] database...

Description
       The  command sorts files of records containing refer key-letters by user-specified keys.  Records may be separated by blank lines, or by .[
       and .] delimiters, but the two styles may not be mixed together.  This program reads through each database and pulls out key fields,  which
       are  sorted  separately.   The sorted key fields contain the file pointer, byte offset, and length of corresponding records.  These records
       are delivered using disk seeks and reads, so may not be used in a pipeline to read standard input.

       By default, alphabetizes by the first %A and the %D fields, which contain the senior author and date.  The -s option is used to specify new
       KEYS.  For instance, -sATD will sort by author, title, and date, while -sA+D will sort by all authors, and date.  Sort keys past the fourth
       are not meaningful.  No more than 16 databases may be sorted together at one time.  Records longer than 4096 characters will be truncated.

       The command sorts on the last word on the %A line, which is assumed to be the author's last name.  A word in the final  position,  such	as
       ``jr.''	or  ``ed.'',  will be ignored if the name beforehand ends with a comma.  Authors with two-word last names or unusual constructions
       can be sorted correctly by using the convention ``'' in place of a blank.  A %Q field is considered to be the same as %A, except  sorting
       begins  with  the first, not the last, word.  The command sorts on the last word of the %D line, usually the year.  It also ignores leading
       articles (like ``A'' or ``The'') when sorting by titles in the %T or %J fields; it will ignore articles of any  modern  European  language.
       If a sort-significant field is absent from a record, places that record before other records containing that field.

Options
       -sKEYS
	  Specifies new sort KEYS.  For example, ATD sorts by author, title, and date.

See Also
       addbib(1), indxbib(1), lookbib(1), refer(1), roffbib(1)

																	sortbib(1)
All times are GMT -4. The time now is 10:13 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy