plz help me with this, I want to to extract the duplicate rows (column 1) in a file which at least repeat 4 times. then I want to summarize them by getting the max , mean, median and min. The file is sorted by column 1, all the repeated rows appear together.
If number of elements is odd, median is middle one , eg 4th element among 7 sorted numbers ... element number (n+1)/2
If number of elements is even, it is the average of middle 2, eg. average of 4th and 5th element for set of 8 sorted numbers...average of n/2 + 1 and n/2
hi all
can anyone please let me know if there is a way to find out duplicate rows in a file. i have a file that has hundreds of numbers(all in next row).
i want to find out the numbers that are repeted in the file.
eg.
123434
534
5575
4746767
347624
5575
i want 5575
please help (3 Replies)
I have searched the internet for duplicate row extracting.
All I have seen is extracting good rows or eliminating duplicate rows.
How do I extract duplicate rows from a flat file in unix.
I'm using Korn shell on HP Unix.
For.eg.
FlatFile.txt
========
123:456:678
123:456:678
123:456:876... (5 Replies)
Hi all,
I have written one shell script. The output file of this script is having sql output.
In that file, I want to extract the rows which are having multiple entries(duplicate rows).
For example, the output file will be like the following way.
... (7 Replies)
Hi! I have a file as below:
line1
line2
line2
line3
line3
line3
line4
line4
line4
line4
I would like to extract duplicate lines (not unique, triplicate or quadruplicate lines). Output will be as below:
line2
line2
I would appreciate if anyone can help. Thanks. (4 Replies)
Hi ,
I have a data file in this format.
p1 p2 p3
10 0
10 0 1000
I am using a sqlloader script to load the data into the database table.There is a unique constraint on the columns p1 and p2.
So, sqlldr cannot load both the records. This eliminates duplicate records from being... (1 Reply)
I feel stupid for asking this because it seems that MYSQL code isn't working the way that I think it should work.
Basically I wrote code like this:
select * from `Test_DC_Trailer` HAVING max(DR_RefKey);
Where the DR_RefKey is a unique numeric field that is auto iterated (like a primary key)... (7 Replies)
Hi all
I have a file that has two columns and I need the maximum value in column 2 of 4 positions o rows. for example at position {1..3} there are 4 characters (A, C, G and T) each of these characters with a value with a value in column 2. I need the maximum value in column 2 and the corresponding... (2 Replies)
I want to duplicate each row in my file
Egfile.txt
Name State Age
Jack NJ 34
John MA 23
Jessica FL 45
I want the code to produce this output
Name State Age
Jack NJ 34
Jack NJ 34
John MA 23
John MA 23
Jessica FL 45
Jessica FL 45 (6 Replies)
Hi,
I have a file that contains multiple records of the same database.
I need to search for the maximum size of the database. At the moment, I am doing as below:
Sample generated file to parse is as below. With the caret (^) delimiter, field 1 is the database name, 2 is the database ID and... (3 Replies)
Discussion started by: newbie_01
3 Replies
LEARN ABOUT DEBIAN
smokeping_matchers_medratio
..::lib::Smokeping::matchers::Medratio(3) SmokePing ..::lib::Smokeping::matchers::Medratio(3)NAME
Smokeping::matchers::Medratio - detect changes in the latency median
OVERVIEW
The Medratio matcher establishes a historic median latency over several measurement rounds. It compares this median, against a second
median latency value again build over several rounds of measurement.
By looking at the median value this matcher is largly imune against spikes and will only react to long term developments.
DESCRIPTION
Call the matcher with the following sequence:
type = matcher
pattern = Medratio(historic=>a,current=>b,comparator=>o,percentage=>p)
historic
The number of values to use for building the 'historic' median.
current
The number of values to use for building the 'current' median.
comparator
Which comparison operator should be used to compare current/historic with percentage.
percentage
Right hand side of the comparison.
old <--- historic ---><--- current ---> now
EXAMPLE
Take the 12 last median values. Build the median out of the first 10 and the median from the other 2 values. Divide the results and decide
if it is bigger than 150 percent.
Medratio(historic=>10,current=>2,comparator=>'>',percentage=>150);
med(current)/med(historic) > 150/100
This means the matcher will activate when the current latency median is more than 1.5 times the historic latency median established over
the last 10 rounds of measurement.
COPYRIGHT
Copyright (c) 2006 by OETIKER+PARTNER AG. All rights reserved.
SPONSORSHIP
The development of this matcher has been paied for by Virtela Communications, <http://www.virtela.net/>.
LICENSE
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation,
Inc., 675 Mass Ave, Cambridge, MA 02139, USA.
AUTHOR
Tobias Oetiker <tobi@oetiker.ch>
2.6.8 2012-02-26 ..::lib::Smokeping::matchers::Medratio(3)