02-24-2016
Hi ks_reddy,
Assuming that the Check values in your input are all numeric values, I note that RudiC's code will sort the header from your input file to the end of your output file. And by using the 2nd field as the primary sort key, the output will be grouped by (alphanumeric; not numeric) Check values while your input seems to be grouped by Key values.
Does your real input have all lines for each distinct Key value grouped together?
Do you want the header line in the output file? If so, does the header need to be kept as the first line in the output?
Does the order of other lines in the output matter? If so, does the input order need to be maintained in the output? Or is a different sort order required (and, if so, what order)?
Approximately how many distinct Key values are there in your real input? Approximately how many of those Key values will need to be removed?
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I have got a file like this
003ABC00281020091005000100042.810001 ... (8 Replies)
Discussion started by: Maruti
8 Replies
2. Shell Programming and Scripting
Hi,
I want to get rid of multiple rows (duplicate, triplicate etc..) for only column 1.
e.g.
iu 2
iu 1
iu 3
k 4
jk 3
nm 4
nm 2
output
k 4
jk 3
thanks (7 Replies)
Discussion started by: phil_heath
7 Replies
3. Shell Programming and Scripting
Sorry I made a mistake in my last post (output is suppose to be the opposite). Here is a revised post.
Hi,
I am not sure if this has already been asked (I tried the search but the search was too broad). Basically I want to remove rows based on another file.
So file1 looks like this (tab... (3 Replies)
Discussion started by: kylle345
3 Replies
4. Shell Programming and Scripting
Hi,
I'm using AIX(ksh shell).
> cat temp.txt
"a","b",0
"c",bc",0
"a1","b1",0
"cc","cb",1
"cc","b2",1
"bb","bc",2
I want the output as:
"a","b","c","bc","a1","b1"
"cc","cb","cc","b2"
"bb","bc"
I want to combine multiple lines into single line where third column is same.
Is... (1 Reply)
Discussion started by: samuelray
1 Replies
5. Shell Programming and Scripting
HI all,
I have a simple challenge for you.. I have the following pipe delimited file
2345|98|1809||x|969|0
2345|98|0809||y|0|537
2345|97|9809||x|544|0
2345|97|0909||y|0|651
9685|98|7809||x|321|0
9685|98|7909||y|0|357
9685|98|7809||x|687|0
9685|98|0809||y|0|234
2315|98|0809||x|564|0
... (2 Replies)
Discussion started by: nithins007
2 Replies
6. Shell Programming and Scripting
Hello All,
I have a .CSV file where I expect all numeric data in all the columns other than column headers.
But sometimes I get the files (result of statistics computation by other persons) like below( sample data)
SNO,Data1,Data2,Data3
1,2,3,4
2,3,4,SOME STRING
3,4,Inf,5
4,5,4,4
I... (9 Replies)
Discussion started by: ks_reddy
9 Replies
7. Shell Programming and Scripting
Hello experts,
Shown below is the 2 column sample data(there are many data columns in actual input file),
Key, Data
A, 1
A, 2
A, 2
A, 3
A, 1
A, 1
A, 1
I need the below output.
Key, Data
A, 2
A, 2
A, 3
A, 1
A, 1
A, 1 (2 Replies)
Discussion started by: ks_reddy
2 Replies
8. Shell Programming and Scripting
Hi I have a matrix with n rows and m columns like below example. i want to extract all the pairs with values <200.
Input
A B C D
A 100 206 51 300
B 206 100 72 48
C 351 22 100 198
D 13 989 150 100
Output format
A,A:200
A,C:51
B,B:100... (2 Replies)
Discussion started by: anurupa777
2 Replies
9. Shell Programming and Scripting
I have a file some thing like this:
GN Name=YWHAB;
RC TISSUE=Keratinocyte;
RC TISSUE=Thymus;
CC -!- FUNCTION: Adapter protein implicated in the regulation of a large
CC spectrum of both general and specialized signaling pathways
GN Name=YWHAE;
RC TISSUE=Liver;
RC ... (13 Replies)
Discussion started by: raj_k
13 Replies
10. Shell Programming and Scripting
Hello
I want to collapse a file with multiple rows into consolidated lines of entries based on selected columns as the 'key'.
Example:
1 2 3 Abc def ghi
1 2 3 jkl mno p qrts
6 9 0 mno def Abc
7 8 4 Abc mno mno abc
7 8 9 mno mno abc
7 8 9 mno j k
So if columns 1, 2 and 3 are... (6 Replies)
Discussion started by: linuxlearner123
6 Replies
LEARN ABOUT OPENDARWIN
uniq
UNIQ(1) BSD General Commands Manual UNIQ(1)
NAME
uniq -- report or filter out repeated lines in a file
SYNOPSIS
uniq [-c | -d | -u] [-i] [-f num] [-s chars] [input_file [output_file]]
DESCRIPTION
The uniq utility reads the specified input_file comparing adjacent lines, and writes a copy of each unique input line to the output_file. If
input_file is a single dash ('-') or absent, the standard input is read. If output_file is absent, standard output is used for output. The
second and succeeding copies of identical adjacent input lines are not written. Repeated lines in the input will not be detected if they are
not adjacent, so it may be necessary to sort the files first.
The following options are available:
-c Precede each output line with the count of the number of times the line occurred in the input, followed by a single space.
-d Only output lines that are repeated in the input.
-f num Ignore the first num fields in each input line when doing comparisons. A field is a string of non-blank characters separated from
adjacent fields by blanks. Field numbers are one based, i.e. the first field is field one.
-s chars
Ignore the first chars characters in each input line when doing comparisons. If specified in conjunction with the -f option, the
first chars characters after the first num fields will be ignored. Character numbers are one based, i.e. the first character is
character one.
-u Only output lines that are not repeated in the input.
-i Case insensitive comparison of lines.
DIAGNOSTICS
The uniq utility exits 0 on success, and >0 if an error occurs.
COMPATIBILITY
The historic +number and -number options have been deprecated but are still supported in this implementation.
SEE ALSO
sort(1)
STANDARDS
The uniq utility is expected to be IEEE Std 1003.2 (``POSIX.2'') compatible.
HISTORY
A uniq command appeared in Version 3 AT&T UNIX.
BSD
June 6, 1993 BSD