02-24-2016
Hi ks_reddy,
Assuming that the Check values in your input are all numeric values, I note that RudiC's code will sort the header from your input file to the end of your output file. And by using the 2nd field as the primary sort key, the output will be grouped by (alphanumeric; not numeric) Check values while your input seems to be grouped by Key values.
Does your real input have all lines for each distinct Key value grouped together?
Do you want the header line in the output file? If so, does the header need to be kept as the first line in the output?
Does the order of other lines in the output matter? If so, does the input order need to be maintained in the output? Or is a different sort order required (and, if so, what order)?
Approximately how many distinct Key values are there in your real input? Approximately how many of those Key values will need to be removed?
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I have got a file like this
003ABC00281020091005000100042.810001 ... (8 Replies)
Discussion started by: Maruti
8 Replies
2. Shell Programming and Scripting
Hi,
I want to get rid of multiple rows (duplicate, triplicate etc..) for only column 1.
e.g.
iu 2
iu 1
iu 3
k 4
jk 3
nm 4
nm 2
output
k 4
jk 3
thanks (7 Replies)
Discussion started by: phil_heath
7 Replies
3. Shell Programming and Scripting
Sorry I made a mistake in my last post (output is suppose to be the opposite). Here is a revised post.
Hi,
I am not sure if this has already been asked (I tried the search but the search was too broad). Basically I want to remove rows based on another file.
So file1 looks like this (tab... (3 Replies)
Discussion started by: kylle345
3 Replies
4. Shell Programming and Scripting
Hi,
I'm using AIX(ksh shell).
> cat temp.txt
"a","b",0
"c",bc",0
"a1","b1",0
"cc","cb",1
"cc","b2",1
"bb","bc",2
I want the output as:
"a","b","c","bc","a1","b1"
"cc","cb","cc","b2"
"bb","bc"
I want to combine multiple lines into single line where third column is same.
Is... (1 Reply)
Discussion started by: samuelray
1 Replies
5. Shell Programming and Scripting
HI all,
I have a simple challenge for you.. I have the following pipe delimited file
2345|98|1809||x|969|0
2345|98|0809||y|0|537
2345|97|9809||x|544|0
2345|97|0909||y|0|651
9685|98|7809||x|321|0
9685|98|7909||y|0|357
9685|98|7809||x|687|0
9685|98|0809||y|0|234
2315|98|0809||x|564|0
... (2 Replies)
Discussion started by: nithins007
2 Replies
6. Shell Programming and Scripting
Hello All,
I have a .CSV file where I expect all numeric data in all the columns other than column headers.
But sometimes I get the files (result of statistics computation by other persons) like below( sample data)
SNO,Data1,Data2,Data3
1,2,3,4
2,3,4,SOME STRING
3,4,Inf,5
4,5,4,4
I... (9 Replies)
Discussion started by: ks_reddy
9 Replies
7. Shell Programming and Scripting
Hello experts,
Shown below is the 2 column sample data(there are many data columns in actual input file),
Key, Data
A, 1
A, 2
A, 2
A, 3
A, 1
A, 1
A, 1
I need the below output.
Key, Data
A, 2
A, 2
A, 3
A, 1
A, 1
A, 1 (2 Replies)
Discussion started by: ks_reddy
2 Replies
8. Shell Programming and Scripting
Hi I have a matrix with n rows and m columns like below example. i want to extract all the pairs with values <200.
Input
A B C D
A 100 206 51 300
B 206 100 72 48
C 351 22 100 198
D 13 989 150 100
Output format
A,A:200
A,C:51
B,B:100... (2 Replies)
Discussion started by: anurupa777
2 Replies
9. Shell Programming and Scripting
I have a file some thing like this:
GN Name=YWHAB;
RC TISSUE=Keratinocyte;
RC TISSUE=Thymus;
CC -!- FUNCTION: Adapter protein implicated in the regulation of a large
CC spectrum of both general and specialized signaling pathways
GN Name=YWHAE;
RC TISSUE=Liver;
RC ... (13 Replies)
Discussion started by: raj_k
13 Replies
10. Shell Programming and Scripting
Hello
I want to collapse a file with multiple rows into consolidated lines of entries based on selected columns as the 'key'.
Example:
1 2 3 Abc def ghi
1 2 3 jkl mno p qrts
6 9 0 mno def Abc
7 8 4 Abc mno mno abc
7 8 9 mno mno abc
7 8 9 mno j k
So if columns 1, 2 and 3 are... (6 Replies)
Discussion started by: linuxlearner123
6 Replies
UNIQ(1) General Commands Manual UNIQ(1)
NAME
uniq - report repeated lines in a file
SYNOPSIS
uniq [ -udc [ +n ] [ -n ] ] [ input [ output ] ]
DESCRIPTION
Uniq reads the input file comparing adjacent lines. In the normal case, the second and succeeding copies of repeated lines are removed;
the remainder is written on the output file. Note that repeated lines must be adjacent in order to be found; see sort(1). If the -u flag
is used, just the lines that are not repeated in the original file are output. The -d option specifies that one copy of just the repeated
lines is to be written. The normal mode output is the union of the -u and -d mode outputs.
The -c option supersedes -u and -d and generates an output report in default style but with each line preceded by a count of the number of
times it occurred.
The n arguments specify skipping an initial portion of each line in the comparison:
-n The first n fields together with any blanks before each are ignored. A field is defined as a string of non-space, non-tab charac-
ters separated by tabs and spaces from its neighbors.
+n The first n characters are ignored. Fields are skipped before characters.
SEE ALSO
sort(1), comm(1)
UNIQ(1)