05-03-2012
remove duplicate lines with condition
hi to all
Does anyone know if there's a way to remove duplicate lines which we consider the same only if they have the first and the second column the same?
For example I have :
us2333 bbb 5
us2333 bbb 3
us2333 bbb 2
and I want to get
us2333 bbb 10
The thing is I cannot remove the 3rd line and then use uniq -c
as the 3rd line holds an information which is needed.Besides,it is the sum
of the 3rd column (the 3rd column is a result of a previous uniq -c)
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I have following file content (3 fields each line):
23 888 10.0.0.1
dfh 787 10.0.0.2
dssf dgfas 10.0.0.3
dsgas dg 10.0.0.4
df dasa 10.0.0.5
df dag 10.0.0.5
dfd dfdas 10.0.0.5
dfd dfd 10.0.0.6
daf nfd 10.0.0.6
...
as can be seen, that the third field is ip address and sorted. but... (3 Replies)
Discussion started by: fredao
3 Replies
2. UNIX for Dummies Questions & Answers
I have a log file "logreport" that contains several lines as seen below:
04:20:00 /usr/lib/snmp/snmpdx: Agent snmpd appeared dead but responded to ping
06:38:08 /usr/lib/snmp/snmpdx: Agent snmpd appeared dead but responded to ping
07:11:05 /usr/lib/snmp/snmpdx: Agent snmpd appeared dead but... (18 Replies)
Discussion started by: Nysif Steve
18 Replies
3. Shell Programming and Scripting
Hi,
I came to know that using awk '!x++' removes the duplicate lines. Can anyone please explain the above syntax. I want to understand how the above awk syntax removes the duplicates.
Thanks in advance,
sudvishw :confused: (7 Replies)
Discussion started by: sudvishw
7 Replies
4. Shell Programming and Scripting
Hi, I have a huge file which is about 50GB. There are many lines. The file format likes
21 rs885550 0 9887804 C C T C C C C C C C
21 rs210498 0 9928860 0 0 C C 0 0 0 0 0 0
21 rs303304 0 9941889 A A A A A A A A A A
22 rs303304 0 9941890 0 A A A A A A A A A
The question is that there are a few... (4 Replies)
Discussion started by: zhshqzyc
4 Replies
5. Shell Programming and Scripting
Hey guys, need some help to fix this script. I am trying to remove all the duplicate lines in this file.
I wrote the following script, but does not work. What is the problem?
The output file should only contain five lines:
Later! (5 Replies)
Discussion started by: Ernst
5 Replies
6. Shell Programming and Scripting
Hi
Ive been scratching over this for some time with no solution.
I have a file like this
1 bla bla 1
2 bla bla 2
4 bla bla 3
5 bla bla 1
6 bla bla 1
I want to remove consecutive occurrences of lines like bla bla 1, but the first column may be different.
Any ideasss?? (23 Replies)
Discussion started by: jamie_123
23 Replies
7. UNIX for Dummies Questions & Answers
Hi
I need this output. Thanks.
Input:
TAZ
YET
FOO
FOO
VAK
TAZ
BAR
Output:
YET
VAK
BAR (10 Replies)
Discussion started by: tara123
10 Replies
8. Shell Programming and Scripting
Hi,
I have a csv file which contains some millions of lines in it.
The first line(Header) repeats at every 50000th line. I want to remove all the duplicate headers from the second occurance(should not remove the first line).
I don't want to use any pattern from the Header as I have some... (7 Replies)
Discussion started by: sudhakar T
7 Replies
9. Shell Programming and Scripting
Hi Gents,
Please can you help me to get the desired output .
In the first column I have some duplicate records, The condition is that all need to reject the duplicate record keeping the last occurrence. But the condition is. If the last occurrence is equal to value 14 or 98 in column 3 and... (2 Replies)
Discussion started by: jiam912
2 Replies
10. Shell Programming and Scripting
Hi All,
I am storing the result in the variable result_text using the below code.
result_text=$(printf "$result_text\t\n$name") The result_text is having the below text. Which is having duplicate lines.
file and time for the interval 03:30 - 03:45
file and time for the interval 03:30 - 03:45 ... (4 Replies)
Discussion started by: nalu
4 Replies
fixman(1M) fixman(1M)
NAME
fixman - fix manpages for faster viewing with man command
SYNOPSIS
alt-path]
DESCRIPTION
The command is a shell script that processes manpages in the directories to unexpand spaces to tabs where possible, and to remove all char-
acter-backspace pairs (which usually exist to cause overstriking or underscoring for printer output). Removal of unnecessary character
sequences improves the speed of man(1), and reduces disk space consumption. The command should be run after using to create formatted,
manpages from unformatted, source files (see catman(1M)).
By default, searches for subdirectories in the following parent directories in the order indicated:
o
o
o
If the environment variable is set, the directory paths specified by are searched instead of the default. See environ(5) for a description
of the environment variable.
The command does not remove duplicate blank lines. Thus, all files remain a multiple of one page (66 lines) long and can still be passed
directly to (see lp(1)). (Note that man(1) normally uses to accomplish this removal.)
To ensure success, should be run by a user who has appropriate privileges. It will take awhile to complete depending on system speed,
load, memory size, etc. As a side-effect, file ownerships and permissions may be changed.
Options
Perform actions based on the given alternate root.
With this option, alt-path will be prepended to all directory paths, including default paths or the paths defined by
EXTERNAL INFLUENCES
Environment Variables
if set, defines the directories to be searched for manpages.
WARNINGS
If the value of is not the same while is running as it was when was run or when manpage files were installed, some files may be missed and
not processed (see catman(1M)).
EXAMPLES
Run fixman from a server to fix the manpages on a diskless under the alternate root
This will fix manpages in directories under:
FILES
Directories containing [compressed]
nroff(1)-formatted versions of manpages
AUTHOR
was developed by HP.
SEE ALSO
catman(1M), chmod(1), expand(1), lp(1), man(1), mv(1), sed(1), environ(5).
fixman(1M)