Sponsored Content
Top Forums Shell Programming and Scripting Remove lines containing 2 or more duplicate strings Post 302964812 by Scrutinizer on Tuesday 19th of January 2016 03:06:41 PM
Old 01-19-2016
@Bakunin, that would only work with adjacent words and would also match partial patterns:
Code:
$ echo foo foobar | sed '/\([^ ]*\) \1/d'
$

And because of the zero or more match:
Code:
$ echo abc def ghi | sed '/\([^ ]*\) \1/d'
$

This User Gave Thanks to Scrutinizer For This Post:
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

how to remove duplicate lines

I have following file content (3 fields each line): 23 888 10.0.0.1 dfh 787 10.0.0.2 dssf dgfas 10.0.0.3 dsgas dg 10.0.0.4 df dasa 10.0.0.5 df dag 10.0.0.5 dfd dfdas 10.0.0.5 dfd dfd 10.0.0.6 daf nfd 10.0.0.6 ... as can be seen, that the third field is ip address and sorted. but... (3 Replies)
Discussion started by: fredao
3 Replies

2. UNIX for Dummies Questions & Answers

Delete lines with duplicate strings based on date

Hey all, a relative bash/script newbie trying solve a problem. I've got a text file with lots of lines that I've been able to clean up and format with awk/sed/cut, but now I'd like to remove the lines with duplicate usernames based on time stamp. Here's what the data looks like 2007-11-03... (3 Replies)
Discussion started by: mattv
3 Replies

3. Shell Programming and Scripting

Remove duplicate lines

Hi, I have a huge file which is about 50GB. There are many lines. The file format likes 21 rs885550 0 9887804 C C T C C C C C C C 21 rs210498 0 9928860 0 0 C C 0 0 0 0 0 0 21 rs303304 0 9941889 A A A A A A A A A A 22 rs303304 0 9941890 0 A A A A A A A A A The question is that there are a few... (4 Replies)
Discussion started by: zhshqzyc
4 Replies

4. Shell Programming and Scripting

Delete lines in file containing duplicate strings, keeping longer strings

The question is not as simple as the title... I have a file, it looks like this <string name="string1">RZ-LED</string> <string name="string2">2.0</string> <string name="string2">Version 2.0</string> <string name="string3">BP</string> I would like to check for duplicate entries of... (11 Replies)
Discussion started by: raidzero
11 Replies

5. Shell Programming and Scripting

Need to remove the duplicate lines from a log!!

Hello Folks, Can some one help me with the removal of duplicate lines from a log file and send it to another log file. It's bit complicated as two lines are same but only difference is the timestamp, but some lines are uniq. Line has been seperated by colon's. Log file:... (5 Replies)
Discussion started by: sim_je
5 Replies

6. Shell Programming and Scripting

remove duplicate lines with condition

hi to all Does anyone know if there's a way to remove duplicate lines which we consider the same only if they have the first and the second column the same? For example I have : us2333 bbb 5 us2333 bbb 3 us2333 bbb 2 and I want to get us2333 bbb 10 The thing is I cannot... (2 Replies)
Discussion started by: vlm
2 Replies

7. UNIX for Dummies Questions & Answers

Remove Duplicate Lines

Hi I need this output. Thanks. Input: TAZ YET FOO FOO VAK TAZ BAR Output: YET VAK BAR (10 Replies)
Discussion started by: tara123
10 Replies

8. Shell Programming and Scripting

Getting lines between two strings with duplicate set of data

if I have the following lines in a file app.log some lines here <AAAA> abc <id>123456789</id> ddd </AAAA>some lines here too <BBBB> abc <id>123456789</id> ddd </BBBB>some lines here too <AAAA> xyz <id>987654321</id> ssss </AAAA>some lines here again... How do I get the... (5 Replies)
Discussion started by: nariwithu
5 Replies

9. Shell Programming and Scripting

Remove duplicate lines from a file

Hi, I have a csv file which contains some millions of lines in it. The first line(Header) repeats at every 50000th line. I want to remove all the duplicate headers from the second occurance(should not remove the first line). I don't want to use any pattern from the Header as I have some... (7 Replies)
Discussion started by: sudhakar T
7 Replies

10. Shell Programming and Scripting

How to remove duplicate lines?

Hi All, I am storing the result in the variable result_text using the below code. result_text=$(printf "$result_text\t\n$name") The result_text is having the below text. Which is having duplicate lines. file and time for the interval 03:30 - 03:45 file and time for the interval 03:30 - 03:45 ... (4 Replies)
Discussion started by: nalu
4 Replies
pospell(1)							   User Manuals 							pospell(1)

NAME
pospell - Run a program on the translation strings of a .po-file SYNOPSIS
pospell [-l language ] [-fsv] -n pofile -p program [-- [ arguments... ]] DESCRIPTION
pospell copies the translations from the pofile to a temporary file (called the spellfile) and then calls program with its arguments. pro- gram is supposed to somehow change the spellfile. It is typically a spell checker. Afterwards the possibly changed translations in spell- file is copied back into the pofile. If pofile is specified as - then pospell will act as a filter, i.e. standard input and standard output will be used. %f in the arguments will be expanded to the name of the spellfile, or - if the -f option is used. Use %% for a real % character. All comment lines in the pofile are replaced with lines with "#" as the only character in the spellfile. The msgid strings with original text in the pofile are replaced with lines with ">" and "]" as the only characters in the spellfile. The msgstr strings with file headers (i.e. strings without corresponding msgid strings) are replaced with lines with ")" and "]" as the only characters in the spellfile. Don't change or remove these lines from the spellfile - they are used when the spellfile and the pofile are merged together after running the program. The normal msgstr strings are copied to the spellfile with some changes: 1) The word msgstr and everything else before the string is replaced by a "+" character. 2) Strings with C escape sequences followed by a letter like "one two" are split to parts like "one " "two". 3) Leading whitespace in continuation lines is removed. The changes of kind 1) and 2) are undone when the spellfile is copied back into the pofile, but removed whitespace will not be restored. OPTIONS
-f The called program is a filter so don't make a temporary file, but pipe the spellfile to its standard input and read it back from its standard output. -l language Only translations with the indicated language code are copied to the spellfile. -s Quit with an error message if unrecognized items are found in the pofile or spellfile. -v Print the version and exit. FILES
pospell uses one or two temporary files with names given by the mkstemp(3) or the tmpnam(3) function. BUGS
If some system call gives an unexpected error pospell will stop immediately with an error message leaving its temporary files. AUTHOR
Byrial Jensen <byrial@image.dk> SEE ALSO
The GNU gettext utilities. newsbody(1). Unix APRIL 2000 pospell(1)
All times are GMT -4. The time now is 09:48 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy