Delete duplicate strings in a line Post: 302879523

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Delete lines with duplicate strings based on date

Hey all, a relative bash/script newbie trying solve a problem. I've got a text file with lots of lines that I've been able to clean up and format with awk/sed/cut, but now I'd like to remove the lines with duplicate usernames based on time stamp. Here's what the data looks like 2007-11-03...

2. Shell Programming and Scripting

delete repeated strings (tags) in a line and concatenate corresponding words

Hello friends! Each line of my input file has this format: word<TAB>tag1<blankspace>lemma<TAB>tag2<blankspace>lemma ... <TAB>tag3<blankspace>lemma Of this file I need to eliminate all the repeated tags (of the same word) in a line, as in the example here below, but conserving both (all) the...

3. UNIX for Dummies Questions & Answers

Delete strings in file1 based on the list of strings in file2

Hello guys, should be a very easy questn for you: I need to delete strings in file1 based on the list of strings in file2. like file2: word1_word2_ word3_word5_ word3_word4_ word6_word7_ file1: word1_word2_otherwords..,word3_word5_others...

4. Shell Programming and Scripting

How to delete a duplicate line and original with sed.

I am completely new to shell scripting but have been assigned the task of creating several batch files to manipulate data. My final task requires me to find lines that have duplicates present then delete not only the duplicate but the original as well. The script will be used in a windows...

5. UNIX for Dummies Questions & Answers

Delete duplicate second line

Hi ALL I need a help I need to retain only the first line of 035 if I have two line before =040 , if only one then need to take that Eg: Input =035 (ABC)12324141241 =035 (XYZPQR)704124 =040 AB$QS$WEWR =035 (ABC)08080880809 =035 (XYZPQR)9809314 =040 ...

6. Shell Programming and Scripting

Delete lines in file containing duplicate strings, keeping longer strings

The question is not as simple as the title... I have a file, it looks like this <string name="string1">RZ-LED</string> <string name="string2">2.0</string> <string name="string2">Version 2.0</string> <string name="string3">BP</string> I would like to check for duplicate entries of...

7. Shell Programming and Scripting

Delete Duplicate line (not really) from the file

I need help in figuring out hoe to delete lines in a data file. The data file is huge. I am currently using "vi" to search and delete the lines - which is cumbersome since it takes lots of time to save that file (due to its huge size). Here is the issue. I have a data file with the following...

8. Shell Programming and Scripting

Delete 2 strings from 1 line with sed?

Hi guys, I wonder if it's possible to search for a line containing 2 strings and delete that line and perhaps replace the source file with already deleted line(s). What I mean is something like this: sourcefile.txt line1: something 122344 somethin2 24334 45554676 line2: another something...

9. UNIX for Dummies Questions & Answers

Log file - Delete duplicate line & keep last date

Hello All ! I need your help on this case, I have a csv file with this: ITEM105;ARI FSR;2016-02-01 08:02;243 ITEM101;ARI FSR;2016-02-01 06:02;240 ITEM032;RNO TLE;2016-02-01 11:03;320 ITEM032;RNO TLE;2016-02-02 05:43;320 ITEM032;RNO TLE;2016-02-01 02:03;320 ITEM032;RNO...

10. Shell Programming and Scripting

Find duplicate values in specific column and delete all the duplicate values

Dear folks I have a map file of around 54K lines and some of the values in the second column have the same value and I want to find them and delete all of the same values. I looked over duplicate commands but my case is not to keep one of the duplicate values. I want to remove all of the same...

LEARN ABOUT DEBIAN

fdupes

FDUPES(1)						      General Commands Manual							 FDUPES(1)

NAME

       fdupes - finds duplicate files in a given set of directories

SYNOPSIS

       fdupes [ options ] DIRECTORY ...

DESCRIPTION

       Searches  the  given  path for duplicate files. Such files are found by comparing file sizes and MD5 signatures, followed by a byte-by-byte
       comparison.

OPTIONS

       -r --recurse
	      for every directory given follow subdirectories encountered within

       -R --recurse:
	      for each directory given after this option follow subdirectories encountered within (note the ':' at the	end  of  option;  see  the
	      Examples section below for further explanation)

       -s --symlinks
	      follow symlinked directories

       -H --hardlinks
	      normally, when two or more files point to the same disk area they are treated as non-duplicates; this option will change this behav-
	      ior

       -n --noempty
	      exclude zero-length files from consideration

       -f --omitfirst
	      omit the first file in each set of matches

       -A --nohidden
	      exclude hidden files from consideration

       -1 --sameline
	      list each set of matches on a single line

       -S --size
	      show size of duplicate files

       -m --summarize
	      summarize duplicate files information

       -q --quiet
	      hide progress indicator

       -d --delete
	      prompt user for files to preserve, deleting all others (see CAVEATS below)

       -N --noprompt
	      when used together with --delete, preserve the first file in each set of duplicates and delete the others without prompting the user

       -v --version
	      display fdupes version

       -h --help
	      displays help

SEE ALSO

       md5sum(1)

NOTES

       Unless -1 or --sameline is specified, duplicate files are listed together in groups, each file displayed on a separate line. The groups are
       then separated from each other by blank lines.

       When -1 or --sameline is specified, spaces and backslash characters  () appearing in a filename are preceded by a backslash character.

EXAMPLES

       fdupes a --recurse: b
	      will follow subdirectories under b, but not those under a.

       fdupes a --recurse b
	      will follow subdirectories under both a and b.

CAVEATS

       If  fdupes  returns  with  an error message such as fdupes: error invoking md5sum it means the program has been compiled to use an external
       program to calculate MD5 signatures (otherwise, fdupes uses internal routines for this purpose), and an error has occurred while attempting
       to execute it. If this is the case, the specified program should be properly installed prior to running fdupes.

       When using -d or --delete, care should be taken to insure against accidental data loss.

       When used together with options -s or --symlink, a user could accidentally preserve a symlink while deleting the file it points to.

       Furthermore, when specifying a particular directory more than once, all files within that directory will be listed as their own duplicates,
       leading to data loss should a user preserve a file without its "duplicate" (the file itself!).

AUTHOR

       Adrian Lopez <adrian2@caribe.net>

																	 FDUPES(1)

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Delete lines with duplicate strings based on date

Discussion started by: mattv

2. Shell Programming and Scripting

delete repeated strings (tags) in a line and concatenate corresponding words

Discussion started by: mjomba

3. UNIX for Dummies Questions & Answers

Delete strings in file1 based on the list of strings in file2

Discussion started by: roussine

4. Shell Programming and Scripting

How to delete a duplicate line and original with sed.

Discussion started by: chino_1