Delete duplicate like pattern lines Post: 302996829

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

delete semi-duplicate lines from file?

Ok here's what I'm trying to do. I need to get a listing of all the mountpoints on a system into a file, which is easy enough, just using something like "mount | awk '{print $1}'" However, on a couple of systems, they have some mount points looking like this: /stage /stand /usr /MFPIS...

2. UNIX for Dummies Questions & Answers

Delete duplicate lines and print to file

OK, I have read several things on how to do this, but can't make it work. I am writing this to a vi file then calling it as an awk script. So I need to search a file for duplicate lines, delete duplicate lines, then write the result to another file, say /home/accountant/files/docs/nodup ...

3. UNIX for Dummies Questions & Answers

How to delete or remove duplicate lines in a file

Hi please help me how to remove duplicate lines in any file. I have a file having huge number of lines. i want to remove selected lines in it. And also if there exists duplicate lines, I want to delete the rest & just keep one of them. Please help me with any unix commands or even fortran...

4. Shell Programming and Scripting

Delete Lines between the pattern

Hi All, Below is my requirement. Whatever coming in between ' ', needs to delete. Input File Contents: ============== This is nice 'boy' This 'is bad boy.' Got it Expected Output =========== This is nice This Got it

5. UNIX for Dummies Questions & Answers

Delete lines with duplicate strings based on date

Hey all, a relative bash/script newbie trying solve a problem. I've got a text file with lots of lines that I've been able to clean up and format with awk/sed/cut, but now I'd like to remove the lines with duplicate usernames based on time stamp. Here's what the data looks like 2007-11-03...

6. UNIX for Dummies Questions & Answers

How to delete partial duplicate lines unix

hi :) I need to delete partial duplicate lines I have this in a file sihp8027,/opt/cf20,1980182 sihp8027,/opt/oracle/10gRelIIcd,155200016 sihp8027,/opt/oracle/10gRelIIcd,155200176 sihp8027,/var/opt/ERP,10376312 and need to leave it like this: sihp8027,/opt/cf20,1980182...

7. UNIX for Advanced & Expert Users

In a huge file, Delete duplicate lines leaving unique lines

Hi All, I have a very huge file (4GB) which has duplicate lines. I want to delete duplicate lines leaving unique lines. Sort, uniq, awk '!x++' are not working as its running out of buffer space. I dont know if this works : I want to read each line of the File in a For Loop, and want to...

8. Shell Programming and Scripting

sed pattern to delete lines containing a pattern, except the first occurance

Hello sed gurus. I am using ksh on Sun and have a file created by concatenating several other files. All files contain header rows. I just need to keep the first occurrence and remove all other header rows. header for file 1111 2222 3333 header for file 1111 2222 3333 header for file...

9. Shell Programming and Scripting

Delete duplicate lines... with a twist!

Hi, I'm sorry I'm no coder so I came here, counting on your free time and good will to beg for spoonfeeding some good code. I'll try to be quick and concise! Got file with 50k lines like this: "Heh, heh. Those darn ninjas. They're _____."*wacky The "canebrake", "timber" & "pygmy" are types...

10. Shell Programming and Scripting

How to delete all lines before a particular pattern when the pattern is defined in a variable?

I have a file Line 1 a Line 22 Line 33 Line 1 b Line 22 Line 1 c Line 4 Line 5 I want to delete all lines before last occurrence of a line which contains something which is defined in a variable. Say a variable var contains 'Line 1', then I need the following in the output. ...

LEARN ABOUT DEBIAN

fdupes

FDUPES(1)						      General Commands Manual							 FDUPES(1)

NAME

       fdupes - finds duplicate files in a given set of directories

SYNOPSIS

       fdupes [ options ] DIRECTORY ...

DESCRIPTION

       Searches  the  given  path for duplicate files. Such files are found by comparing file sizes and MD5 signatures, followed by a byte-by-byte
       comparison.

OPTIONS

       -r --recurse
	      for every directory given follow subdirectories encountered within

       -R --recurse:
	      for each directory given after this option follow subdirectories encountered within (note the ':' at the	end  of  option;  see  the
	      Examples section below for further explanation)

       -s --symlinks
	      follow symlinked directories

       -H --hardlinks
	      normally, when two or more files point to the same disk area they are treated as non-duplicates; this option will change this behav-
	      ior

       -n --noempty
	      exclude zero-length files from consideration

       -f --omitfirst
	      omit the first file in each set of matches

       -A --nohidden
	      exclude hidden files from consideration

       -1 --sameline
	      list each set of matches on a single line

       -S --size
	      show size of duplicate files

       -m --summarize
	      summarize duplicate files information

       -q --quiet
	      hide progress indicator

       -d --delete
	      prompt user for files to preserve, deleting all others (see CAVEATS below)

       -N --noprompt
	      when used together with --delete, preserve the first file in each set of duplicates and delete the others without prompting the user

       -v --version
	      display fdupes version

       -h --help
	      displays help

SEE ALSO

       md5sum(1)

NOTES

       Unless -1 or --sameline is specified, duplicate files are listed together in groups, each file displayed on a separate line. The groups are
       then separated from each other by blank lines.

       When -1 or --sameline is specified, spaces and backslash characters  () appearing in a filename are preceded by a backslash character.

EXAMPLES

       fdupes a --recurse: b
	      will follow subdirectories under b, but not those under a.

       fdupes a --recurse b
	      will follow subdirectories under both a and b.

CAVEATS

       If  fdupes  returns  with  an error message such as fdupes: error invoking md5sum it means the program has been compiled to use an external
       program to calculate MD5 signatures (otherwise, fdupes uses internal routines for this purpose), and an error has occurred while attempting
       to execute it. If this is the case, the specified program should be properly installed prior to running fdupes.

       When using -d or --delete, care should be taken to insure against accidental data loss.

       When used together with options -s or --symlink, a user could accidentally preserve a symlink while deleting the file it points to.

       Furthermore, when specifying a particular directory more than once, all files within that directory will be listed as their own duplicates,
       leading to data loss should a user preserve a file without its "duplicate" (the file itself!).

AUTHOR

       Adrian Lopez <adrian2@caribe.net>

																	 FDUPES(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

delete semi-duplicate lines from file?

Discussion started by: paqman

2. UNIX for Dummies Questions & Answers

Delete duplicate lines and print to file

Discussion started by: bfurlong

3. UNIX for Dummies Questions & Answers

How to delete or remove duplicate lines in a file

Discussion started by: reva

4. Shell Programming and Scripting

Delete Lines between the pattern

Discussion started by: susau_79