Removing duplicates depending on file size Post: 302830135

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

removing duplicates from a file

i have a file with some 1000 entries it will contain entries like 1000,ram 2000,pankaj 1001,rahim 1000,ram 2532,govind 2000,pankaj 3000,venkat 2532,govind what i want is i want to extract only the distinct rows from this file so my output should contain only 1000,ram...

2. Shell Programming and Scripting

Removing duplicates in a sorted file by field.

I have data like this: It's sorted by the 2nd field (TID). envoy,90000000000000634600010001,04/11/2008,23:19:27,RB00266,0015,DETAIL,ERROR, envoy,90000000000000634600010001,04/12/2008,04:23:45,RB00266,0015,DETAIL,ERROR,...

3. UNIX for Dummies Questions & Answers

removing duplicates of a pattern from a file

hey all, I need some help. I have a text file with names in it. My target is that if a particular pattern exists in that file more than once..then i want to rename all the occurences of that pattern by alternate patterns.. for e.g if i have PATTERN occuring 5 times then i want to...

4. Shell Programming and Scripting

Removing duplicates from log file?

I have a log file with posts looking like this: -- Messages can be delivered by different systems at different times. The id number is used to sort out duplicate messages. What I need is to strip the arrival time from each post, sort posts by id number, and reattach arrival time to respective...

5. Shell Programming and Scripting

Removing Duplicates from file

6. Shell Programming and Scripting

formatting a file and removing duplicates

Hi, I have a file that I want to change the format of. It is a large file in rows but I want it to be comma separated (comma then a space). The current file looks like this: HI, Joe, Bob, Jack, Jack After I would want to remove any duplicates so it would look like this: HI, Joe,...

7. UNIX for Dummies Questions & Answers

Removing duplicates from a file

Hi All, I am merging files coming from 2 different systems ,while doing that I am getting duplicates entries in the merged file I,01,000131,764,2,4.00 I,01,000131,765,2,4.00 I,01,000131,772,2,4.00 I,01,000131,773,2,4.00 I,01,000168,762,2,2.00 I,01,000168,763,2,2.00...

8. UNIX for Dummies Questions & Answers

Grep from pattern file without removing duplicates?

9. Shell Programming and Scripting

Removing duplicates from new file

i hav two files like i want to remove/delete all the duplicate lines in file2 which are viz unix,unix2,unix3

10. Shell Programming and Scripting

Removing duplicates from new file

i hav two files like i want to remove/delete all the duplicate lines in file2 which are viz unix,unix2,unix3.I have tried previous post also,but in that complete line must be similar.In this case i have to verify first column only regardless what is the content in succeeding columns.

LEARN ABOUT CENTOS

sort

SORT(1) 							   User Commands							   SORT(1)

NAME

       sort - sort lines of text files

SYNOPSIS

       sort [OPTION]... [FILE]...
       sort [OPTION]... --files0-from=F

DESCRIPTION

       Write sorted concatenation of all FILE(s) to standard output.

       Mandatory arguments to long options are mandatory for short options too.  Ordering options:

       -b, --ignore-leading-blanks
	      ignore leading blanks

       -d, --dictionary-order
	      consider only blanks and alphanumeric characters

       -f, --ignore-case
	      fold lower case to upper case characters

       -g, --general-numeric-sort
	      compare according to general numerical value

       -i, --ignore-nonprinting
	      consider only printable characters

       -M, --month-sort
	      compare (unknown) < 'JAN' < ... < 'DEC'

       -h, --human-numeric-sort
	      compare human readable numbers (e.g., 2K 1G)

       -n, --numeric-sort
	      compare according to string numerical value

       -R, --random-sort
	      sort by random hash of keys

       --random-source=FILE
	      get random bytes from FILE

       -r, --reverse
	      reverse the result of comparisons

       --sort=WORD
	      sort according to WORD: general-numeric -g, human-numeric -h, month -M, numeric -n, random -R, version -V

       -V, --version-sort
	      natural sort of (version) numbers within text

       Other options:

       --batch-size=NMERGE
	      merge at most NMERGE inputs at once; for more use temp files

       -c, --check, --check=diagnose-first
	      check for sorted input; do not sort

       -C, --check=quiet, --check=silent
	      like -c, but do not report first bad line

       --compress-program=PROG
	      compress temporaries with PROG; decompress them with PROG -d

       --debug
	      annotate the part of the line used to sort, and warn about questionable usage to stderr

       --files0-from=F
	      read input from the files specified by NUL-terminated names in file F; If F is - then read names from standard input

       -k, --key=KEYDEF
	      sort via a key; KEYDEF gives location and type

       -m, --merge
	      merge already sorted files; do not sort

       -o, --output=FILE
	      write result to FILE instead of standard output

       -s, --stable
	      stabilize sort by disabling last-resort comparison

       -S, --buffer-size=SIZE
	      use SIZE for main memory buffer

       -t, --field-separator=SEP
	      use SEP instead of non-blank to blank transition

       -T, --temporary-directory=DIR
	      use DIR for temporaries, not $TMPDIR or /tmp; multiple options specify multiple directories

       --parallel=N
	      change the number of sorts run concurrently to N

       -u, --unique
	      with -c, check for strict ordering; without -c, output only the first of an equal run

       -z, --zero-terminated
	      end lines with 0 byte, not newline

       --help display this help and exit

       --version
	      output version information and exit

       KEYDEF  is  F[.C][OPTS][,F[.C][OPTS]]  for start and stop position, where F is a field number and C a character position in the field; both
       are origin 1, and the stop position defaults to the line's end.	If neither -t nor -b is in effect, characters in a field are counted  from
       the  beginning of the preceding whitespace.  OPTS is one or more single-letter ordering options [bdfgiMhnRrV], which override global order-
       ing options for that key.  If no key is given, use the entire line as the key.

       SIZE may be followed by the following multiplicative suffixes: % 1% of memory, b 1, K 1024 (default), and so on for M, G, T, P, E, Z, Y.

       With no FILE, or when FILE is -, read standard input.

       *** WARNING *** The locale specified by the environment affects sort order.  Set LC_ALL=C to get  the  traditional  sort  order	that  uses
       native byte values.

       GNU coreutils online help: <http://www.gnu.org/software/coreutils/> Report sort translation bugs to <http://translationproject.org/team/>

AUTHOR

       Written by Mike Haertel and Paul Eggert.

COPYRIGHT

       Copyright (C) 2013 Free Software Foundation, Inc.  License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>.
       This is free software: you are free to change and redistribute it.  There is NO WARRANTY, to the extent permitted by law.

SEE ALSO

       uniq(1)

       The  full documentation for sort is maintained as a Texinfo manual.  If the info and sort programs are properly installed at your site, the
       command

	      info coreutils 'sort invocation'

       should give you access to the complete manual.

GNU coreutils 8.22						     June 2014								   SORT(1)

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

removing duplicates from a file

Discussion started by: trichyselva

2. Shell Programming and Scripting

Removing duplicates in a sorted file by field.

Discussion started by: kinksville

3. UNIX for Dummies Questions & Answers

removing duplicates of a pattern from a file

Discussion started by: ashisharora

4. Shell Programming and Scripting

Removing duplicates from log file?

Discussion started by: Ilja

5. Shell Programming and Scripting

Removing Duplicates from file

Discussion started by: tinufarid

6. Shell Programming and Scripting

formatting a file and removing duplicates

Discussion started by: kylle345

7. UNIX for Dummies Questions & Answers

Removing duplicates from a file

Discussion started by: Sri3001

8. UNIX for Dummies Questions & Answers

Grep from pattern file without removing duplicates?

Discussion started by: Mauve

9. Shell Programming and Scripting

Removing duplicates from new file

Discussion started by: sagar_1986

10. Shell Programming and Scripting

Removing duplicates from new file

Discussion started by: sagar_1986

LEARN ABOUT CENTOS

sort