How to remove duplicates without sorting Post: 302159411

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

fastest way to remove duplicates.

I have searched the FAQ - by using sort, duplicates, etc.... but I didn't get any articles or results on it. Currently, I am using: sort -u file1 > file2 to remove duplicates. For a file size of 1giga byte approx. time taken to remove duplicates is 1hr 21 mins. Is there any other faster way...

2. Shell Programming and Scripting

Remove duplicates

Hello Experts, I have two files named old and new. Below are my example files. I need to compare and print the records that only exist in my new file. I tried the below awk script, this script works perfectly well if the records have exact match, the issue I have is my old file has got extra...

3. Shell Programming and Scripting

Script to remove duplicates

Hi I need a script that removes the duplicate records and write it to a new file for example I have a file named test.txt and it looks like abcd.23 abcd.24 abcd.25 qwer.25 qwer.26 qwer.98 I want to pick only $1 and compare with the next record and the output should be abcd.23...

4. Shell Programming and Scripting

remove duplicates and sort

Hi, I'm using the below command to sort and remove duplicates in a file. But, i need to make this applied to the same file instead of directing it to another. Thanks

5. Shell Programming and Scripting

Perl, sorting and eliminating duplicates

Hi guys! I'm trying to eliminate some duplicates from a file but I'm like this :wall: !!! My file looks like this: ID_1 0.02 ID_2 2.4e-2 ID_2 4.3.e-9 ID_3 0.003 ID_4 0.2 ID_4 0.05 ID_5 1.2e-3 What I need is to eliminate all the duplicates considering the first column (in this...

6. Shell Programming and Scripting

bash - remove duplicates

I need to use a bash script to remove duplicate files from a download list, but I cannot use uniq because the urls are different. I need to go from this: http://***/fae78fe/file1.wmv http://***/39du7si/file1.wmv http://***/d8el2hd/file2.wmv http://***/h893js3/file2.wmv to this: ...

7. Shell Programming and Scripting

awk remove first duplicates

Hi All, I have searched many threads for possible close solution. But I was unable to get simlar scenario. I would like to print all duplicate based on 3rd column except the first occurance. Also would like to print if it is single entry(non-duplicate). i/P file 12 NIL ABD LON 11 NIL ABC...

8. Shell Programming and Scripting

Remove duplicates

9. Shell Programming and Scripting

Sort and Remove duplicates

Here is my task : I need to sort two input files and remove duplicates in the output files : Sort by 13 characters from 97 Ascending Sort by 1 characters from 96 Ascending If duplicates are found retain the first value in the file the input files are variable length, convert...

10. Shell Programming and Scripting

Remove duplicates

Hi I have a below file structure. 200,1245,E1,1,E1,,7611068,KWH,30, ,,,,,,,, 200,1245,E1,1,E1,,7611070,KWH,30, ,,,,,,,, 300,20140223,0.001,0.001,0.001,0.001,0.001 300,20140224,0.001,0.001,0.001,0.001,0.001 300,20140225,0.001,0.001,0.001,0.001,0.001 300,20140226,0.001,0.001,0.001,0.001,0.001...

LEARN ABOUT REDHAT

amplot

AMPLOT(8)						      System Manager's Manual							 AMPLOT(8)

NAME

       amplot - visualize the behavior of Amanda

SYNOPSIS

       amplot [ -c ] [ -e ] [ -g ] [ -l ] [ -p ] [ -t T ] amdump_files

DESCRIPTION

       Amplot  reads  an  amdump  output file that Amanda generates each run (e.g.  amdump.1) and translates the information into a picture format
       that may be used to determine how your installation is doing and if any parameters need to be changed.  Amplot also prints out amdump lines
       that  it  either  does  not understand or knows to be warning or error lines and a summary of the start, end and total time for each backup
       image.

       Amplot is a shell script that executes an awk program (amplot.awk) to scan the amdump output file.  It  then  executes  a  gnuplot  program
       (amplot.g)  to  generate the graph.  The awk program is written in an enhanced version of awk, such as GNU awk (gawk version 2.15 or later)
       or nawk.

       During execution, amplot generates a few temporary files that gnuplot uses.  These files are deleted at the end of execution.

       See the amanda(8) man page for more details about Amanda.

OPTIONS

       -c     Compress amdump_files after plotting.

       -e     Extend the X (time) axis if needed.

       -g     Direct gnuplot output directly to the X11 display (default).

       -p     Direct postscript output to file YYYYMMDD.ps (opposite of -g).

       -l     Generate landscape oriented output.

       -t T   Set the right edge of the plot to be T hours.

       The amdump_files may be in various compressed formats (compress, gzip, pact, compact).

INTERPRETATION

       The figure is divided into a number of regions.	There are titles on the top that show important statistical information about the configu-
       ration  and  from  this execution of amdump.  In the figure, the X axis is time, with 0 being the moment amdump was started.  The Y axis is
       divided into 5 regions:

	      QUEUES: How many backups have not been started, how many are waiting on space in the holding disk and how many have been transferred
	      successfully to tape.

	      %BANDWIDTH: Percentage of allowed network bandwidth in use.

	      HOLDING DISK: The higher line depicts space allocated on the holding disk to backups in progress and completed backups waiting to be
	      written to tape.	The lower line depicts the fraction of the holding disk containing completed backups waiting to be written to tape
	      including the file currently being written to tape.  The scale is percentage of the holding disk.

	      TAPE: Tape drive usage.

	      %DUMPERS: Percentage of active dumpers.

       The idle period at the left of the graph is time amdump is asking the machines how much data they are going to dump.  This process can take
       a while if hosts are down or it takes them a long time to generate estimates.

AUTHOR

       Olafur Gudmundsson ogud@tis.com
       Trusted Information Systems
       formerly at University of Maryland, College Park

BUGS

       Reports lines it does not recognize, mainly error cases but some are legitimate lines the program needs to be taught about.

SEE ALSO

       amanda(8), amdump(8), gawk(1), nawk(1), awk(1), gnuplot(1), sh(1), compress(1), gzip(1)

4th Berkeley Distribution														 AMPLOT(8)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

fastest way to remove duplicates.

Discussion started by: radhika

2. Shell Programming and Scripting

Remove duplicates

Discussion started by: forumthreads

3. Shell Programming and Scripting

Script to remove duplicates

Discussion started by: antointoronto

4. Shell Programming and Scripting

remove duplicates and sort

Discussion started by: dvah

5. Shell Programming and Scripting

Perl, sorting and eliminating duplicates

Discussion started by: gabrysfe

6. Shell Programming and Scripting

bash - remove duplicates

Discussion started by: locoroco

7. Shell Programming and Scripting

awk remove first duplicates

Discussion started by: sybadm

8. Shell Programming and Scripting

Remove duplicates

Discussion started by: dtdt

9. Shell Programming and Scripting

Sort and Remove duplicates

Discussion started by: ysvsr1

10. Shell Programming and Scripting

Remove duplicates

Discussion started by: tejashavele

LEARN ABOUT REDHAT

amplot