Sponsored Content
Top Forums Shell Programming and Scripting Removing duplicates depending on file size Post 302830547 by krishmaths on Tuesday 9th of July 2013 05:25:08 AM
Old 07-09-2013
@Error404, Please try below solution.

cd to the directory where you have the files and execute below command. You may redirect the output to a temporary file.


Code:
ls -l|sort -k9 | awk '{OFS="."}{print $5,$9}' | awk -F"." 'BEGIN{row=$0;T=$2;} {if ($2==T) {if($1>max){max=$1;row=$0;}} else {print row;row=$0;max=0}; T=$2} END{print row}'

The command first lists all the files under the directory and picks the filename ($9) and size ($5). You may adjust this if you are getting the filename and size in different positions.

The fiesize is output as first field and the filename follows. I have used "." as an output delimiter to easily fetch the file with maximum size.

I created below files in a directory called tempdir:
Code:
LAJ.g.gif-1.JPEG                    4
LAJ.g.gif-2.JPEG                   12
LKJFDA01.gf.gif-1.JPEG           0
LKJFDA01.gf.gif-2.JPEG           0
LKJFDA01.gif-3.JPEG               4
OLUSDN.gf.gif-1.JPEG             0


The output was as below.
Code:
12.LAJ.g.gif-2.JPEG
4.LKJFDA01.gif-3.JPEG
0.OLUSDN.gf.gif-1.JPEG

The first field in the output is the maximum size of the file starting with 2nd field (i.e., LAJ, etc) in bytes.
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

removing duplicates from a file

i have a file with some 1000 entries it will contain entries like 1000,ram 2000,pankaj 1001,rahim 1000,ram 2532,govind 2000,pankaj 3000,venkat 2532,govind what i want is i want to extract only the distinct rows from this file so my output should contain only 1000,ram... (2 Replies)
Discussion started by: trichyselva
2 Replies

2. Shell Programming and Scripting

Removing duplicates in a sorted file by field.

I have data like this: It's sorted by the 2nd field (TID). envoy,90000000000000634600010001,04/11/2008,23:19:27,RB00266,0015,DETAIL,ERROR, envoy,90000000000000634600010001,04/12/2008,04:23:45,RB00266,0015,DETAIL,ERROR,... (1 Reply)
Discussion started by: kinksville
1 Replies

3. UNIX for Dummies Questions & Answers

removing duplicates of a pattern from a file

hey all, I need some help. I have a text file with names in it. My target is that if a particular pattern exists in that file more than once..then i want to rename all the occurences of that pattern by alternate patterns.. for e.g if i have PATTERN occuring 5 times then i want to... (3 Replies)
Discussion started by: ashisharora
3 Replies

4. Shell Programming and Scripting

Removing duplicates from log file?

I have a log file with posts looking like this: -- Messages can be delivered by different systems at different times. The id number is used to sort out duplicate messages. What I need is to strip the arrival time from each post, sort posts by id number, and reattach arrival time to respective... (2 Replies)
Discussion started by: Ilja
2 Replies

5. Shell Programming and Scripting

Removing Duplicates from file

Hi Experts, Please check the following new requirement. I got data like the following in a file. FILE_HEADER 01cbbfde7898410| 3477945| home| 1 01cbc275d2c122| 3478234| WORK| 1 01cbbe4362743da| 3496386| Rich Spare| 1 01cbc275d2c122| 3478234| WORK| 1 This is pipe separated file with... (3 Replies)
Discussion started by: tinufarid
3 Replies

6. Shell Programming and Scripting

formatting a file and removing duplicates

Hi, I have a file that I want to change the format of. It is a large file in rows but I want it to be comma separated (comma then a space). The current file looks like this: HI, Joe, Bob, Jack, Jack After I would want to remove any duplicates so it would look like this: HI, Joe,... (2 Replies)
Discussion started by: kylle345
2 Replies

7. UNIX for Dummies Questions & Answers

Removing duplicates from a file

Hi All, I am merging files coming from 2 different systems ,while doing that I am getting duplicates entries in the merged file I,01,000131,764,2,4.00 I,01,000131,765,2,4.00 I,01,000131,772,2,4.00 I,01,000131,773,2,4.00 I,01,000168,762,2,2.00 I,01,000168,763,2,2.00... (5 Replies)
Discussion started by: Sri3001
5 Replies

8. UNIX for Dummies Questions & Answers

Grep from pattern file without removing duplicates?

I have been using grep to output whole lines using a pattern file with identifiers (fileA): fig|562.2322.peg.1 fig|562.2322.peg.3 fig|562.2322.peg.3 fig|562.2322.peg.3 fig|562.2322.peg.7 From fileB with corresponding identifiers in the second column: NODE_0 fig|562.2322.peg.1 peg ... (2 Replies)
Discussion started by: Mauve
2 Replies

9. Shell Programming and Scripting

Removing duplicates from new file

i hav two files like i want to remove/delete all the duplicate lines in file2 which are viz unix,unix2,unix3 (2 Replies)
Discussion started by: sagar_1986
2 Replies

10. Shell Programming and Scripting

Removing duplicates from new file

i hav two files like i want to remove/delete all the duplicate lines in file2 which are viz unix,unix2,unix3.I have tried previous post also,but in that complete line must be similar.In this case i have to verify first column only regardless what is the content in succeeding columns. (3 Replies)
Discussion started by: sagar_1986
3 Replies
img-jpeg(n)															       img-jpeg(n)

__________________________________________________________________________________________________________________________________________________

NAME
img-jpeg - Img, Joint Picture Expert Group format (jpeg) SYNOPSIS
package require Tk package require img::jpeg ?1.4? image create photo ?name? ?options? _________________________________________________________________ DESCRIPTION
The package img::jpeg is a sub-package of Img. It can be loaded as a part of the complete Img support, via package require Img, or on its own, via package require img::jpeg. Like all packages of Img it does not provide new commands, but extends the existing Tk command image so that it supports files containing raster images in the Joint Picture Expert Group format (jpeg). More specifically img::jpeg extends Tk's photo image type. The name of the new format handler is jpeg. This handler provides new additional configuration options. See section JPEG OPTIONS for more detailed explanations. All of the above means that in a call like image create photo ?name? ?options? [1] Image data in jpeg format (options -data and -file) is detected automatically. [2] The format name jpeg is recognized by the option -format. In addition the value for the option is treated as list and may contain any of the special options listed in section JPEG OPTIONS. JPEG OPTIONS
The handler provides six options, two effective when reading from a JPEG image, and five influencing the writing of such. One option is usable for both reading an writing. -fast This option is for reading from JPEG data. It usage activates a processing mode which is fast, but also provides only low-quality information. -grayscale This option can be used for both reading and writing of JPEG data. Usage of this option forces incoming images to grayscale, and written images will be monochrome. -quality n This option is for writing JPEG data. It specifies the compression level as a quality percentage. The higher the quality, the less the compression. The nominal range for n is 0...100. Useful values are in the range 5...95. The default value is 75. -smooth n This option is for writing JPEG data. When used the writer will smooth the image before performing the compression. Values in the 10...30 are usually enough. The default is 0, i.e no smoothing. -optimize This option is for writing JPEG data. It usage causes the writer to optimize the huffman table used to encode the jpeg coefficients. -progressive This option is for writing JPEG data. It usage causes the creation of a progressive JPEG file. SEE ALSO
img-bmp, img-dted, img-gif, img-ico, img-intro, img-jpeg, img-pcx, img-pixmap, img-png, img-ppm, img-ps, img-raw, img-sgi, img-sun, img- tga, img-tiff, img-window, img-xbm, img-xpm KEYWORDS
image handling, jpeg, tk COPYRIGHT
Copyright (c) 1995-2009 Jan Nijtmans <nijtmans@users.sourceforge.net> Img 1.4 img-jpeg(n)
All times are GMT -4. The time now is 12:35 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy