05-09-2013
Remove duplicates from a file
Can u tell me how to remove duplicate records from a file?
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
How can i remove the duplicate lines from a file, for example
sample123456Sample
testing123456testing
XXXXX131323XXXXX
YYYYY423432YYYYY
fsdfdsf123456gsdfdsd
all the duplicates from column 6-12 , must be deleted. I want to consider the first row, if same comes in the given range i want to... (1 Reply)
Discussion started by: gopikgunda
1 Replies
2. Shell Programming and Scripting
hi.. i have a file in the following format :-
name-a
age -12
address-123
age-12
phone-22222
============
name-ab
age -11
address-123
age-11
phone-222223
=============
name-abc
age -12
address-1234
age-12
phone-2222223
============= (2 Replies)
Discussion started by: nipun_garg
2 Replies
3. Shell Programming and Scripting
Hi,
I am writing a shell script that needs to remove duplicate lines within a file by category.
example:
section a
a
c
b
a
section b
a
b
a
c
I need to remove the duplicates within th category with out removing the duplicates from the 2 different sections (one of the a's in section... (1 Reply)
Discussion started by: RichElks
1 Replies
4. Shell Programming and Scripting
1/p
----
A
B
C
A
C
o/p
---
B
A
C
From input file it should remove duplicates from end without changing order (5 Replies)
Discussion started by: lavnayas
5 Replies
5. Shell Programming and Scripting
Hi,
I need to remove duplicates from a file. The file will be like this
0003 10101 20100120 abcdefghi
0003 10101 20100121 abcdefghi
0003 10101 20100122 abcdefghi
0003 10102 20100120 abcdefghi
0003 10103 20100120 abcdefghi
0003 10103 20100121 abcdefghi
Here if the first colum and... (6 Replies)
Discussion started by: gpaulose
6 Replies
6. Shell Programming and Scripting
Hi,
I am unable to search the duplicates in a file based on the 1st,2nd,4th,5th columns in a file and also remove the duplicates in the same file.
Source filename: Filename.csv
"1","ccc","information","5000","temp","concept","new"
"1","ddd","information","6000","temp","concept","new"... (2 Replies)
Discussion started by: onesuri
2 Replies
7. Shell Programming and Scripting
All,
I have a file 1181CUSTOMER-L061411_003500.dat.Z having duplicate records in it.
bash-2.05$ zcat 1181CUSTOMER-L061411_003500.dat.Z|grep "90876251S"
90876251S|ABG, AN ADAYANA COMPANY|3550 DEPAUW BLVD|||US|IN|INDIANAPOLIS||DAL|46268||||||GEN|||||||USD|||ABG, AN ADAYANA... (3 Replies)
Discussion started by: Oracle_User
3 Replies
8. UNIX for Dummies Questions & Answers
Hi,
I have a tablular separated file and I want to remove all the rows that have duplicates. The diuplicates I need to check are in column 13.
I have tried to use awk but I have no Idea how to keep the duplicate file.
awk 'FNR==NR{a++;next}(a> 1)' tomodify.txt tomodify.txt > new.txt
... (4 Replies)
Discussion started by: flacchy
4 Replies
9. Shell Programming and Scripting
Hi some one please help me to remove duplicates from a pipe delimited file based on first two columns.
123|asdf|sfsd|qwrer
431|yui|qwer|opws
123|asdf|pol|njio
Here My first record and last record are duplicates.As per my requirement I want all the latest records into one file.
I want the... (12 Replies)
Discussion started by: ginrkf
12 Replies
10. UNIX for Advanced & Expert Users
Hi all,
I have a issues while loading a flat file to the DB. It is taking much time.
When analyzed i found out that there are duplicates entry in the flat file.
There are 2 type of Duplicate entry.
1) is entire row is duplicate. ( i can use sort | uniq) to remove the duplicated entry.
2) the... (4 Replies)
Discussion started by: samjoshuab
4 Replies
runacct(1M) runacct(1M)
NAME
runacct - run daily accounting
SYNOPSIS
[mmdd[state]]
DESCRIPTION
runacct is the main daily accounting shell procedure. It is normally initiated via cron(1M). runacct processes connect, fee, disk, and
process accounting files. It also prepares summary files for prdaily or billing purposes.
runacct takes care not to damage active accounting files or summary files in the event of errors. It records its progress by writing
descriptive diagnostic messages into When an error is detected, a message is written to mail (see mail(1), mailx(1), or elm(1)) is sent to
and and runacct terminates. runacct uses a series of lock files to protect against re-invocation. The files and are used to prevent
simultaneous invocation, and is used to prevent more than one invocation per day.
runacct breaks its processing into separate, restartable states using to remember the last state completed. It accomplishes this by writ-
ing the state name into runacct then looks in to see what it has done and to determine what to process next. states are executed in the
following order:
Move active accounting files into working files.
Verify integrity of
file, correcting date changes if necessary.
Produce connect session records in
format.
Convert process accounting records into
format.
Merge the connect and process accounting records.
Convert output of
chargefee into format and merge with connect and process accounting records.
Merge disk accounting records with connect, process, and fee accounting
records.
Merge the daily total accounting records in
with the summary total accounting records in
Produce command summaries.
Any installation-dependent accounting programs can be
included here.
Cleanup temporary files and exit.
To restart runacct after a failure, first check the file for diagnostics, then fix up any corrupted data files such as or The files and
file must be removed before runacct can be restarted. The argument mmdd is necessary if runacct is being restarted, and specifies the
month and day for which runacct will rerun the accounting. Entry point for processing is based on the contents of to override this,
include the desired state on the command line to designate where processing should begin.
EXAMPLES
To start runacct.
To restart runacct.
To restart runacct at a specific state.
WARNINGS
Normally it is not a good idea to restart runacct in its state. Run manually, then restart via:
If runacct failed in its state, remove the last file because it will not be complete.
FILES
SEE ALSO
mail(1), acct(1M), acctcms(1M), acctcom(1M), acctcon(1M), acctmerg(1M), acctprc(1M), acctsh(1M), cron(1M), fwtmp(1M), acct(2), acct(4),
utmp(4).
STANDARDS CONFORMANCE
runacct(1M)