Sponsored Content
Top Forums Shell Programming and Scripting Remove Duplicates on multiple Key Columns and get the Latest Record from Date/Time Column Post 302799151 by Don Cragun on Thursday 25th of April 2013 07:55:09 PM
Old 04-25-2013
Can we assume that CDC_FLAG will always be the 3rd field, SRC_PMTN_I will always be the 5th field, and CDC_PRCS_TS will always be the 1st field; or do we have to match the strings against the header line to determine which fields to use?

Do the output records need to be in the same order as they appeared in the input file or can the output be in random order except that the 1st output line must be the 1st input line (the headings)?
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Remove duplicates based on the two key columns

Hi All, I needs to fetch unique records based on a keycolumn(ie., first column1) and also I needs to get the records which are having max value on column2 in sorted manner... and duplicates have to store in another output file. Input : Input.txt 1234,0,x 1234,1,y 5678,10,z 9999,10,k... (7 Replies)
Discussion started by: kmsekhar
7 Replies

2. Shell Programming and Scripting

Search based on 1,2,4,5 columns and remove duplicates in the same file.

Hi, I am unable to search the duplicates in a file based on the 1st,2nd,4th,5th columns in a file and also remove the duplicates in the same file. Source filename: Filename.csv "1","ccc","information","5000","temp","concept","new" "1","ddd","information","6000","temp","concept","new"... (2 Replies)
Discussion started by: onesuri
2 Replies

3. Shell Programming and Scripting

need to remove duplicates based on key in first column and pattern in last column

Given a file such as this I need to remove the duplicates. 00060011 PAUL BOWSTEIN ad_waq3_921_20100826_010517.txt 00060011 PAUL BOWSTEIN ad_waq3_921_20100827_010528.txt 0624-01 RUT CORPORATION ad_sade3_10_20100827_010528.txt 0624-01 RUT CORPORATION ... (13 Replies)
Discussion started by: script_op2a
13 Replies

4. Shell Programming and Scripting

finding duplicates in csv based on key columns

Hi team, I have 20 columns csv files. i want to find the duplicates in that file based on the column1 column10 column4 column6 coulnn8 coulunm2 . if those columns have same values . then it should be a duplicate record. can one help me on finding the duplicates, Thanks in advance. ... (2 Replies)
Discussion started by: baskivs
2 Replies

5. Shell Programming and Scripting

Removing duplicates in fixed width file which has multiple key columns

Hi All , I have a requirement where I need to remove duplicates from a fixed width file which has multiple key columns .Also , need to capture the duplicate records into another file . File has 8 columns. Key columns are col1 and col2. Col1 has the length of 8 col 2 has the length of 3. ... (5 Replies)
Discussion started by: saj
5 Replies

6. Shell Programming and Scripting

Remove the time from the date column

Hi, I have file named file1.txt with below contents cat file1.txt 1/29/2014 0:00,706886 1/30/2014 0:00,791265 1/31/2014 0:00,987087 2/1/2014 0:00,1098572 2/2/2014 0:00,572477 2/3/2014 0:00,701715 I want to display as below 1/29/2014,706886 1/30/2014,791265 1/31/2014,987087... (5 Replies)
Discussion started by: villain41
5 Replies

7. UNIX for Dummies Questions & Answers

Display latest record from file based on multiple columns combination

I have requirement to print latest record from file based on multiple columns combination. EWAPE EW1SLE0000 EW1SOMU01 ABORTED 03/16/2015 100004 03/16/2015 100005 001 EWAPE EW1SLE0000 EW1SOMU01 ABORTED 03/18/2015 140003 03/18/2015 140004 001 EWAPE EW1SLE0000 EW1SOMU01 ABORTED 03/18/2015 220006... (1 Reply)
Discussion started by: tmalik79
1 Replies

8. UNIX for Beginners Questions & Answers

Sort and remove duplicates in directory based on first 5 columns:

I have /tmp dir with filename as: 010020001_S-FOR-Sort-SYEXC_20160229_2212101.marker 010020001_S-FOR-Sort-SYEXC_20160229_2212102.marker 010020001-S-XOR-Sort-SYEXC_20160229_2212104.marker 010020001-S-XOR-Sort-SYEXC_20160229_2212105.marker 010020001_S-ZOR-Sort-SYEXC_20160229_2212106.marker... (4 Replies)
Discussion started by: gnnsprapa
4 Replies

9. Shell Programming and Scripting

awk to Sum columns when other column has duplicates and append one column value to another with Care

Hi Experts, Please bear with me, i need help I am learning AWk and stuck up in one issue. First point : I want to sum up column value for column 7, 9, 11,13 and column15 if rows in column 5 are duplicates.No action to be taken for rows where value in column 5 is unique. Second point : For... (1 Reply)
Discussion started by: as7951
1 Replies

10. UNIX for Beginners Questions & Answers

Remove duplicates in a dataframe (table) keeping all the different cells of just one of the columns

Hello all, I need to filter a dataframe composed of several columns of data to remove the duplicates according to one of the columns. I did it with pandas. In the main time, I need that the last column that contains all different data ( not redundant) is conserved in the output like this: A ... (5 Replies)
Discussion started by: pedro88
5 Replies
mhc(5)								File Formats Manual							    mhc(5)

NAME
mhc - Message Harmonized Calendaring system DESCRIPTION
This manual page documents briefly the mhc file format. This manual page was written for the Debian GNU/Linux distribution because the original program does not have a manual page. FIELDS
Mhc file format is based on STD11/RFC822: Standard for the format of ARPA Internet text messages. In mhc file, the following extra header fields are used. X-SC-Day: Date of event in format yyyymmdd. You can specify multiple date with space separated like: X-SC-Day: 19990409 19990413 which means April 9th 1999 and April 13th 1999. X-SC-Time: Time of event in format hh:mm-hh:mm or hh:mm. For event which has no meanings about time, you can leave it empty in this field. X-SC-Duration: Period of event in yyyymmdd-yyyymmdd. You can omit start date or end date, like "19991121-". X-SC-Duration: is used only to limit date specified by X-SC-Day: or X-SC-Cond, so you can't describe event date only by X-SC-Duration:. If X-SC-Duration: is empty, it means no limit. X-SC-Cond: Condition of event date by using following keyword. 00-31 Day of month. 1st 2nd 3rd 4th 5th Last Week in month. Sun Mon ... Sat Day of week. Jan Feb ... Dec Month Examples, X-SC-Cond: Tue Fri Every Tuesday and Friday. X-SC-Cond: 31 Aug Every August 31th. X-SC-Cond: 1st 3rd Fri Every 1st and 3rd Friday. X-SC-Cond: Fri X-SC-Day: !19990409 Every Friday, but except April 9th 1999. X-SC-Cond: Fri 13 Every 13th and Every Friday, not 13th Friday. X-SC-Alarm: Hint for alarm of event. Currently, mhc.el doesn't alaram, but gemcal will pop-up alarm window. It will be useful if you transfer this event to PalmOS. In X-SC-Alarm:, you can specify the number (1 to 99) with suffix such as minute, hour or day. For example, X-SC-Alarm: 10 minutes Alarm 10 minutes before event. X-SC-Alarm: 3 hour Alarm 3 hours before event. X-SC-Alarm: 3 day Alarm 3 day before event. X-SC-Subject: Subject of event. unstructured? X-SC-Location: Location of event. unstructured? X-SC-Category Category of event. Any keyword, space separeated. Case ignored. X-SC-Recoard-Id: Internal use only. SEE ALSO
adb2mhc(1) gemcal(1), mhc-sync(1), mhc2palm(1), palm2mhc(1), today(1). AUTHOR
This program was written by Yoshinari Nomura <nom@quickhack.net> and this manual page was written by Fumitoshi UKAI <ukai@debian.or.jp>, for the Debian GNU/Linux system (but may be used by others). 23 Jun 2000 mhc(5)
All times are GMT -4. The time now is 02:20 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy