Sponsored Content
Top Forums Shell Programming and Scripting Sorting data file by date and time Post 302837331 by Corona688 on Thursday 25th of July 2013 02:03:54 PM
Old 07-25-2013
Yeah... I'd have never guessed that from what you told me. Smilie

...Are they lines? XML is organized by tags, whether they are lines is purely incidental. This may have as much to do with your sorting problem as anything else.

This will work either way, prepending YYYY/MM/DD:HH:MM:SS times that sort nicely and are easily removed later and printing as individual lines.

Code:
$ cat inputfile

<c id="xxx01345-383x-4630-b1b9-aaa505939734" t="7/25/2013 6:35:34 AM"><c id="xxx01345-383x-4630-b1b9-aaa505939734" t="10/25/2013 6:35:34 AM"><c id="xxx01345-383x-4630-b1b9-aaa505939734" t="7/2/2013 6:35:34 AM"><c id="xxx01345-383x-4630-b1b9-aaa505939734" t="7/25/2013 10:35:34 AM"><c id="xxx01345-383x-4630-b1b9-aaa505939734" t="7/25/2013 6:35:34 PM">

$ awk -v RS="<" -F'"' 'NR>1 {
        split($4, A, " ");
        split(A[1], D, "/");
        split(A[2], T, ":");

        if((A[3] == "PM") && (T[1] != "12")) T[1] += 12;
        if((A[3] == "AM") && (T[1] == "12")) T[1]="00";

        P=sprintf("%04d/%02d/%02d:%02d:%02d:%02d", D[3], D[1], D[2], T[1], T[2], T[3]);

        $0=P" <"$0
        sub(/\n/, "");
        print }' inputfile | sort

2013/07/02:06:35:34 <c id="xxx01345-383x-4630-b1b9-aaa505939734" t="7/2/2013 6:35:34 AM">
2013/07/25:06:35:34 <c id="xxx01345-383x-4630-b1b9-aaa505939734" t="7/25/2013 6:35:34 AM">
2013/07/25:10:35:34 <c id="xxx01345-383x-4630-b1b9-aaa505939734" t="7/25/2013 10:35:34 AM">
2013/07/25:18:35:34 <c id="xxx01345-383x-4630-b1b9-aaa505939734" t="7/25/2013 6:35:34 PM">
2013/10/25:06:35:34 <c id="xxx01345-383x-4630-b1b9-aaa505939734" t="10/25/2013 6:35:34 AM">

$

 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Sorting by date and time

Hi guys... I've been trying to do this for ages. Maybe you can help. I have log files like the examples below and I have grepped out certain lines from the files so that I can get an idea of who is logging on and how. So now I have the information in a new file but it is now in a different order... (7 Replies)
Discussion started by: padmundo
7 Replies

2. Shell Programming and Scripting

Processing a log file based on date/time input and the date/time on the log file

Hi, I'm trying to accomplish the following and would like some suggestions or possible bash script examples that may work I have a directory that has a list of log files that's periodically dumped from a script that is crontab that are rotated 4 generations. There will be a time stamp that is... (4 Replies)
Discussion started by: primp
4 Replies

3. Shell Programming and Scripting

To extract data of a perticular interval (date-time wise)

I want a shell script which extract data from a log file which contains date and time-wise data and i need the data for a perticular interval of time...what can i do??? (3 Replies)
Discussion started by: abhishek27
3 Replies

4. UNIX and Linux Applications

Matlab (more generally: sorting data by date)

I'm not sure if this really belongs anywhere on this forum but my previous experiences on here have shown me that you guys are very helpful so I figure I may as well try. I have a bunch of large 2d arrays in matlab and each has a column for a date that each row corresponds to. The format is... (1 Reply)
Discussion started by: skray
1 Replies

5. Shell Programming and Scripting

Split Date/time data to two

Hi all, I have logs(in a log file) with the following structure 20100916011501559;0.812;null;TRUE;;FALSE;0.812;0;0;;19 20100916011504762;0.015;null;TRUE;;FALSE;0;4|4;0.015;;4 20100916011504762;0;null;TRUE;;FALSE;0;0;0;;4 20100916011501731;3.343;null;TRUE;;FALSE;3.156;131|65;0.172;;11... (14 Replies)
Discussion started by: babai
14 Replies

6. Shell Programming and Scripting

compare date and time inside data of two files

i have two files with identical no of columns. 6th columns is date (MM/DD/YY format) and 7th columns is time (HH:MM:SS) format. I need to compare these two vaules and if the date & time is higher than fileA, save it on fileC; if the value is lower, then save it on fileD CONDITIONS... (7 Replies)
Discussion started by: ajiwww
7 Replies

7. Shell Programming and Scripting

Date and Time sorting

Hi Guys! i have a problem of sorting column chronologically because the data i have in column is in the following format 06/Dec/2006:18:09:54 and need to be sorted in the following way (upto seconds) 06/Dec/2005:18:09:50 06/Dec/2005:18:09:51 31/Mar/2006:19:30:41 24/Oct/2006:19:16:19... (4 Replies)
Discussion started by: me_newbie
4 Replies

8. Shell Programming and Scripting

Sorting the data with date

Hi, PFB the data: C_Random_130417 Java_Random_130518 Perl_Random_120519 Perl_Random_120528 so the values are ending with year,i.e.,130417 i want to sort the values with date. i want the output like this: Perl_Random_120519 Perl_Random_120528 C_Random_130417 Java_Random_130518 can... (5 Replies)
Discussion started by: arindam guha
5 Replies

9. UNIX for Dummies Questions & Answers

Plz Help in sorting the data on date basis

I have file having below data 01-MAY-13 2.38.11.00.100089 IN 4512 0000741881 01-JUN-13 2.38.11.00.100089 IN 1514 0000764631 01-NOV-13 2.38.11.00.100089 IN 1514 0000856571 01-NOV-13 2.38.15.00.100015 IN 300.32 0000856531 01-JUN-13 2.38.19.00.100000 IN 2698 0000764493 01-JUL-13... (2 Replies)
Discussion started by: ranabhavish
2 Replies

10. Shell Programming and Scripting

Plz Help in sorting the data on date basis

I have file a.txt having below data cat a.txt 01-MAY-13 2.38.11.00.100089 IN 4512 0000741881 01-JUN-13 2.38.11.00.100089 IN 1514 0000764631 01-NOV-13 2.38.11.00.100089 IN 1514 0000856571 01-NOV-13 2.38.15.00.100015 IN 300.32 0000856531 01-JUN-13 2.38.19.00.100000 IN 2698 0000764493... (5 Replies)
Discussion started by: ranabhavish
5 Replies
STRPTIME(3)						   BSD Library Functions Manual 					       STRPTIME(3)

NAME
strptime -- converts a character string to a time value LIBRARY
Standard C Library (libc, -lc) SYNOPSIS
#include <time.h> char * strptime(const char * restrict buf, const char * restrict format, struct tm * restrict tm); DESCRIPTION
The strptime() function converts the character string pointed to by buf to values which are stored in the tm structure pointed to by tm, using the format specified by format. The format string consists of zero or more conversion specifications, whitespace characters as defined by isspace(), and ordinary characters. All ordinary characters in format are compared directly against the corresponding characters in buf; comparisons which fail will cause strptime() to fail. Whitespace characters in format match any number of whitespace characters in buf, including none. A conversion specification consists of a percent sign '%' followed by one or two conversion characters which specify the replacement required. There must be white-space or other non-alphanumeric characters between any two conversion specifications. Conversion of alphanumeric strings (such as month and weekday names) is done without regard to case. Conversion specifications which cannot be matched will cause strptime() to fail. The LC_TIME category defines the locale values for the conversion specifications. The following conversion specifications are supported: %a the day of week, using the locale's weekday names; either the abbreviated or full name may be specified. %A the same as %a. %b the month, using the locale's month names; either the abbreviated or full name may be specified. %B the same as %b. %c the date and time, using the locale's date and time format. %C the century number [0,99]; leading zeros are permitted but not required. This conversion should be used in conjunction with the %y conversion. %d the day of month [1,31]; leading zeros are permitted but not required. %D the date as %m/%d/%y. %e the same as %d. %F the date as %Y-%m-%d (the ISO 8601 date format). %g the year corresponding to the ISO week number, without the century. (A NetBSD extension.) %G the year corresponding to the ISO week number, with the century. (A NetBSD extension.) %h the same as %b. %H the hour (24-hour clock) [0,23]; leading zeros are permitted but not required. %I the hour (12-hour clock) [1,12]; leading zeros are permitted but not required. %j the day number of the year [1,366]; leading zeros are permitted but not required. %k the same as %H. %l the same as %I. %m the month number [1,12]; leading zeros are permitted but not required. %M the minute [0,59]; leading zeros are permitted but not required. %n any white-space, including none. %p the locale's equivalent of a.m. or p.m. %r the time (12-hour clock) with %p, using the locale's time format. %R the time as %H:%M. %S the seconds [0,61]; leading zeros are permitted but not required. %s the number of seconds since the Epoch, UTC (see mktime(3)). (A NetBSD extension.) %t any white-space, including none. %T the time as %H:%M:%S. %u the day of the week as a decimal number, where Monday = 1. (A NetBSD extension.) %U the week number of the year (Sunday as the first day of the week) as a decimal number [0,53]; leading zeros are permitted but not required. All days in a year preceding the first Sunday are considered to be in week 0. %V the ISO 8601:1988 week number as a decimal number. If the week (starting on Monday) that contains January 1 has more than three days in the new year, then it is considered the first week of the year. If it has fewer than four days in the new year, then it is consid- ered the last week of the previous year. Weeks are numbered from 1 to 53. (A NetBSD extension.) %w the weekday as a decimal number [0,6], with 0 representing Sunday; leading zeros are permitted but not required. %W the week number of the year (Monday as the first day of the week) as a decimal number [0,53]; leading zeros are permitted but not required. All days in a year preceding the first Monday are considered to be in week 0. %x the date, using the locale's date format. %X the time, using the locale's time format. %y the year within the 20th century [69,99] or the 21st century [0,68]; leading zeros are permitted but not required. If specified in conjunction with %C, specifies the year [0,99] within that century. %Y the year, including the century (i.e., 1996). %z an ISO 8601 or RFC-2822 timezone specification. This is one of the following: the offset from Coordinated Universal Time ('UTC') spec- ified as: ``[+-]hhmm'', ``[+-]hh:mm'', or ``[+-]hh''; 'UTC' specified as: ``GMT'' ('Greenwich Mean Time'), ``UT'' ('Universal Time'), or ``Z'' ('Zulu Time'); a three character US timezone specified as: ``EDT'', ``EST'', ``CDT'', ``CST'', ``MDT'', ``MST'', ``PDT'', or ``PST'', with the first letter standing for 'Eastern' (``E''), 'Central' (``C''), 'Mountain' (``M'') or 'Pacific' (``P''), and the sec- ond letter standing for 'Daylight' (``D'' or summer) time or 'Standard' (``S'') time; a single letter military timezone specified as: ``A'' through ``I'' and ``K'' through ``Y''. (A NetBSD extension.) %Z timezone name or no characters when time zone information is unavailable. (A NetBSD extension.) %% matches a literal `%'. No argument is converted. Modified conversion specifications For compatibility, certain conversion specifications can be modified by the E and O modifier characters to indicate that an alternative for- mat or specification should be used rather than the one normally used by the unmodified conversion specification. As there are currently neither alternative formats nor specifications supported by the system, the behavior will be as if the unmodified conversion specification were used. Case is ignored when matching string items in buf, such as month and weekday names. RETURN VALUES
If successful, the strptime() function returns a pointer to the character following the last character parsed. Otherwise, a NULL pointer is returned. SEE ALSO
ctime(3), isspace(3), localtime(3), strftime(3), tm(3) STANDARDS
The strptime() function conforms to X/Open Portability Guide Issue 4 (``XPG4''). BUGS
The %Z format specifier only accepts timezone abbreviations of the local timezone, or the value ``GMT''. This limitation is caused by the ambiguity of overloaded timezone abbreviations, for example EST is both Eastern Standard Time and Eastern Australia Summer Time. BSD
April 12, 2011 BSD
All times are GMT -4. The time now is 08:50 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy