Sponsored Content
Full Discussion: Text manipulation help
Top Forums Shell Programming and Scripting Text manipulation help Post 302976047 by RudiC on Thursday 23rd of June 2016 04:00:56 AM
Old 06-23-2016
Not too elegant with awk as the Address and Creation fields are to be filled into several fields...
Code:
awk '
NR==1                   {HD="Ticket #|Creation Month|Creation Year|Ref|Name|Address |City|State|Zipcode|Phone|Player|DOB|IP"

                         print HD                               # print it
                         sub ("Month\|Creation Year", "Date", HD)
                         sub ("\|City\|State\|Zipcode", _, HD)
                         gsub ("\|", ",", HD)
                         HDCnt = split(HD, HDArr, ",")          # HDArr n HDCnt needed later for extracting and printing
                         HD    = "," HD ","
                        }

function PRT()          {DL = ""                                # clear delimiter

                         for (i=1; i<=HDCnt; i++)       {printf "%s%s", DL, RES[HDArr[i]]       # print fields in sequence, plus delimiter
                                                         DL=SEP                                 # set delimiter
                                                        }
                         printf "\n"
                         delete RES                             # clear for next record
                        }

NF == 0                 {PRT()                                  # empty line means: print complete record
                        }

HD ~ "," $1 ","         {gsub (",", SEP, $NF)                   # prepare Address field
                         if ($1 ~ "Date") sub ("-","|", $NF)    # prepare Creation field
                         RES[$1] = $NF                          # save it for print
                        }

END                  {PRT()}                                    # print last record
' FS="[[:]" SEP="|" file
Ticket #|Creation Month|Creation Year|Ref|Name|Address |City|State|Zipcode|Phone|Player|DOB|IP
 XXXXXXXXXXXXXXXXX| 03|2015| 299XXXXXX| Alexandra| XXXX N 37 Ave| Hollywood| FL| 33021| XXX3190XXX| XXXXXX| June-14-1995| 73.XXXXXXXX - 73.XXXXXXXX
 XXXXXXXXXXXXXXXXX| 03|2016| 449XXXXXX| GXX(1) JonXX(2)| XXXX XXXXXX SE| Washington (city)| DC (state)| 20032 (zipcode)| XXX5954XXX| XXXXXX| April-24-1986| 66.44.XXXX - XXXXX.md.cable.rcn.com

and, the record separator is expected to be one single empty line.
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Text Manipulation.

Hi I have only ever used awk and sed for basic requirements up until now. I have had to break a log down for multiple purposes. Using awk, sed and a date script. I am left with this: (message id, time of msg attempt, message id, domain name, time of msg completion) ... (4 Replies)
Discussion started by: Icepick
4 Replies

2. UNIX for Dummies Questions & Answers

text manipulation

I am tryin to figure out how to extract interested text from file example.txt blah blah blah a: child1 blah a: child2 blah b: parent1 blah blah blah .... blah a: child21 blah a: child22 blah a: child23 blah b: parent2 this kinda text repeats .. number of children is... (6 Replies)
Discussion started by: rajkishore
6 Replies

3. UNIX for Dummies Questions & Answers

Help with text manipulation

Hi there, I have some text files in unix format that processed by a program in windows, and when I open them with less or vi in linux, a warn for opening binary file is prompted, and as shown in vi, between every two characters there was inserted a "^@". How can I fix this. Plus, there are over... (2 Replies)
Discussion started by: dustinwang2003
2 Replies

4. UNIX for Dummies Questions & Answers

Text Manipulation

Greetings. Iīm a biologist and I donīt have mucho knowledge on Unix/Linux, but I need to use Cygwin to change some documents from a GenBank format to a FASTA format. GenBank format goes somthing like this: LOCUS NM_013964 2568 bp mRNA linear PRI 26-APR-2009... (2 Replies)
Discussion started by: vanesa1230
2 Replies

5. Shell Programming and Scripting

[HELP] Text manipulation... [HELP]

I need to know how can I remove all word after comma on each line. Like: jjkj,iiuiui,ijlkjkij,ookoo kijljlj,jhhkj,ijijkijkj,oijkijj kjkljlkj,kjkjlkjlkj,opok,okop to jjkj, kijljlj, ... (5 Replies)
Discussion started by: slutb3
5 Replies

6. UNIX for Dummies Questions & Answers

text manipulation help

Hello again unix.com How can I extract from a large file in format: steve@aol.com steve hawkins Location of this member is bla bla bla sun@hotmail.com Sun Ying This member is using browser bla bla bla to another text in format: steve@aol.com steve hawkins sun@hotmail.com sun ying ... (5 Replies)
Discussion started by: galford
5 Replies

7. UNIX for Dummies Questions & Answers

Text Manipulation Help

Hello Unix.com, I have a text in format: john sara lee How can I make it: john:john john:john1 john:john12 john:john123 sara:sara sara:sara12 sara:sara123 and so on (2 Replies)
Discussion started by: galford
2 Replies

8. UNIX for Dummies Questions & Answers

Text manipulation help

Hello unix.com users, I have a ip file (line-by-line). How can I delete the ips that keep repeating by mark XXX.XXX.XXX.* ... I want to erase only the lines that keep repeating more than 2 times. Example: 1.2.3.1 1.2.3.2 1.2.3.3 I want to erase all ips blocks that are repeating by C... (1 Reply)
Discussion started by: galford
1 Replies

9. UNIX for Dummies Questions & Answers

Text manipulation

i want to generate a list line-by-line of normal characters using letters . for example : dnds gnos mgod pets jnfp etc... i want to use all letters with all the posibilities is there a script that can do this ? (3 Replies)
Discussion started by: suppliernr1
3 Replies

10. Shell Programming and Scripting

Help text manipulation

Hello Forum , I need a help about text manupulation. I have a text file and I have to manipulate this file. Let's say source.txt source.txt UNB+UNOC:3+O0013000005MAN MN RVS:91+0098006688:92+190304:2313+F004169241' UNH+8146848+DELJIT:D:96A:UN' BGM+307:::JIS_SYNCRO_FIRM+2019030423234101+9'... (8 Replies)
Discussion started by: cemokam65
8 Replies
calendar(3erl)						     Erlang Module Definition						    calendar(3erl)

NAME
calendar - Local and universal time, day-of-the-week, date and time conversions DESCRIPTION
This module provides computation of local and universal time, day-of-the-week, and several time conversion functions. Time is local when it is adjusted in accordance with the current time zone and daylight saving. Time is universal when it reflects the time at longitude zero, without any adjustment for daylight saving. Universal Coordinated Time (UTC) time is also called Greenwich Mean Time (GMT). The time functions local_time/0 and universal_time/0 provided in this module both return date and time. The reason for this is that sepa- rate functions for date and time may result in a date/time combination which is displaced by 24 hours. This happens if one of the functions is called before midnight, and the other after midnight. This problem also applies to the Erlang BIFs date/0 and time/0 , and their use is strongly discouraged if a reliable date/time stamp is required. All dates conform to the Gregorian calendar. This calendar was introduced by Pope Gregory XIII in 1582 and was used in all Catholic coun- tries from this year. Protestant parts of Germany and the Netherlands adopted it in 1698, England followed in 1752, and Russia in 1918 (the October revolution of 1917 took place in November according to the Gregorian calendar). The Gregorian calendar in this module is extended back to year 0. For a given date, the gregorian days is the number of days up to and including the date specified. Similarly, the gregorian seconds for a given date and time, is the the number of seconds up to and including the specified date and time. For computing differences between epochs in time, use the functions counting gregorian days or seconds. If epochs are given as local time, they must be converted to universal time, in order to get the correct value of the elapsed time between epochs. Use of the function time_difference/2 is discouraged. There exists different definitions for the week of the year. The calendar module contains a week of the year implementation which conforms to the ISO 8601 standard. Since the week number for a given date can fall on the previous, the current or on the next year it is important to provide the information which year is it together with the week number. The function iso_week_number/0 and iso_week_number/1 returns a tuple of the year and the week number. DATA TYPES
date() = {Year, Month, Day} Year = int() Month = 1..12 Day = 1..31 Year cannot be abbreviated. Example: 93 denotes year 93, not 1993. Valid range depends on the underlying OS. The date tuple must denote a valid date. time() = {Hour, Minute, Second} Hour = 0..23 Minute = Second = 0..59 EXPORTS
date_to_gregorian_days(Date) -> Days date_to_gregorian_days(Year, Month, Day) -> Days Types Date = date() Days = int() This function computes the number of gregorian days starting with year 0 and ending at the given date. datetime_to_gregorian_seconds({Date, Time}) -> Seconds Types Date = date() Time = time() Seconds = int() This function computes the number of gregorian seconds starting with year 0 and ending at the given date and time. day_of_the_week(Date) -> DayNumber day_of_the_week(Year, Month, Day) -> DayNumber Types Date = date() DayNumber = 1..7 This function computes the day of the week given Year , Month and Day . The return value denotes the day of the week as 1 : Monday, 2 : Tuesday, and so on. gregorian_days_to_date(Days) -> Date Types Days = int() Date = date() This function computes the date given the number of gregorian days. gregorian_seconds_to_datetime(Seconds) -> {Date, Time} Types Seconds = int() Date = date() Time = time() This function computes the date and time from the given number of gregorian seconds. is_leap_year(Year) -> bool() This function checks if a year is a leap year. iso_week_number() -> IsoWeekNumber Types IsoWeekNumber = {int(), int()} This function returns the tuple {Year, WeekNum} representing the iso week number for the actual date. For determining the actual date, the function local_time/0 is used. iso_week_number(Date) -> IsoWeekNumber Types Date = date() IsoWeekNumber = {int(), int()} This function returns the tuple {Year, WeekNum} representing the iso week number for the given date. last_day_of_the_month(Year, Month) -> int() This function computes the number of days in a month. local_time() -> {Date, Time} Types Date = date() Time = time() This function returns the local time reported by the underlying operating system. local_time_to_universal_time({Date1, Time1}) -> {Date2, Time2} This function converts from local time to Universal Coordinated Time (UTC). Date1 must refer to a local date after Jan 1, 1970. Warning: This function is deprecated. Use local_time_to_universal_time_dst/1 instead, as it gives a more correct and complete result. Especially for the period that does not exist since it gets skipped during the switch to daylight saving time, this function still returns a result. local_time_to_universal_time_dst({Date1, Time1}) -> [{Date, Time}] Types Date1 = Date = date() Time1 = Time = time() This function converts from local time to Universal Coordinated Time (UTC). Date1 must refer to a local date after Jan 1, 1970. The return value is a list of 0, 1 or 2 possible UTC times: [] : For a local {Date1, Time1} during the period that is skipped when switching to daylight saving time, there is no corresponding UTC since the local time is illegal - it has never happened. [DstDateTimeUTC, DateTimeUTC] : For a local {Date1, Time1} during the period that is repeated when switching from daylight saving time, there are two corre- sponding UTCs. One for the first instance of the period when daylight saving time is still active, and one for the second instance. [DateTimeUTC] : For all other local times there is only one corresponding UTC. now_to_local_time(Now) -> {Date, Time} Types Now -- see erlang:now/0 Date = date() Time = time() This function returns local date and time converted from the return value from erlang:now() . now_to_universal_time(Now) -> {Date, Time} now_to_datetime(Now) -> {Date, Time} Types Now -- see erlang:now/0 Date = date() Time = time() This function returns Universal Coordinated Time (UTC) converted from the return value from erlang:now() . seconds_to_daystime(Seconds) -> {Days, Time} Types Seconds = Days = int() Time = time() This function transforms a given number of seconds into days, hours, minutes, and seconds. The Time part is always non-negative, but Days is negative if the argument Seconds is. seconds_to_time(Seconds) -> Time Types Seconds = int() < 86400 Time = time() This function computes the time from the given number of seconds. Seconds must be less than the number of seconds per day(86400). time_difference(T1, T2) -> {Days, Time} This function returns the difference between two {Date, Time} tuples. T2 should refer to an epoch later than T1 . Warning: This function is obsolete. Use the conversion functions for gregorian days and seconds instead. time_to_seconds(Time) -> Seconds Types Time = time() Seconds = int() This function computes the number of seconds since midnight up to the specified time. universal_time() -> {Date, Time} Types Date = date() Time = time() This function returns the Universal Coordinated Time (UTC) reported by the underlying operating system. Local time is returned if universal time is not available. universal_time_to_local_time({Date1, Time1}) -> {Date2, Time2} Types Date1 = Date2 = date() Time1 = Time2 = time() This function converts from Universal Coordinated Time (UTC) to local time. Date1 must refer to a date after Jan 1, 1970. valid_date(Date) -> bool() valid_date(Year, Month, Day) -> bool() Types Date = date() This function checks if a date is a valid. LEAP YEARS
The notion that every fourth year is a leap year is not completely true. By the Gregorian rule, a year Y is a leap year if either of the following rules is valid: * Y is divisible by 4, but not by 100; or * Y is divisible by 400. Accordingly, 1996 is a leap year, 1900 is not, but 2000 is. DATE AND TIME SOURCE
Local time is obtained from the Erlang BIF localtime/0 . Universal time is computed from the BIF universaltime/0 . The following facts apply: * there are 86400 seconds in a day * there are 365 days in an ordinary year * there are 366 days in a leap year * there are 1461 days in a 4 year period * there are 36524 days in a 100 year period * there are 146097 days in a 400 year period * there are 719528 days between Jan 1, 0 and Jan 1, 1970. Ericsson AB stdlib 1.17.3 calendar(3erl)
All times are GMT -4. The time now is 08:53 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy