Get 4 character each from 2 different fields concatenate and add as a new field Post: 302341802

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Sorting on two fields time field and number field

Hi, I have a file that has data in it that says 00:01:48.233 1212 00:01:56.233 345 00:09:01.221 5678 00:12:23.321 93444 The file has more line than this but i just wanted to put in a snippet to ask how I would get the highest number with time stamp into another file. So from the above...

2. Shell Programming and Scripting

awk script to (un)/concatenate fields in file

Hi everyone, I'm trying to use the "join" function for more than 1 field. Since it's not possible as it is, I want to take my input files and concatenate the joining fields as 1 field (separated by "|"). I wrote 2 awk script to do and undo it (see below). However I'm new to awk and I'm certain I...

3. Shell Programming and Scripting

Count number of occurences of a character in a field defined by the character in another field

Hello, I have a text file with n lines in the following format (9 column fields): Example: contig00012 149606 G C 49 68 60 18 c$cccccacccccccccc^c I need to count the number of lower-case and upper-case occurences in column 9, respectively, of the...

4. Shell Programming and Scripting

AWK : Add Fields of lines with matching field

Dear All, I would like to add values of a field, if the lines match in a certain field. Then I would like to divide the sum though the number of lines that have a matched field. This is the Input: Input: Test1 5 Test1 10 Test2 2 Test2 5 Test2 13 Test3 4 Output: Test1 7.5 Test1 7.5...

5. Shell Programming and Scripting

Concatenate last field values for all occurences

Hello all, Maybe you can help me with an awk script to get what I need. I have the input file with format below: REQUEST|79023787741690|738227864597|985 REQUEST|79024002151717|738229423534|985 REQUEST|79024002151717|738229423534|*985 NDS-REQUEST|79024002151717|738229423534 ...

6. Shell Programming and Scripting

How to print 1st field and last 2 fields together and the rest of the fields after it using awk?

Hi experts, I need to print the first field first then last two fields should come next and then i need to print rest of the fields. Input : a1,abc,jsd,fhf,fkk,b1,b2 a2,acb,dfg,ghj,b3,c4 a3,djf,wdjg,fkg,dff,ggk,d4,d5 Expected output: a1,b1,b2,abc,jsd,fhf,fkk...

7. Shell Programming and Scripting

Display combination of 4 field uniqe record and along with concatenate 5th and 6th field.

Table ACN|NAME|CITY|CTY|NO1|NO2 115|AKKK|ASH|IND|10|15 115|AKKK|ASH|IND|20|20 115|AKKK|ASH|IND|30|35 115|AKKK|ASH|IND|30|35 112|ABC|FL|USA|15|15 112|ABC|FL|USA|25|20 112|ABC|FL|USA|25|45 i have written shell script using cut command and awk programming getting error correct it and add...

8. Shell Programming and Scripting

Inserting a field without disturbing field separator on other fields

Hi All, I have the input as below: cat input 032016002 2.891 97.109 16.605 27.172 24.017 32.207 0.233 0.021 39.810 0.077 0.026 19.644 13.882 0.131 11.646 0.102 11.449 76.265 23.735 16.991 83.009 8.840 91.160 0.020 99.980 52.102 47.898 44.004 55.996 39.963 18.625 0.121 1.126 40.189...

9. Shell Programming and Scripting

sed to add field heards to specific fields

I have tab delimited input that prints out in the format below: I am trying to add field headers to $5 and $6. Not sure if sed is the best tool but my attempt is below. Thank you :). $5 = REF $6 = ALT file ID CHR START STOP 123 1 100 200 A ...

10. Shell Programming and Scripting

awk to add plus or minus to fields and split another field

In the tab-delimited input below I am trying to use awk to -10 from $2 and +10 to $3. Something like awk -F'\t' -v OFS='\t' -v s=10 '{split($4,a,":"); print $1,$2-s,$3+s,a,$5,$6} | awk {split(a,b,"-"); print $1,$2-s,$3+s,b-s,b+s,$5,$6}' input should do that. I also need to -10 from $4...

LEARN ABOUT OSX

regexp::common::zip

Regexp::Common::zip(3)					User Contributed Perl Documentation				    Regexp::Common::zip(3)

NAME

       Regexp::Common::zip -- provide regexes for postal codes.

SYNOPSIS

	   use Regexp::Common qw /zip/;

	   while (<>) {
	       /^$RE{zip}{Netherlands}$/   and	print "Dutch postal code
";
	   }

DESCRIPTION

       Please consult the manual of Regexp::Common for a general description of the works of this interface.

       Do not use this module directly, but load it via Regexp::Common.

       This module offers patterns for zip or postal codes of many different countries. They all have the form "$RE{zip}{Country}[{options}]".

       The following common options are used:

   "{-prefix=[yes|no|allow]}" and "{-country=PAT}".
       Postal codes can be prefixed with a country abbreviation. That is, a dutch postal code of 1234 AB can also be written as NL-1234 AB.  By
       default, all the patterns will allow the prefixes. But this can be changed with the "-prefix" option. With "-prefix=yes", the returned
       pattern requires a country prefix, while "-prefix=no" disallows a prefix. Any argument that doesn't start with a "y" or a "n" allows a
       country prefix, but doesn't require them.

       The prefixes used are, unfortunally, not always the same. Officially, ISO country codes need to be used, but the usage of CEPT codes (the
       same ones as used on cars) is common too. By default, each postal code will recognize a country prefix that's either the ISO standard or
       the CEPT code. That is, German postal codes may prefixed with either "DE" or "D". The recognized prefix can be changed with the "-country"
       option, which takes a (sub)pattern as argument. The arguments "iso" and "cept" are special, and indicate the language prefix should be the
       ISO country code, or the CEPT code.

       Examples:
	/$RE{zip}{Netherlands}/;
		  # Matches '1234 AB' and 'NL-1234 AB'.
	/$RE{zip}{Netherlands}{-prefix => 'no'}/;
		  # Matches '1234 AB' but not 'NL-1234 AB'.
	/$RE{zip}{Netherlands}{-prefix => 'yes'}/;
		  # Matches 'NL-1234 AB' but not '1234 AB'.

	/$RE{zip}{Germany}/;
		  # Matches 'DE-12345' and 'D-12345'.
	/$RE{zip}{Germany}{-country => 'iso'}/;
		  # Matches 'DE-12345' but not 'D-12345'.
	/$RE{zip}{Germany}{-country => 'cept'}/;
		  # Matches 'D-12345' but not 'DE-12345'.
	/$RE{zip}{Germany}{-country => 'GER'}/;
		  # Matches 'GER-12345'.

   "{-sep=PAT}"
       Some countries have postal codes that consist of two parts. Typically there is an official way of separating those parts; but in practise
       people tend to use different separators. For instance, if the official way to separate parts is to use a space, it happens that the space
       is left off. The "-sep" option can be given a pattern as argument which indicates what to use as a separator between the parts.

       Examples:
	/$RE{zip}{Netherlands}/;
		  # Matches '1234 AB' but not '1234AB'.
	/$RE{zip}{Netherlands}{-sep => 's*'}/;
		  # Matches '1234 AB' and '1234AB'.

   $RE{zip}{Australia}
       Returns a pattern that recognizes Australian postal codes. Australian postal codes consist of four digits; the first two digits, which
       range from '10' to '97', indicate the state. Territories use '02' or '08' as starting digits; the leading zero is optional. '0909' is the
       only postal code starting with '09' (the leading zero is optional here as well) - this is the postal code for the Nothern Territory
       University).  The (optional) country prefixes are AU (ISO country code) and AUS (CEPT code).  Regexp::Common 2.107 and before used
       $RE{zip}{Australia}. This is still supported.

       If "{-keep}" is used, the following variables will be set:

       $1  The entire postal code.

       $2  The country code prefix.

       $3  The postal code without the country prefix.

       $4  The state or territory.

       $5  The last two digits.

   $RE{zip}{Belgium}
       Returns a pattern than recognizes Belgian postal codes. Belgian postal codes consist of 4 digits, of which the first indicates the
       province.  The (optional) country prefixes are BE (ISO country code) and B (CEPT code).

       If "{-keep}" is used, the following variables will be set:

       $1  The entire postal code.

       $2  The country code prefix.

       $3  The postal code without the country prefix.

       $4  The digit indicating the province.

       $5  The last three digits of the postal code.

   $RE{zip}{Denmark}
       Returns a pattern that recognizes Danish postal codes. Danish postal codes consist of four numbers; the first digit (which cannot be 0),
       indicates the distribution region, the second the distribution district. The (optional) country prefix is DK, which is both the ISO country
       code and the CEPT code.

       If "{-keep}" is used, the following variables will be set:

       $1  The entire postal code.

       $2  The country code prefix.

       $3  The postal code without the country prefix.

       $4  The digit indicating the distribution region.

       $5  The digit indicating the distribution district.

       $6  The last two digits of the postal code.

   $RE{zip}{France}
       Returns a pattern that recognizes French postal codes. French postal codes consist of five numbers; the first two numbers, which range from
       '01' to '98', indicate the department. The (optional) country prefixes are FR (ISO country code) and F (CEPT code).  Regexp::Common 2.107
       and before used $RE{zip}{French}. This is still supported.

       If "{-keep}" is used, the following variables will be set:

       $1  The entire postal code.

       $2  The country code prefix.

       $3  The postal code without the country prefix.

       $4  The department.

       $5  The last three digits.

   $RE{zip}{Germany}
       Returns a pattern that recognizes German postal codes. German postal codes consist of five numbers; the first number indicating the
       distribution zone, the second the distribution region, while the latter three indicate the distribution district and the postal town.  The
       (optional) country prefixes are DE (ISO country code) and D (CEPT code).  Regexp::Common 2.107 and before used $RE{zip}{German}. This is
       still supported.

       If "{-keep}" is used, the following variables will be set:

       $1  The entire postal code.

       $2  The country code prefix.

       $3  The postal code without the country prefix.

       $4  The distribution zone.

       $5  The distribution region.

       $6  The distribution district and postal town.

   $RE{zip}{Greenland}
       Returns a pattern that recognizes postal codes from Greenland.  Greenland, being part of Denmark, uses Danish postal codes.  All postal
       codes of Greenland start with 39.  The (optional) country prefix is DK, which is both the ISO country code and the CEPT code.

       If "{-keep}" is used, the following variables will be set:

       $1  The entire postal code.

       $2  The country code prefix.

       $3  The postal code without the country prefix.

       $4  39, being the distribution region and distribution district for Greenland.

       $5  The last two digits of the postal code.

   $RE{zip}{Italy}
       Returns a pattern recognizing Italian postal codes. Italian postal codes consist of 5 digits. The first digit indicates the region, the
       second the province. The third digit is odd for province capitals, and even for the province itself. The fourth digit indicates the route,
       and the fifth a place on the route (0 for small places, alphabetically for the rest).

       The country prefix is either IT (the ISO country code), or I (the CEPT code).

       If "{-keep}" is used, the following variables will be set:

       $1  The entire postal code.

       $2  The country code prefix.

       $3  The postal code without the country prefix.

       $4  The region.

       $5  The province.

       $6  Capital or province.

       $7  The route.

       $8  The place on the route.

   $RE{zip}{Netherlands}
       Returns a pattern that recognizes Dutch postal codes. Dutch postal codes consist of 4 digits and 2 letters, separated by a space.  The
       separator can be changed using the "{-sep}" option, as discussed above. The (optional) country prefix is NL, which is both the ISO country
       code and the CEPT code. Regexp::Common 2.107 and earlier used $RE{zip}{Dutch}. This is still supported.

       If "{-keep}" is used, the following variables will be set:

       $1  The entire postal code.

       $2  The country code prefix.

       $3  The postal code without the country prefix.

       $4  The digits part of the postal code.

       $5  The separator between the digits and the letters.

       $6  The letters part of the postal code.

   $RE{zip}{Norway}
       Returns a pattern that recognizes Norwegian postal codes. Norwegian postal codes consist of four digits.

       The country prefix is either NO (the ISO country code), or N (the CEPT code).

       If "{-keep}" is used, the following variables will be set:

       $1  The entire postal code.

       $2  The country code prefix.

       $3  The postal code without the country prefix.

   $RE{zip}{Spain}
       Returns a pattern that recognizes Spanish postal codes. Spanish postal codes consist of 5 digits. The first 2 indicate one of Spains
       fifties provinces (in alphabetical order), starting with 00. The third digit indicates a main city or the main delivery rounds. The last
       two digits are the delivery area, secondary delivery route or a link to rural areas.

       The country prefix is either ES (the ISO country code), or E (the CEPT code).

       If "{-keep}" is used, the following variables will be set:

       $1  The entire postal code.

       $2  The country code prefix.

       $3  The postal code without the country prefix.

       $4  The two digits indicating the province.

       $5  The digit indicating the main city or main delivery route.

       $6  The digits indicating the delivery area, secondary delivery route or a link to rural areas.

   $RE{zip}{Switzerland}
       Returns a pattern that recognizes Swiss postal codes. Swiss postal codes consist of 4 digits. The first indicates the district, starting
       with 1. The second indicates the area, the third, the route, and the fourth the post office number.

   $RE{zip}{US}{-extended => [yes|no|allow]}
       Returns a pattern that recognizes US zip codes. US zip codes consist of 5 digits, with an optional 4 digit extension. By default,
       extensions are allowed, but not required. This can be influenced by the "-extended" option. If its argument starts with a "y", extensions
       are required; if the argument starts with a "n", extensions will not be recognized. If an extension is used, a dash is used to separate the
       main part from the extension, but this can be changed with the "-sep" option.

       The country prefix is either US (the ISO country code), or USA (the CEPT code).

       If "{-keep}" is being used, the following variables will be set:

       $1  The entire postal code.

       $2  The country code prefix.

       $3  The postal code without the country prefix.

       $4  The first 5 digits of the postal code.

       $5  The first three digits of the postal code, indicating a sectional center or a large city. New in Regexp::Common 2.119.

       $6  The last 2 digits of the 5 digit part of the postal code, indicating a post office facility or delivery area. New in Regexp::Common
	   2.119.

       $7  The separator between the 5 digit part and the 4 digit part. Up to Regexp::Common 2.118, this used to be $5.

       $8  The 4 digit part of the postal code (if any). Up to Regexp::Common 2.118, this used to be $6.

       $9  The first two digits of the 4 digit part of the postal code, indicating a sector, or several blocks. New in Regexp::Common 2.119.

       $10 The last two digits of the 4 digit part of the postal code, indicating a segment or one side of a street. New in Regexp::Common 2.119.

       You need at least version 5.005_03 to be able to use US postal codes.  Older versions contain a bug that let the pattern match invalid US
       postal codes.

       Questions

       o   Can the 5 digit part of the zip code (in theory) start with 000?

       o   Can the 5 digit part of the zip code (in theory) end with 00?

       o   Can the 4 digit part of the zip code (in theory) start with 00?

       o   Can the 4 digit part of the zip code (in theory) end with 00?

SEE ALSO

       Regexp::Common for a general description of how to use this interface.

       <http://www.columbia.edu/kermit/postal.html>
	   Frank's compulsive guide to postal addresses.

       <http://www.upu.int/post_code/en/addressing_formats_guide.shtml>
	   Postal addressing systems.

       http://www.uni-koeln.de/~arcd2/33e.htm <http://www.uni-koeln.de/~arcd2/33e.htm>
	   Postal code information.

       <http://www.grcdi.nl/linkspc.htm>
	   Links to Postcode Pages.

       <http://www1.auspost.com.au/postcodes/>
	   Information about Australian postal codes.

       http://hdusps.esecurecare.net/cgi-bin/hdusps.cfg/php/enduser/std_adp.php?p_faqid=1014 <http://hdusps.esecurecare.net/cgi-
       bin/hdusps.cfg/php/enduser/std_adp.php?p_faqid=1014>
	   Information about US postal codes.

       <http://en.wikipedia.org/wiki/Postal_code>

AUTHORS

       Damian Conway (damian@conway.org) and Abigail (regexp-common@abigail.be).

MAINTAINANCE

       This package is maintained by Abigail (regexp-common@abigail.be).

BUGS AND IRRITATIONS

       Zip codes for most countries are missing.  Send them in to regexp-common@abigail.be.

LICENSE and COPYRIGHT
       This software is Copyright (c) 2001 - 2009, Damian Conway and Abigail.

       This module is free software, and maybe used under any of the following licenses:

	1) The Perl Artistic License.	  See the file COPYRIGHT.AL.
	2) The Perl Artistic License 2.0. See the file COPYRIGHT.AL2.
	3) The BSD Licence.		  See the file COPYRIGHT.BSD.
	4) The MIT Licence.		  See the file COPYRIGHT.MIT.

perl v5.16.2							    2010-02-23						    Regexp::Common::zip(3)