Sponsored Content
Top Forums Shell Programming and Scripting Complex string operation (awk, sed, other?) Post 302784517 by usshadowop on Friday 22nd of March 2013 11:33:33 AM
Old 03-22-2013
Complex string operation (awk, sed, other?)

I have a file that contains RewriteRules for 200 countries (2 examples for 1 country below):

Code:
RewriteRule ^/at(/|/index.html|)$ http://%{HTTP_HOST}/locate/index.html?locale=de_AT [R=301,L]

#&

RewriteRule ^/at_english(/|/index.html|)$ http://%{HTTP_HOST}/locate/index.html?locale=en_AT [R=301,L]

I have another list of redirects for the mobile versions of these sites in the following format:
Code:
RewriteRule ^/at_engilsh(/|/index.html|)$ http://%{HTTP_HOST}m.website.com/www.website.com/at_engilsh [R=301,L]

Bear in mind the at_english is just 1 of the country codes, there are many more.

So my goals is to go from

Code:
RewriteRule ^/at_english(/|/index.html|)$ http://%{HTTP_HOST}/locate/index.html?locale=en_AT [R=301,L]

#to

RewriteRule ^/at_engilsh(/|/index.html|)$ http://%{HTTP_HOST}m.website.com/www.website.com/at_engilsh [R=301,L]

I'm supplying the awk / pseudo code for one way I've thought to do it.

Code:
awk '
{
newurl="m.website.com/www.website.com/"
one=substr($0,1,14)
two=substr($1,13,37)
rest=substr($4,1)

# The line below this comment is the section I'm having difficulty with because 
#I have country codes in multiple formats at / at_engilsh / at_french
#I want to select all characters between ^/ ---> (  
code=substr($2,1) 
     

printf ("%s%s%s%s%s %s\n", one,code,two,newurl,code, rest)
}' input

So I either need help converting the pseudo code into actual code, or suggestions on a better way to do this operation.

Thank you for any help
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Complex Sed/Awk Question?

Hello, So i have this file called /apps/turnout which looks like that of the contents of the /etc/shadow (but not exactly) the file has a long list in it. basically, the contents of this file looks something similar to the following: jajajajalala:D#$#AFVAdfda lalabavisof:#%R@fafla#$... (3 Replies)
Discussion started by: SkySmart
3 Replies

2. Shell Programming and Scripting

string operation

i am new user of unix.i have a question.My script is- export STR_ALFA=`head -2 "${FILE_PATH}"|tail -1|cut -d"," -f1` "${TEST_HOME}"/function/chk_alfa.ksh STR_ALFA now i want to check STR_ALFA: 1)whether is alphabetic 2)whether is numeric 3)whether is alphanumeric... (1 Reply)
Discussion started by: arghya_owen
1 Replies

3. UNIX for Dummies Questions & Answers

string operation

I am trying to ask for full name in a script, Then echo back to the user with the surname only, omitting the first name. Thanks (2 Replies)
Discussion started by: orjnet
2 Replies

4. Shell Programming and Scripting

SED complex string replacement

sed -i 's:"ps -ef | grep $(cat $PID_FILE) | grep -v grep":"ps -C java -o pid,cmd | grep ${SERVER_NAME} | cut -d' ' -f1 | grep -v grep":g' scriptName That's what I'm attempting to do. I'm attempting to replace this: ps -ef | grep $(cat $PID_FILE) | grep -v grep with this: ps -C java -o... (5 Replies)
Discussion started by: cbo0485
5 Replies

5. Shell Programming and Scripting

Help - Search for string, then do string operation on line

Hi, I wish to find all lines that contain a specific search word, and then do few string operations on that line. The idea is to "fix" the file which has been moved from windows to unix. Using unix - Sun Solaris Test input ("t2.sas") statement1 statement2 libname yahoo ... (6 Replies)
Discussion started by: deepaksinbox
6 Replies

6. Shell Programming and Scripting

sed or awk command to replace a string pattern with another string based on position of this string

here is what i want to achieve... consider a file contains below contents. the file size is large about 60mb cat dump.sql INSERT INTO `table1` (`id`, `action`, `date`, `descrip`, `lastModified`) VALUES (1,'Change','2011-05-05 00:00:00','Account Updated','2012-02-10... (10 Replies)
Discussion started by: vivek d r
10 Replies

7. Programming

How to replace the complex strings from a file using sed or awk?

Dear All, I am having a requirement to find the difference between 2 files and generate a discrepancy report out of it as an html page. I prefer using diff -y file1 file2 since it gives user friendly layout to know any discrepancy in the record and unique records among the 2 file. Here's how it... (12 Replies)
Discussion started by: Badhrish
12 Replies

8. Shell Programming and Scripting

Replace string in XML file with awk/sed with string from another

Sorry for the long/weird title but I'm stuck on a problem I have. I have this XML file: </member> <member> <name>TransactionID</name> <value><string>123456789123456</string></value> </member> <member> <name>Number</name> ... (9 Replies)
Discussion started by: cozzin
9 Replies

9. Shell Programming and Scripting

Multiple Replacement in a Text File in one operation (sed/awk) ?

Hi all, Saying we have two files: 1. A "Reference File" whose content is "Variable Name": "Variable Value" 2. A "Model File" whose content is a model program in which I want to substitute "VariableName" with their respective value to produce a third file "Program File" which would be a... (4 Replies)
Discussion started by: dae
4 Replies

10. Shell Programming and Scripting

Complex Filter using grep, awk or sed

Hi, I'm not very familiar witrh sed or awk and hope the somebody can help me to solve my problem. I need to filter a text report using grep, sed or awk. I would like to cut out text lines with the pattern INFO and if exists the following lines of the pattern DETAILS. I need te keep the lines with... (4 Replies)
Discussion started by: Frankg
4 Replies
Locale::Country(3pm)					 Perl Programmers Reference Guide				      Locale::Country(3pm)

NAME
Locale::Country - ISO codes for country identification (ISO 3166) SYNOPSIS
use Locale::Country; $country = code2country('jp'); # $country gets 'Japan' $code = country2code('Norway'); # $code gets 'no' @codes = all_country_codes(); @names = all_country_names(); # semi-private routines Locale::Country::alias_code('uk' => 'gb'); Locale::Country::rename_country('gb' => 'Great Britain'); DESCRIPTION
The "Locale::Country" module provides access to the ISO codes for identifying countries, as defined in ISO 3166-1. You can either access the codes via the "conversion routines" (described below), or with the two functions which return lists of all country codes or all country names. There are three different code sets you can use for identifying countries: alpha-2 Two letter codes, such as 'tv' for Tuvalu. This code set is identified with the symbol "LOCALE_CODE_ALPHA_2". alpha-3 Three letter codes, such as 'brb' for Barbados. This code set is identified with the symbol "LOCALE_CODE_ALPHA_3". numeric Numeric codes, such as 064 for Bhutan. This code set is identified with the symbol "LOCALE_CODE_NUMERIC". All of the routines take an optional additional argument which specifies the code set to use. If not specified, it defaults to the two- letter codes. This is partly for backwards compatibility (previous versions of this module only supported the alpha-2 codes), and partly because they are the most widely used codes. The alpha-2 and alpha-3 codes are not case-dependent, so you can use 'BO', 'Bo', 'bO' or 'bo' for Bolivia. When a code is returned by one of the functions in this module, it will always be lower-case. As of version 2.00, Locale::Country supports variant names for countries. So, for example, the country code for "United States" is "us", so country2code('United States') returns 'us'. Now the following will also return 'us': country2code('United States of America') country2code('USA') CONVERSION ROUTINES
There are three conversion routines: "code2country()", "country2code()", and "country_code2code()". code2country( CODE, [ CODESET ] ) This function takes a country code and returns a string which contains the name of the country identified. If the code is not a valid country code, as defined by ISO 3166, then "undef" will be returned: $country = code2country('fi'); country2code( STRING, [ CODESET ] ) This function takes a country name and returns the corresponding country code, if such exists. If the argument could not be identified as a country name, then "undef" will be returned: $code = country2code('Norway', LOCALE_CODE_ALPHA_3); # $code will now be 'nor' The case of the country name is not important. See the section "KNOWN BUGS AND LIMITATIONS" below. country_code2code( CODE, CODESET, CODESET ) This function takes a country code from one code set, and returns the corresponding code from another code set. $alpha2 = country_code2code('fin', LOCALE_CODE_ALPHA_3, LOCALE_CODE_ALPHA_2); # $alpha2 will now be 'fi' If the code passed is not a valid country code in the first code set, or if there isn't a code for the corresponding country in the second code set, then "undef" will be returned. QUERY ROUTINES
There are two function which can be used to obtain a list of all codes, or all country names: "all_country_codes( [ CODESET ] )" Returns a list of all two-letter country codes. The codes are guaranteed to be all lower-case, and not in any particular order. "all_country_names( [ CODESET ] )" Returns a list of all country names for which there is a corresponding country code in the specified code set. The names are capi- talised, and not returned in any particular order. Not all countries have alpha-3 and numeric codes - some just have an alpha-2 code, so you'll get a different number of countries depending on which code set you specify. SEMI-PRIVATE ROUTINES Locale::Country provides two semi-private routines for modifying the internal data. Given their status, they aren't exported by default, and so need to be called by prefixing the function name with the package name. alias_code Define a new code as an alias for an existing code: Locale::Country::alias_code( ALIAS => CODE [, CODESET ] ) This feature was added as a mechanism for handling a "uk" code. The ISO standard says that the two-letter code for "United Kingdom" is "gb", whereas domain names are all .uk. By default the module does not understand "uk", since it is implementing an ISO standard. If you would like 'uk' to work as the two-letter code for United Kingdom, use the following: Locale::Country::alias_code('uk' => 'gb'); With this code, both "uk" and "gb" are valid codes for United Kingdom, with the reverse lookup returning "uk" rather than the usual "gb". Note: this function was previously called _alias_code, but the leading underscore has been dropped. The old name will be supported for all 2.X releases for backwards compatibility. rename_country If the official country name just isn't good enough for you, you can rename a country. For example, the official country name for code 'gb' is 'United Kingdom'. If you want to change that, you might call: Locale::Country::rename_country('gb' => 'Great Britain'); This means that calling code2country('gb') will now return 'Great Britain' instead of 'United Kingdom'. The original country name is retained as an alias, so for the above example, country2code('United Kingdom') will still return 'gb'. EXAMPLES
The following example illustrates use of the "code2country()" function. The user is prompted for a country code, and then told the corre- sponding country name: $| = 1; # turn off buffering print "Enter country code: "; chop($code = <STDIN>); $country = code2country($code, LOCALE_CODE_ALPHA_2); if (defined $country) { print "$code = $country "; } else { print "'$code' is not a valid country code! "; } DOMAIN NAMES
Most top-level domain names are based on these codes, but there are certain codes which aren't. If you are using this module to identify country from hostname, your best bet is to preprocess the country code. For example, edu, com, gov and friends would map to us; uk would map to gb. Any others? KNOWN BUGS AND LIMITATIONS
o When using "country2code()", the country name must currently appear exactly as it does in the source of the module. The module now sup- ports a small number of variants. Possible extensions to this are: an interface for getting at the list of variant names, and regular expression matches. o In the current implementation, all data is read in when the module is loaded, and then held in memory. A lazy implementation would be more memory friendly. o Support for country names in different languages. SEE ALSO
Locale::Language ISO two letter codes for identification of language (ISO 639). Locale::Script ISO codes for identification of scripts (ISO 15924). Locale::Currency ISO three letter codes for identification of currencies and funds (ISO 4217). Locale::SubCountry ISO codes for country sub-divisions (states, counties, provinces, etc), as defined in ISO 3166-2. This module is not part of the Locale-Codes distribution, but is available from CPAN in CPAN/modules/by-module/Locale/ ISO 3166-1 The ISO standard which defines these codes. http://www.iso.org/iso/en/prods-services/iso3166ma/index.html Official home page for the ISO 3166 maintenance agency. http://www.egt.ie/standards/iso3166/iso3166-1-en.html Another useful, but not official, home page. http://www.cia.gov/cia/publications/factbook/docs/app-d-1.html An appendix in the CIA world fact book which lists country codes as defined by ISO 3166, FIPS 10-4, and internet domain names. AUTHOR
Neil Bowers <neil@bowers.com> COPYRIGHT
Copyright (C) 2002, Neil Bowers. Copyright (c) 1997-2001 Canon Research Centre Europe (CRE). This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself. perl v5.8.0 2002-06-01 Locale::Country(3pm)
All times are GMT -4. The time now is 10:43 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy