Matching number of syllables on right-hand and left side
I am developing a database for translating names. I have mapped through a rule engine syllables in English to syllables in Indic, delimited by an equal to sign.
An example will illustrate this
However it so happens that due to an error or inconsistency in syllable divisions the number of syllables on the right hand side do not match the number of syllables on the left hand side.
In the first two instances 2 on the left hand, 3 on the right. In the next two, three on the left and four on the right and in the last case, four on the left and five on the right
I need a script in Perl or Awk which can identify such discrepancies and separate out the database in two files: clean and inconsistent
I work in a Windows environment but have loaded Sed also; however, I am more comfortable with Awk or Perl. The database is around 200,000 entries.
Many thanks for your help
Hi,
I remember once seeing a way to get the left most string in a word.
Let's say: a="First.Second.Third" (separated by dot)
echo ${a#*.} shows --> Second.Third
echo ${a##*.} shows --> Third
How do I get the the left most string "First" Or "First.Second" ???
Tried to replace #... (2 Replies)
i am new to shell scripting. i want to keep on increamenting a 6 digit number. For eg. 000000 + 1 = 000001 But instead of 000001 i get only 1. How do i do this ? Pls help. (8 Replies)
guys,
i am writing a .ksh file to ssh to a remote machine and change all occurances of .ixf to .WIP like this :
-->>> for i in *.ixf do echo $i done mv $i $i.WIP exit <<---
--> this returns .ixf.WIP - i can live with that.
then i need to sftp from another remote machine, copy the files... (5 Replies)
does any one have any ideas how i would go about calculating the number of days left in the month from a bash script ?. I want to do some operations on a csv file according to the result (8 Replies)
I have a awk file which consists of the follwoing code in file select.awk :
/xxx/ {
time = gensub(/xxx \*\*\*(.*)/, "\\1", "g")
printf("%s\n",time)
next
}
and an input file with the following file file.txt :-
xxx ***Wed May 2 18:00:00 CDT 2012
AAA AAAA AAAA xxx... (4 Replies)
Hello,
I am working on an Urdu to Hindi dictionary which has the following structure:
a=b
a=c
n=d
n=q
and so on.
i.e. Headword separated from gloss by a =
I am giving below a live sample
بتا=बता
بتا=बित्ता
بتا=बुत्ता
بتان=बतान
بتان=बितान
بتانا=बिताना
I need the following... (3 Replies)
I need to work with records having #AX in the EXP1 , please see my data sample and my attempt below:
$ cat xx
08:30:33 KEY1 (1255) EXP1 VAL:20AX0030006
08:30:33 KEY1 (1255) EXP1 VAL:20AX0030006
08:30:33 KEY1 (1255) EXP1 VAL:20AW0030006
08:30:33 KEY1 (1255) EXP1 VAL:20AW0030006
$ gawk '{... (1 Reply)
Hello,
I have a large file with the following structure
Englishpseudo syllableEnglishpseudo syllable=IndicsyllableIndicsyllable
An example will make this clear:
la l=ला ल
gi ta=गी ता
ka la va ti=कa ला वa ती
ma h to=मa ह तो
ra je sh=रा जे श
a sha=आ शा
ra me sh=रa मे श
san ja y=सं जa य... (3 Replies)
Discussion started by: gimley
3 Replies
LEARN ABOUT CENTOS
locale::codes::langvar
Locale::Codes::LangVar(3) User Contributed Perl Documentation Locale::Codes::LangVar(3)NAME
Locale::Codes::LangVar - standard codes for language variation identification
SYNOPSIS
use Locale::Codes::LangVar;
$lvar = code2langvar('acm'); # $lvar gets 'Mesopotamian Arabic'
$code = langvar2code('Mesopotamian Arabic'); # $code gets 'acm'
@codes = all_langvar_codes();
@names = all_langvar_names();
DESCRIPTION
The "Locale::Codes::LangVar" module provides access to standard codes used for identifying language variations, such as those as defined in
the IANA language registry.
Most of the routines take an optional additional argument which specifies the code set to use. If not specified, the default IANA language
registry codes will be used.
SUPPORTED CODE SETS
There are several different code sets you can use for identifying language variations. A code set may be specified using either a name, or
a constant that is automatically exported by this module.
For example, the two are equivalent:
$lvar = code2langvar('arevela','alpha');
$lvar = code2langvar('arevela',LOCALE_LANGVAR_ALPHA);
The codesets currently supported are:
alpha
This is the set of alphanumeric codes from the IANA language registry, such as 'arevela' for Eastern Armenian.
This code set is identified with the symbol "LOCALE_LANGVAR_ALPHA".
This is the default code set.
ROUTINES
code2langvar ( CODE [,CODESET] )
langvar2code ( NAME [,CODESET] )
langvar_code2code ( CODE ,CODESET ,CODESET2 )
all_langvar_codes ( [CODESET] )
all_langvar_names ( [CODESET] )
Locale::Codes::LangVar::rename_langvar ( CODE ,NEW_NAME [,CODESET] )
Locale::Codes::LangVar::add_langvar ( CODE ,NAME [,CODESET] )
Locale::Codes::LangVar::delete_langvar ( CODE [,CODESET] )
Locale::Codes::LangVar::add_langvar_alias ( NAME ,NEW_NAME )
Locale::Codes::LangVar::delete_langvar_alias ( NAME )
Locale::Codes::LangVar::rename_langvar_code ( CODE ,NEW_CODE [,CODESET] )
Locale::Codes::LangVar::add_langvar_code_alias ( CODE ,NEW_CODE [,CODESET] )
Locale::Codes::LangVar::delete_langvar_code_alias ( CODE [,CODESET] )
These routines are all documented in the Locale::Codes::API man page.
SEE ALSO
Locale::Codes
The Locale-Codes distribution.
Locale::Codes::API
The list of functions supported by this module.
http://www.iana.org/assignments/language-subtag-registry
The IANA language subtag registry.
AUTHOR
See Locale::Codes for full author history.
Currently maintained by Sullivan Beck (sbeck@cpan.org).
COPYRIGHT
Copyright (c) 2011-2013 Sullivan Beck
This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
perl v5.16.3 2013-04-12 Locale::Codes::LangVar(3)