Glad to hear that you solved it yourself. May I suggest some optimization for your code?
Parsing fixed length records is easy with GNU awk. Working on the sample file you gave above:
With GNU awk, it is as easy as this:
With original awk or nawk, you need to create your own parse function:
Both snippets will yield:
When you define the fieldwidths values, don't forget to include the spaces.
In awk, I'd like to check if a field is blank.
And by blank I mean, the field could be "" or " "
In other words, the field could either be empty, or be filled with spaces.
Would the regex look like this?
$5 ~ // { Action }?
What other ways are there?
Hmm.. in any case I think... (7 Replies)
Hello,
I have a record which split with "," I would like to check..if the field is empty and it will field "NIL" into the field.
input
45111,40404,peter,,0303403,0,030304,john,,9,0,
output
45111,40404,peter,NIL,0303403,0,030304,john,NIL,9,0, (8 Replies)
Hello,
I have a file with a lot of record like below:
00001,CUSTR,CUSTOMER ADDRESS,02310,N,0:00,0,0,0,0,0,0,0,0,0,0,0,0:00,0,0,0,0,0,CSH,ACC
Can I validate each record in the file and output the incorrect result?
field 1 - customer number, should be "5 digit.
field 2 - should be 5... (9 Replies)
Hi
I use the following code to read the file and to fix the length of the column of the record in the file 'Sample.txt'
ls Samp* | awk '
{ a=$1 }
END{
FS="n"
for(i=1;i<=NR;i++)
{
while( getline < a )
{
f1=$0;
print("Line::",f1);
f2=substr(f1,1,10)
print("Field1::",f2);... (10 Replies)
Hi all,
I have a huge file and I need to get ride of the fields 6-11 and replace the blanks in field 5 with a missing value(99999).
159,93848,5354,343,67898,45,677,5443,434,5545,45
677,45545,3522,244,
554,54344,3342,456,
344,43443,2344,444,23477... (12 Replies)
Hi everyone..
I'm trying to eliminate multiple whitespaces from a file..
I must make use of shell script to eliminate whitespaces..
Take a look at the sample file
1 int main()
2 {
3 int a,b;
4 printf("Enter the values of a and b");
5 scanf("%d%d",&a,&b);
6 if(a>b) ... (6 Replies)
I have this
text.filecharles darwin sam delight
george washington johnson culper
darwin sam delight
micheal jackson penny lite and would like to remove the row, if the first field is blank. so the result would be:
result.filecharles darwin sam ... (4 Replies)
Hi,
I have the following file, I'd like to add an additional blank field to this file
This is a tab delimited file, I have tried the same thing on excel, but looking for a unix solution.
Here is my input:
Country Postal Admin4 StreetBaseName StreetType
HUN 2243 Kóka Dózsa György ... (3 Replies)
Find a blank field
Hi
I have set of fields that have some blank values, how to find that and get its line noumbers in output file.
Ex:
Col1 col2 col3
11 ss 103
12 104
13 105
14 se 106 (2 Replies)
Hi All,
i have a file like
col1 col2 col3
13 24 NA
12 13 14
11 12 13
14 22 NA
18 26 NA
in this file if i found "NA" other values in the line are also replace by NA
Could you help me! (7 Replies)
Discussion started by: Shenbaga.d
7 Replies
LEARN ABOUT DEBIAN
locale::script
Locale::Script(3perl) Perl Programmers Reference Guide Locale::Script(3perl)NAME
Locale::Script - standard codes for script identification
SYNOPSIS
use Locale::Script;
$script = code2script('phnx'); # 'Phoenician'
$code = script2code('Phoenician'); # 'Phnx'
$code = script2code('Phoenician',
LOCALE_CODE_NUMERIC); # 115
@codes = all_script_codes();
@scripts = all_script_names();
DESCRIPTION
The "Locale::Script" module provides access to standards codes used for identifying scripts, such as those defined in ISO 15924.
Most of the routines take an optional additional argument which specifies the code set to use. If not specified, the default ISO 15924
four-letter codes will be used.
SUPPORTED CODE SETS
There are several different code sets you can use for identifying scripts. The ones currently supported are:
alpha
This is a set of four-letter (capitalized) codes from ISO 15924 such as 'Phnx' for Phoenician.
This code set is identified with the symbol "LOCALE_SCRIPT_ALPHA".
The Zxxx, Zyyy, and Zzzz codes are not used.
This is the default code set.
numeric
This is a set of three-digit numeric codes from ISO 15924 such as 115 for Phoenician.
This code set is identified with the symbol "LOCALE_SCRIPT_NUMERIC".
ROUTINES
code2script ( CODE [,CODESET] )
script2code ( NAME [,CODESET] )
script_code2code ( CODE ,CODESET ,CODESET2 )
all_script_codes ( [CODESET] )
all_script_names ( [CODESET] )
Locale::Script::rename_script ( CODE ,NEW_NAME [,CODESET] )
Locale::Script::add_script ( CODE ,NAME [,CODESET] )
Locale::Script::delete_script ( CODE [,CODESET] )
Locale::Script::add_script_alias ( NAME ,NEW_NAME )
Locale::Script::delete_script_alias ( NAME )
Locale::Script::rename_script_code ( CODE ,NEW_CODE [,CODESET] )
Locale::Script::add_script_code_alias ( CODE ,NEW_CODE [,CODESET] )
Locale::Script::delete_script_code_alias ( CODE [,CODESET] )
These routines are all documented in the Locale::Codes man page.
SEE ALSO
Locale::Codes
Locale::Constants
http://www.unicode.org/iso15924/
Home page for ISO 15924.
AUTHOR
See Locale::Codes for full author history.
Currently maintained by Sullivan Beck (sbeck@cpan.org).
COPYRIGHT
Copyright (c) 1997-2001 Canon Research Centre Europe (CRE).
Copyright (c) 2001-2010 Neil Bowers
Copyright (c) 2010-2011 Sullivan Beck
This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
perl v5.14.2 2011-09-26 Locale::Script(3perl)