Sponsored Content
Top Forums Shell Programming and Scripting Check whether a given file is in ASCII format and data is tab-delimited Post 302116097 by cfajohnson on Tuesday 1st of May 2007 07:30:02 PM
Old 05-01-2007
Quote:
Originally Posted by Mandab
Hi,
I am trying to execute the following script but I am getting error:

What is the error?
Quote:
My requirement is to check whether the data in the file is tab delimited and pass error as say "105" to var1 and desc as "not tab delimited" to var2 and also check for the data which starts from 3rd line of the file. If the above condition satisfies it should check whether field 1 datatype and length(numeric(9)) are same or not and also whether it is null. If not var1 = "101" and var2 desc "Missing/wrong field1", field 2 datatype and length(char(9)) are same or not also for null. if not then var1 ="102" var2 "Missing/wrong field2" and so on. Any help would be appreciated.


Here is the code:

If it's code, please put it inside [CODE] tags so that it is properly formatted.
Quote:
#!/bin/ksh
eval $(awk 'BEGIN { IFS = "\t" }

What is the ouput of the awk script that you expect to eval?

In order to use eval, you need to output valid shell code.
Quote:
NR>=3 {print $1}
!/\t/ ## check whether lines contain tab else var1="105" and var2="No Tabs"
{
if ( length($1) == 0 || $1 !~ /[^0-9]/ ) ## check for null and numeric value and length(9)

You haven't checked that the length is 9. You have checked that it is not empty and that it doesn't contain any numbers.
Quote:
then

That is not awk syntax.
Quote:
var1="101"
var2="Missing or wrong First Field"
elif ( length($2) == 0 || $2 !~ /[a-zA-Z]/ ) ## check for null and char value and length(9)

There is no 'then', 'elif', or 'fi' keyword in awk.

You still haven't (even after the syntax is fixed) checked that the length is 9. You have checked that it is not empty and that it doesn't contain any letters.
Quote:
then
var1="102"
var2="Missing or Wrong Second Field"
fi
}
}' $1)

echo "$var1"
echo "$var2"

I suggest that you start with the code I posted, and tell us what it lacks. (Reply directly to that post, quoting relevant segments.)

 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

how to check the file data type(ascii or binary)

hi i am receiving a file from one system , i have to verify the format of the file data i.e whether the data is in acii format or binary format, please help thanks in advance satya (1 Reply)
Discussion started by: Satyak
1 Replies

2. Shell Programming and Scripting

Extracting a portion of data from a very large tab delimited text file

Hi All I wanted to know how to effectively delete some columns in a large tab delimited file. I have a file that contains 5 columns and almost 100,000 rows 3456 f g t t 3456 g h 456 f h 4567 f g h z 345 f g 567 h j k lThis is a very large data file and tab delimited. I need... (2 Replies)
Discussion started by: Lucky Ali
2 Replies

3. UNIX for Dummies Questions & Answers

How to convert a text file into tab delimited format?

I have a text file that made using text editor in Ubuntu. However the text file is not being recognized as space or tab delimited, the formatting seems to be messed up. How can I convert the text file into tab delimited format? (3 Replies)
Discussion started by: evelibertine
3 Replies

4. Shell Programming and Scripting

Append output in tab delimited format

hello.. i m scripting in Perl and having issue writing the output in specific format..i read two files and run some commands and write output to one file. i want this to be a 2d table, File one has 48 rows and file two has 48 rows, first i take one id from file one, and go to second file, loop... (2 Replies)
Discussion started by: empyrean
2 Replies

5. Shell Programming and Scripting

Parse tab delimited file, check condition and delete row

I am fairly new to programming and trying to resolve this problem. I have the file like this. CHROM POS REF ALT 10_sample.bam 11_sample.bam 12_sample.bam 13_sample.bam 14_sample.bam 15_sample.bam 16_sample.bam tg93 77 T C T T T T T tg93 79 ... (4 Replies)
Discussion started by: empyrean
4 Replies

6. Shell Programming and Scripting

How to read data from tab delimited file after a specific position?

Hi Experts, I have a tab deliminated file as below myfile.txt Local Group Memberships *Administrators *Guests I need data in below format starting from 4th position. myfile1.txt Administrators Guests the above one is just an example and there could... (15 Replies)
Discussion started by: Litu1988
15 Replies

7. UNIX for Dummies Questions & Answers

Need to convert a pipe delimited text file to tab delimited

Hi, I have a rquirement in unix as below . I have a text file with me seperated by | symbol and i need to generate a excel file through unix commands/script so that each value will go to each column. ex: Input Text file: 1|A|apple 2|B|bottle excel file to be generated as output as... (9 Replies)
Discussion started by: raja kakitapall
9 Replies

8. UNIX for Beginners Questions & Answers

Check if file is EBCDIC or ASCII format

So, i have this requirement where i need to check the file format, whether it's EBCDIC or ASCII, and based on format retrieve the information from that file: my file is: file1.txt-->this ebcdic file file2.txt-->ascii file i tried below code: file=file1.txt type="`file $file`" i get... (7 Replies)
Discussion started by: gnnsprapa
7 Replies

9. UNIX for Beginners Questions & Answers

Replace a column in tab delimited file with column in other tab delimited file,based on match

Hello Everyone.. I want to replace the retail col from FileI with cstp1 col from FileP if the strpno matches in both files FileP.txt ... (2 Replies)
Discussion started by: YogeshG
2 Replies

10. UNIX for Beginners Questions & Answers

Creating data delimited by ASCII code 1

<Any suggestion how to create a file where the values are separated by ASCII code 1,with data extracted from a table using shell script The format is :/> <columnname1(binary1)columnvalue(binary1)columnname2(binary1)columnvalue(binary1)columnname3(binary1)columnvalue... 1st row/>... (6 Replies)
Discussion started by: dasun
6 Replies
ExtUtils::Constant(3pm) 				 Perl Programmers Reference Guide				   ExtUtils::Constant(3pm)

NAME
ExtUtils::Constant - generate XS code to import C header constants SYNOPSIS
use ExtUtils::Constant qw (WriteConstants); WriteConstants( NAME => 'Foo', NAMES => [qw(FOO BAR BAZ)], ); # Generates wrapper code to make the values of the constants FOO BAR BAZ # available to perl DESCRIPTION
ExtUtils::Constant facilitates generating C and XS wrapper code to allow perl modules to AUTOLOAD constants defined in C library header files. It is principally used by the "h2xs" utility, on which this code is based. It doesn't contain the routines to scan header files to extract these constants. USAGE
Generally one only needs to call the "WriteConstants" function, and then #include "const-c.inc" in the C section of "Foo.xs" INCLUDE const-xs.inc in the XS section of "Foo.xs". For greater flexibility use "constant_types()", "C_constant" and "XS_constant", with which "WriteConstants" is implemented. Currently this module understands the following types. h2xs may only know a subset. The sizes of the numeric types are chosen by the "Con- figure" script at compile time. IV signed integer, at least 32 bits. UV unsigned integer, the same size as IV NV floating point type, probably "double", possibly "long double" PV NUL terminated string, length will be determined with "strlen" PVN A fixed length thing, given as a [pointer, length] pair. If you know the length of a string at compile time you may use this instead of PV SV A mortal SV. YES Truth. ("PL_sv_yes") The value is not needed (and ignored). NO Defined Falsehood. ("PL_sv_no") The value is not needed (and ignored). UNDEF "undef". The value of the macro is not needed. FUNCTIONS
C_stringify NAME A function which returns a 7 bit ASCII correctly escaped version of the string passed suitable for C's "" or ''. It will die if passed Unicode characters. perl_stringify NAME A function which returns a 7 bit ASCII correctly escaped version of the string passed suitable for a perl "" string. constant_types A function returning a single scalar with "#define" definitions for the constants used internally between the generated C and XS func- tions. memEQ_clause NAME, CHECKED_AT, INDENT A function to return a suitable C "if" statement to check whether NAME is equal to the C variable "name". If CHECKED_AT is defined, then it is used to avoid "memEQ" for short names, or to generate a comment to highlight the position of the character in the "switch" statement. If CHECKED_AT is a reference to a scalar, then instead it gives the characters pre-checked at the beginning, (and the number of chars by which the C variable name has been advanced. These need to be chopped from the front of NAME). assign INDENT, TYPE, PRE, POST, VALUE... A function to return a suitable assignment clause. If TYPE is aggregate (eg PVN expects both pointer and length) then there should be multiple VALUEs for the components. PRE and POST if defined give snippets of C code to proceed and follow the assignment. PRE will be at the start of a block, so variables may be defined in it. return_clause return_clause ITEM, INDENT A function to return a suitable "#ifdef" clause. ITEM is a hashref (as passed to "C_constant" and "match_clause". INDENT is the number of spaces to indent, defaulting to 6. XXX document me switch_clause INDENT, NAMELEN, ITEMHASH, ITEM... An internal function to generate a suitable "switch" clause, called by "C_constant" ITEMs are in the hash ref format as given in the description of "C_constant", and must all have the names of the same length, given by NAMELEN (This is not checked). ITEMHASH is a reference to a hash, keyed by name, values being the hashrefs in the ITEM list. (No parameters are modified, and there can be keys in the ITEMHASH that are not in the list of ITEMs without causing problems). params WHAT An internal function. WHAT should be a hashref of types the constant function will return. params returns a hashref keyed IV NV PV SV to show which combination of pointers will be needed in the C argument list. dump_names dump_names DEFAULT_TYPE, TYPES, INDENT, OPTIONS, ITEM... An internal function to generate the embedded perl code that will regenerate the constant subroutines. DEFAULT_TYPE, TYPES and ITEMs are the same as for C_constant. INDENT is treated as number of spaces to indent by. OPTIONS is a hashref of options. Currently only "declare_types" is recognised. If the value is true a $types is always declared in the perl code generated, if defined and false never declared, and if undefined $types is only declared if the values in TYPES as passed in cannot be inferred from DEFAULT_TYPES and the ITEMs. dogfood dogfood PACKAGE, SUBNAME, DEFAULT_TYPE, TYPES, INDENT, BREAKOUT, ITEM... An internal function to generate the embedded perl code that will regenerate the constant subroutines. Parameters are the same as for C_constant. C_constant C_constant PACKAGE, SUBNAME, DEFAULT_TYPE, TYPES, INDENT, BREAKOUT, ITEM... A function that returns a list of C subroutine definitions that return the value and type of constants when passed the name by the XS wrapper. ITEM... gives a list of constant names. Each can either be a string, which is taken as a C macro name, or a reference to a hash with the following keys name The name of the constant, as seen by the perl code. type The type of the constant (IV, NV etc) value A C expression for the value of the constant, or a list of C expressions if the type is aggregate. This defaults to the name if not given. macro The C pre-processor macro to use in the "#ifdef". This defaults to the name, and is mainly used if value is an "enum". If a reference an array is passed then the first element is used in place of the "#ifdef" line, and the second element in place of the "#endif". This allows pre-processor constructions such as #if defined (foo) #if !defined (bar) ... #endif #endif to be used to determine if a constant is to be defined. A "macro" 1 signals that the constant is always defined, so the "#if"/"#endif" test is omitted. default Default value to use (instead of "croak"ing with "your vendor has not defined...") to return if the macro isn't defined. Spec- ify a reference to an array with type followed by value(s). pre C code to use before the assignment of the value of the constant. This allows you to use temporary variables to extract a value from part of a "struct" and return this as value. This C code is places at the start of a block, so you can declare variables in it. post C code to place between the assignment of value (to a temporary) and the return from the function. This allows you to clear up anything in pre. Rarely needed. def_pre =item def_post Equivalents of pre and post for the default value. utf8 Generated internally. Is zero or undefined if name is 7 bit ASCII, "no" if the name is 8 bit (and so should only match if SvUTF8() is false), "yes" if the name is utf8 encoded. The internals automatically clone any name with characters 128-255 but none 256+ (ie one that could be either in bytes or utf8) into a second entry which is utf8 encoded. PACKAGE is the name of the package, and is only used in comments inside the generated C code. The next 5 arguments can safely be given as "undef", and are mainly used for recursion. SUBNAME defaults to "constant" if undefined. DEFAULT_TYPE is the type returned by "ITEM"s that don't specify their type. In turn it defaults to IV. TYPES should be given either as a comma separated list of types that the C subroutine "constant" will generate or as a reference to a hash. DEFAULT_TYPE will be added to the list if not present, as will any types given in the list of ITEMs. The resultant list should be the same list of types that "XS_constant" is given. [Otherwise "XS_constant" and "C_constant" may differ in the number of parameters to the constant function. INDENT is currently unused and ignored. In future it may be used to pass in information used to change the C indentation style used.] The best way to maintain consistency is to pass in a hash reference and let this function update it. BREAKOUT governs when child functions of SUBNAME are generated. If there are BREAKOUT or more ITEMs with the same length of name, then the code to switch between them is placed into a function named SUBNAME_LEN, for example "constant_5" for names 5 characters long. The default BREAKOUT is 3. A single "ITEM" is always inlined. XS_constant PACKAGE, TYPES, SUBNAME, C_SUBNAME A function to generate the XS code to implement the perl subroutine PACKAGE::constant used by PACKAGE::AUTOLOAD to load constants. This XS code is a wrapper around a C subroutine usually generated by "C_constant", and usually named "constant". TYPES should be given either as a comma separated list of types that the C subroutine "constant" will generate or as a reference to a hash. It should be the same list of types as "C_constant" was given. [Otherwise "XS_constant" and "C_constant" may have different ideas about the number of parameters passed to the C function "constant"] You can call the perl visible subroutine something other than "constant" if you give the parameter SUBNAME. The C subroutine it calls defaults to the name of the perl visible subroutine, unless you give the parameter C_SUBNAME. autoload PACKAGE, VERSION, AUTOLOADER A function to generate the AUTOLOAD subroutine for the module PACKAGE VERSION is the perl version the code should be backwards compati- ble with. It defaults to the version of perl running the subroutine. If AUTOLOADER is true, the AUTOLOAD subroutine falls back on AutoLoader::AUTOLOAD for all names that the constant() routine doesn't recognise. WriteMakefileSnippet WriteMakefileSnippet ATTRIBUTE => VALUE [, ...] A function to generate perl code for Makefile.PL that will regenerate the constant subroutines. Parameters are named as passed to "WriteConstants", with the addition of "INDENT" to specify the number of leading spaces (default 2). Currently only "INDENT", "NAME", "DEFAULT_TYPE", "NAMES", "C_FILE" and "XS_FILE" are recognised. WriteConstants ATTRIBUTE => VALUE [, ...] Writes a file of C code and a file of XS code which you should "#include" and "INCLUDE" in the C and XS sections respectively of your module's XS code. You probably want to do this in your "Makefile.PL", so that you can easily edit the list of constants without touch- ing the rest of your module. The attributes supported are NAME Name of the module. This must be specified DEFAULT_TYPE The default type for the constants. If not specified "IV" is assumed. BREAKOUT_AT The names of the constants are grouped by length. Generate child subroutines for each group with this number or more names in. NAMES An array of constants' names, either scalars containing names, or hashrefs as detailed in "C_constant". C_FILE The name of the file to write containing the C code. The default is "const-c.inc". The "-" in the name ensures that the file can't be mistaken for anything related to a legitimate perl package name, and not naming the file ".c" avoids having to override Makefile.PL's ".xs" to ".c" rules. XS_FILE The name of the file to write containing the XS code. The default is "const-xs.inc". SUBNAME The perl visible name of the XS subroutine generated which will return the constants. The default is "constant". C_SUBNAME The name of the C subroutine generated which will return the constants. The default is SUBNAME. Child subroutines have "_" and the name length appended, so constants with 10 character names would be in "constant_10" with the default XS_SUBNAME. AUTHOR
Nicholas Clark <nick@ccl4.org> based on the code in "h2xs" by Larry Wall and others perl v5.8.0 2002-06-01 ExtUtils::Constant(3pm)
All times are GMT -4. The time now is 11:13 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy