Sponsored Content
Top Forums Shell Programming and Scripting Csv file parsing and validating Post 302898821 by shree11 on Thursday 24th of April 2014 07:56:37 AM
Old 04-24-2014
Hi, If i execute the below code i'm getting all the records in bad file(including good and bad records) and getting only the headers in good file. I'm not getting validated good records along with the headers in good file.
Code:
awk -F "," 'NR == FNR {h = (h == "") ? $1 : (h OFS $1); gsub("[)(]", "-", $2); split($2, a, "-"); d[NR] = a[1]; l[NR] = a[2]; n[NR] = ($3 == "NOT NULL") ? 1 : 0; next}
  FNR == 1 {print h > "goodrec"}
 {for(i = 1; i <= NF; i++)
  {if(((d[i] == "Numeric" && (($i + 0) == $i || $i == "")) || d[i] == "String") && (length($i) <= l[i]) && (length($i) >= n[i]))
      {f = 1} else {f = 0};
    if(f == 0) {print $0 > "badrec"; next}} print $0 > "goodrec"}' cf.txt df.txt

Output getting in the goodrec is :
Quote:
id,name,state,street_No
But the expected output in the goodrec is
Quote:
id,name,state,street_No
22,Lilly,CA,405
55,Taylor,GI,3333
77,,FL,22
Thanks,
Shree
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Help in parsing a CSV file with Shell script

I have a CSV file which contains number series as one of the fields. Some of the records of the field look like : 079661/3 I have to convert the above series as 079661 079662 079663 and store it as 3 different records. Looking for help on how to achieve this. Am a newbie at Shell... (10 Replies)
Discussion started by: mihirk
10 Replies

2. Shell Programming and Scripting

Parsing a csv file

I am trying to parse a csv file in the below 'name-value pair' format and then use the values corresponding to the name. Type:G,Instance:instance1,FunctionalID:funcid,Env:dev,AppName:appname... (6 Replies)
Discussion started by: chiru_h
6 Replies

3. Shell Programming and Scripting

CSV file parsing and validation

I have a CSV file that needs to through two seperate processes (in the end there will be 2 files (Dload.unl and Tload.unl and we'll say the input file name is mass.csv). I have a processfile() function that will call the process Dload funtion. In Dload I want to read mass.csv into Dload and then... (1 Reply)
Discussion started by: dolo21taf
1 Replies

4. Shell Programming and Scripting

2 problems: Mailing CSV file / parsing CSV for display

I have been trying to find a good solution for this seemingly simple task for 2 days, and I'm giving up and posting a thread. I hope someone can help me out! I'm on HPUX, using sqlplus, mailx, awk, have some other tools available, but can't install stuff that isn't already in place (without a... (6 Replies)
Discussion started by: soldstatic
6 Replies

5. Shell Programming and Scripting

Parsing a CSV File

Hey guys, I'm in the process of learning PHP and BASH scripting. I'm getting there, slowly ;) I would like some help with parsing a CSV file. This file contains a list of hostnames, dates, and either Valid, Expired, or Expired Soon in the last column. Basically, I want to parse the file,... (12 Replies)
Discussion started by: dzl
12 Replies

6. Shell Programming and Scripting

Parsing complicated CSV file with sed

Yes, there is a great doc out there that discusses parsing csv files with sed, and this topic has been covered before but not enough to answer my question (unix.com forums). I'm trying to parse a CSV file that has optional quotes like the following: "Apple","Apples, are fun",3.60,4.4,"I... (3 Replies)
Discussion started by: analog999
3 Replies

7. Shell Programming and Scripting

Parsing a CSV file and deleting all rows on condition

Hello list, I am working on a csv file which contains two fields per record which contain IP addresses. What I am trying to do is find records which have identical fields(IP addresses) which occur 4(four) times, and if they do, delete all records with that specific identical field(ip address). ... (4 Replies)
Discussion started by: landossa
4 Replies

8. Shell Programming and Scripting

Help required in parsing a csv file

Hi Members, I am stuck with the following problem. Request your kind help I have an csv file which contains, 1 header record, data records and 1 footer record. Sample is as below Contents of cm_update_file_101010.csv -------------------------------------------------- ... (6 Replies)
Discussion started by: ramakanth_burra
6 Replies

9. Shell Programming and Scripting

Parsing csv file and pass to a variable

Hi, Newbie here and I need some help to parse a csv file that contains fields separated by ",". What I need to achieve here is, read the 1 line file and extract 240 fields and pass to a variable and then read the next 240 fields and pass to a variable, over and over. If anyone can assist that... (4 Replies)
Discussion started by: tmslixx
4 Replies

10. Shell Programming and Scripting

Help with Parsing a CSV File

Hello All, I have an input CSV file like below, where first row data can be in different position after every run of the tool, i.e. pzTest in below example is in column 1, but it can be also in 3 column and same for all the headers in the first row. pzTest, pzExtract, pxUpdate, pzInfo... (1 Reply)
Discussion started by: asirohi
1 Replies
Bad(3)							User Contributed Perl Documentation						    Bad(3)

NAME
PDL::Bad - PDL does process bad values DESCRIPTION
PDL has been compiled with WITH_BADVAL set to 1. Therefore, you can enter the wonderful world of bad value support in PDL. This module is loaded when you do "use PDL", "Use PDL::Lite" or "PDL::LiteF". Implementation details are given in PDL::BadValues. SYNOPSIS
use PDL::Bad; print " Bad value support in PDL is turned " . $PDL::Bad::Status ? "on" : "off" . ". "; Bad value support in PDL is turned on. and some other things VARIABLES
There are currently three variables that this module defines which may be of use. $PDL::Bad::Status Set to 1 $PDL::Bad::UseNaN Set to 1 if PDL was compiled with "BADVAL_USENAN" set, 0 otherwise. $PDL::Bad::PerPdl Set to 1 if PDL was compiled with the experimental "BADVAL_PER_PDL" option set, 0 otherwise. FUNCTIONS
badflag switch on/off/examine bad data flag if ( $a->badflag() ) { print "Data may contain bad values. "; } $a->badflag(1); # set bad data flag $a->badflag(0); # unset bad data flag A return value of 1 does not guarantee the presence of bad data in a piddle; all it does is say that we need to check for the presence of such beasties. To actually find out if there are any bad values present in a piddle, use the check_badflag method. Does support bad values. badvalue returns the value used to indicate a missing (or bad) element for the given piddle type. You can give it a piddle, a PDL::Type object, or one of $PDL_B, $PDL_S, etc. $badval = badvalue( float ); $a = ones(ushort,10); print "The bad data value for ushort is: ", $a->badvalue(), " "; If a new value is supplied via a piddle (e.g. "$a->badvalue(23)"), then the data in the supplied piddle is converted to use the new bad value as well if the data type is an integer or "$PDL::Bad::UseNaN == 0". Currently there is no way of automatically converting the bad values of already existing piddles. This could be supported - e.g. by having a per-piddle bad value or by storing a time index in the piddle structure - if required. If the $PDL::Bad::PerPdl flag is set then it is possible to change the bad value on a per-piddle basis, so $a = sequence(10); $a->badvalue(3); $a->badflag(1); $b = sequence(10); $b->badvalue(4); $b->badflag(1); will set $a to be "[0 1 2 BAD 4 5 6 7 8 9]" and $b to be "[0 1 2 3 BAD 5 6 7 8 9]". If the flag is not set then both $a and $b will be set to "[0 1 2 3 BAD 5 6 7 8 9]". Please note that the code to support per-piddle bad values is experimental in the current release. Does support bad values. orig_badvalue returns the original value used to represent bad values for a given type. This routine operates the same as badvalue, except you can not change the values. It also has an awful name. $orig_badval = orig_badvalue( float ); $a = ones(ushort,10); print "The original bad data value for ushort is: ", $a->orig_badvalue(), " "; Does support bad values. check_badflag clear the bad-value flag of a piddle if it does not contain any bad values Given a piddle whose bad flag is set, check whether it actually contains any bad values and, if not, clear the flag. It returns the final state of the bad-value flag. print "State of bad flag == ", $pdl->check_badflag; Does support bad values. isbad Signature: (a(); int [o]b()) Is a value bad? Returns a 1 if the value is bad, 0 otherwise. Also see isfinite. $a = pdl(1,2,3); $a->badflag(1); set($a,1,$a->badvalue); $b = isbad($a); print $b, " "; [0 1 0] isbad does handle bad values. The output piddles will NOT have their bad-value flag set. isgood Signature: (a(); int [o]b()) Is a value good? Returns a 1 if the value is good, 0 otherwise. Also see isfinite. $a = pdl(1,2,3); $a->badflag(1); set($a,1,$a->badvalue); $b = isgood($a); print $b, " "; [1 0 1] isgood does handle bad values. The output piddles will NOT have their bad-value flag set. nbadover Signature: (a(n); int+ [o]b()) Find the number of bad elements along the 1st dimension. This function reduces the dimensionality of a piddle by one by finding the number of bad elements along the 1st dimension. By using xchg etc. it is possible to use any dimension. $a = nbadover($b); $spectrum = nbadover $image->xchg(0,1) nbadover does handle bad values. It will set the bad-value flag of all output piddles if the flag is set for any of the input piddles. ngoodover Signature: (a(n); int+ [o]b()) Find the number of good elements along the 1st dimension. This function reduces the dimensionality of a piddle by one by finding the number of good elements along the 1st dimension. By using xchg etc. it is possible to use any dimension. $a = ngoodover($b); $spectrum = ngoodover $image->xchg(0,1) ngoodover does handle bad values. It will set the bad-value flag of all output piddles if the flag is set for any of the input piddles. nbad Returns the number of bad values in a piddle $x = nbad($data); Does support bad values. ngood Returns the number of good values in a piddle $x = ngood($data); Does support bad values. setbadat Set the value to bad at a given position. setbadat $piddle, @position @position is a coordinate list, of size equal to the number of dimensions in the piddle. This is a wrapper around set and is probably mainly useful in test scripts! perldl> $x = sequence 3,4 perldl> $x->setbadat 2,1 perldl> p $x [ [ 0 1 2] [ 3 4 BAD] [ 6 7 8] [ 9 10 11] ] Supports badvalues. setbadif Signature: (a(); int mask(); [o]b()) Set elements bad based on the supplied mask, otherwise copy across the data. $a = sequence(5,5); $a = $a->setbadif( $a % 2 ); print "a badflag: ", $a->badflag, " "; a badflag: 1 Unfortunately, this routine can not be run inplace, since the current implementation can not handle the same piddle used as "a" and "mask" (eg "$a->inplace->setbadif($a%2)" fails). Also see setvaltobad and setnantobad. The output always has its bad flag set, even if it does not contain any bad values (use check_badflag to check whether there are any bad values in the output). Any bad values in the input piddles are copied across to the output piddle. setvaltobad Signature: (a(); [o]b(); double value) Set bad all those elements which equal the supplied value. $a = sequence(10) % 3; $a->inplace->setvaltobad( 0 ); print "$a "; [BAD 1 2 BAD 1 2 BAD 1 2 BAD] This is a simpler version of setbadif, but this function can be done inplace. See setnantobad if you want to convert NaN/Inf to the bad value. The output always has its bad flag set, even if it does not contain any bad values (use check_badflag to check whether there are any bad values in the output). Any bad values in the input piddles are copied across to the output piddle. setnantobad Signature: (a(); [o]b()) Sets NaN/Inf values in the input piddle bad (only relevant for floating-point piddles). Can be done inplace. $b = $a->setnantobad; $a->inplace->setnantobad; Supports bad values. setbadtonan Signature: (a(); [o]b()) Sets Bad values to NaN (only relevant for floating-point piddles). Can be done inplace and it clears the bad flag. $b = $a->setbadtonan; $a->inplace->setbadtonan; Supports bad values. setbadtoval Signature: (a(); [o]b(); double newval) Replace any bad values by a (non-bad) value. Can be done inplace. Also see badmask. $a->inplace->setbadtoval(23); print "a badflag: ", $a->badflag, " "; a badflag: 0 The output always has its bad flag cleared. If the input piddle does not have its bad flag set, then values are copied with no replacement. copybad Signature: (a(); mask(); [o]b()) Copies values from one piddle to another, setting them bad if they are bad in the supplied mask. Can be done inplace. $a = byte( [0,1,3] ); $mask = byte( [0,0,0] ); set($mask,1,$mask->badvalue); $a->inplace->copybad( $mask ); p $a; [0 BAD 3] It is equivalent to: $c = $a + $mask * 0 Handles bad values. CHANGES
The experimental "BADVAL_PER_PDL" configuration option, which - when set - allows per-piddle bad values, was added after the 2.4.2 release of PDL. The "" variable can be inspected to see if this feature is available. AUTHOR
Doug Burke (djburke@cpan.org), 2000, 2001, 2003, 2006. The per-piddle bad value support is by Heiko Klein(2006). All rights reserved. There is no warranty. You are allowed to redistribute this software / documentation under certain conditions. For details, see the file COPYING in the PDL distribution. If this file is separated from the PDL distribution, the copyright notice should be included in the file. perl v5.12.1 2010-07-05 Bad(3)
All times are GMT -4. The time now is 11:30 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy