Removing duplicates in fixed width file which has multiple key columns Post: 302745161

9 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Combining Two fixed width columns to a variable length file

Hi, I have two files. File1: File1 contains two fixed width columns ID of 15 characters length and Name is of 100 characters length. ID Name 1-43<<11 spaces>>Swapna<<94 spaces>> 1-234<<10 spaces>>Mani<<96 spaces>> 1-3456<<9 spaces>>Kapil<<95 spaces>> File2: ...

2. Shell Programming and Scripting

Removing \n within a fixed width record

I am trying to remove a line feed (\n) within a fixed width record. I tried the tr -d �\n' command, but it also removes the record delimiter. Is there a way to remove the line feed without removing the record delimiter?

3. Shell Programming and Scripting

Removing inserted newlines from a fileld of fixed width file.

Hi champs! I have a fixed width file in which the records appear like this 11111 <fixed spaces such as 6> description for 11111 <fixed spaces such as 6> some more field to the record of 11111 22222 <fixed spaces such as 6> description for 22222 <fixed spaces such as 6> some more field to the...

4. Shell Programming and Scripting

Printing Fixed Width Columns

Hi everyone, I have been working on a pretty laborious shellscript (with bash) the last couple weeks that parses my firewall policies (from a Juniper) for me and creates a nifty little columned output. It does so using awk on a line by line basis to pull out the appropriate pieces of each...

5. UNIX for Dummies Questions & Answers

Remove duplicates based on a column in fixed width file

Hi, How to output the duplicate record to another file. We say the record is duplicate based on a column whose position is from 2 and its length is 11 characters. The file is a fixed width file. ex of Record: DTYU12333567opert tjhi kkklTRG9012 The data in bold is the key on which...

6. UNIX for Dummies Questions & Answers

Removing duplicates based on key

Hi, I have the input file with the below data: 12345|12|34 12345|13|23 3456|12|90 15670|12|13 12345|10|14 3456|12|13 I need to remove the duplicates based on the first field only. I need the output like: 12345|12|34 3456|12|90 15670|12|13 The first field needs to be unique .

7. Shell Programming and Scripting

How to parse fixed-width columns which may include empty fields?

I am trying to selectively display several columns from a db2 query, which gives me a fixed-width output (partial output listed here): --------- -------------------------- ------------ ------ 000 0000000000198012 702 29 000 0000000000198013 ...

8. Shell Programming and Scripting

Remove Duplicates on multiple Key Columns and get the Latest Record from Date/Time Column

Hi Experts , we have a CDC file where we need to get the latest record of the Key columns Key Columns will be CDC_FLAG and SRC_PMTN_I and fetch the latest record from the CDC_PRCS_TS Can we do it with a single awk command. Please help....

9. Shell Programming and Scripting

Removing duplicates from delimited file based on 2 columns

Hi guys,Got a bit of a bind I'm in. I'm looking to remove duplicates from a pipe delimited file, but do so based on 2 columns. Sounds easy enough, but here's the kicker... Column #1 is a simple ID, which is used to identify the duplicate. Once dups are identified, I need to only keep the one...

LEARN ABOUT DEBIAN

dbix::class::helper::schema::lintcontents

DBIx::Class::Helper::Schema::LintContents(3pm)		User Contributed Perl Documentation	    DBIx::Class::Helper::Schema::LintContents(3pm)

NAME

       DBIx::Class::Helper::Schema::LintContents - Check the data in your database match your constraints

VERSION

       version 2.013002

SYNOPSIS

	package MyApp::Schema;

	use parent 'DBIx::Class::Schema';

	__PACKAGE__->load_components('Helper::Schema::LintContents');

	1;

       And later, somewhere else:

	say "Incorrectly Null Users:";
	for ($schema->null_check_source_auto('User')->all) {
	   say '* ' . $_->id
	}

	say "Duplicate Users:";
	my $duplicates = $schema->dup_check_source_auto('User');
	for (keys %$duplicates) {
	   say "Constraint: $_";
	   for ($duplicates->{$_}->all) {
	      say '* ' . $_->id
	   }
	}

	say "Users with invalid FK's:";
	my $invalid_fks = $schema->fk_check_source_auto('User');
	for (keys %$invalid_fks) {
	   say "Rel: $_";
	   for ($invalid_fks->{$_}->all) {
	      say '* ' . $_->id
	   }
	}

DESCRIPTION

       Some people think that constraints make their databases slower.	As silly as that is, I have been in a similar situation!  I'm here to help
       you, dear developers!  Basically this is a suite of methods that allow you to find violated "constraints."  To be clear, the constraints I
       mean are the ones you tell DBIx::Class about, real constraints are fairly sure to be followed.

METHODS

   fk_check_source
	my $busted = $schema->fk_check_source(
	  'User',
	  'Group',
	  { group_id => 'id' },
	);

       "fk_check_source" takes three arguments, the first is the from source moniker of a relationship.  The second is the to source or source
       moniker of a relationship.  The final argument is a hash reference representing the columns of the relationship.  The return value is a
       resultset of the from source that do not have a corresponding to row.  To be clear, the example given above would return a resultset of
       "User" rows that have a "group_id" that points to a "Group" that does not exist.

   fk_check_source_auto
	my $broken = $schema->fk_check_source_auto('User');

       "fk_check_source_auto" takes a single argument: the source to check.  It will check all the foreign key (that is, "belongs_to")
       relationships for missing...  "foreign" rows.  The return value will be a hashref where the keys are the relationship name and the values
       are resultsets of the respective violated relationship.

   dup_check_source
	my $smashed = $schema->fk_check_source( 'Group', ['id'] );

       "dup_check_source" takes two arguments, the first is the source moniker to be checked.  The second is an arrayref of columns that "should
       be" unique.  The return value is a resultset of the source that duplicate the passed columns.  So with the example above the resultset
       would return all groups that are "duplicates" of other groups based on "id".

   dup_check_source_auto
	my $ruined = $schema->dup_check_source_auto('Group');

       "dup_check_source_auto" takes a single argument, which is the name of the resultsource in which to check for duplicates.  It will return a
       hashref where they keys are the names of the unique constraints to be checked.  The values will be resultsets of the respective duplicate
       rows.

   null_check_source
	my $blarg = $schema->null_check_source('Group', ['id']);

       "null_check_source" tales two arguments, the first is the name of the source to check.  The second is an arrayref of columns that should
       contain no nulls.  The return value is simply a resultset of rows that contain nulls where they shouldn't be.

   null_check_source_auto
	my $wrecked = $schema->null_check_source_auto('Group');

       "null_check_source_auto" takes a single argument, which is the name of the resultsource in which to check for nulls.  The return value is
       simply a resultset of rows that contain nulls where they shouldn't be.  This method automatically uses the configured columns that have
       "is_nullable" set to false.

AUTHOR

       Arthur Axel "fREW" Schmidt <frioux+cpan@gmail.com>

COPYRIGHT AND LICENSE

       This software is copyright (c) 2012 by Arthur Axel "fREW" Schmidt.

       This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.

perl v5.14.2							    2012-06-18			    DBIx::Class::Helper::Schema::LintContents(3pm)