Finding longest line in a Record Post: 302515078

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Finding a character in first line of a record

HI, I am pretty new to Unix scripting. I will need help in Finding a character in first line of a file or a set of files. The scenario is as follows: Lets consider a set of files which is having a character "ID"(without quotes) in the first line of each file.I need to find this character...

2. Shell Programming and Scripting

Find the length of the longest line

Dear All, To find the length of the longest line from a file i have used wc -L which is giving the proper output... But the problem is AIX os does not support wc -L command. so is there any other way 2 to find out the length of the longest line using awk or sed ? Regards, Pankaj

3. Shell Programming and Scripting

Finding longest common substring among filenames

I will be performing a task on several directories, each containing a large number of files (2500+) that follow a regular naming convention: YYYY_MM_DD_XX.foo_bar.A.B.some_different_stuff.EXT What I would like to do is automatically discover the part of the filenames that are common to all...

4. Shell Programming and Scripting

Bash script find longest line/lines in several files

Hello everyone... I need to find out, how to find longest line or possibly lines in several files which are arguments for script. The thing is, that I tried some possibilities before, but nothing worked correctly. Example when i use: awk ' { if ( length > L ) { L=length ;s=$0 } }END{ print...

5. Shell Programming and Scripting

Reject the record if the record in the next line does not satisfy the pattern

Hi, I have a input file with the following entries: 1one 2two 3three 1four 2five 3six 1seven 1eight 1nine 2ten The output should be 1one 2two 3three 1four 2five 3six

6. Shell Programming and Scripting

Reject the record if the record in the next line does not begin with 2.

Hi, I have a input file with the following entries: 1one 2two 3three 1four 2five 3six 1seven 1eight 1nine 2ten 2eleven 2twelve 1thirteen 2fourteen The output should be:

7. Shell Programming and Scripting

finding the incorrect record

Hi , I have a scenario where i have to find the incorrect records in the file. In our comma delimited file , we have the following to be taken care : 1) if there is new line character then we have to capture the current line and the next line as error record Ex : In a comma...

8. Shell Programming and Scripting

Finding the length of the longest column

Hi, I am trying to figure out how to get the length of the longest column in the entire file (because the length varies from one row to the other) I was doing this at first to check how many fields I have for the first row: awk '{print NF; exit}' file Now, I can do this: awk '{ if...

9. Shell Programming and Scripting

wc -L giving incorrect length of longest line

Running below line gives 3957 as length of longest line in file 20121119_SRMNotes_init.dat awk ' { if ( length > 3950 ) { x = length } }END{ print x }' 20121119_SRMNotes_init.dat While wc -L 20121119_SRMNotes_init.dat gives output as 4329. Why is there a difference between these two commands....

10. Shell Programming and Scripting

Finding the Latest record

Dear All, I have getting data as follows, the second field signifies table name and last one is time stamp. I have return always latest record based on time stamp. Could you please help me ? I/P ==== ...

LEARN ABOUT DEBIAN

xml::sax::byrecord

XML::SAX::ByRecord(3pm) 				User Contributed Perl Documentation				   XML::SAX::ByRecord(3pm)

NAME

       XML::SAX::ByRecord - Record oriented processing of (data) documents

SYNOPSIS

	   use XML::SAX::Machines qw( ByRecord ) ;

	   my $m = ByRecord(
	       "My::RecordFilter1",
	       "My::RecordFilter2",
	       ...
	       {
		   Handler => $h, ## optional
	       }
	   );

	   $m->parse_uri( "foo.xml" );

DESCRIPTION

       XML::SAX::ByRecord is a SAX machine that treats a document as a series of records.  Everything before and after the records is emitted as-
       is while the records are excerpted in to little mini-documents and run one at a time through the filter pipeline contained in ByRecord.

       The output is a document that has the same exact things before, after, and between the records that the input document did, but which has
       run each record through a filter.  So if a document has 10 records in it, the per-record filter pipeline will see 10 sets of (
       start_document, body of record, end_document ) events.  An example is below.

       This has several use cases:

       o   Big, record oriented documents

	   Big documents can be treated a record at a time with various DOM oriented processors like XML::Filter::XSLT.

       o   Streaming XML

	   Small sections of an XML stream can be run through a document processor without holding up the stream.

       o   Record oriented style sheets / processors

	   Sometimes it's just plain easier to write a style sheet or SAX filter that applies to a single record at at time, rather than having to
	   run through a series of records.

   Topology
       Here's how the innards look:

	  +-----------------------------------------------------------+
	  |		     An XML:SAX::ByRecord		      |
	  |    Intake						      |
	  |   +----------+    +---------+	  +--------+  Exhaust |
	--+-->| Splitter |--->| Stage_1 |-->...-->| Merger |----------+----->
	  |   +----------+    +---------+	  +--------+	      |
	  |		  			       ^	      |
	  |		   			       |	      |
	  |		    +---------->---------------+	      |
	  |		      Events not in any records 	      |
	  |							      |
	  +-----------------------------------------------------------+

       The "Splitter" is an XML::Filter::DocSplitter by default, and the "Merger" is an XML::Filter::Merger by default.  The line that bypasses
       the "Stage_1 ..." filter pipeline is used for all events that do not occur in a record.	All events that occur in a record pass through the
       filter pipeline.

   Example
       Here's a quick little filter to uppercase text content:

	   package My::Filter::Uc;

	   use vars qw( @ISA );
	   @ISA = qw( XML::SAX::Base );

	   use XML::SAX::Base;

	   sub characters {
	       my $self = shift;
	       my ( $data ) = @_;
	       $data->{Data} = uc $data->{Data};
	       $self->SUPER::characters( @_ );
	   }

       And here's a little machine that uses it:

	   $m = Pipeline(
	       ByRecord( "My::Filter::Uc" ),
	       $out,
	   );

       When fed a document like:

	   <root> a
	       <rec>b</rec> c
	       <rec>d</rec> e
	       <rec>f</rec> g
	   </root>

       the output looks like:

	   <root> a
	       <rec>B</rec> c
	       <rec>C</rec> e
	       <rec>D</rec> g
	   </root>

       and the My::Filter::Uc got three sets of events like:

	   start_document
	   start_element: <rec>
	   characters:	  'b'
	   end_element:   </rec>
	   end_document

	   start_document
	   start_element: <rec>
	   characters:	  'd'
	   end_element:   </rec>
	   end_document

	   start_document
	   start_element: <rec>
	   characters:	 'f'
	   end_element:   </rec>
	   end_document

METHODS

       new
	       my $d = XML::SAX::ByRecord->new( @channels, \%options );

	   Longhand for calling the ByRecord function exported by XML::SAX::Machines.

CREDIT

       Proposed by Matt Sergeant, with advise by Kip Hampton and Robin Berjon.

Writing an aggregator.
       To be written.  Pretty much just that "start_manifold_processing" and "end_manifold_processing" need to be provided.  See
       XML::Filter::Merger and it's source code for a starter.

perl v5.10.0							    2009-06-11						   XML::SAX::ByRecord(3pm)