How to remove page breaks from a flat file??? Post: 302114472

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Insert page breaks into .csv file

I have large .csv files that I need to get page breaks into. I am taking comma delimited files of over a million records and putting them into a pdf file. Is there a way, using sed or otherwise, to insert some type of page break character into my file?

2. Programming

Page Breaks

Hi, I have a program in Pro*c when I run it I have no problem with the output but when it runs via the at command and except the output has page breaks every 66 lines. I don't want those page breaks to be in the output. any idea?

3. Shell Programming and Scripting

Help on page breaks

Hi, I am new to Unix (AIX). I have a header (in a text file) that needs to be wrtitten on all the pages of a result file (text file). After the header is written, data needs to be read from a file A(text file) and inserted to the result file. If the number of lines reaches 80 in a page, page...

4. UNIX for Dummies Questions & Answers

how to remove the first line from a flat file ?

Hi, I want to remove the first line from a flat file using unix command as simple as possible. Can anybody give me a hand ? Thanks in advance. xli

5. UNIX for Dummies Questions & Answers

How to remove numeric characters in the flat file

HI, can any one help me please .. i have flat file like qwer123rt ass3242ccf jjk654 kjh838ppp nhdg453ok hdkk34 i want remove numeric characters in the flat file i want output like this qwerrt assccf jjk kjhppp nhdgok hdkk help me...

6. Shell Programming and Scripting

Remove line breaks in csv file using shell script

Hi All, I've a csv file in which the record is getting break into 1 line or more than one line. I want to combine those splits into one line and remove the unwanted character existing in the record i.e. double quote symbol ("). The line gets break only when the record contains double...

7. Shell Programming and Scripting

script for adding page number before page breaks

Hi, If there is an expert that can help: I have many txt files that are produced from pdftotext that include page breaks the page breaks seem to be unix style hex 0C. I want to add page numbers before each page break as in : Page XXXX Regards antman

8. UNIX for Advanced & Expert Users

Remove duplicates in flat file

Hi all, I have a issues while loading a flat file to the DB. It is taking much time. When analyzed i found out that there are duplicates entry in the flat file. There are 2 type of Duplicate entry. 1) is entire row is duplicate. ( i can use sort | uniq) to remove the duplicated entry. 2) the...

9. UNIX for Dummies Questions & Answers

Page breaks and line breaks

Hi All, Need an urgent solution to an issue . We have created a ksh file or shell script which generates 1 DAT file. the DAT file contains extract of a select statement . Now the issue is , when we are executing the ksh file , the output is coimng with page breaks and line breaks . We have...

10. Shell Programming and Scripting

Remove first NULL Character in Flat File

We have a flat file with below data : ^@^@^@^@00000305^@^@^@^@^@^@430^@430^@^@^@^@^@^@^@^@^@09079989530As we can see ^@ is Null character in this file I want to remove only the first few null characters before string 00000305 How can we do that, any idea. I want a new file without first few...

LEARN ABOUT DEBIAN

xml::sax::byrecord

XML::SAX::ByRecord(3pm) 				User Contributed Perl Documentation				   XML::SAX::ByRecord(3pm)

NAME

       XML::SAX::ByRecord - Record oriented processing of (data) documents

SYNOPSIS

	   use XML::SAX::Machines qw( ByRecord ) ;

	   my $m = ByRecord(
	       "My::RecordFilter1",
	       "My::RecordFilter2",
	       ...
	       {
		   Handler => $h, ## optional
	       }
	   );

	   $m->parse_uri( "foo.xml" );

DESCRIPTION

       XML::SAX::ByRecord is a SAX machine that treats a document as a series of records.  Everything before and after the records is emitted as-
       is while the records are excerpted in to little mini-documents and run one at a time through the filter pipeline contained in ByRecord.

       The output is a document that has the same exact things before, after, and between the records that the input document did, but which has
       run each record through a filter.  So if a document has 10 records in it, the per-record filter pipeline will see 10 sets of (
       start_document, body of record, end_document ) events.  An example is below.

       This has several use cases:

       o   Big, record oriented documents

	   Big documents can be treated a record at a time with various DOM oriented processors like XML::Filter::XSLT.

       o   Streaming XML

	   Small sections of an XML stream can be run through a document processor without holding up the stream.

       o   Record oriented style sheets / processors

	   Sometimes it's just plain easier to write a style sheet or SAX filter that applies to a single record at at time, rather than having to
	   run through a series of records.

   Topology
       Here's how the innards look:

	  +-----------------------------------------------------------+
	  |		     An XML:SAX::ByRecord		      |
	  |    Intake						      |
	  |   +----------+    +---------+	  +--------+  Exhaust |
	--+-->| Splitter |--->| Stage_1 |-->...-->| Merger |----------+----->
	  |   +----------+    +---------+	  +--------+	      |
	  |		  			       ^	      |
	  |		   			       |	      |
	  |		    +---------->---------------+	      |
	  |		      Events not in any records 	      |
	  |							      |
	  +-----------------------------------------------------------+

       The "Splitter" is an XML::Filter::DocSplitter by default, and the "Merger" is an XML::Filter::Merger by default.  The line that bypasses
       the "Stage_1 ..." filter pipeline is used for all events that do not occur in a record.	All events that occur in a record pass through the
       filter pipeline.

   Example
       Here's a quick little filter to uppercase text content:

	   package My::Filter::Uc;

	   use vars qw( @ISA );
	   @ISA = qw( XML::SAX::Base );

	   use XML::SAX::Base;

	   sub characters {
	       my $self = shift;
	       my ( $data ) = @_;
	       $data->{Data} = uc $data->{Data};
	       $self->SUPER::characters( @_ );
	   }

       And here's a little machine that uses it:

	   $m = Pipeline(
	       ByRecord( "My::Filter::Uc" ),
	       $out,
	   );

       When fed a document like:

	   <root> a
	       <rec>b</rec> c
	       <rec>d</rec> e
	       <rec>f</rec> g
	   </root>

       the output looks like:

	   <root> a
	       <rec>B</rec> c
	       <rec>C</rec> e
	       <rec>D</rec> g
	   </root>

       and the My::Filter::Uc got three sets of events like:

	   start_document
	   start_element: <rec>
	   characters:	  'b'
	   end_element:   </rec>
	   end_document

	   start_document
	   start_element: <rec>
	   characters:	  'd'
	   end_element:   </rec>
	   end_document

	   start_document
	   start_element: <rec>
	   characters:	 'f'
	   end_element:   </rec>
	   end_document

METHODS

       new
	       my $d = XML::SAX::ByRecord->new( @channels, \%options );

	   Longhand for calling the ByRecord function exported by XML::SAX::Machines.

CREDIT

       Proposed by Matt Sergeant, with advise by Kip Hampton and Robin Berjon.

Writing an aggregator.
       To be written.  Pretty much just that "start_manifold_processing" and "end_manifold_processing" need to be provided.  See
       XML::Filter::Merger and it's source code for a starter.

perl v5.10.0							    2009-06-11						   XML::SAX::ByRecord(3pm)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Insert page breaks into .csv file

Discussion started by: welsht

2. Programming

Page Breaks

Discussion started by: rama71

3. Shell Programming and Scripting

Help on page breaks

Discussion started by: simhasuri

4. UNIX for Dummies Questions & Answers

how to remove the first line from a flat file ?

Discussion started by: xli