Sponsored Content
Top Forums Shell Programming and Scripting Split long record into csv file Post 302310947 by wvdeijk on Monday 27th of April 2009 11:48:19 AM
Old 04-27-2009
Split long record into csv file

Hi

I receive a mainframe file which has very long records (1100 chars) with no field delimiters. I need to parse each record and output a comma delimited (csv) file. The record layout is fixed. If there weren't so many fields and records I would read the file into Excel, as a "fixed width" file and manually split the record into it's separate components, but that is too time consuming and anyway, there are way too many records.

I was thinking of doing something in awk, like
read line
a=substr(line,1,5)
b=substr(line,6,2)
etc for each of the 226 fields
write a,b,c.......

but I'm sure there is a better way.

Any help will be much appreciated.
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

How to delete a record from a csv file

Hi Guys I have downloaded a table from oracle database in .csv format. it has many fields as Title, First Name, Last Name etc. I have to download distinct titles from database and now i have to check all those titles from data of First Name one by one. and then i have to delete matched record.... (1 Reply)
Discussion started by: Rajeev Agrawal
1 Replies

2. UNIX for Dummies Questions & Answers

how to get a file name & record count of csv file

HI , I am new to shell scripting , I have a requirement that I send a file for data quality ( original.csv) & i will be getting 4 files daily into a particular directory in return with cleansed data . the files may be clean.csv, unclean.csv , ... (2 Replies)
Discussion started by: sirik
2 Replies

3. Shell Programming and Scripting

How to split a file record

-Hi, I have a problem with parcing/spliting a file record into two parts and assigning the split parts to two viriables. The record is as follows: ftrn facc ttrd feed xref fsdb fcp ruldb csdb omom fordr ftxn fodb fsdc texc oxox reng ttrn ttxn fqdb ... (5 Replies)
Discussion started by: aoussenko
5 Replies

4. Shell Programming and Scripting

Record count of a csv file

Hello Gurus, We have a requirement to count the valid number of records in a comma delimited file with double quotes. The catch here is..few records have a new line carriage within the double quotes,,say for ex:we have a file called accounts the record count is 4827..but the actual valid count... (5 Replies)
Discussion started by: ajaykk
5 Replies

5. Shell Programming and Scripting

csv file - adding total to a trailer record

Hi, I have a script which creates and modifies a csv file. I have managed to do everything I need to do apart from 1 thing. I need to append a trailer record to the file. I need this line to hold the total of an entire column of the csv file (skipping the 1st line which is a header). Can... (2 Replies)
Discussion started by: mcclunyboy
2 Replies

6. Shell Programming and Scripting

Split a large file in n records and skip a particular record

Hello All, I have a large file, more than 50,000 lines, and I want to split it in even 5000 records. Which I can do using sed '1d;$d;' <filename> | awk 'NR%5000==1{x="F"++i;}{print > x}'Now I need to add one more condition that is not to break the file at 5000th record if the 5000th record... (20 Replies)
Discussion started by: ibmtech
20 Replies

7. Shell Programming and Scripting

Output first unique record in csv file

Hi, I have to output a new csv file from an input csv file with first unique value in the first column. input csv file color product id status green 102 pass yellow 201 hold yellow 202 keep green 101 ok green 103 hold yellow 203 ... (5 Replies)
Discussion started by: Chris LAU
5 Replies

8. Shell Programming and Scripting

Long file record

riends I have the following problem: test.txt I have a file that has the following contents: is a fixed-length file to the end of the number 12 has spaces, so that it is fixed length 123456789 123456789 123456789 12 This code shows me the length of each record, but in... (2 Replies)
Discussion started by: tricampeon81
2 Replies

9. Shell Programming and Scripting

EBCDIC File Split Based On Record Key

I was wondering if anyone could explain to me how to split a variable length EBCDIC file into seperate files based on the record key. I have the COBOL layout, and so I need to split the file into 13 different EBCDIC files so that I can run each one through a C++ converter I have, and get the... (11 Replies)
Discussion started by: hanshot1stx
11 Replies

10. UNIX for Advanced & Expert Users

How to split large file with different record delimiter?

Hi, I have received a file which is 20 GB. We would like to split the file into 4 equal parts and process it to avoid memory issues. If the record delimiter is unix new line, I could use split command either with option l or b. The problem is that the line terminator is |##| How to use... (5 Replies)
Discussion started by: Ravi.K
5 Replies
sortbib(1)						      General Commands Manual							sortbib(1)

Name
       sortbib - sort bibliographic database

Syntax
       sortbib [-sKEYS] database...

Description
       The  command sorts files of records containing refer key-letters by user-specified keys.  Records may be separated by blank lines, or by .[
       and .] delimiters, but the two styles may not be mixed together.  This program reads through each database and pulls out key fields,  which
       are  sorted  separately.   The sorted key fields contain the file pointer, byte offset, and length of corresponding records.  These records
       are delivered using disk seeks and reads, so may not be used in a pipeline to read standard input.

       By default, alphabetizes by the first %A and the %D fields, which contain the senior author and date.  The -s option is used to specify new
       KEYS.  For instance, -sATD will sort by author, title, and date, while -sA+D will sort by all authors, and date.  Sort keys past the fourth
       are not meaningful.  No more than 16 databases may be sorted together at one time.  Records longer than 4096 characters will be truncated.

       The command sorts on the last word on the %A line, which is assumed to be the author's last name.  A word in the final  position,  such	as
       ``jr.''	or  ``ed.'',  will be ignored if the name beforehand ends with a comma.  Authors with two-word last names or unusual constructions
       can be sorted correctly by using the convention ``'' in place of a blank.  A %Q field is considered to be the same as %A, except  sorting
       begins  with  the first, not the last, word.  The command sorts on the last word of the %D line, usually the year.  It also ignores leading
       articles (like ``A'' or ``The'') when sorting by titles in the %T or %J fields; it will ignore articles of any  modern  European  language.
       If a sort-significant field is absent from a record, places that record before other records containing that field.

Options
       -sKEYS
	  Specifies new sort KEYS.  For example, ATD sorts by author, title, and date.

See Also
       addbib(1), indxbib(1), lookbib(1), refer(1), roffbib(1)

																	sortbib(1)
All times are GMT -4. The time now is 04:06 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy