Formatting data in a raw file by using another mapping file


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers Formatting data in a raw file by using another mapping file
# 1  
Old 01-28-2013
Formatting data in a raw file by using another mapping file

Hi All,

i have a requirement where i need to format the input RAW file ( which is CSV) by using another mapping file(also CSV file). basically i am getting feed file with dynamic headers by using mapping file (in that target field is mapped with source filed) i have to convert the raw file into a standard format (target format).

currently i am using ETL tool to achieve this but it's taking long time.
i am trying to write generic shell script for this. could anyone help me in a high level view. i am new to scripting. the target file also be a CSV file.

thanks in advance

Ravi
# 2  
Old 01-28-2013
Provide sample input and target files.
# 3  
Old 01-28-2013
I suspect we need some idea of what the one csv file's table does to the other csv file. For instance, an associative array can deal with a two column translate. PERL might be a good choice, as it has CSV handlers. Shell can only deal with "well behaved" CSV, e.g., no quoted commas. PERL can draw on many resources to deal with your problem, can call compiled code and has much of the speed of compiled code.
# 4  
Old 01-29-2013
Hi all,

Thanks for the prompt response. Below is the sample data for both files

mapping file:

Header: AU_Template_Field,RawFile_Field

Data:
VENDOR_NAME,VENDOR NAME
SHIP_TO_PARTY_NAME, PARTY_NAME
SHIP_TO_PARTY_ADDRESS1,PARTY_ADDRESS1
SHIP_TO_PARTY_ADDRESS2,PARTY_ADDRESS2
SHIP_TO_PARTY_CITY,PARTY_CITY
SHIP_TO_PARTY_STATE,PARTY_STATE
SHIP_TO_PARTY_ZIP,PARTY_ZIP

Raw File:

Header: VENDOR NAME,PARTY_NAME,PARTY_ADDRESS1,PARTY_ADDRESS2,PARTY_CITY,PARTY_STATE,PARTY_ZIP

data: abc,xyz,aaa,lny,london,london,abc123

output file:

header: VENDOR_NAME,SHIP_TO_PARTY_NAME,SHIP_TO_PARTY_ADDRESS1,SHIP_TO_PARTY_ADDRESS2,SHIP_TO_PARTY_CITY,SHIP _TO_PARTY_STATE,SHIP_TO_PARTY_ZIP

the out file strcture is standard but raw file structure might vary each time that the reason business is providing mapping file to map the columns for target file.

i hope this explanation would be clear to all.

awaiting for the responses.

Thanks in advance

Ravi
# 5  
Old 01-29-2013
Put the input keys in a simple array, then for each record, put each field in an associative array keyed to the input name, then write the output using the unchanging ordered list of input names for each output field to fetch the fields from the associative array.
# 6  
Old 02-02-2013
Hi,

Thanks for the prompt response. if you don't mind could you please provide an sample code for overview so that i can use to rewrite my script.

i am new to the scripting hence i am requesting to provide a sample snippet.

Thanks in advance.

Ravi
# 7  
Old 02-04-2013
Bash, PERL, awk, c, c++, JAVA, ???
 
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Replacing 12 columns of one file by second file based on mapping in third file

i have a real data prod file with 80+ fields containing 1k -2k records. i have to extract say 12 columns out of this which are sensitive fields along with one primary key say SEQ_ID (like DOB,account no, name, SEQ_ID, govtid etc) in a lookup file. i have to replace these sensitive fields in... (11 Replies)
Discussion started by: megh12
11 Replies

2. Shell Programming and Scripting

Formatting data to put it in the excel file

Hello, I have a file with the below contents : Policy Name: Backup_bkp Policy Type: Catalog_bkp Active: yes Effective date: 08/07/2013 02:02:12 Mult. Data Streams: no Client Encrypt: no Checkpoint: no Policy Priority: ... (11 Replies)
Discussion started by: rahul2662
11 Replies

3. Shell Programming and Scripting

Data formatting in CSV file to EXCEL

Hello friends I want to convert an csv file on unix (which is generated by a ETL application) to a formatted excel sheet like .I have roughly like 28 columns 1)All numbers need to be stored as numbers with leading zeros-like format as text for this column to preserve leading zeroes e.g... (6 Replies)
Discussion started by: etldev
6 Replies

4. Shell Programming and Scripting

Formatting file data to another file (control character related)

I have to write a program to read data from files and then format into another file. However, I face a strange problem related to control character that I can't understand and solve. The source file is compose of many lines with such format: T_NAME|P_NAME|P_CODE|DOCUMENT_PATH|REG_DATE ... (3 Replies)
Discussion started by: hk6279
3 Replies

5. UNIX for Dummies Questions & Answers

Mapping a data in a file and delete line in source file if data does not exist.

Hi Guys, Please help me with my problem here: I have a source file: 1212 23232 343434 ASAS1 4 3212 23232 343434 ASAS2 4 3234 23232 343434 QWQW1 4 1134 23232 343434 QWQW2 4 3212 23232 343434 QWQW3 4 and a mapping... (4 Replies)
Discussion started by: kokoro
4 Replies

6. Shell Programming and Scripting

AWK/Shell script for formatting data in a file

Hi All, Need an urgent help to convert a unix file in to a particular format: **source file:** 1111111 2d2f2h2 3dfgsd3 ........... 1111111 <-- repeats in every nth line. remaining all lines will be different 123ss41 432ff45 ........... 1111111 <-- repetition qwe1234 123weq3... (1 Reply)
Discussion started by: rajivnairfis
1 Replies

7. Shell Programming and Scripting

Formatting Report and Reading data and fetching the details from contents file

Data I was trying to write shell script which will be return the output in the below format First i was trying to do these using sed. sed -n '/.ksh/p' mainksh.ksh sed -e 's/*\(.*\)/\1/g' mainksh.ksh $RUN_DIR, $SUB_DIR and the variables which will be defined in the profile file. when i am... (0 Replies)
Discussion started by: rameshds
0 Replies

8. UNIX for Dummies Questions & Answers

How to make a CSV file from a Raw data

Hi Please help me on this.i have the Following data i want to make it CSV file by a Unix Shell Script. •msgType : 234 ( m_code : 0 # m_name : type # m_data : LOG ) pls help me on this (4 Replies)
Discussion started by: Aditya.Gurgaon
4 Replies

9. Shell Programming and Scripting

formatting data file with awk or sed

Hi, I have a (quite large) data file which looks like: _____________ header part.. more header part.. x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 x11 x12 x13 ... ... x59 x60 y1 y2 y3 y4... ... y100 ______________ where x1, x2,...,x60 and y1, y2,...y100 are numbers of 10 digits (so each line... (5 Replies)
Discussion started by: lego
5 Replies

10. Shell Programming and Scripting

Awk formatting of a data file - nested for loops?

Hello - is there any way in awk I can do... 4861 x(1) y(1) z(1) 4959 x(1) y(1) z(1) 5007 x(1) y(1) z(1) 4861 x(2) y(2) z(2) 4959 x(2) y(2) z(2) 5007 x(2) y(2) z(2) 4861 x(3) y(3) z(3) 4959 x(3) y(3) z(3) 5007 x(3) y(3) z(3) to become... 4861 x(1) y(1) z(1) 4861 x(2) y(2) z(2)... (3 Replies)
Discussion started by: catwoman
3 Replies
Login or Register to Ask a Question