Merge CSV files and create a column with the filename from the original file

06-23-2011

Registered User

5, 0

Join Date: May 2011

Last Activity: 23 June 2011, 6:54 AM EDT

Posts: 5

Thanks Given: 2

Thanked 0 Times in 0 Posts

Hi Klashxx,

CSVfix was working great, but unfortunately the data contained many special characters and it ended up with a few messed rows. I gave your script a go and it works fine, I only have a couple of issues:

1. For some reason it only opens and output one file (100_LARSSON_KRISTIAN.csv) although there are over 300 in the directory, all with the format number_surname_firstname.csv. I used
'/home/fran/Desktop/ex.pl' '/home/fran/Desktop/chalmers test.csv' /home/fran/Desktop/chalmers/*.csv

2. Path name is displayed after /n, which is great, but many fields contains embedded new lines, which complicate things. Do you know how to output the name only in the last /n? The difference between the last /n and the rest is that is not enclosed by "" (perhaps use CSV_XS? I'm trying to figure out how to use it)

I'm running ubuntu by the way.

Many thanks!

Last edited by fransanchezoria; 06-23-2011 at 07:49 AM..

fransanchezoria

View Public Profile for fransanchezoria

Find all posts by fransanchezoria

TM::Serializable::CSV(3pm) User Contributed Perl Documentation TM::Serializable::CSV(3pm) NAME
TM::Serializable::CSV - Topic Maps, trait for parsing (and later dumping) CSV stream SYNOPSIS
# 1) bare bones my $tm = .....; # get a map from somewhere (can be empty) Class::Trait->apply ($tm, "TM::Serializable::CSV"); use Perl6::Slurp; $tm->deserialize (slurp 'myugly.csv'); # 2) exploiting the timed sync in/out mechanism my $tm = new TM::.... (url => 'file:myugly.csv'); # get a RESOURCEABLE map from somewhere $tm->sync_in; DESCRIPTION
This trait provides parsing and dumping from CSV formatted text streams. INTERFACE
Methods deserialize $tm->deserialize ($text) This method consumes the text string passed in and interprets it as CSV formatted information. What topic map information is generated, depends on the header line (the first line): o If the header line contains a field called "association-type", then all rows will be interpreted as assertions. In that the remaining header fields (in that order) are interpreted as roles (role types). For all rows in the CSV stream, the position where the "association-type" field was is ignored. The other fields (in that order) are affiliated with the corresponding roles. Example: association-type,location,bio-unit is-born,gold-coast,rumsti is-born,vienna,ramsti Scoping cannot be controlled. Also all players and roles (obviously) are directly interpreted as identifiers. Subject identifiers and locators are not (yet) implemented. o If the header line contains a field called "id", then all further rows will be interpreted as topic characteristics, with each topic on one line. The column position where the "id" field in the header is will be interpreted as toplet identifier. All further columns will be interpreted according to the following: o If the header column is named "name", the values will be used as topic names. o Otherwise if the value looks like a URI, an occurrence with that URI value will be be added to the topic. o Otherwise an occurrence with a string value will be added to the topic. Example: name,id,location,homepage "Rumsti",rumsti,gold-coast,http://rumsti.com "Ramsti",ramsti,vienna,http://ramsti.com serialize $tm->serialize [Since TM 1.53] This method serializes a fragment of a topic map into CSV. Which fragment can be controlled with the header line and options (see constructor). "header_line" (only for serialization) This string contains a comma separated list (CSV parseable) of headings. If one of the headings is "association-type", then the generated CSV content will contain associations only. Nothing else is implemented yet. The other headings control which roles (and in which order) should be included in the CSV content. If a particular role type has more than one player, then all players are included. NOTE: As this is inconsistent, this will have to change. "type" (only for serialization) If existing, then this controls which association type is to be taken. "baseuri" (only for serialization) If existing and non-zero, the base URI of the map will remain in the identifiers. Otherwise it will be removed. "specification" If existing (and when selecting only associations), this specification will be interpreted in the sense of "asserts" (see TM). Example: $tm->serialize (header_line => 'association-type,location,bio-unit', type => 'is-born', baseuri => 0); SEE ALSO
TM, TM::Serializable AUTHOR INFORMATION
Copyright 2010 Robert Barta. This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. http://www.perl.com/perl/misc/Artistic.html perl v5.10.1 2012-06-05 TM::Serializable::CSV(3pm)

Shell Programming and Scripting

Merge CSV files and create a column with the filename from the original file

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

I am trying to merge all csv files from source path into 1 file

Discussion started by: cplusplus1

2. UNIX for Dummies Questions & Answers

Merge two csv files using column name

Discussion started by: Nivas

3. Shell Programming and Scripting

Compare 2 files of csv file and match column data and create a new csv file of them

Discussion started by: refrain

4. Shell Programming and Scripting

Merge different files into the original file

Discussion started by: Pratik4891

5. UNIX for Dummies Questions & Answers

How to create a .csv file from 2 different .txt files?

Discussion started by: alisrpp

6. Shell Programming and Scripting

create new column for filename

Discussion started by: danieladna

7. Shell Programming and Scripting

How to create a CSV File by reading fields from separate files

Discussion started by: mayanksargoch

8. Shell Programming and Scripting

Filename from splitting files to have the same filename of the original file with counter value

Discussion started by: natalie23

9. Shell Programming and Scripting

Merging files to create CSV file

Discussion started by: Ravendark

10. Shell Programming and Scripting

merge two two txt files into one file based on one column

Discussion started by: techmoris

LEARN ABOUT DEBIAN

tm::serializable::csv