Merge CSV files and create a column with the filename from the original file Post: 302520900

Sponsored Content

Top Forums Shell Programming and Scripting Merge CSV files and create a column with the filename from the original file Post 302520900 by fransanchezoria on Monday 9th of May 2011 01:52:33 PM

05-09-2011

Registered User

Thank you Klashxx!!

That was a step in the right direction. Actually if I run the command with the example files as I posted in the example it works without any problems. I must have oversimplified my data because when I run it on the real thing the result file is huge, 2,3 GB although the source file are less than 100 Kb altogether. I tried to open with openoffice and in the preview window before importing I can see that there is a column with the file name which is great, then some stuff gets scrambled and repeated.

Here is a sample code of one file:

Code:

"Innography URL",Assignee,"Publication Number","Publication Country","Publication Date",Source,Title,Abstract,"Application Number",Citations,"Est. Expiration Date","Family ID","File Date","First Claim","All Claims",Inventors,"First IP Classification","All IPC Classifications","Kind Code","Priority Date","Normalized Assignee","Number of Claims","Number of Backward References","Number of Forward References","Original Assignee",Strength,"Ultimate Parent","US Classification"
"=Hyperlink(""https://app.innography.com/patents/14530190"",""Innography Link"")","Biora Ab",CA2226570,CA,1997-01-30,"CA Patents","Enamel matrix related polypeptide","The invention relates to novel nucleic acid fragments encoding polypeptides which are capable of mediating contact between enamel and cell surface. The invention also relates to expression vectors containing the nucleic acid fragments according to the invention for production of the protein organisms containing said expression vector methods for producing the polypeptide compositions comprising the polypeptides antibodies or antibody fragments recognizing the polypeptides and methods for treating various hard tissue diseases or disorders.",CA22265," ",2006-12-18,27221209,1996-06-26,,,"Wurtz, Tilmann  | Hammarstr, M Lars | Slaby, Ivan  | Cerny, Radim  | Fong, Cheng Dan",C12N01500900,"C12N 15/09|A61K 31/00|A61K 38/00|A61P 1/00|A61P 1/02|C07K 14/435|C07K 14/47|C07K 14/78|C12N 1/15|C12N 1/19|C12N 1/21|C12N 5/10|C12P 21/02",A1,1995-07-13,"Biora Ab",0,0,0,"BIORA AB","0th-10th Percentile",,435317100

As you can see one column contains an "abstract", the file may have all sort of special characters, could that have been the cause why it didnt work?

Thank you again for your help, really appreciated!

fransanchezoria

View Public Profile for fransanchezoria

Find all posts by fransanchezoria

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

merge two two txt files into one file based on one column

Hi, I have file1.txt and file2.txt and would like to create file3.txt based on one column in UNIX Eg: file1.txt 17328756,0000786623.pdf,0000786623 20115537,0000793892.pdf,0000793892 file2.txt 12521_74_4.zip,0000786623.pdf 12521_15_5.zip,0000793892.pdf Desired Output ...

2. Shell Programming and Scripting

Merging files to create CSV file

Hi, I have different files of the same type, as: Time: 100 snr: 88 perf: 10 other: 222 Each of these files are created periodically. What I need to do is to merge all of them into one but having the following form:

3. Shell Programming and Scripting

Filename from splitting files to have the same filename of the original file with counter value

Hi all, I have a list of xml file. I need to split the files to a different files when see the <ko> tag. The list of filename are B20090908.1100-20090908.1200_CDMA=1,NO=2,SITE=3.xml B20090908.1200-20090908.1300_CDMA=1,NO=2,SITE=3.xml B20090908.1300-20090908.1400_CDMA=1,NO=2,SITE=3.xml ...

4. Shell Programming and Scripting

How to create a CSV File by reading fields from separate files

SHELL SCRIPT Hi, I have 3 separate files within a folder. Every File contains data in a single column like File1 contains data mayank sushant dheeraj File2 contains DSA_AT MG_AT FLAT_09 File3 contains data 123123 232323

5. Shell Programming and Scripting

create new column for filename

Hi, I created a list with 2 columns. Each line is from a different file. I am getting these with a loop in Perl. I would like to add a 3rd column with the name of the file that the line is coming from. I usually use pr to print the filename but this is not working here ... I was wondering if...

6. UNIX for Dummies Questions & Answers

How to create a .csv file from 2 different .txt files?

Hi, I need to create a .csv file from information that i have in two different tab delimited .txt file. I just want to select some of the columns of each .txt file and paste them into a .cvs file. My files look like: File 1 transcript_id Seq. Description Seq. Length ...

7. Shell Programming and Scripting

Merge different files into the original file

Hello Below is my requirement I have 3 files A1.txt , A2.txt and A3.txt . A2 is dynamically generating file I want the merge of A1,A2 and A3 in A2.txt Could you please help?

8. Shell Programming and Scripting

Compare 2 files of csv file and match column data and create a new csv file of them

Hi, I am newbie in shell script. I need your help to solve my problem. Firstly, I have 2 files of csv and i want to compare of the contents then the output will be written in a new csv file. File1: SourceFile,DateTimeOriginal /home/intannf/foto/IMG_0713.JPG,2015:02:17 11:14:07...

9. UNIX for Dummies Questions & Answers

Merge two csv files using column name

Hi all, I have two separate csv files(comma delimited) file 1 and file 2. File 1 contains PAN,NAME,Salary AAAAA5467D,Raj,50000 AAFAC5467D,Ram,60000 BDCFA5677D,Kumar,90000 File 2 contains PAN,NAME,Dept,Salary ASDFG6756T,Karthik,ABC,450000 QWERT8765Y,JAX,CDR,780000...

10. Shell Programming and Scripting

I am trying to merge all csv files from source path into 1 file

I am trying to merge all csv files from source path into one single csv file in target. but getting error message: hadoop fs -cat /user/hive/warehouse/stage.db/PK_CLOUD_CHARGE/TCH-charge_*.csv > /user/hive/warehouse/stage.db/PK_CLOUD_CHARGE/final/TCH_pb_charge.csv getting error message:...

LEARN ABOUT DEBIAN

bib1-attr

BIB-1 ATTRIBUTE SET(7)					   Conventions and miscellaneous				    BIB-1 ATTRIBUTE SET(7)

NAME

       bib1-attr - Bib-1 Attribute Set

DESCRIPTION

       This reference entry lists the Bib-1 attribute set types and values.

TYPES

       The Bib-1 attribute defines six attribute types: Use (1), Relation (2), Position (3), Structure (4), Truncation (5) and completeness (6).

USE (1)
	       1     Personal-name
	       2     Corporate-name
	       3     Conference-name
	       4     Title
	       5     Title-series
	       6     Title-uniform
	       7     ISBN
	       8     ISSN
	       9     LC-card-number
	       10    BNB-card-number
	       11    BGF-number
	       12    Local-number
	       13    Dewey-classification
	       14    UDC-classification
	       15    Bliss-classification
	       16    LC-call-number
	       17    NLM-call-number
	       18    NAL-call-number
	       19    MOS-call-number
	       20    Local-classification
	       21    Subject-heading
	       22    Subject-Rameau
	       23    BDI-index-subject
	       24    INSPEC-subject
	       25    MESH-subject
	       26    PA-subject
	       27    LC-subject-heading
	       28    RVM-subject-heading
	       29    Local-subject-index
	       30    Date
	       31    Date-of-publication
	       32    Date-of-acquisition
	       33    Title-key
	       34    Title-collective
	       35    Title-parallel
	       36    Title-cover
	       37    Title-added-title-page
	       38    Title-caption
	       39    Title-running
	       40    Title-spine
	       41    Title-other-variant
	       42    Title-former
	       43    Title-abbreviated
	       44    Title-expanded
	       45    Subject-precis
	       46    Subject-rswk
	       47    Subject-subdivision
	       48    Number-natl-biblio
	       49    Number-legal-deposit
	       50    Number-govt-pub
	       51    Number-music-publisher
	       52    Number-db
	       53    Number-local-call
	       54    Code-language
	       55    Code-geographic
	       56    Code-institution
	       57    Name-and-title
	       58    Name-geographic
	       59    Place-publication
	       60    CODEN
	       61    Microform-generation
	       62    Abstract
	       63    Note
	       1000  Author-title
	       1001  Record-type
	       1002  Name
	       1003  Author
	       1004  Author-name-personal
	       1005  Author-name-corporate
	       1006  Author-name-conference
	       1007  Identifier-standard
	       1008  Subject-LC-childrens
	       1009  Subject-name-personal
	       1010  Body-of-text
	       1011  Date/time-added-to-db
	       1012  Date/time-last-modified
	       1013  Authority/format-id
	       1014  Concept-text
	       1015  Concept-reference
	       1016  Any
	       1017  Server-choice
	       1018  Publisher
	       1019  Record-source
	       1020  Editor
	       1021  Bib-level
	       1022  Geographic-class
	       1023  Indexed-by
	       1024  Map-scale
	       1025  Music-key
	       1026  Related-periodical
	       1027  Report-number
	       1028  Stock-number
	       1030  Thematic-number
	       1031  Material-type
	       1032  Doc-id
	       1033  Host-item
	       1034  Content-type
	       1035  Anywhere
	       1036  Author-Title-Subject

RELATION (2)
	       1 Less than
	       2 Less than or equal
	       3 Equal
	       4 Greater or equal
	       5 Greater than
	       6 Not equal
	       100 Phonetic
	       101 Stem
	       102 Relevance
	       103 AlwaysMatches

POSITION (3)
	       1 First in field
	       2 First in subfield
	       3 Any position in field

STRUCTURE (4)
	       1 Phrase
	       2 Word
	       3 Key
	       4 Year
	       5 Date (normalized)
	       6 Word list
	       100 Date (un-normalized)
	       101 Name (normalized)
	       102 Name (un-normalized)
	       103 Structure
	       104 Urx
	       105 Free-form-text
	       106 Document-text
	       107 Local-number
	       108 String
	       109 Numeric-string

TRUNCATION (5)
	       1 Right truncation
	       2 Left truncation
	       3 Left and right truncation
	       100 Do not truncate
	       101 Process # in search term  . regular #=.*
	       102 RegExpr-1
	       103 RegExpr-2
	       104 Process # ?n . regular: #=., ?n=.{0,n} or ?=.* Z39.58

       Thw 105-106 truncation attributes below are only supported by Index Data's Zebra server.

	       105 Process * ! regular: *=.*, !=. and right truncate
	       106 Process * ! regular: *=.*, !=.

COMPLETENSS (6)
	       1 Incomplete subfield
	       2 Complete subfield
	       3 Complete field

SORTING (7)
	       1 ascending
	       2 descending

       Type 7 is an Index Data extension to RPN queries that allows embedding a sort critieria into a query.

SEE ALSO

       Bib-1 Attribute Set[1]

       Attibute Set Bib-1 Semantics[2].

NOTES

	1. Bib-1 Attribute Set
	   http://www.loc.gov/z3950/agency/defns/bib1.html

	2. Attibute Set Bib-1 Semantics
	   http://www.loc.gov/z3950/agency/bib1.html

YAZ 4.2.30							    04/16/2012						    BIB-1 ATTRIBUTE SET(7)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

merge two two txt files into one file based on one column

Discussion started by: techmoris

2. Shell Programming and Scripting

Merging files to create CSV file

Discussion started by: Ravendark

3. Shell Programming and Scripting

Filename from splitting files to have the same filename of the original file with counter value

Discussion started by: natalie23

4. Shell Programming and Scripting

How to create a CSV File by reading fields from separate files

Discussion started by: mayanksargoch