Comparing two CSV files Post: 302974643

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Last field problem while comparing two csv files

Hi All, I've two .csv files as below file1.csv abc, tdf, 223, tpx jgsd, tex, 342, rpy a, jdjdsd, 423, djfkld Where as file2.csv is the new version of file1.csv with some added fields in the end of each line and some additional lines. lfj, eru, 98, jkldj, 39, jdkj9 abc, tdf, 223, tpx,...

2. Shell Programming and Scripting

Comparing 2 csv files and matching content

Hello, I have the following problem: There are two csv files csv-file #1: aaa1, aaa2, ... aaan aaa1, bbb2, ... bbbn aaa1, ccc2, ... cccn bbb1, bbb2, ... bbbn ... zzz1, zzz2, ... zzzn csv-file #2: aaa1, matchvalue1 ccc1, matchvalue2

3. Shell Programming and Scripting

Comparing Strings in 2 .csv/txt files?

EDIT: My problems have been solved thanks to the help of bartus11 and pravin27 This code is just to help me learn. It serves no purpose other than that. Here's a sample csv that I'm working with - #listofpeeps.csv Jackie Chan,1954,M Chuck Norris,1930,M Bruce Lee,1940,M This code is...

4. Shell Programming and Scripting

comparing csv files

Hi! I'm just new to shell scripting n simple tasks looks so tough in initial stage. i need to write a script which will read a property file, property file will be containing count of the csv files, and in a folder(same folder) there will be respective csv files. like Property file data1=100...

5. Shell Programming and Scripting

removing duplicate records comparing 2 csv files

Hi All, I want to remove the rows from File1.csv by comparing a column/field in the File2.csv. If both columns matches then I want that row to be deleted from File1 using shell script(awk). Here is an example on what I need. File1.csv: RAJAK,ACTIVE,1 VIJAY,ACTIVE,2 TAHA,ACTIVE,3...

6. Shell Programming and Scripting

Comparing 2 difference csv files

Hello, I have about 10 csv files which range from csv1 - csv10. Each csv file has same type/set of tabs and we have around 5-6 tabs for each of the csv file which have slightly different content(data). A sample of CSV1 is shown below: Joins: Data related to Joins, it can be any number of...

7. Shell Programming and Scripting

Comparing 2 CSV files and sending the difference to a new csv file

(say) I have 2 csv files - file1.csv & file2.csv as mentioned below: file1.csv ID,version,cost 1000,1,30 2000,2,40 3000,3,50 4000,4,60 file2.csv ID,version,cost 1000,1,30 2000,2,45 3000,4,55 6000,5,70 ...

8. Shell Programming and Scripting

Comparing two large unsorted csv files

Hi All, My requirement is to write a shell script to compare two large csv files. I've created sample files for explaining my problem i.e., a.csv and b.csv contents of files: ----------------- a.csv ------ Type,Memory (Kb),Location HD,Size (Mb),Serial # XT,640,D402,0,MG0010...

9. Shell Programming and Scripting

Comparing Select Columns from two CSV files in UNIX and create a third file based on comparision

Hi , I want to compare first 3 columns of File A and File B and create a new file File C which will have all rows from File B and will include rows that are present in File A and not in File B based on First 3 column comparison. Thanks in advance for your help. File A A,B,C,45,46...

10. UNIX for Beginners Questions & Answers

awk assistance - Comparing 2 csv files

Hello all, I have searched high and low for a solution to this, many have come really close but not quite what I'm after. I have 2 files. One contains GUID's, for example: 8121E002-96FE-4C9C-BC5A-6AFF20DACECD 84468F30-F3B7-418B-81F0-0908E80792BF A second file, contains a path to the...

LEARN ABOUT OPENSOLARIS

join

join(1) 							   User Commands							   join(1)

NAME

       join - relational database operator

SYNOPSIS

       join [-a filenumber | -v filenumber] [-1 fieldnumber]
	    [-2 fieldnumber] [-o list] [-e string] [-t char] file1 file2

       join [-a filenumber] [-j fieldnumber] [-j1 fieldnumber]
	    [-j2 fieldnumber] [-o list] [-e string] [-t char] file1 file2

DESCRIPTION

       The join command forms, on the standard output, a join of the two relations specified by the lines of file1 and file2.

       There  is  one  line in the output for each pair of lines in file1 and file2 that have identical join fields. The output line normally con-
       sists of the common field, then the rest of the line from file1, then the rest of the line from file2. This format can be changed by  using
       the  -o	option	(see  below).  The  -a	option	can be used to add unmatched lines to the output. The -v option can be used to output only
       unmatched lines.

       The default input field separators are blank, tab, or new-line. In this case, multiple separators count as one field separator, and leading
       separators are ignored. The default output field separator is a blank.

       If the input files are not in the appropriate collating sequence, the results are unspecified.

OPTIONS

       Some of the options below use the argument filenumber. This argument should be a 1 or a 2 referring to either file1 or file2, respectively.

       -a filenumber	   In  addition to the normal output, produce a line for each unpairable line in file filenumber, where filenumber is 1 or
			   2. If both -a 1 and -a 2 are specified, all unpairable lines will be output.

       -e string	   Replace empty output fields in the list selected by option -o with the string string.

       -j fieldnumber	   Equivalent to -1fieldnumber -2fieldnumber.

       -j1 fieldnumber	   Equivalent to -1fieldnumber.

       -j2 fieldnumber	   Equivalent to -2fieldnumber. Fields are numbered starting with 1.

       -o list		   Each output line includes the fields specified in list. Fields selected by list that do not appear in the input will be
			   treated as empty output fields. (See the -e option.) Each element of which has the either the form filenumber.fieldnum-
			   ber, or 0, which represents the join field. The common field is not printed unless specifically requested.

       -t char		   Use character char as a separator. Every appearance of char in a line is significant. The character char is used as the
			   field  separator  for  both input and output. With this option specified, the collating term should be the same as sort
			   without the -b option.

       -v filenumber	   Instead of the default output, produce a line only for each unpairable line in filenumber, where filenumber is 1 or	2.
			   If both -v 1 and -v 2 are specified, all unpairable lines will be output.

       -1 fieldnumber	   Join on the fieldnumberth field of file 1. Fields are decimal integers starting with 1.

       -2fieldnumber	   Join on the fieldnumberth field of file 2. Fields are decimal integers starting with 1.

OPERANDS

       The following operands are supported:

       file1

       file2	 A path name of a file to be joined. If either of the file1 or file2 operands is -, the standard input is used in its place.

       file1  and  file2 must be sorted in increasing collating sequence as determined by LC_COLLATE on the fields on which they are to be joined,
       normally the first in each line (see sort(1)).

USAGE

       See largefile(5) for the description of the behavior of join when encountering files greater than or equal to 2 Gbyte (2^31 bytes).

EXAMPLES

       Example 1 Joining the password file and group file

       The following command line will join the password file and the group file, matching on the numeric group ID, and outputting the login name,
       the group name and the login directory. It is assumed that the files have been sorted in ASCII collating sequence on the group ID fields.

	 example% join -j1 4-j2 3 -o 1.1 2.1 1.6 -t:/etc/passwd /etc/group

       Example 2 Using the -o option

       The -o 0 field essentially selects the union of the join fields. For example, given file phone:

	 !Name		 Phone Number
	 Don		 +1 123-456-7890
	 Hal		 +1 234-567-8901
	 Yasushi	 +2 345-678-9012

       and file fax:

	 !Name		 Fax Number

	 Don		 +1 123-456-7899

	 Keith		 +1 456-789-0122

	 Yasushi	 +2 345-678-9011

       where the large expanses of white space are meant to each represent a single tab character), the command:

	 example% join -t"tab" -a 1 -a 2 -e '(unknown)' -o 0,1.2,2.2 phone fax

       would produce

	 !Name		 Phone Number		Fax Number
	 Don		 +1 123-456-7890	 +1 123-456-7899
	 Hal		 +1 234-567-8901	 (unknown
	 Keith		 (unknown)		 +1 456-789-012
	 Yasushi	 +2 345-678-9012	 +2 345-678-9011

ENVIRONMENT VARIABLES

       See  environ(5)	for descriptions of the following environment variables that affect the execution of join: LANG, LC_ALL, LC_CTYPE, LC_MES-
       SAGES, LC_COLLATE, and NLSPATH.

EXIT STATUS

       The following exit values are returned:

       0      All input files were output successfully.

       >0     An error occurred.

ATTRIBUTES

       See attributes(5) for descriptions of the following attributes:

       +-----------------------------+-----------------------------+
       |      ATTRIBUTE TYPE	     |	    ATTRIBUTE VALUE	   |
       +-----------------------------+-----------------------------+
       |Availability		     |SUNWcsu			   |
       +-----------------------------+-----------------------------+
       |CSI			     |Enabled			   |
       +-----------------------------+-----------------------------+
       |Interface Stability	     |Standard			   |
       +-----------------------------+-----------------------------+

SEE ALSO

       awk(1), comm(1), sort(1), uniq(1), attributes(5), environ(5), largefile(5), standards(5)

NOTES

       With default field separation, the collating sequence is that of sort -b; with -t, the sequence is that of a plain sort.

       The conventions of the join, sort, comm, uniq, and awk commands are wildly incongruous.

SunOS 5.11							    8 Feb 2000								   join(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Last field problem while comparing two csv files

Discussion started by: ganapati

2. Shell Programming and Scripting

Comparing 2 csv files and matching content

Discussion started by: ghl10000

3. Shell Programming and Scripting

Comparing Strings in 2 .csv/txt files?

Discussion started by: chickeneaterguy

4. Shell Programming and Scripting

comparing csv files

Discussion started by: sukhdip