Comparing two large unsorted csv files Post: 302822423

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Last field problem while comparing two csv files

Hi All, I've two .csv files as below file1.csv abc, tdf, 223, tpx jgsd, tex, 342, rpy a, jdjdsd, 423, djfkld Where as file2.csv is the new version of file1.csv with some added fields in the end of each line and some additional lines. lfj, eru, 98, jkldj, 39, jdkj9 abc, tdf, 223, tpx,...

2. Shell Programming and Scripting

Comparing 2 csv files and matching content

Hello, I have the following problem: There are two csv files csv-file #1: aaa1, aaa2, ... aaan aaa1, bbb2, ... bbbn aaa1, ccc2, ... cccn bbb1, bbb2, ... bbbn ... zzz1, zzz2, ... zzzn csv-file #2: aaa1, matchvalue1 ccc1, matchvalue2

3. Shell Programming and Scripting

Comparing Strings in 2 .csv/txt files?

EDIT: My problems have been solved thanks to the help of bartus11 and pravin27 This code is just to help me learn. It serves no purpose other than that. Here's a sample csv that I'm working with - #listofpeeps.csv Jackie Chan,1954,M Chuck Norris,1930,M Bruce Lee,1940,M This code is...

4. Shell Programming and Scripting

Comparing two unsorted files

Hi Guys, I'm a complete shell scripting newbie and need some help with comparing a file against a master file and outputting the results. master.txt would look something like this: 000123 000345 000341 000927 000762 000235 000155 000452 000846 000623 file.txt would look like...

5. Shell Programming and Scripting

comparing csv files

Hi! I'm just new to shell scripting n simple tasks looks so tough in initial stage. i need to write a script which will read a property file, property file will be containing count of the csv files, and in a folder(same folder) there will be respective csv files. like Property file data1=100...

6. Shell Programming and Scripting

Merging Very large CSV files in Unix

Hi, I have two very large CSV files, which I want to merge (equi-join) based on a key (column). One of the file (say F1) would have ~30 MM records and 700 columns. The other file (~f2) would have same # of records and lesser columns (say 50). I want to create an output file joining on a...

7. Shell Programming and Scripting

Comparing 2 difference csv files

Hello, I have about 10 csv files which range from csv1 - csv10. Each csv file has same type/set of tabs and we have around 5-6 tabs for each of the csv file which have slightly different content(data). A sample of CSV1 is shown below: Joins: Data related to Joins, it can be any number of...

8. Shell Programming and Scripting

Comparing 2 CSV files and sending the difference to a new csv file

(say) I have 2 csv files - file1.csv & file2.csv as mentioned below: file1.csv ID,version,cost 1000,1,30 2000,2,40 3000,3,50 4000,4,60 file2.csv ID,version,cost 1000,1,30 2000,2,45 3000,4,55 6000,5,70 ...

9. Shell Programming and Scripting

Comparing two CSV files

I have two csv files and im trying to compare them. e.g. SAMPLE DATA: file one: ZipCode Name 20878 Washington 10023 Missouri 20304 Maryland file two: ID Name City ZipCode 11654 ...

10. UNIX for Beginners Questions & Answers

awk assistance - Comparing 2 csv files

Hello all, I have searched high and low for a solution to this, many have come really close but not quite what I'm after. I have 2 files. One contains GUID's, for example: 8121E002-96FE-4C9C-BC5A-6AFF20DACECD 84468F30-F3B7-418B-81F0-0908E80792BF A second file, contains a path to the...

LEARN ABOUT DEBIAN

largefile

largefile(5)                                            Standards, Environments, and Macros                                           largefile(5)

NAME

       largefile - large file status of utilities

DESCRIPTION

       A  large file is a regular file whose size is greater than or equal to 2 Gbyte ( 2**31 bytes). A small file is a regular file whose size is
       less than 2 Gbyte.

   Large file aware utilities
       A utility is called large file aware if it can process large files in the same manner as it does small files. A utility that is large  file
       aware is able to handle large files as input and generate as output large files that are being processed. The exception is where additional
       files  are used as system configuration files or support files that can augment the processing. For example, the file utility supports  the
       -m  option  for  an  alternative "magic" file and the -f option for a support file that can contain a list of file names. It is unspecified
       whether a utility that is large file aware will accept configuration or support files that are large files. If a large file  aware  utility
       does  not accept configuration or support files that are large files, it will cause no data loss or corruption upon encountering such files
       and will return an appropriate error.

       The following /usr/bin utilities are large file aware:

       adb           awk           bdiff         cat           chgrp
       chmod         chown         cksum         cmp           compress
       cp            csh           csplit        cut           dd
       dircmp        du            egrep         fgrep         file
       find          ftp           getconf       grep          gzip
       head          join          jsh           ksh           ln
       ls            mdb           mkdir         mkfifo        more
       mv            nawk          page          paste         pathchck
       pg            rcp           remsh         rksh          rm
       rmdir         rsh           sed           sh            sort
       split         sum           tail          tar           tee
       test          touch         tr            uncompress    uudecode
       uuencode      wc            zcat

       The following /usr/xpg4/bin utilities are large file aware:

       awk           cp            chgrp         chown         du
       egrep         fgrep         file          grep          ln
       ls            more          mv            rm            sed
       sh            sort          tail          tr

       The following /usr/xpg6/bin utilities are large file aware:

       getconf       ls            tr

       The following /usr/sbin utilities are large file aware:

       install       mkfile        mknod         mvdir         swap

       See the USAGE section of the swap(1M) manual page for limitations of swap on block devices greater than 2 Gbyte on a 32-bit operating  sys-
       tem.

       The following /usr/ucb utilities are large file aware:

       chown         from          ln            ls            sed
       sum           touch

       The /usr/bin/cpio and /usr/bin/pax utilities are large file aware, but cannot archive a file whose size exceeds 8 Gbyte - 1 byte.

       The /usr/bin/truss utilities has been modified to read a dump file and display information relevant to large files, such as offsets.

   cachefs file systems
       The following /usr/bin utilities are large file aware for cachefs file systems:

       cachefspack      cachefsstat

       The following /usr/sbin utilities are large file aware for cachefs file systems:

       cachefslog       cachefswssize   cfsadmin         fsck
       mount            umount

   nfs file systems
       The following utilities are large file aware for nfs file systems:

       /usr/lib/autofs/automountd    /usr/sbin/mount
       /usr/lib/nfs/rquotad

   ufs file systems
       The following /usr/bin utility is large file aware for ufs file systems:

              df

       The following /usr/lib/nfs utility is large file aware for ufs file systems:

              rquotad

       The following /usr/xpg4/bin utility is large file aware for ufs file systems:

              df

       The following /usr/sbin utilities are large file aware for ufs file systems:

       clri          dcopy         edquota       ff            fsck
       fsdb          fsirand       fstyp         labelit       lockfs
       mkfs          mount         ncheck        newfs         quot
       quota         quotacheck    quotaoff      quotaon       repquota
       tunefs        ufsdump       ufsrestore    umount

   Large file safe utilities
       A  utility  is called large file safe if it causes no data loss or corruption when it encounters a large file. A utility that is large file
       safe is unable to process properly a large file, but returns an appropriate error.

       The following /usr/bin utilities are large file safe:

       audioconvert     audioplay    audiorecord    comm          diff
       diff3            diffmk       ed             lp            mail

       mailcompat       mailstats    mailx          pack          pcat
       red              rmail        sdiff          unpack        vi
       view

       The following /usr/xpg4/bin utilities are large file safe:

       ed            vi            view

       The following /usr/xpg6/bin utility is large file safe:

       ed

       The following /usr/sbin utilities are large file safe:

       lpfilter      lpforms

       The following /usr/ucb utilities are large file safe:

       Mail          lpr

       The following /usr/lib utility is large file safe:

              sendmail

SEE ALSO

       lf64(5), lfcompile(5), lfcompile64(5)

SunOS 5.10                                                          7 Nov 2003                                                        largefile(5)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Last field problem while comparing two csv files

Discussion started by: ganapati

2. Shell Programming and Scripting

Comparing 2 csv files and matching content

Discussion started by: ghl10000

3. Shell Programming and Scripting

Comparing Strings in 2 .csv/txt files?

Discussion started by: chickeneaterguy

4. Shell Programming and Scripting

Comparing two unsorted files

Discussion started by: ven