Finding total distinct count from multiple csv files through UNIX script Post: 303002569

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

finding duplicate files by size and finding pattern matching and its count

Hi, I have a challenging task,in which i have to find the duplicate files by its name and size,then i need to take anyone of the file.Then i need to open the file and find for more than one pattern and count of that pattern. Note:These are the samples of two files,but i can have more...

2. UNIX for Dummies Questions & Answers

grep running total/ final total across multiple files

Ok, another fun hiccup in my UNIX learning curve. I am trying to count the number of occurrences of an IP address across multiple files named example.hits. I can extract the number of occurrences from the files individually but when you use grep -c with multiple files you get the output similar to...

3. Shell Programming and Scripting

How to use the programming in UNIX to count the total G+C and the GC%?What command li

Seems like can use awk and perl command. But I don't have the idea to write the command line. Thanks for all of your advise. For example, if I have the file whose content are: Sample 1. ATAGCAGAGGGAGTGAAGAGGTGGTGGGAGGGAGCT Sample 2. ACTTTTATTTGAATGTAATATTTGGGACAATTATTC Sample 3....

4. Shell Programming and Scripting

perl script on how to count the total number of lines of all the files under a directory

how to count the total number of lines of all the files under a directory using perl script.. I mean if I have 10 files under a directory then I want to count the total number of lines of all the 10 files contain. Please help me in writing a perl script on this.

5. Shell Programming and Scripting

Search and find total count from multiple files

Please advice how can we search for a string say (abc) in multiple files and to get total occurrence of that searched string. (Need number of records that exits in period of time). File look like this (read as filename.yyyymmdd) a.20100101 b.20100108 c.20100115 d.20100122 e.20100129...

6. Shell Programming and Scripting

Finding total count of a word.

i want to find the no:of occurrences of a word in a file cat 1.txt unix script unix script unix script unix script unix script unix script unix script unix script unix unix script unix script unix script now i want to find , how many times 'unix' was occurred please help me thanks...

7. Shell Programming and Scripting

Script to compare count of two csv files

Hi Guys, I need to write a script to compare the count of two csv files each having 5 columns. Everyday a csv file is recived. Now we need to compare the count of todays csv file with yesterday's csv file and if the total count of records is same in todays csv file and yesterday csv file out...

8. Shell Programming and Scripting

Shell script for field wise record count for different Files .csv files

Hi, Very good wishes to all! Please help to provide the shell script for generating the record counts in filed wise from the .csv file My question: Source file: Field1 Field2 Field3 abc 12f sLm 1234 hjd 12d Hyd 34 Chn My target file should generate the .csv file with the...

9. Shell Programming and Scripting

Help with Getting distinct record count from a .dat file using UNIX command

Hi, I have a .dat file with contents like the below: Input file ============SEQ NO-1: COLUMN1========== 9835619 7152815 ============SEQ NO-2: COLUMN2 ========== 7615348 7015548 9373086 ============SEQ NO-3: COLUMN3=========== 9373086 Expected Output: (I just...

10. UNIX for Beginners Questions & Answers

Export Oracle multiple tables to multiple csv files using UNIX shell scripting

Hello All, just wanted to export multiple tables from oracle sql using unix shell script to csv file and the below code is exporting only the first table. Can you please suggest why? or any better idea? export FILE="/abc/autom/file/geo_JOB.csv" Export= `sqlplus -s dev01/password@dEV3...

LEARN ABOUT NETBSD

join

JOIN(1) 						    BSD General Commands Manual 						   JOIN(1)

NAME

     join -- relational database operator

SYNOPSIS

     join [-a file_number | -v file_number] [-e string] [-j file_number field] [-o list] [-t char] [-1 field] [-2 field] file1 file2

DESCRIPTION

     The join utility performs an ``equality join'' on the specified files and writes the result to the standard output.  The ``join field'' is
     the field in each file by which the files are compared.  The first field in each line is used by default.	There is one line in the output
     for each pair of lines in file1 and file2 which have identical join fields.  Each output line consists of the join field, the remaining
     fields from file1 and then the remaining fields from file2.

     The default field separators are tab and space characters.  In this case, multiple tabs and spaces count as a single field separator, and
     leading tabs and spaces are ignored.  The default output field separator is a single space character.

     Many of the options use file and field numbers.  Both file numbers and field numbers are 1 based, i.e. the first file on the command line is
     file number 1 and the first field is field number 1.  The following options are available:

     -a file_number
		 In addition to the default output, produce a line for each unpairable line in file file_number.  (The argument to -a must not be
		 preceded by a space; see the COMPATIBILITY section.)

     -e string	 Replace empty output fields with string.

     -o list	 The -o option specifies the fields that will be output from each file for each line with matching join fields.  Each element of
		 list has the form 'file_number.field', where file_number is a file number and field is a field number.  The elements of list must
		 be either comma (``,'') or whitespace separated.  (The latter requires quoting to protect it from the shell, or, a simpler
		 approach is to use multiple -o options.)

     -t char	 Use character char as a field delimiter for both input and output.  Every occurrence of char in a line is significant.

     -v file_number
		 Do not display the default output, but display a line for each unpairable line in file file_number.  The options -v 1 and -v 2
		 may be specified at the same time.

     -1 field	 Join on the field'th field of file 1.

     -2 field	 Join on the field'th field of file 2.

     When the default field delimiter characters are used, the files to be joined should be ordered in the collating sequence of sort(1), using
     the -b option, on the fields on which they are to be joined, otherwise join may not report all field matches.  When the field delimiter char-
     acters are specified by the -t option, the collating sequence should be the same as sort(1) without the -b option.

     If one of the arguments file1 or file2 is ``-'', the standard input is used.

     The join utility exits 0 on success, and >0 if an error occurs.

COMPATIBILITY

     For compatibility with historic versions of join, the following options are available:

     -a 	 In addition to the default output, produce a line for each unpairable line in both file 1 and file 2.	(To distinguish between
		 this and -a file_number, join currently requires that the latter not include any white space.)

     -j1 field	 Join on the field'th field of file 1.

     -j2 field	 Join on the field'th field of file 2.

     -j field	 Join on the field'th field of both file 1 and file 2.

     -o list ...
		 Historical implementations of join permitted multiple arguments to the -o option.  These arguments were of the form ``file_num-
		 ber.field_number'' as described for the current -o option.  This has obvious difficulties in the presence of files named ``1.2''.

     These options are available only so historic shell scripts don't require modification and should not be used.

SEE ALSO

     awk(1), comm(1), paste(1), sort(1), uniq(1)

STANDARDS

     The join command is expected to be IEEE Std 1003.2 (``POSIX.2'') compatible.

BSD
								  April 28, 1995							       BSD

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

finding duplicate files by size and finding pattern matching and its count

Discussion started by: jerome Sukumar

2. UNIX for Dummies Questions & Answers

grep running total/ final total across multiple files

Discussion started by: MrAd

3. Shell Programming and Scripting

How to use the programming in UNIX to count the total G+C and the GC%?What command li

Discussion started by: patrick chia

4. Shell Programming and Scripting

perl script on how to count the total number of lines of all the files under a directory

Discussion started by: adityam