02-05-2009
compare file columns
I need help in file comparision.
I have two files in below format:
FILE_A:
-------
COL1 COL2 COL3 COL4 COL5
FILE_B:
-------
COL1A COL1B COL1C COL1D COL1E
i want to compare for a for each row in FILE_A and FILE_B
COL1 of FILE_A with COL1B of FILE_B
COL3 of FILE_A with COL1E of FILE_B
COL5 of FILE_A with COL1C of FILE_B
and in case they are matching, print the column values COL1 COL2 COL3 from FILE_A into FILE_OK,
and if they do not match then print them into FILE_ERR.txt
for example
FILE_A:
-------
A B C D 1
E F G H 2
I J K L 3
FILE_B:
-------
X E 2 M G
Y A 1 O T
for above two files, first row column of FILE_A (COL1, COL3 and COL5) are matching with
2nd row columns COL1B, COL1E and COL1C respectively of FILE_B, but for second row in FILE_A
they are not matching (COL3 not matching with COL1E).
what is the fastest possible way to do it using awk (or otherwise) ?
if i read line by line, it takes a lot of time as each file contains 50k++ records
Thanks in advance
:-)
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I have learned file comparison from my previous post here. Then, it is comparing the whole line. Now, i have a new problem.
I have two files with 3 columns separated with a "|". What i want to do is to compare the second and third column of file 1, and the second and third column of file 2. And... (4 Replies)
Discussion started by: kingpeejay
4 Replies
2. Shell Programming and Scripting
Hi,
I have two tab separated files;
file1:
S.No ddi fi cu o/l t+ t-
1 0.5 0.6 o 0.1 0.2
2 0.2 0.3 l 0.3 0.4
3 0.5 0.8 l 0.1 0.6
... (5 Replies)
Discussion started by: vasanth.vadalur
5 Replies
3. UNIX for Dummies Questions & Answers
Hi All,
i have a excel sheet with two columns as below.
column1 column2
100 100
200 300
300 400
400 400
500 600
i need to compare the values these two columns and the output should be printed in the third column...if these values are equal the output should be green and if these... (2 Replies)
Discussion started by: arunmanas
2 Replies
4. Shell Programming and Scripting
Dear everyone,
I need any sort of shell script or perl script would do the following. I have a txt file as follows:
;Stretnumber Resident Resdient (not in file)
16 John Mary
16 Mary Parker
16 Nancy Smith
16 Mary John
18 Trey ... (5 Replies)
Discussion started by: sasharma
5 Replies
5. Shell Programming and Scripting
Hello guys, I am quite new to Shell Scripting and I need help for this
I have a CSV file like this:
Requisition,Order,RequisitionLineNumber,OrderLineNumber
REQ1,Order1,1,1
REQ1,Order1,1,3
REQ2,Order2,1,5
Basically what I want to do is compare the first 3 fields
If all 3 fields are the same... (5 Replies)
Discussion started by: jeffreybsu
5 Replies
6. Shell Programming and Scripting
Hi All,
Need to compare two date columns from the filname FinalDate.txt. My data's are like below
D_OT_START D_EXP_STR Amount
1/3/2012 1/3/2012 5000
6/21/2011 6/25/2011 6000
2/28/2011 2/28/2011 7000
7/16/2010 8/16/2010 8000
7/14/2010 10/26/2010 9000
... (3 Replies)
Discussion started by: suresh_target
3 Replies
7. Shell Programming and Scripting
i have the following files (all separated by tabs):
file 1.txt
1 yes
2 no
3 yes
4 yes
file 2.txt
a no
b no
c yes
d no
i combine the above files in file 3 which looks like
file 3.txt
1 yes a no
2 no b no
3 yes c yes
4 yes d no
now, i need to compare the values between column 2... (3 Replies)
Discussion started by: msonoth
3 Replies
8. Shell Programming and Scripting
Hello Unix gurus,
I have a file with this format (example values):
label1 1 0
label2 1 0
label3 0.4 0.6
label4 0.5 0.5
label5 0.1 0.9
label6 0.9 0.1
in which:
column 1 is a row label
column 2 and 3 are values
I would like to do a simple operation on this table and get the... (8 Replies)
Discussion started by: ksennin
8 Replies
9. UNIX for Beginners Questions & Answers
I Have two files as below,
first file:
Start State |Next State |Session Count |Transition%
LA_product_view |home |694 |28.660%
LA_product_view | searchresults |54 |2.230%
home | 1101260 | 2 | 0.050%
second file:
Start State Next State Session Count Transition%
... (7 Replies)
Discussion started by: Raghuram717
7 Replies
10. UNIX for Beginners Questions & Answers
I'm trying to learn awk, but I've hit a roadblock with this problem. I have a hierarchy stored in a file with 3 columns:
id name parentID
4 D 2
2 B 1
3 C 1
1 A 5
I need to check if there are any values in column 3 that are not represented anywhere in column 1. I've tried this:
awk '{arr;}... (7 Replies)
Discussion started by: kaktus
7 Replies
PSC(1) General Commands Manual PSC(1)
NAME
psc - prepare sc files
SYNOPSIS
psc [-fLkrSPv] [-s cell] [-R n] [-C n] [-n n] [-d c]
DESCRIPTION
Psc is used to prepare data for input to the spreadsheet calculator sc(1). It accepts normal ascii data on standard input. Standard out-
put is a sc file. With no options, psc starts the spreadsheet in cell A0. Strings are right justified. All data on a line is entered on
the same row; new input lines cause the output row number to increment by one. The default delimiters are tab and space. The column for-
mats are set to one larger than the number of columns required to hold the largest value in the column.
OPTIONS
-f Omit column width calculations. This option is for preparing data to be merged with an existing spreadsheet. If the option is not
specified, the column widths calculated for the data read by psc will override those already set in the existing spreadsheet.
-L Left justify strings.
-k Keep all delimiters. This option causes the output cell to change on each new delimiter encountered in the input stream. The
default action is to condense multiple delimiters to one, so that the cell only changes once per input data item.
-r Output the data by row first then column. For input consisting of a single column, this option will result in output of one row
with multiple columns instead of a single column spreadsheet.
-s cell
Start the top left corner of the spreadsheet in cell. For example, -s B33 will arrange the output data so that the spreadsheet
starts in column B, row 33.
-R n Increment by n on each new output row.
-C n Increment by n on each new output column.
-n n Output n rows before advancing to the next column. This option is used when the input is arranged in a single column and the
spreadsheet is to have multiple columns, each of which is to be length n.
-d c Use the single character c as the delimiter between input fields.
-P Plain numbers only. A field is a number only when there is no imbedded [-+eE].
-S All numbers are strings.
-v Print the version of psc
SEE ALSO
sc(1)
AUTHOR
Robert Bond
PSC 7.16 19 September 2002 PSC(1)