Compare 2 csv files by columns, then extract certain columns of matcing rows
Hi all, I'm pretty much a newbie to UNIX. I would appreciate any help with UNIX coding on comparing two large csv files (greater than 10 GB in size), and output a file with matching columns.
I want to compare file1 and file2 by 'id' and 'chain' columns, then extract exact matching rows' remaining columns from file2 and add them to file1's columns, and remove no match rows. Also create new entries in file1 for multiple row matches from file2
For example:
and the desired output would be:
If you leave a code please explain them a little bit
Thanks
Last edited by Scrutinizer; 06-01-2014 at 02:07 PM..
Reason: code tags
Hello guys,
Please help me to solve this problem. I have tried some awk commands but couldn't succeed.
I have a tab delimited file where each record is separated by ------ and 4th column of each record is same.
<INPUT FILE>
------
peon 53931587 53931821 ... (12 Replies)
Hi
I have to compare two .csv files which having 4 columns and i am expecting the output if there is difference in the 3,4columns in two files with respect to the first column. if my statement is not clear please refer the example.
Input:
-----
File 1 :
hostname MAC SWITCH_IP SWITCH_PORT... (7 Replies)
Hi All,
I have a huge (and its really huge!) matrix about 400GB in size (2 million rows by 1.5 million columns) . I am trying to optimize its space by creating a sparse representation of it.
Miniature version of the matrix looks like this (matrix.mtx):
3.4543 65.7876 54.564
2.12344... (4 Replies)
Hi all,
Could someone help me with the following issue:
1st
I have an CSV file delimiter is ";" I I have a column 7 where I need to do some multiple mathem. operation, I need all values in this columns to be multiplied by 1.5 and create a new CSV file with the replaced values.
2nd.
I... (3 Replies)
Hello,
I want to extract several columns and rows from a huge tab delimited file
for example: I want to print from from column 3 to 68 till row number 30.
I have tried using cut command but it was extracting whole 3rd and 68th column.
Please suggest a solution.
Ryan (8 Replies)
Hi,
I have a requirement to convert rows into columns. data looks like:
c1,c2,c3,..
r1,r2,r3,..
p1,p2,p3,..
and so on..
output shud be like this:
c1,r1,p1,..
c2,r2,p2,..
c3,r3,p3,..
Thanks in advance, (12 Replies)
Hi Friends,
I have come across some files where some of the columns don not have data.
Key, Data1,Data2,Data3,Data4,Data5
A,5,6,,10,,
A,3,4,,3,,
B,1,,4,5,,
B,2,,3,4,,
If we see the above data on Data5 column do not have any row got filled. So remove only that column(Here Data5) and... (4 Replies)
Hallo,
I would need to extract only rows which has different value in the second and third column.
Thank you very much for any advices
Input:
A 0 0
B 0 1
C 1 1
D 1 3
Output
B 0 1
D 1 3 (4 Replies)
I have a series of csv files in the following format
eg file1
Experiment Name,XYZ_07/28/15,
Specimen Name,Specimen_001,
Tube Name, Control,
Record Date,7/28/2015 14:50,
$OP,XYZYZ,
GUID,abc,
Population,#Events,%Parent
All Events,10500,
P1,10071,95.9
Early Apoptosis,1113,11.1
Late... (6 Replies)