I need help on getting difference between 2 .csv files.
I have 2 large . csv files which has equal number of columns. I nned to compare them and get output in new file which will have difference olny.
E.g.
File1.csv
File2.csv
then output should be like
It's just an example. What I am trying to say is I should get only the values of columns where we have the difference. Not the whole line.
Assuming File will be in sorted order.
There can be m number of columns but for sure in both the files, we will get same columns. If values are different then those values should be given in output.
It can also work if we can get difference in comma separated file like
wherver values matches between 2 files we get blank
,16-12-2011,,
Hope I am able to explain the issue.
Last edited by Franklin52; 10-08-2012 at 04:07 AM..
Reason: Please use code tags for data and code samples
Hi,
As per my requirement, I need to take difference between two big files(around 6.5 GB) and get the difference to a output file without any line numbers or '<' or '>' in front of each new line.
As DIFF command wont work for big files, i tried to use BDIFF instead.
I am getting incorrect... (13 Replies)
IBM RS6000 F50
AIX 4.3.2
i am having trouble in calculating the actual size of a set of directories and reconciling the results with the actual Hard Disk space used
I have 33GB disk which is showing 7.8GB used, a byte count of the files in the directory/sub-dirs i`m interested in is 48GB,... (4 Replies)
On my Linux system there seems to be a massive difference between the value of _POSIX_OPEN_MAX and what sysconf(_SC_OPEN_MAX) returns and also what I'd expect from the table of examples of configuration limits from Advanced Programming In The UNIX Environment, 2nd Ed.
_POSIX_OPEN_MAX: 16... (5 Replies)
Hi,
I am new to awk and trying to count the difference between the first columns of two CSV files.
--------
Sample input (header is:name, id1,id2):
file1.csv
name, id1,id2
sss,34,56
yyy,3,56
www,56,78
pppp,43,12
file2.csv
name,id1,id2
sss,32,56
yyy,12,7
ttt,4,8
uuu,7,9 (0 Replies)
I have two csv files having 90K records each & each row has around 50 columns.Lets say the file names are FILE1 and FILE2. I have to compare both the files and generate a new file that has rows from FILE2 if it differs.
FILE1
-----
2001,"John",25,19901130,21211.41,Unix Forum... (3 Replies)
Hello,
I have about 10 csv files which range from csv1 - csv10.
Each csv file has same type/set of tabs and we have around 5-6 tabs for each of the csv file which have slightly different content(data).
A sample of CSV1 is shown below:
Joins: Data related to Joins, it can be any number of... (2 Replies)
I am having two csv files i need to compare these files and the output file should have the information of the differences at the field level.
For Example,
File 1:
A,B,C,D,E,F
1,2,3,4,5,6
File 2:
A,C,B,D,E,F
1,2,4,5,5,6
out put file: (12 Replies)
(say) I have 2 csv files - file1.csv & file2.csv as mentioned below:
file1.csv
ID,version,cost
1000,1,30
2000,2,40
3000,3,50
4000,4,60
file2.csv
ID,version,cost
1000,1,30
2000,2,45
3000,4,55
6000,5,70
The... (7 Replies)
Discussion started by: Naresh101
7 Replies
LEARN ABOUT CENTOS
processcsv.py
PROCESSCSV.PY(1) Virtualization Support PROCESSCSV.PY(1)NAME
processcsv.py - process virt-top CSV files
SUMMARY
virt-top --csv data.csv
processcsv.py < data.csv
DESCRIPTION
virt-top is a top(1)-like utility for showing stats of virtualized domains.
processcsv.py is a simple Python script that post-processes the output of "virt-top --csv".
It is used like this:
virt-top --csv data.csv
processcsv.py < data.csv
The second command will overwrite the following files in the current directory:
"global.csv"
This contains the global (host) statistics columns from the CSV file.
"domainNN.csv" (multiple files)
For each libvirt domain ID NN, a file is created containing the per-domain statistics from the CSV file.
SEE ALSO virt-top(1)AUTHORS
Richard W.M. Jones <rjones @ redhat . com>
COPYRIGHT
(C) Copyright 2007-2012 Red Hat Inc., Richard W.M. Jones http://libvirt.org/
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation,
Inc., 675 Mass Ave, Cambridge, MA 02139, USA.
virt-top-1.0.8 2013-12-29 PROCESSCSV.PY(1)