Since you're using a Solaris/SunOS system, you'll need to use /usr/xpg4/bin/awk or nawk instead of awk. I don't see the need for the three arrays that rdrtx1 used... I think the following should run a tiny bit faster and use less memory while it is running:
This User Gave Thanks to Don Cragun For This Post:
Hi,
I need to compare two flat files (yesterday & today's data) and get only the changed data from flat files. In flat file i dont have data column or anything its just a string data in flat file.Can any one please let me know the script
With Regds
Shashi (3 Replies)
I'm trying to compare the first column values in two different files that use a numerical value as the key and output the more meaningful value found in the second column of file1 in front of the matching line(s) in file2. My problem is that file2 has multiple records. For example given:
FILE1... (4 Replies)
I have searched about 30 threads, a load of Google pages and cannot find what I am looking for. I have some of the parts but not the whole. I cannot seem to get the puzzle fit together.
I have three folders, two of which contain different versions of multiple files, dist/file1.php dist/file2.php... (4 Replies)
I am not an expert in awk, SED, etc... but I really hope there is a way to do this, because I don't want to have to right a program. I am using C shell.
FILE 1 FILE 2
H0000000 H0000000
MA1 MA1
CA1DDDDDD CA1AAAAAA
MA2 ... (2 Replies)
I did some searches, but couldn't really find what I'm looking for. I have a file formatted as below:
BOF ABC CO - XYZ COMM DATA OF 07/05/2011
EBA00000001 sdfa rtyus uyml
EBB00000001 54682 984w3
EBA00000002 mkiyuasdf 98234
I want to pull the date from the header record and add it... (4 Replies)
I have 2 zip files which have about 20 million records in each file. file 2 will have additional records than file 1. I want to compare the records in both the files and capture the new records from file 2 into another file file3. Please help me with a command/script which provides me the desired... (8 Replies)
Hi ,
My requirement is to Compare 2 files having different number of columns and records and get the ouptut containing all the non-matching records from File A(with all column values ) .Example data below :
File A contains following :
Aishvarya |1234... (4 Replies)
Good morning all,
I have a problem that is one step beyond a standard awk compare.
I would like to compare three files which have several thousand records against a fourth file. All of them have a value in each row that is identical, and one value in each of those rows which may be duplicated... (1 Reply)
hi.. I want to compare records present in 1 file with those in 3 other files and print those records of file 1 which are not present in any of the files. for eg -
file1 file2 file3 file4
1 1 5 7
2 2 6 9
3
4
5
6
7
8
9
... (3 Replies)
Discussion started by: Abhiraj Singh
3 Replies
LEARN ABOUT DEBIAN
vcf-compare
VCF-COMPARE(1) User Commands VCF-COMPARE(1)NAME
vcf-compare - compare bgzipped and tabix indexed VCF files
SYNOPSIS
compare-vcf [OPTIONS] file1.vcf file2.vcf ...
DESCRIPTION
About: Compare bgzipped and tabix indexed VCF files. (E.g. bgzip file.vcf; tabix -p vcf file.vcf.gz)
OPTIONS -c, --chromosomes <list|file>
Same as -r, left for backward compatibility. Please do not use as it will be dropped in the future.
-d, --debug
Debugging information. Giving the option multiple times increases verbosity
-H, --cmp-haplotypes
Compare haplotypes, not only positions
-m, --name-mapping <list|file>
Use with -H when comparing files with differing column names. The argument to this options is a comma-separated list or one mapping
per line in a file. The names are colon separated and must appear in the same order as the files on the command line.
-R, --refseq <file>
Compare the actual sequence, not just positions. Use with -w to compare indels.
-r, --regions <list|file>
Process the given regions (comma-separated list or one region per line in a file).
-s, --samples <list>
Process only the listed samples. Excluding unwanted samples may increase performance considerably.
-w, --win <int>
In repetitive sequences, the same indel can be called at different positions. Consider records this far apart as matching (be it a
SNP or an indel).
-h, -?, --help
This help message.
vcf-compare 0.1.5 July 2011 VCF-COMPARE(1)