Hi all,
I have difficulty to solve the followign problem.
mydata:
StartPoint EndPoint
22 55
2222 2230
33 66
44 58
222 240
11 25
22 60
33 45
The union of above... (2 Replies)
Dear Gurus,
I have 57 tab-delimited different text files, each one containing entries in 3 columns. The first column in each file contains names of objects. Some names are present in more than one file. I would like to find those names and store them in a separate text file, preferably with a... (6 Replies)
Hi, I have a file1 of many long sequences, each preceded by a unique header line. file2 is 3-columns list: headers name, start position, end position. I'd like to extract the sequence region of file1 specified in file2.
Based on a post elsewhere, I found the code:
awk... (2 Replies)
Discussion started by: pathunkathunk
2 Replies
4. Forum Support Area for Unregistered Users & Account Problems
The forums have been seeing a sharp increase in spam bots, forum robots, and malicious registrations from certain countries. If you have been directed to this thread due to a "No Permission Error" when trying to register please post in this thread and request permission to register, including... (1 Reply)
Hi I have 2 files; usually the end position in the file1 is the start position in the file2 and the end position in file2 will be the start position in file1 (flanks)
file1
Id start end
aaa1 0 3000070
aaa1 3095270 3095341
aaa1 3100822 3100894
aaa1 ... (1 Reply)
Hello, here I am posting my query again with modified data input files.
see my query is :
i have two input files file1 and file2.
file1 is smalldata.fasta
>gi|546671471|gb|AWWX01449637.1| Bubalus bubalis breed Mediterranean WGS:AWWX01:contig449636, whole genome shotgun sequence... (20 Replies)
Hi,
I have been trying to retrieve the names of files present in a directory one by one but the names of files are getting overlapped on one another.
I tried the below command.
ls -1 > filename
please help me in getting the file names line by line without overlapping. I am using korn... (6 Replies)
Hi all,
I have a file like this I want to extract only those regions which are big and continous
chr1 3280000 3440000
chr1 3440000 3920000
chr1 3600000 3920000 # region coming within the 3440000 3920000. so i don't want it to be printed in output
chr1 3920000 4800000
chr1 ... (2 Replies)
Discussion started by: amrutha_sastry
2 Replies
LEARN ABOUT DEBIAN
vcf-compare
VCF-COMPARE(1) User Commands VCF-COMPARE(1)NAME
vcf-compare - compare bgzipped and tabix indexed VCF files
SYNOPSIS
compare-vcf [OPTIONS] file1.vcf file2.vcf ...
DESCRIPTION
About: Compare bgzipped and tabix indexed VCF files. (E.g. bgzip file.vcf; tabix -p vcf file.vcf.gz)
OPTIONS -c, --chromosomes <list|file>
Same as -r, left for backward compatibility. Please do not use as it will be dropped in the future.
-d, --debug
Debugging information. Giving the option multiple times increases verbosity
-H, --cmp-haplotypes
Compare haplotypes, not only positions
-m, --name-mapping <list|file>
Use with -H when comparing files with differing column names. The argument to this options is a comma-separated list or one mapping
per line in a file. The names are colon separated and must appear in the same order as the files on the command line.
-R, --refseq <file>
Compare the actual sequence, not just positions. Use with -w to compare indels.
-r, --regions <list|file>
Process the given regions (comma-separated list or one region per line in a file).
-s, --samples <list>
Process only the listed samples. Excluding unwanted samples may increase performance considerably.
-w, --win <int>
In repetitive sequences, the same indel can be called at different positions. Consider records this far apart as matching (be it a
SNP or an indel).
-h, -?, --help
This help message.
vcf-compare 0.1.5 July 2011 VCF-COMPARE(1)