OK; and what be your results when applying either of the above proposals?
With the second set of test data:
Scrutinizers script incorrectly identifies the second line of file 2 as a match:
Your script correctly identifies all records
If you have the time could you please help me understand the code you've very kindly provided? Hopefully then I can write my own for similar tasks in the future.
Cheers
---------- Post updated at 11:31 AM ---------- Previous update was at 10:59 AM ----------
Quote:
Originally Posted by Scrutinizer
Hi, see if this works:
Hi - With an expanded dataset this script incorrectly matched the second line in file 2.
Last edited by Scrutinizer; 03-25-2018 at 11:18 PM..
Reason: Code Tags
I have a large text-file with tab-delimited genetic data that looks like:
KSC112 KSC234 0 0 1 1 A G C T
I simply wan to delete the first column, but since the file has 600 000 columns, it is not possible with awk (seems to be limited at 32k columns).
Does anyone have an idea how to do this? (2 Replies)
I want to add a new column to a tab delimited text file. It will be the first column and it will just be 1's. How do I go about doing that? Thanks! (1 Reply)
Hi all,
I'm new to Unix and work primarily in bioinformatics. I am in need of a script which will allow me to replace "1" with "chr1" in only the first column of a file which looks like such:
1 10327 rs112750067 T C . PASS ASP;RSPOS=10327;... (4 Replies)
I have a file which looks like this:
73450 articles and news developmental psychology 2006-03-30 16:22:40 1 http://www.usnews.com
73450 articles and news developmental psychology 2006-03-30 16:22:40 2 http://www.apa.org
73450 articles and news developmental psychology 2006-03-30... (1 Reply)
I have a file having the following entries:
test1 test2 test3
11 22 33
22 44 66
99 99 44
---
I want to add a column so that the above file becomes:
test1 test2 test3 notest
11 22 33 *
22 44 66 *
99 99 44 *
---
Thanks (6 Replies)
Hi,
Can anyone please tell me about how we can delete an entire column from a tab delimited file?
Mu input_file.txt looks like this:
And I want the output as:
I used the below code
nawk -v d="1" 'BEGIN{FS=OFS="\t"}{$d=""}{print}' input_file.txtBut in the output, the first column is... (5 Replies)
I have tried the following to no avail.
xargs -n8 < test.txt
awk '{if(NR%6!=0){p=""}else{p="\n"};printf $0" "p}' Mod_Alm_log.txt > test.txt
I have tried different variations of the above, the problem is mixes lines together.
And it includes the tags "%a and %A" I need them to be all tab... (16 Replies)
Hello Everyone..
I want to replace the retail col from FileI with cstp1 col from FileP if the strpno matches in both files
FileP.txt
... (2 Replies)
Discussion started by: YogeshG
2 Replies
LEARN ABOUT DEBIAN
vcf-isec
VCF-ISEC(1) User Commands VCF-ISEC(1)NAME
vcf-isec - create intersections, unions, complements on bgzipped and tabix indexed VCF or tab-delimited files
SYNOPSIS
vcf-isec [OPTIONS] file1.vcf file2.vcf ...
DESCRIPTION
About: Create intersections, unions, complements on bgzipped and tabix indexed VCF or tab-delimited files.
Note that lines from all files can be intermixed together on the output, which can yield unexpected results.
OPTIONS -C, --chromosomes <list|file>
Process the given chromosomes (comma-separated list or one chromosome per line in a file).
-c, --complement
Output positions present in the first file but missing from the other files.
-d, --debug
Debugging information
-f, --force
Continue even if the script complains about differing columns.
-o, --one-file-only
Print only entries from the left-most file. Without -o, all unique positions will be printed.
-n, --nfiles [+-=]<int>
Output positions present in this many (=), this many or more (+), or this many or fewer (-) files.
-p, --prefix <path>
If present, multiple files will be created with all possible isec combinations. (Suitable for Venn Diagram analysis.)
-t, --tab <chr:pos:file>
Tab-delimited file with indexes of chromosome and position columns. (1-based indexes)
-w, --win <int>
In repetitive sequences, the same indel can be called at different positions. Consider records this far apart as matching (be it a
SNP or an indel).
-h, -?, --help
This help message.
EXAMPLES
bgzip file.vcf; tabix -p vcf file.vcf.gz bgzip file.tab; tabix -s 1 -b 2 -e 2 file.tab.gz
vcf-isec 0.1.5 July 2011 VCF-ISEC(1)