03-08-2013
Match ids and print original file
Hello,
I have two files
Original: ( 5000 entries)
Chr Position
chr1 879108
chr1 881918
chr1 896874 ...
and a file with allele freq ( 2000 entries)
Chr Position MAF
chr1 881918 0.007
chr1 979748 0.007
chr1 1120377 0.007
chr1 1178925 0.036
I would like the original file matched with the allele freq and print out the output file with 5000 entries.
Chr Position MAF
chr1 879108 NULL
chr1 881918 0.007
chr1 896874 NULL
...
Any help is appreciated. Thank you.
Last edited by nans; 03-08-2013 at 04:59 AM..
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hi, i was looking for unix command(s) for :
find the first occurrence of a given pattern with in a file and print the remaining part.
below is an example of what i am looking for:
lets say, a file named myfile.txt
now, the command i am looking for will do the following (4 Replies)
Discussion started by: nurulamin862
4 Replies
2. Shell Programming and Scripting
Hi,
I have two files. 1st file has 1 column (huge file containing ~19200000 lines) and 2nd file has 2 columns (small file containing ~6000 lines).
#################################
huge_file.txt
a
a
ab
b
##################################
small_file.txt
a 1.5
b 2.5
ab ... (4 Replies)
Discussion started by: AshwaniSharma09
4 Replies
3. Shell Programming and Scripting
Hello all, please help. There are two file like this:
file1:
1197510.0 294777.7 9666973.0 21.6 1839.8
1197510.0 294777.7 9666973.0 413.2 2075.9
1197510.0 294777.7 9666973.0 689.3 2260.0
... (1 Reply)
Discussion started by: attila
1 Replies
4. UNIX for Dummies Questions & Answers
Hello,
I am trying to modify 2 files, to yield results in a 3rd file.
File-1 is a 8-columned file, separted with tab.
1234:1 xyz1234 blah blah blah blah blah blah
1234:1 xyz1233 blah blah blah blah blah blah
1234:1 abc1234 blah blah blah blah blah blah
n/a RRR0000 blah blah blah... (1 Reply)
Discussion started by: ad23
1 Replies
5. Shell Programming and Scripting
I have a file with very specific column spacing formatting,
I wish to do the following:
awk '{print $1, $2, $3, $4, $5, $6, $19-$7, $20-$8, $21-$9, $10, $11, $12}' merge.pdb > vector.pdb
but the format gets ruined.
I have tried with print -f but to no avail.... (7 Replies)
Discussion started by: chrisjorg
7 Replies
6. Shell Programming and Scripting
Hi All,
I have to match each row in file 1 with 1st row in file 2 and print the corresponding column from file2. I am trying to use an awk script to do this. For example
cat File1
X1
X3
X4
cat File2
ID X1 X2 X3 X4
A 1 6 2 1
B 2 7 3 3
C 3 8 4 1
D 4 9 1 1 (3 Replies)
Discussion started by: newpro
3 Replies
7. Shell Programming and Scripting
Hello,
I have two files
File 1 with 10 columns
rsid position ........
xx 1:10000
File 2
position
1:10000
2:2000
....
I need to extract the IDs given in file 2(column1) from file 1 (column2) and print all columns from file1. I am trying this command (1 Reply)
Discussion started by: nans
1 Replies
8. UNIX for Beginners Questions & Answers
Hello, I have two tab files with headers
File1: with 4 columns
header1 header2 header3 header4
44 a bb 1
57 c ab 4
64 d d 5
File2: with 26 columns
header1.. header5 header6 header7 ... header 22...header26
id1 44 a bb
id2 57 ... (6 Replies)
Discussion started by: nans
6 Replies
9. UNIX for Beginners Questions & Answers
I have two text files. File 1 has 150 ids but all the ids exists in duplicates so it has 300 ids in total. File 2 has 1500 ids but all exists in duplicates so file 2 has 300 ids in total. i want to match the first occurance of every id in file 1 with first occurance of thet id in file 2 and 2nd... (2 Replies)
Discussion started by: limd
2 Replies
10. Shell Programming and Scripting
In the awk below I am trying to output those lines that Match between file1 and file2, those Missing in file1, and those missing in file2. Using each $1,$2,$4,$5 value as a key to match on, that is if those 4 fields are found in both files the match, but if those 4 fields are not found then missing... (0 Replies)
Discussion started by: cmccabe
0 Replies
LEARN ABOUT DEBIAN
bio::graphics::glyph::allele_tower
Bio::Graphics::Glyph::allele_tower(3pm) User Contributed Perl Documentation Bio::Graphics::Glyph::allele_tower(3pm)
NAME
Bio::Graphics::Glyph::allele_tower - The "allele_tower" glyph
SYNOPSIS
See <Bio::Graphics::Panel> and <Bio::Graphics::Glyph>.
DESCRIPTION
This glyph draws a letter for each allele found at a SNP position, one above the other (i.e. in a column). For example:
A
G
See also http://www.hapmap.org/cgi-perl/gbrowse/gbrowse 'genotyped SNPs' for an example.
The common options are available (except height which is calculated based on the number of alleles). In addition, if you give the glyph
the minor allele frequency (MAF) and indicate which is the minor allele, the glyph will display these differences.
GETTING THE ALLELES
To specify the alleles, create an "Alleles" attribute for the feature. There should be two such attributes. For example, for a T/G
polymorphism, the GFF load file should look like:
Chr3 . SNP 12345 12345 . . . SNP ABC123; Alleles T ; Alleles G
Alternatively, you can pass an "alleles" callback to the appropriate section of the config file. This option should return the two alleles
separated by a slash:
alleles = sub {
my $snp = shift;
my @d = $snp->get_tag_values('AllelePair');
return join "/",@d;
}
OPTIONS
. Glyph Colour
. Different colour for alleles on the reverse strand
. Print out the complement for alleles on the reverse strand
. Major allele shown in bold
. Horizontal histogram to show allele frequency
GLYPH COLOR
The glyph color can be configured to be different if the feature is on the plus or minus strand. Use fgcolor to define the glyph color for
the plus strand and bgcolor for the minus strand. For example:
fgcolor = blue
bgcolor = red
For this option to work, you must also set ref_strand to return the strand of the feature:
ref_strand = sub {shift->strand}
REVERSE STRAND ALLELES
If the alleles on the negative strand need to be the complement of what is listed in the GFF files, (e.g. A/G becomes T/C), set the
complement option to have value 1
complement = 1
For this option to work, you must also set ref_strand to return the strand of the feature:
ref_strand = sub {shift->strand}
MAJOR/MINOR ALLELE
Use the 'minor_allele' option to return the minor allele for the SNP. If you use this option, the major allele will appear in bold type.
ALLELE FREQUENCY HISTOGRAMS
Use the 'maf' option to return the minor allele frequency for the SNP. If you use this option, a horizontal histogram will be drawn next
to the alleles, to indicate their relative frequencies. e.g.
A______
C__
Note: The 'label' option must be set to 1 (i.e. on) and the 'minor_allele' option must return a valid allele for this to work.
BUGS
Please report them.
SEE ALSO
Bio::Graphics::Panel, Bio::Graphics::Glyph, Bio::Graphics::Glyph::arrow, Bio::Graphics::Glyph::cds, Bio::Graphics::Glyph::crossbox,
Bio::Graphics::Glyph::diamond, Bio::Graphics::Glyph::dna, Bio::Graphics::Glyph::dot, Bio::Graphics::Glyph::ellipse,
Bio::Graphics::Glyph::extending_arrow, Bio::Graphics::Glyph::generic, Bio::Graphics::Glyph::graded_segments,
Bio::Graphics::Glyph::heterogeneous_segments, Bio::Graphics::Glyph::line, Bio::Graphics::Glyph::pinsertion, Bio::Graphics::Glyph::primers,
Bio::Graphics::Glyph::rndrect, Bio::Graphics::Glyph::segments, Bio::Graphics::Glyph::ruler_arrow, Bio::Graphics::Glyph::toomany,
Bio::Graphics::Glyph::transcript, Bio::Graphics::Glyph::transcript2, Bio::Graphics::Glyph::translation, Bio::Graphics::Glyph::allele_tower,
Bio::DB::GFF, Bio::SeqI, Bio::SeqFeatureI, Bio::Das, GD
AUTHOR
Fiona Cunningham <cunningh@cshl.edu> in Lincoln Stein's lab <steinl@cshl.edu>.
Copyright (c) 2003 Cold Spring Harbor Laboratory
This package and its accompanying libraries is free software; you can redistribute it and/or modify it under the terms of the GPL (either
version 1, or at your option, any later version) or the Artistic License 2.0. Refer to LICENSE for the full license text. In addition,
please see DISCLAIMER.txt for disclaimers of warranty.
perl v5.14.2 2012-02-20 Bio::Graphics::Glyph::allele_tower(3pm)