Perl: filtering lines based on duplicate values in a column
Hi I have a file like this. I need to eliminate lines with first column having the same value 10 times.
The value 13 in the first column is repeated 10 times in the consecutive lines. I need to eliminate all those lines in the output.
so the desired output will be
Thank you much in advance. If it is possible a code in Perl would be much appreciated.
Does anybody know a command that filters duplicate lines out of a file. Similar to the uniq command but can handle duplicate lines no matter where they occur in a file? (9 Replies)
Hi, I've got a file that I'd like to uniquely sort based on column 2 (values in column 2 begin with "comp").
I tried sort -t -nuk2,3 file.txtBut got:
sort: multi-character tab `-nuk2,3'
"man sort" did not help me out
Any pointers?
Input:
Output: (5 Replies)
Hi experts, I have a tab-delimited file with one column containing values separated by a comma. I wish to duplicate the entire line for every value in that comma-delimited field.
For example:
$cat file
4444 4444 4444 4444
9990 2222,7777 6666 2222 ... (3 Replies)
Hi,
I have a similar input format-
A_1 2
B_0 4
A_1 1
B_2 5
A_4 1
and looking to print in this output format with headers. can you suggest in awk?awk because i am doing some pattern matching from parent file to print column 1 of my input using awk already.Thanks!
letter number_of_letters... (5 Replies)
Hi,
I have tried to remove dublicate lines based on first column with pipe delimiter . but i ma not able to get some uniqu lines
Command : sort -t'|' -nuk1 file.txt
Input :
38376KZ|09/25/15|1.057
38376KZ|09/25/15|1.057
02006YB|09/25/15|0.859
12593PS|09/25/15|2.803... (2 Replies)
Dear folks
I have a map file of around 54K lines and some of the values in the second column have the same value and I want to find them and delete all of the same values. I looked over duplicate commands but my case is not to keep one of the duplicate values. I want to remove all of the same... (4 Replies)
Hi there,
I am trying to filter a big file with several columns using values on a column with values like (AC=5;AN=10;SF=341,377,517,643,662;VRT=1). I wont to filter the data based on SF= values that are (bigger than 400)
... (25 Replies)
I have a file with 5 columns. I want to pull out all records where the value in column 4 is not unique. For example in the sample below, I would want it to print out all lines except for the last two.
40991764 2419 724 47182 Cand A
40992936 3591 724 47182 Cand B
40993016 3671 724 47182 Cand C... (5 Replies)
Discussion started by: kaktus
5 Replies
LEARN ABOUT DEBIAN
ace::sequence::gene
Ace::Sequence::Gene(3pm) User Contributed Perl Documentation Ace::Sequence::Gene(3pm)NAME
Ace::Sequence::Gene - Simple "Gene" Object
SYNOPSIS
# open database connection and get an Ace::Object sequence
use Ace::Sequence;
# get a megabase from the middle of chromosome I
$seq = Ace::Sequence->new(-name => 'CHROMOSOME_I,
-db => $db,
-offset => 3_000_000,
-length => 1_000_000);
# get all the genes
@genes = $seq->genes;
# get the exons from the first one
@exons = $genes[0]->exons;
# get the introns
@introns = $genes[0]->introns
# get the CDSs (NOT IMPLEMENTED YET!)
@cds = $genes[0]->cds;
DESCRIPTION
Ace::Sequence::Gene is a subclass of Ace::Sequence::Feature. It inherits all the methods of Ace::Sequence::Feature, but adds the ability
to retrieve the annotated introns and exons of the gene.
OBJECT CREATION
You will not ordinarily create an Ace::Sequence::Gene object directly. Instead, objects will be created in response to a genes() call to
an Ace::Sequence object.
OBJECT METHODS
Most methods are inherited from Ace::Sequence::Feature. The following methods are also supported:
exons()
@exons = $gene->exons;
Return a list of Ace::Sequence::Feature objects corresponding to annotated exons.
introns()
@introns = $gene->introns;
Return a list of Ace::Sequence::Feature objects corresponding to annotated introns.
cds()
@cds = $gene->cds;
Return a list of Ace::Sequence::Feature objects corresponding to coding sequence. THIS IS NOT YET IMPLEMENTED.
relative()
$relative = $gene->relative;
$gene->relative(1);
This turns on and off relative coordinates. By default, the exons and intron features will be returned in the coordinate system used
by the gene. If relative() is set to a true value, then coordinates will be expressed as relative to the start of the gene. The first
exon will (usually) be 1.
SEE ALSO
Ace, Ace::Object, Ace::Sequence,Ace::Sequence::Homol, Ace::Sequence::Feature, Ace::Sequence::FeatureList, GFF
AUTHOR
Lincoln Stein <lstein@cshl.org> with extensive help from Jean Thierry-Mieg <mieg@kaa.crbm.cnrs-mop.fr>
Copyright (c) 1999, Lincoln D. Stein
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. See DISCLAIMER.txt for
disclaimers of warranty.
POD ERRORS
Hey! The above document had some coding errors, which are explained below:
Around line 148:
You forgot a '=back' before '=head1'
perl v5.14.2 2001-02-18 Ace::Sequence::Gene(3pm)