Grep or awk a unique and specific word across many fields
Hi there,
I have data with similar structure as this:
is there a way to report a string the whole string if words such homo or hetero is found across columns not minding fields with dots (.) which mean unknown. So the data looks like this:
for example:
searches only for single word
for single word
this is line three
match=$(grep -n -e "single" data.txt)
this command will stored "..... single ...... single" into search.
how can i grep the single word just from line 2 only?? (3 Replies)
ok, so this is proving to be kind of difficult even though it should not be.
say for instance I want to grep out ONLY the word fkafal from the below output, how do I do it?
echo ajfjf fjfjf iafjga fkafal foeref afoafahfia | grep -w "fkafal"
If i run the above command, i get back all the... (4 Replies)
Hi.
I have a tab separated file that has a couple nearly identical lines. When doing:
sort file | uniq > file.new
It passes through the nearly identical lines because, well, they still are unique.
a)
I want to look only at field x for uniqueness and if the content in field x is the... (1 Reply)
Hi,
I have gone through may posts and dint find exact solution for my requirement.
I have file which consists below data and same file have lot of other data.
<MAPPING DESCRIPTION ='' ISVALID ='YES' NAME='m_TASK_UPDATE' OBJECTVERSION ='1'>
<MAPPING DESCRIPTION ='' ISVALID ='NO'... (11 Replies)
Is it possible to modify file like this.
1. Remove all the duplicate names in a define column i.e 4th col
2. Count the no.of unique names separated by ";" and print as a 5th col
thanx in advance!!
Q
input
c1 30 3 Eh2
c10 96 3 Frp
c41 396 3 Ua5;Lop;Kol;Kol
c62 2 30 Fmp;Fmp;Fmp
... (5 Replies)
Hi,
Below is an example :
ST1 PREF: int1 AVAIL: int2
ST2 PREF :int1 AVAIL: int2
I need int1 to come in preferred variable while programming and int2 in available variable
Please help me doing so
Best regards,
Vishal (10 Replies)
Trying to use awk to find a keyword and return the matches in the row, but also $1 and $2, which are the unique id's, but they only appear once. Thank you :).
file
name 31 Index Chromosomal Position Gene Inheritance
122 2106725 TSC2 AD
124 2115481 TSC2 AD
121 2105400 TSC2 AD... (6 Replies)
Hello All,
Here is am trying to get maximum value of third field depending on
first,second and fourth fields with awk command . delimeter is pipe(|) .
input
0221|09|14.25|aaa
0221|09|44.27|aaa
0221|09|44.33|aaa
0221|09|44.53|bbb
0221|09|34.32|bbb
0221|09|37.13|bbb... (5 Replies)
Discussion started by: sayami00
5 Replies
LEARN ABOUT DEBIAN
bio::cluster::familyi
Bio::Cluster::FamilyI(3pm) User Contributed Perl Documentation Bio::Cluster::FamilyI(3pm)NAME
Bio::Cluster::FamilyI - Family Interface
SYNOPSIS
# see the implementations of this interface for details
my $cluster= $cluster->new(-description=>"POLYUBIQUITIN",
-members =>[$seq1,$seq2]);
my @members = $cluster->get_members();
my @sub_members = $cluster->get_members(-species=>"homo sapiens");
DESCRIPTION
This interface if for a Family object representing a family of biological objects. A generic implementation for this may be found a
Bio::Cluster::Family.
FEEDBACK
Mailing Lists
User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the
Bioperl mailing list. Your participation is much appreciated.
bioperl-l@bioperl.org - General discussion
http://bioperl.org/wiki/Mailing_lists - About the mailing lists
Support
Please direct usage questions or support issues to the mailing list:
bioperl-l@bioperl.org
rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address
it. Please include a thorough description of the problem with code and data examples if at all possible.
Reporting Bugs
Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the
web:
https://redmine.open-bio.org/projects/bioperl/
AUTHOR - Shawn Hoon
Email shawnh@fugu-sg.org
APPENDIX
The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _
new
We don't mandate but encourage implementors to support at least the
following named parameters upon object initialization.
Arguments Description
--------------------
-family_id the name of the family
-description the consensus description of the family
-annotation_score the confidence by which the consensus description is
representative of the family
-members the members belonging to the family
-alignment the multiple alignment of the members
family_id
Title : family_id
Usage : Bio::Cluster::FamilyI->family_id("znfp");
Function: get/set for the family id
Returns : the family id
Args : the family id
family_score
Title : family_score
Usage : Bio::Cluster::FamilyI->family_score(95);
Function: get/set for the score of algorithm used to generate
the family if present
Returns : the score
Args : the score
Methods inherited from Bio::ClusterI
display_id
Title : display_id
Usage :
Function: Get the display name or identifier for the cluster
Returns : a string
Args :
get_members
Title : get_members
Usage : Bio::Cluster::FamilyI->get_members();
Function: get the members of the family
Returns : the array of members
Args : the array of members
description
Title : description
Usage : Bio::Cluster::FamilyI->description("Zinc Finger Protein");
Function: get/set for the description of the family
Returns : the description
Args : the description
size
Title : size
Usage : Bio::Cluster::FamilyI->size();
Function: get/set for the description of the family
Returns : size
Args :
cluster_score
Title : cluster_score
Usage : $cluster ->cluster_score(100);
Function: get/set for cluster_score which
represent the score in which the clustering
algorithm assigns to this cluster.
Returns : a number
perl v5.14.2 2012-03-02 Bio::Cluster::FamilyI(3pm)