Sponsored Content
Full Discussion: comparing multiple files
Top Forums Shell Programming and Scripting comparing multiple files Post 302338295 by karla on Monday 27th of July 2009 01:04:26 PM
Old 07-27-2009
comparing multiple files

hi, quick question i have one file which join one file with reference one
Looks like this:
KB0000 KB207418
KB0001 KB244904
KB0002 KB215027
KB0003 KB215027
KB0004 KB215027
KB0005 KB204320
KB0006 KB207074
KB0007 KB215204
KB0008 KB223809
KB0009 KB236640
KB0010 KB244506
....
Then i have all these files, which should be compared pairwise and the difference, if any should be printed, The files looks like this:
>KB0000 1658 amino acids
#
#
#
# Sequence # x Context Score Kinase Answer
# -------------------------------------------------------------------
# KB0000 10 S RRWASGSRG 0.978 unsp YES
# KB0000 10 S RRWASGSRG 0.637 PKA YES
# KB0000 10 S RRWASGSRG 0.528 RSK YES
# KB0000 10 S RRWASGSRG 0.519 cdc2 YES
# KB0000 10 S RRWASGSRG 0.468 CaM-II .
# KB0000 10 S RRWASGSRG 0.441 GSK3 .
# KB0000 10 S RRWASGSRG 0.416 DNAPK .
# KB0000 10 S RRWASGSRG 0.359 CKI YES
# KB0000 10 S RRWASGSRG 0.356 PKG .
# KB0000 10 S RRWASGSRG 0.281 p38MAPK .
# KB0000 10 S RRWASGSRG 0.252 ATM .
# KB0000 10 S RRWASGSRG 0.232 PKC .
# KB0000 10 S RRWASGSRG 0.223 CKII .
# KB0000 10 S RRWASGSRG 0.168 cdk5 .
# KB0000 10 S RRWASGSRG 0.147 PKB .
#
# KB0000 12 S WASGSRGAA 0.757 PKC YES



>KB207418 1658 amino acids
#
#
# Sequence # x Context Score Kinase Answer
# -------------------------------------------------------------------
# KB207418 10 S RRWASGSRG 0.978 unsp YES
# KB207418 10 S RRWASGSRG 0.637 PKA YES
# KB207418 10 S RRWASGSRG 0.528 RSK YES
# KB207418 10 S RRWASGSRG 0.519 cdc2 YES
# KB207418 10 S RRWASGSRG 0.468 CaM-II .
# KB207418 10 S RRWASGSRG 0.441 GSK3 .
# KB207418 10 S RRWASGSRG 0.416 DNAPK .
# KB207418 10 S RRWASGSRG 0.359 CKI .
# KB207418 10 S RRWASGSRG 0.356 PKG .
# KB207418 10 S RRWASGSRG 0.281 p38MAPK .
# KB207418 10 S RRWASGSRG 0.252 ATM .
# KB207418 10 S RRWASGSRG 0.232 PKC .
# KB207418 10 S RRWASGSRG 0.223 CKII .
# KB207418 10 S RRWASGSRG 0.168 cdk5 .
# KB207418 10 S RRWASGSRG 0.147 PKB .
#
# KB207418 12 S WASGSRGAA 0.757 PKC YES



so in this case the output should be
# KB0000 10 S RRWASGSRG 0.359 CKI YES


Thx in advance for the help Smilie
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

comparing multiple files in multiple subfolders

Hello, I am having a bit of hard time to get my head around this one. I really hope someone is out there to help me out! Background of my code: I am doing some automation where I am verifying multiple files in multiple sub folders and if they are all identical, I would echo a line with my test... (0 Replies)
Discussion started by: Riz
0 Replies

2. Shell Programming and Scripting

Comparing multiple variables

Hi! I've come up with a ksh-script that produces one or more lists of hosts. At the and of the script, I would like to print only those hosts that exists in all the lists. Ex. HOSTS="host1 host2 host3 host11" HOSTS="host1 host2 host4" HOSTS="host2 host11" HOSTS="host2 host5 host6 host7... (1 Reply)
Discussion started by: Bugenhagen
1 Replies

3. Shell Programming and Scripting

comparing multiple variables by 'if then'

Hi, I am a noob at shell scripting. basically I am trying to compare row counts from 8 tables in different databases. I have managed to get the row counts using awk from the spool files for both databases. now I have 16 variables with me for database 1 : $A $B $C $D $E $F $G... (3 Replies)
Discussion started by: smallville
3 Replies

4. Shell Programming and Scripting

Comparing multiple variable in if statement

Hi there this script is an atempt to define which instances of Jboss relate to its PID by the date and timestamp I am using calc to test with. On our system the only way you can tell which instance relates to a particular instance is by looking at the start up time and date in a log. The... (9 Replies)
Discussion started by: nathan.harris
9 Replies

5. UNIX for Dummies Questions & Answers

Comparing multiple fields from 2 files uing awk

Hi I have 2 files as below File 1 Chr Start End chr1 120 130 chr1 140 150 chr2 130 140 File2 Chr Start End Value chr1 121 128 ABC chr1 144 149 XYZ chr2 120 129 PQR I would like to compare these files using awk; specifically if column 1 of file1 is equal to column 1 of file2... (7 Replies)
Discussion started by: sshetty
7 Replies

6. Shell Programming and Scripting

awk arrays comparing multiple columns across two files.

Hi, I'm trying to use awk arrays to compare values across two files based on multiple columns. I've attempted to load file 2 into an array and compare with values in file 1, but success has been absent. If anyone has any suggestions (and I'm not even sure if my script so far is on the right lines)... (4 Replies)
Discussion started by: hubleo
4 Replies

7. UNIX for Advanced & Expert Users

Need help in comparing multiple columns from two files.

Hi all, I have two files as below. I need to compare field 2 of file 1 against field 1 of file 2 and field 5 of file 1 against filed 2 of file 2. If both matches , then create a result file 1 with first file data and if not matches , then create file with first fie data. Please help me in... (12 Replies)
Discussion started by: sivarajb
12 Replies

8. Shell Programming and Scripting

Comparing multiple network files (edge lists)

I want to compare 4 edge-lists to basically see if an edge is present in all 4 networks. The issue is that an edge A-B in one file can be present as B-A in another file. Example: Input 1: net1.txt A B 0.1 C D 0.65 D E 0.9 E A 0.7 Input 2: net2.txt A Z 0.1 C D 0.65 E D 0.9 E A... (1 Reply)
Discussion started by: Sanchari
1 Replies

9. Shell Programming and Scripting

Comparing multiple columns using awk

Hello All; I have two files with below conditions: 1. Entries in file A is missing in file B (primary is field 1) 2. Entries in file B is missing in file A (primary is field 1) 3. Field 1 is present in both files but Field 2 is different. Example Content: File A ... (4 Replies)
Discussion started by: mystition
4 Replies

10. Shell Programming and Scripting

Comparing multiple files

I want to develop one unix script that will first match the multiple files on one server say A with multiple files on another server say B and copy those to server A. After that need to compare the contents of these 2 set of multiple files on different location on same server and generate the... (4 Replies)
Discussion started by: Charnjeet Singh
4 Replies
Bio::Tools::Protparam(3pm)				User Contributed Perl Documentation				Bio::Tools::Protparam(3pm)

NAME
Bio::Tools::Protparam - submit to and parse output from protparam ; SYNOPSIS
my $gb = new Bio::DB::GenBank(-retrievaltype => 'tempfile' , -format => 'Fasta'); my @ids=qw(O14521 O43709 O43826); my $seqio = $gb->get_Stream_by_acc(@ids ); while( my $seq = $seqio->next_seq ) { my $pp = Protparam->new(seq=>$seq->seq); print "ID : ", $seq->display_id," ", "Amino acid number : ",$pp->amino_acid_number()," ", "Number of negative amino acids : ",$pp->num_neg()," ", "Number of positive amino acids : ",$pp->num_pos()," ", "Molecular weight : ",$pp->molecular_weight()," ", "Theoretical pI : ",$pp->theoretical_pI()," ", "Total number of atoms : ", $pp->total_atoms()," ", "Number of carbon atoms : ",$pp->num_carbon()," ", "Number of hydrogen atoms : ",$pp->num_hydrogen()," ", "Number of nitrogen atoms : ",$pp->num_nitro()," ", "Number of oxygen atoms : ",$pp->num_oxygen()," ", "Number of sulphur atoms : ",$pp->num_sulphur()," ", "Half life : ", $pp->half_life()," ", "Instability Index : ", $pp->instability_index()," ", "Stability class : ", $pp->stability()," ", "Aliphatic_index : ",$pp->aliphatic_index()," ", "Gravy : ", $pp->gravy()," ", "Composition of A : ", $pp->AA_comp('A')," ", "Composition of R : ", $pp->AA_comp('R')," ", "Composition of N : ", $pp->AA_comp('N')," ", "Composition of D : ", $pp->AA_comp('D')," ", "Composition of C : ", $pp->AA_comp('C')," ", "Composition of Q : ", $pp->AA_comp('Q')," ", "Composition of E : ", $pp->AA_comp('E')," ", "Composition of G : ", $pp->AA_comp('G')," ", "Composition of H : ", $pp->AA_comp('H')," ", "Composition of I : ", $pp->AA_comp('I')," ", "Composition of L : ", $pp->AA_comp('L')," ", "Composition of K : ", $pp->AA_comp('K')," ", "Composition of M : ", $pp->AA_comp('M')," ", "Composition of F : ", $pp->AA_comp('F')," ", "Composition of P : ", $pp->AA_comp('P')," ", "Composition of S : ", $pp->AA_comp('S')," ", "Composition of T : ", $pp->AA_comp('T')," ", "Composition of W : ", $pp->AA_comp('W')," ", "Composition of Y : ", $pp->AA_comp('Y')," ", "Composition of V : ", $pp->AA_comp('V')," ", "Composition of B : ", $pp->AA_comp('B')," ", "Composition of Z : ", $pp->AA_comp('Z')," ", "Composition of X : ", $pp->AA_comp('X')," "; } DESCRIPTION
This module takes an amino acid sequence and submits it to the Protparam program at www.expasy.org/cgi-bin/protparam. Many properties of the submitted sequence are returned. AUTHOR
Richard Dobson, r.j.dobson at qmul dot ac dot uk new Title : new Usage : $pp = Protparam->new(seq=>$seq->seq); Function : Creates a new Protparam object Returns : A Protparam object Args : A sequence num_neg Title : num_neg Usage : $pp->num_neg() Function : Retrieves the number of negative amino acids in a sequence Returns : Returns the number of negative amino acids in a sequence Args : none num_pos Title : num_pos Usage : $pp->num_pos() Function : Retrieves the number of positive amino acids in a sequence Returns : Returns the number of positive amino acids in a sequence Args : none amino_acid_number Title : amino_acid_number Usage : $pp->amino_acid_number() Function : Retrieves the number of amino acids within a sequence Returns : Returns the number of amino acids within a sequence Args : none total_atoms Title : total_atoms Usage : $pp->total_atoms() Function : Retrieves the total number of atoms within a sequence Returns : Returns the total number of atoms within a sequence Args : none molecular_weight Title : molecular_weight Usage : $pp->molecular_weight() Function : Retrieves the molecular weight of a sequence Returns : Returns the molecular weight of a sequence Args : none theoretical_pI Title : theoretical_pI Usage : $pp->theoretical_pI() Function : Retrieve the theoretical pI for a sequence Returns : Return the theoretical pI for a sequence Args : none num_carbon Title : num_carbon Usage : $pp->num_carbon() Function : Retrieves the number of carbon atoms in a sequence Returns : Returns the number of carbon atoms in a sequence Args : none num_hydrogen Title : num_hydrogen Usage : $pp->num_hydrogen Function : Retrieves the number of hydrogen atoms in a sequence Returns : Returns the number of hydrogen atoms in a sequence Args : none num_nitro Title : num_nitro Usage : $pp->num_nitro Function : Retrieves the number of nitrogen atoms in a sequence Returns : Returns the number of nitrogen atoms in a sequence Args : none num_oxygen Title : num_oxygen Usage : $pp->num_oxygen() Function : Retrieves the number of oxygen atoms in a sequence Returns : Returns the number of oxygen atoms in a sequence Args : none num_sulphur Title : num_sulphur Usage : $pp->num_sulphur() Function : Retrieves the number of sulphur atoms in a sequence Returns : Returns the number of sulphur atoms in a sequence Args : none half_life Title : half_life Usage : $pp->half_life() Function : Retrieves the half life of a sequence Returns : Returns the half life of a sequence Args : none instability_index Title : instability_index Usage : $pp->instability_index() Function : Retrieves the instability index of a sequence Returns : Returns the instability index of a sequence Args : none stability Title : stability Usage : $pp->stability() Function : Calculates whether the sequence is stable or unstable Returns : 'stable' or 'unstable' Args : none aliphatic_index Title : aliphatic_index Usage : $pp->aliphatic_index() Function : Retrieves the aliphatic index of the sequence Returns : Returns the aliphatic index of the sequence Args : none gravy Title : gravy Usage : $pp->gravy() Function : Retrieves the grand average of hydropathicity (GRAVY) of a sequence Returns : Returns the grand average of hydropathicity (GRAVY) of a sequence Args : none AA_comp Title : AA_comp Usage : $pp->AA_comp('P') Function : Retrieves the percentage composition of a given amino acid for a sequence Returns : Returns the percentage composition of a given amino acid for a sequence Args : A single letter amino acid code eg A, R, G, P etc perl v5.14.2 2012-03-02 Bio::Tools::Protparam(3pm)
All times are GMT -4. The time now is 08:15 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy