Sponsored Content
Top Forums Shell Programming and Scripting Retrieve data from one file comparing the ID in the second file Post 302702177 by kaav06 on Tuesday 18th of September 2012 03:41:44 AM
Old 09-18-2012
Retrieve data from one file comparing the ID in the second file

Hi all,

I have one file with IDs
Code:
Q8NDM7
P0C1S8
Q8TF30
Q9BRP8
O00258
Q6AWC2
Q9ULE0
Q702N8
A4UGR9
Q13426
Q6P2D8
Q9ULM3
A8MXQ7

I want to compare ID file with another file which has complete information about these IDs and also about other IDs which are not in the above ID file. As a result I want only information about the entries in the ID file. The second file has information such as
Code:
ID   3BP5L_HUMAN             Reviewed;         393 AA.
AC   Q7L8J4; Q96FI5; Q9BQH8; Q9C0E3;
DT   05-FEB-2008, integrated into UniProtKB/Swiss-Prot.
DT   05-JUL-2004, sequence version 1.
DT   05-SEP-2012, entry version 71.
DE   RecName: Full=SH3 domain-binding protein 5-like;
DE            Short=SH3BP-5-like;
GN   Name=SH3BP5L; Synonyms=KIAA1720; ORFNames=UNQ2766/PRO7133;
OS   Homo sapiens (Human).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
OC   Catarrhini; Hominidae; Homo.
OX   NCBI_TaxID=9606;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC   TISSUE=Brain;
RX   MEDLINE=21082932; PubMed=11214970; DOI=10.1093/dnares/7.6.347;
RA   Nagase T., Kikuno R., Hattori A., Kondo Y., Okumura K., Ohara O.;
RT   "Prediction of the coding sequences of unidentified human genes. XIX.
RT   The complete sequences of 100 new cDNA clones from brain which code
RT   for large proteins in vitro.";
RL   DNA Res. 7:347-355(2000).
RN   [2] //

 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Comparing data in file with values in table

Hi, I want to calculate the number of pipe delimiters in a file for all lines seperately. For eg:i have a file Project.txt Mohit|chawla|123|678 File1|File2|345|767|678 And my file contains many lines like this it shd give me the output as 4 5 or give me the output for all the... (0 Replies)
Discussion started by: Mohit623
0 Replies

2. Shell Programming and Scripting

Retrieve data from a file

Hello guys I want to retrieve two data from a file, like this: bash-2.03$ cat numtest 123456 123457 bash-2.03$ more ./test_num #!/bin/bash num1= num2= cnt=1 while read x do num${cnt}=$x cnt=$(($cnt+1)) done <$1 echo $num1 "\n" $num2 But when i executed this script, error... (2 Replies)
Discussion started by: tpltp
2 Replies

3. Shell Programming and Scripting

Comparing data inside file

Hi Everyone, I will try to explain my question please forgive my english here. I am looking for shell script or command that can compare data in the files. I have 50 files in one directory test1 test2 test3 ....so on. I want to compare data in each files with each other and output each... (4 Replies)
Discussion started by: email-lalit
4 Replies

4. Programming

to find header in Mp3 file and retrieve data

hi all, In an mp3 file , data is arranged in sequence of header and data ,how to retrieve data between two headers. Is the data between two headers fixed? because as per theory it says 1152 samples will be there , but dont knw how many bits one sample correspond to? it would help if any c... (2 Replies)
Discussion started by: shashi
2 Replies

5. UNIX for Advanced & Expert Users

Retrieve data and redirect to a file

How to write a shell script to retrieve datas from database after that this database are redirect to a excell sheet and then i got a mail that gives details about the database with the column name and data.. I m using oracle 9i... Thanks, Anup Das (2 Replies)
Discussion started by: anupdas
2 Replies

6. UNIX for Dummies Questions & Answers

Hot to retrieve *.sql file names which we refer in .sh file.

Hi Guys, How to retrieve/get *.sql file names which we refer in all *.sh files. Can any one help me on this. Thanks, Kolipaka (3 Replies)
Discussion started by: lakshmanrk811
3 Replies

7. UNIX for Dummies Questions & Answers

Mapping a data in a file and delete line in source file if data does not exist.

Hi Guys, Please help me with my problem here: I have a source file: 1212 23232 343434 ASAS1 4 3212 23232 343434 ASAS2 4 3234 23232 343434 QWQW1 4 1134 23232 343434 QWQW2 4 3212 23232 343434 QWQW3 4 and a mapping... (4 Replies)
Discussion started by: kokoro
4 Replies

8. Shell Programming and Scripting

Comparing Data file with Crtl file

Hi, I need to compare a file with its contents matching to that of another file(filename , received date and record count). Lets say has File A original data Ex - 1,abc,1234 2,bcd,4567 3,cde,8901 and File B has details of File A Ex- FILEA.TXT|06/17|2010|3 (filename)|(received... (18 Replies)
Discussion started by: Prashanth B
18 Replies

9. Shell Programming and Scripting

How can I retrieve the matching records from data file mentioned?

XYZNA0000778800Z 16123000012300321000000008000000000000000 16124000012300322000000007000000000000000 17234000012300323000000005000000000000000 17345000012300324000000004000000000000000 17456000012300325000000003000000000000000 9 XYZNA0000778900Z 16123000012300321000000008000000000000000... (8 Replies)
Discussion started by: later_troy
8 Replies

10. UNIX for Beginners Questions & Answers

Grep: Retrieve two strings from one file to find them anyone on line in another file

I am having trouble matching *two* strings from one file anywhere in a line of a second file, and could use some help getting this figured out. My preference would be to use grep for this because I would like to take advantage of its -A option. The latter is due to the fact that I would like both... (2 Replies)
Discussion started by: jvoot
2 Replies
THESEUS_ALIGN(1)					      General Commands Manual						  THESEUS_ALIGN(1)

NAME
theseus_align - quick-and-dirty way to superimpose proteins SYNOPSIS
theseus_align [theseus options] -f pdbfile1.pdb pdbfile2.pdb ... OPTIONS
The options given to the script will be passed on to theseus. For a complete description, see the man page for theseus (1). DESCRIPTION
This manual page briefly documents briefly the script theseus_align, designed for a quick-and-dirty way to ML superposition proteins with different sequences. It should work very well when the protein sequences are relatively similar, although the ML method will still give much better results than least-squares when the sequences are moderately divergent. Technically, this procedure gives a structure-based superposition of a sequence-based alignment. It does not perform a structure-based alignment. First, the script uses theseus to create FASTA formatted sequence files corresponding to the exact protein sequences found in the pdb files that you supply. Second, these sequences are aligned using the multiple sequence alignment program of your choice. The script can easily be modified for CLUSTALW, T_COFFEE, KALIGN, DIALIGN2, or MAFFT. Any multiple sequence alignment program can be used, as long as it can generate clustal- formatted files. However, I highly recommend Bob Edgar's MUSCLE program for both its speed and accuracy. (For more info see http://www.drive5.com/muscle/ .) Third, theseus performs a superposition of the structures using the sequence alignment as a guide. The installed version of theseus_align uses muscle (1) for doing the multiple sequence alignment. If you wish to use one of the other pro- grams mentioned above, you'll have to copy the script to your own directory and edit it. SEE ALSO
theseus (1), muscle (1), clustalw (1), t_coffee (1), kalign (1), dialign2 (1), mafft (1). All of these programs can be installed on Debian or Ubuntu systems using apt-get (8). AUTHOR
theseus_align was written by Douglas L. Theobald, Department of Biochemistry, Brandeis University. November, 2008 THESEUS_ALIGN(1)
All times are GMT -4. The time now is 10:47 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy