Sponsored Content
Operating Systems Linux Comparing specific columns between two files Post 302956327 by Syeda Sumayya on Tuesday 29th of September 2015 05:17:02 AM
Old 09-29-2015
Actually both the columns of file-B are interacting-protein-partners.
I want to match the column4 of file-A with both the columns of file-B, to see which of the protein of column4, file-A is also present in any of the column of file-B along with its corresponding protein partner.

Following is how I want the output to be like,

Code:
30.00  17  gi|49484107|ref|YP_041331.1|   DIP-46805N|uniprot:P70388  1e-06 DIP-44775N|refseq:NP_006210|uniprotkb:P42338    DIP-46805N|uniprotkb:P70388

i.e all columns of file-A + the matching LINE (both columns) of file-B (if either column contains a value same as that in column4 of file-A).
In the given output note that column2 of file-B had the same value as that of column4 file-A.

Hope I was able to explain my question better.
 

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

Comparing Columns of two FIles

Dear all, I have two files in UNIX File1 and File2 as below: File1: 1,1234,.,67.897,,0 1,4134,.,87.97,,4 0,1564,.,97.8,,1 File2: 2,8798,.,67.897,,0 2,8879,.,77.97,,4 0,1564,.,97.8,,1 I want to do the following: (1) Make sure that both the files have equal number of columns and if... (1 Reply)
Discussion started by: ggopal
1 Replies

2. Shell Programming and Scripting

Comparing Columns of two FIles

Dear all, I have two files in UNIX File1 and File2 as below: File1: 1,1234,.,67.897,,0 1,4134,.,87.97,,4 0,1564,.,97.8,,1 File2: 2,8798,.,67.897,,0 2,8879,.,77.97,,4 0,1564,.,97.8,,1 I want to do the following: (1) Make sure that both the files have equal number of columns and if... (4 Replies)
Discussion started by: ggopal
4 Replies

3. Shell Programming and Scripting

comparing the columns in two files

I have two files file1 and file 2 both are having multiple coloumns.i want to select only two columns. i used following code to get the desired columns,with ',' as delimiter cut -d ',' -f 1,2 file1 | sort > file1.new cut -d ',' -f 1,2 file2 | sort > file2.new I want to get the coloums... (1 Reply)
Discussion started by: bab123
1 Replies

4. Shell Programming and Scripting

comparing 2 columns from 2 files

Hey, I have 2 files that have a name and then a number: File 1: dog 21 dog 24 cat 33 cat 27 dog 76 cat 65 File 2: dog 109 dog 248 cat 323 cat 207 cat 66 (2 Replies)
Discussion started by: dcfargo
2 Replies

5. UNIX for Dummies Questions & Answers

Comparing 2 columns from 2 files

Hi, I have two files with the same number of columns. Basically I want to print the 2 columns that match between the two files. File1 looks like this: dr12 12 6 abn dr14 12 7 abn File2 looks something like this: dr12 12 8 abn dr12 14 7 abn So basically if the first... (1 Reply)
Discussion started by: kylle345
1 Replies

6. UNIX for Dummies Questions & Answers

Comparing columns in two files

Hi, I have two files. File1.txt has 2 columns and looks like: 458739 122345 4456 122657 34200 122600 File2.txt has many columns with column 1 the same as column2 of File1.txt, but with lot more rows: 122786 abcdefg user1@email 122778 uuhjeufh user2@email... (1 Reply)
Discussion started by: ursaan
1 Replies

7. Shell Programming and Scripting

comparing two columns from two different files

Hello, I have two files as 1.txt and 2.txt with number as columns. 1.txt 0 53.7988 1 -30.0859 2 20.1632 3 14.2135 4 14.6366 5 -37.6258 . . . 31608 -8.57333 31609 -2.58554 31610 -24.2857 2.txt (1 Reply)
Discussion started by: AKD
1 Replies

8. Shell Programming and Scripting

AWK: Comparing two columns from two different files

Hi - I have two files as follows: File 1: chr5 118464905 118465027 ENST00000514151 utr5 0 + chr5 118464903 118465118 ENST00000504031 utr5 0 + chr5 118468826 118469180 ENST00000504031 utr5 0 + chr5 118469920 118470084 ... (14 Replies)
Discussion started by: polsum
14 Replies

9. Shell Programming and Scripting

Comparing two columns from two different files

Hi, I have a single-column file1 having records like: 00AB01/11 43TG22/00 78RC09/34 ...... ...... and a second file , file 2 having two columns like 78RC09/34 1 45FD11/11 2 00AB01/11 3 43TG22/00 4 ...... ...... (8 Replies)
Discussion started by: amarn
8 Replies

10. UNIX for Advanced & Expert Users

Need help in comparing multiple columns from two files.

Hi all, I have two files as below. I need to compare field 2 of file 1 against field 1 of file 2 and field 5 of file 1 against filed 2 of file 2. If both matches , then create a result file 1 with first file data and if not matches , then create file with first fie data. Please help me in... (12 Replies)
Discussion started by: sivarajb
12 Replies
PROBALIGN(1)							   User Commands						      PROBALIGN(1)

NAME
probalign - Align sequences in MFAFILE(s) and print result to standard output DESCRIPTION
PROBALIGN Version 1.4 (Nov 2010) aligns multiple protein sequences and prints to the standard output. Written by Satish Chikkagoudar and Usman Roshan using code from PROBCONS version 1.1 (written by Chuong Do) and based upon probA (written by Ulrike Muckstein). PROBALIGN 1.4 comes with ABSOLUTELY NO WARRANTY. This is free software, and you are welcome to redistribute it under certain conditions. See the README file for details. Usage: probalign [OPTION]... [MFAFILE]... Description: Align sequences in MFAFILE(s) and print result to standard output -clustalw use CLUSTALW output format instead of MFA -v, --verbose report progress while aligning (default: off) -a, --alignment-order print sequences in alignment order rather than input order (default: off) -T, -temperature Sets the thermodynamic temperature parameter (default: 5 (for protein data mode), 1 ( for nucleotide data mode)). -score_matrix, --score_matrix Sets the type of scoring matrix used to calculate the posterior probabilities (default: gonnet_160, representing gonnet 160, refer README for details) -go, --gap-open This option can be used to specify the gap open parameter. The default for Gonnet 160 (protein) is 22 and nucleotide (simple matrix) is 4. -ge, --gap-extension This option can be used to specify the gap extension parameter. The default for Gonnet 160 (protein) is 1 and nucleotide (simple matrix) is 0.25. -nuc Specify this option to indicate that inputted sequences are nucleotide sequences -prot Specify this option to indicate that inputted sequences are protein sequences [DEFAULT] -showPP Outputs the posterior probabilities of alignment columns as a new sequence named Posterior Probabilities (The probability values are scaled to be between integers between 0 and 9). probalign 1.4 May 2011 PROBALIGN(1)
All times are GMT -4. The time now is 04:23 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy