Comparing and merging two text files


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers Comparing and merging two text files
# 1  
Old 04-28-2013
Comparing and merging two text files

Hey everybody,

I am new here and already a question to ask, I just recently started some bioinformatic work for my PhD so I am slowly learning

Anyway, here is my problem, I have two text files, one contains the complete data file with 43000 genes and their read counts for all my samples (actually 13 columns in total):

Code:
Ant-TAContig10	2025	1276	731	308	237	63	296	
Ant-TAContig100	5340	1711	7322	1088	5018	757	2788	
Ant-TAContig1000	1101	428	650	164	358	67	301	
Ant-TAContig10000	3585	1882	2621	532	838	216

And the other text file is a list of the 21000 genes which have high expression value that I want to extract

Code:
Ant-TAContig10
Ant-TAContig100
Ant-TAContig1000
Ant-TAContig10000
Ant-TAContig10003
Ant-TAContig10005
Ant-TAContig10006

Okay so all I want to do is extract the genes in file 2 from the file 1 so I get expression value of only these 21000 genes

I sorted the files and tried join
Code:
join file1 file2 > out.txt

But the outfile is empty

I tried comm and diff but nothing seems to work

Any ideas why it does not work and hints for me please?

Thanks again everybody, have a nice sunday

Last edited by Scrutinizer; 04-28-2013 at 01:37 PM.. Reason: code tags
# 2  
Old 04-28-2013
ant55,
You will need to read the rules on how to post and use coding tags.

Your initial solution works on Solaris 10 but not sure on the version of Unix you are using.

Code:
join file1 file2 > out1                                                        
cat out1
Ant-TAContig10 2025 1276 731 308 237 63 296 
Ant-TAContig100 5340 1711 7322 1088 5018 757 2788 
Ant-TAContig1000 1101 428 650 164 358 67 301 
Ant-TAContig10000 3585 1882 2621 532 838 216

You can also try these solutions:

Code:
join -t " " -1 1 -2 1 file1 file2 
Ant-TAContig10 2025 1276 731 308 237 63 296 
Ant-TAContig100 5340 1711 7322 1088 5018 757 2788 
Ant-TAContig1000 1101 428 650 164 358 67 301 
Ant-TAContig10000 3585 1882 2621 532 838 216

Code:
nawk 'FNR==NR{arr[$1]=$0;next} {if (arr[$1] != "")  print arr[$1]}' file1 file2
Ant-TAContig10 2025 1276 731 308 237 63 296 
Ant-TAContig100 5340 1711 7322 1088 5018 757 2788 
Ant-TAContig1000 1101 428 650 164 358 67 301 
Ant-TAContig10000 3585 1882 2621 532 838 216

Note that your solution and the other 2 above will not tell you if a record is in file2 but not file1 which is present in your sample files.

Last edited by mjf; 04-28-2013 at 03:39 PM..
 
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Merging two text files by two columns

Hi, I have two text files that I would like to merge/join. I would like to join them if the first columns of both text files match and the second column of the first text file matches the third column of the second text file. Example input: First file: 1334 10 0 0 1 5.2 1334 12 0 0 1 4.5... (4 Replies)
Discussion started by: evelibertine
4 Replies

2. UNIX for Dummies Questions & Answers

Merging two text files by a column

I have two text files. One has two columns and looks like below: rs# otherallele_freq rs10399749 0 rs4030303 0 rs4030300 0 rs940550 1.000 rs13328714 0 rs11490937 0 rs6683466 0 rs12025928 1.000 rs6650104 0 rs11240781 0... (5 Replies)
Discussion started by: evelibertine
5 Replies

3. UNIX for Dummies Questions & Answers

Merging two text files by a column

So I have two text files. The first one looks like this: refsnp_id chr_name chrom_start 1 rs1000000 12 126890980 2 rs10000010 4 21618674 3 rs10000012 4 1357325 4 rs10000013 4 37225069 5 rs1000002 3 183635768 And the second one looks like this: AUC rs1000000 0.03 0.1240 AUC ... (4 Replies)
Discussion started by: evelibertine
4 Replies

4. UNIX for Dummies Questions & Answers

Merging two text files with variable coloumns

Hi All, I have two text files containing space delimited columns. The first file contains 9 columns and the second one contain 3 columns. I want to copy the 3 coloumns from the 2nd file and paste them in 1st file after 9 coloumns. Ex. File1.txt contains 9 coloumns C1 C2 C3 C4 C5 C6 C7 C8 C9... (6 Replies)
Discussion started by: Unilearn
6 Replies

5. Shell Programming and Scripting

AWK Script For Merging Text Files

Hello, I am trying to merge data from two text files. One file (File1) contains a listing of data which includes the trial number in Column 5, while the other text file (File2) contains what category the trial belongs to. Here is a snippet of what File1 looks like. 1 Arrow_ST 9.738 0.905... (2 Replies)
Discussion started by: Jahn
2 Replies

6. Shell Programming and Scripting

merging text files into one

Hi good day Is there a shell script I can use to join multiple text files in a folder and put the name(excluding the extension) of the text file before it's contents and put EOF at the end of each portion. so for example, say I have file1.dat, file2.dat, file3.dat i'll get file1 {text... (7 Replies)
Discussion started by: johnstrong
7 Replies

7. Shell Programming and Scripting

Merging all text files in the order which they recieved

Hi There, I am new to UNIX. requirement is merging all text files from one folder in the order which they have received. EX: under /Test folder files received in certain intervals. abc.txt @ 1:00 AM 123.txt @ 2:00 AM xyz.txt @ 4:00 AM . . . 456.txt @ 10:00 PM so need to develop a... (3 Replies)
Discussion started by: arund_01
3 Replies

8. Shell Programming and Scripting

Merging two files by comparing three fields

Hi Experts, I need your timely help. I have a problem with merging two files. Here my situation : Here I have to compare first three fields from FILE1 with FILE2. If they are equal, I have to append the remaining values from FILE2 with FILE1 to create the output. FILE1: Class ... (3 Replies)
Discussion started by: Hunter85
3 Replies

9. Shell Programming and Scripting

merging text files

hi, i need to merge 2 lakh text files ..... can somebody please help me with a script/program for it... (8 Replies)
Discussion started by: code19
8 Replies

10. Shell Programming and Scripting

merging few columns of two text files to a new file

hi i need to select a few columns of two txt files and write it to a new file. there is one common field for both of these files. plz help me in this thanks in advance (4 Replies)
Discussion started by: kolvi
4 Replies
Login or Register to Ask a Question