Replace text in column1 of a file matching columns of another file
Hi all,
I have 2 files:
species-names.txt
sequence-names.txt
I need to replace text in 1st column of file sequence-names.txt which matches 1st column of file species-names.txt to text of 2nd column of species-names.txt. Output will be:
Hello Gurus,
I have a text file containing nearly 12,000 tab delimited characters with 4000 rows. If the file size is small, excel can convert the text into coloumns. However, the file that I have is very big. Can some body help me in solving this problem?
The input file example,
... (6 Replies)
HELLO! This is my first post here! By the way, I think it is great that people do this.
My question:
I have two files, one is a .dilm and one is a .txt. It is my understanding that the .dilm file can be treated as a .txt file. I wrote another program where I was able to manipulate it as if it... (3 Replies)
Hi All,
I have two sets of files.
Set 1: 100 text files with extension .txt with names like 1.txt, 2.txt, 3.txt until 100.txt
Set 2: One big file with extension .dat
The text files have some records in columns like this:
0.7316431 82628
0.7248189 82577
0.7248182 81369
0.7222999... (1 Reply)
I have a text file that has three columns. But at the end of the text file, there are trailing lines that have missing second and third columns:
4 0.04972604 KLHL28
4 0.0497332 CSTB
4 0.04979822 AIF1
4 0.04983331 DECR2
4 0.04990344 KATNB1
4
4
4
4
How can I remove the trailing... (3 Replies)
Hi,
I want to achieve something similar to what described in another post:
The difference is I want to add the line if the pattern is not found.
File 1:
A123, valueA, valueB
B234, valueA, valueB
C345, valueA, valueB
D456, valueA, valueB
E567, valueA, valueB
F678, valueA, valueB
... (11 Replies)
Hello,
I want to concatenate 2-columns by 2-columns separated by colon. How can I do so? For example, I have a text file containing 6 columns separated by tab. I want to concatenate column 1 and 2; column 3 and 4; column 5 and 6, respectively, and put a colon in between.
input file:
1 0 0 1... (10 Replies)
Not sure if this is the correct forum for this question. I have two files. file1.zip, file2
Input:
file1.zip
col1, col2 , col3
a , b , 0:0:0:0:0:c436:9346:d40b
x, y, 0:0:0:0:0:880:39f9:c9a7
m, n , 0:0:0:0:0:80c7:9161:fe00
file2.txt
col1
c4:36:93:46:d4:0b... (1 Reply)
In the awk below I am trying to cp and paste each matching line in f2 to $3 in f1 if $2 of f1 is in the line in f2 somewhere. There will always be a match (usually more then 1) and my actual data is much larger (several hundreds of lines) in both f1 and f2. When the line in f2 is pasted to $3 in... (4 Replies)
I have file1 and file2 as shown below,
file1:
((org14/1-131541:0.11535,((org29/1-131541:0.00055,org7/1-131541:0.00055)1.000:0.10112,((org17/1-131541:0.07344,(org23/1-131541:0.07426,((org10/1-131541:0.00201,org22/1-131541:0.00243)1.000:0.02451,
file2:
org14=india
org29=america... (5 Replies)
Discussion started by: dineshkumarsrk
5 Replies
LEARN ABOUT DEBIAN
dotter
DOTTER:(1) General Commands Manual DOTTER:(1)NAME
dotter - Sequence dotplots with image enhancement tools.
SYNOPSIS
dotter [options] <horizontal_sequence> <vertical_sequence> [X options]
DESCRIPTION
Reference: Sonnhammer ELL & Durbin R (1995). A dot-matrix program with dynamic threshold control suited for genomic DNA and protein
sequence analysis. Gene 167(2):GC1-10.
Allowed types:
Protein - Protein DNA - DNA DNA - Protein
Options:
-b <file>
Batch mode, write dotplot to <file>
-l <file>
Load dotplot from <file>
-m <float>
Memory usage limit in Mb (default 0.5)
-z <int>
Set zoom (compression) factor
-p <int>
Set pixel factor manually (ratio pixelvalue/score)
-W <int>
Set sliding window size. (K => Karlin/Altschul estimate)
-M <file>
Read in score matrix from <file> (Blast format; Default: Blosum62).
-F <file>
Read in sequences and data from <file> (replaces sequencefiles).
-f <file>
Read feature segments from <file>
-H Do not calculate dotplot at startup.
-R Reversed Greyramp tool at start.
-r Reverse and complement horizontal_sequence (DNA vs Protein)
-D Don't display mirror image in self comparisons
-w For DNA: horizontal_sequence top strand only (Watson)
-c For DNA: horizontal_sequence bottom strand only (Crick)
-q <int>
Horizontal_sequence offset
-s <int>
Vertical_sequence offset
Some X options: -acefont <font> Main font. -font <font> Menu font.
SEE ALSO
http://www.cgb.ki.se/cgb/groups/sonnhammer/Dotter.html for more info.
AUTHOR
Erik.Sonnhammer@cgb.ki.se Version 3.1, compiled Jul 10 2010
July 2010 DOTTER:(1)