I have 2 files; file 1 having smaller positions that overlap with the positions with positions in file2.
file1
file2
I want to get something like this
output
Hi all,
I have difficulty to solve the followign problem.
mydata:
StartPoint EndPoint
22 55
2222 2230
33 66
44 58
222 240
11 25
22 60
33 45
The union of above... (2 Replies)
Dear Gurus,
I have 57 tab-delimited different text files, each one containing entries in 3 columns. The first column in each file contains names of objects. Some names are present in more than one file. I would like to find those names and store them in a separate text file, preferably with a... (6 Replies)
Hi, I have a file1 of many long sequences, each preceded by a unique header line. file2 is 3-columns list: headers name, start position, end position. I'd like to extract the sequence region of file1 specified in file2.
Based on a post elsewhere, I found the code:
awk... (2 Replies)
Discussion started by: pathunkathunk
2 Replies
4. Forum Support Area for Unregistered Users & Account Problems
The forums have been seeing a sharp increase in spam bots, forum robots, and malicious registrations from certain countries. If you have been directed to this thread due to a "No Permission Error" when trying to register please post in this thread and request permission to register, including... (1 Reply)
Hi I have 2 files; usually the end position in the file1 is the start position in the file2 and the end position in file2 will be the start position in file1 (flanks)
file1
Id start end
aaa1 0 3000070
aaa1 3095270 3095341
aaa1 3100822 3100894
aaa1 ... (1 Reply)
Hello, here I am posting my query again with modified data input files.
see my query is :
i have two input files file1 and file2.
file1 is smalldata.fasta
>gi|546671471|gb|AWWX01449637.1| Bubalus bubalis breed Mediterranean WGS:AWWX01:contig449636, whole genome shotgun sequence... (20 Replies)
Hi,
I have been trying to retrieve the names of files present in a directory one by one but the names of files are getting overlapped on one another.
I tried the below command.
ls -1 > filename
please help me in getting the file names line by line without overlapping. I am using korn... (6 Replies)
Hi all,
I have a file like this I want to extract only those regions which are big and continous
chr1 3280000 3440000
chr1 3440000 3920000
chr1 3600000 3920000 # region coming within the 3440000 3920000. so i don't want it to be printed in output
chr1 3920000 4800000
chr1 ... (2 Replies)
Discussion started by: amrutha_sastry
2 Replies
LEARN ABOUT OSF1
diff3
diff3(1) General Commands Manual diff3(1)NAME
diff3 - Compares three files
SYNOPSIS
diff3 [-e | -x | -E | -X | -3] file1 file2 file3
The diff3 command reads three versions of a file and writes to standard output the ranges of text that differ.
OPTIONS
Creates an edit script for use with the ed command to incorporate into file1 all changes between file2 and file3 (that is, the changes that
normally would be flagged ==== and ====3). Produces an edit script to incorporate only changes flagged ====. These are similar to -e and
-x, respectively, but treat overlapping changes (that is, changes that are flagged ==== in the normal listing) differently. The overlap-
ping lines from both files are inserted by the edit script, bracketed by <<<<<< and >>>>>> lines. The -E option is used by RCS merge to
ensure that overlapping changes in the merged files are preserved and brought to someone's attention. Produces an edit script to incorpo-
rate only changes flagged ====3.
DESCRIPTION
The diff3 command reads three versions of a file and writes to standard output the ranges of text that differ, flagged with the following
codes: All three files differ. file1 differs. file2 differs. file3 differs.
The type of change needed to convert a given range of a given file to match another file is indicated in one of these two ways in the out-
put: Text is to be added after line number number1 in file, where file is 1, 2, or 3. Text in the range line number1 to line number2 is to
be changed. If number1 = number2, the range may be abbreviated to number1.
The original contents of the range follow immediately after a c indication. When the contents of two files are identical, diff3 does not
show the contents of the lower-numbered file, although it shows the location of the identical lines for each.
NOTES
Editing scripts produced by the -e option cannot create lines consisting only of a single . (dot).
EXAMPLES
To list the differences among three files, enter: diff3 fruit.a fruit.b fruit.c
fruit.a, fruit.b, and fruit.c contain the following data:
fruit.a:
banana grape kiwi lemon mango orange peach pare
fruit.b:
apple banana grapefruit kiwi orange peach pear
fruit.c:
grape grapefruit kiwi lemon mango orange peach pear
The output from diff3 shows the differences between these files as follows. (The comments on the right do not appear in the output.)
==== All three files are different. 1:1,2c - Lines 1 and 2 of the first file, fruit.a
banana
grape 2:1,3c - Lines 1 through 3 of fruit.b
apple
banana
grapefruit 3:1,2c - Lines 1 and 2 of fruit.c
grape
grapefruit ====2 The second file, fruit.b, is different. 1:4,5c - Lines 4 and 5 are the same in fruit.a and fruit.c. 2:4a
3:4,5c - To make fruit.b look the same, add text after line 4.
lemon
mango ====1 The first file, fruit.a, is different. 1:8c
pare 2:7c - Line 7 of fruit.b and line 8 of fruit.c are the same. 3:8c
pear
FILES
Helper program.
SEE ALSO
Commands: bdiff(1), cmp(1), comm(1), diff(1), ed(1)diff3(1)