Hi Guys...
Please Could you help me with the following ?
aaaa bbbb cccc sdsd
aaaa bbbb cccc qwer
as you can see, the 2 lines are matched in three fields...
how can I delete this pupicate ? I mean to delete the second one if 3 fields were duplicated ?
Thanks (14 Replies)
I am trying to figure out how to scan a file like so:
1 ralphs office","555-555-5555","ralph@mail.com","www.ralph.com
2 margies office","555-555-5555","ralph@mail.com","www.ralph.com
3 kims office","555-555-5555","kims@mail.com","www.ralph.com
4 tims... (17 Replies)
I have a file1 that looks like this:
File 1
a b
b c
c e
d e
and a file 2 that looks like this:
File 2
b
c
e
e
Note that file 2 is the right hand column from file1. I want to remove any lines from file1 that begin with the column in file2. In this case the desired output... (6 Replies)
hello all,
I have an input file with four columns like this with a lot of lines
and for example, line 1 and line 5 match because the first 4 characters match and the fourth column matches too. I want to keep the line that has the lowest number in the third column. So I discard line 5.... (5 Replies)
Hi,
I need to concatenate some lines in a file based on the First 4 coloumns of a file .. (For Eg.)
Consider a file ...
I,01,000002,0666,00000.00,000,00,000,000, ,0
I,01,000002,0667,00000.00,000,00,000,000, ,0
I,01,000002,0666,00056.10
I,01,000002,0667,00056.10
I,01,000002,0666,00001... (6 Replies)
Consider i have 2 directories a1 and a2.
under a1, i have below files
test1
test2
test3.
Under a2,i have below files.
test1
test2
test3
test4
test5My requirement is i will pass the directory names(2 parameters) and directory in which files needs to be removed.(3rd parameter)
a)first... (11 Replies)
Hi guys,Got a bit of a bind I'm in. I'm looking to remove duplicates from a pipe delimited file, but do so based on 2 columns. Sounds easy enough, but here's the kicker...
Column #1 is a simple ID, which is used to identify the duplicate.
Once dups are identified, I need to only keep the one... (2 Replies)
Hi,
I have tried to remove dublicate lines based on first column with pipe delimiter . but i ma not able to get some uniqu lines
Command : sort -t'|' -nuk1 file.txt
Input :
38376KZ|09/25/15|1.057
38376KZ|09/25/15|1.057
02006YB|09/25/15|0.859
12593PS|09/25/15|2.803... (2 Replies)
Discussion started by: parithi06
2 Replies
LEARN ABOUT DEBIAN
theseus_align
THESEUS_ALIGN(1) General Commands Manual THESEUS_ALIGN(1)NAME
theseus_align - quick-and-dirty way to superimpose proteins
SYNOPSIS
theseus_align [theseus options] -f pdbfile1.pdb pdbfile2.pdb ...
OPTIONS
The options given to the script will be passed on to theseus. For a complete description, see the man page for theseus (1).
DESCRIPTION
This manual page briefly documents briefly the script theseus_align, designed for a quick-and-dirty way to ML superposition proteins with
different sequences. It should work very well when the protein sequences are relatively similar, although the ML method will still give
much better results than least-squares when the sequences are moderately divergent. Technically, this procedure gives a structure-based
superposition of a sequence-based alignment. It does not perform a structure-based alignment.
First, the script uses theseus to create FASTA formatted sequence files corresponding to the exact protein sequences found in the pdb files
that you supply.
Second, these sequences are aligned using the multiple sequence alignment program of your choice. The script can easily be modified for
CLUSTALW, T_COFFEE, KALIGN, DIALIGN2, or MAFFT. Any multiple sequence alignment program can be used, as long as it can generate clustal-
formatted files. However, I highly recommend Bob Edgar's MUSCLE program for both its speed and accuracy. (For more info see
http://www.drive5.com/muscle/ .)
Third, theseus performs a superposition of the structures using the sequence alignment as a guide.
The installed version of theseus_align uses muscle (1) for doing the multiple sequence alignment. If you wish to use one of the other pro-
grams mentioned above, you'll have to copy the script to your own directory and edit it.
SEE ALSO
theseus (1), muscle (1), clustalw (1), t_coffee (1), kalign (1), dialign2 (1), mafft (1). All of these programs can be installed on Debian
or Ubuntu systems using apt-get (8).
AUTHOR
theseus_align was written by Douglas L. Theobald, Department of Biochemistry, Brandeis University.
November, 2008 THESEUS_ALIGN(1)