Hopefully someone here can point me in the correct direction.
I'm working on a username migration and am trying to map my users ols usernames to the new ones.
Right now every user has a username of firstname.lastname i.e. john.doe
I'm trying to create a bash or python script that will take... (3 Replies)
Hi,
I am having a file of dna sequences in fasta format which look like this:
>admin_1_45
atatagcaga
>admin_1_46
atatagcagaatatatat
with many such thousands of sequences in a single file. I want to the replace the accession Id "admin_1_45" similarly in following sequences to... (5 Replies)
I have two files. File1 is shown below.
>153L:B|PDBID|CHAIN|SEQUENCE
RTDCYGNVNRIDTTGASCKTAKPEGLSYCGVSASKKIAERDLQAMDRYKTIIKKVGEKLCVEPAVIAGIISRESHAGKVL
KNGWGDRGNGFGLMQVDKRSHKPQGTWNGEVHITQGTTILINFIKTIQKKFPSWTKDQQLKGGISAYNAGAGNVRSYARM
DIGTTHDDYANDVVARAQYYKQHGY
>16VP:A|PDBID|CHAIN|SEQUENCE... (7 Replies)
I have a fasta file as follows
>sp|O15090|FABP4_HUMAN Fatty acid-binding protein, adipocyte OS=Homo sapiens GN=FABP4 PE=1 SV=3
MCDAFVGTWKLVSSENFDDYMKEVGVGFATRKVAGMAKPNMIISVNGDVITIKSESTFKN
TEISFILGQEFDEVTADDRKVKSTITLDGGVLVHVQKWDGKSTTIKRKREDDKLVVECVM
KGVTSTRVYERA
>sp|L18484|AP2A2_RAT AP-2... (3 Replies)
Hi
How can I extract sequences from a fasta file with respect a certain criteria? The beginning of my file (containing in total more than 1000 sequences) looks like this:
>H8V34IS02I59VP
SDACNDLTIALLQIAREVRVCNPTFSFRWHPQVKDEVMRECFDCIRQGLG
YPSMRNDPILIANCMNWHGHPLEEARQWVHQACMSPCPSTKHGFQPFRMA... (6 Replies)
Hi,
I need some help with modifying fasta headers.
I have a fasta file with thousands of contigs and I need to modify their headers with the information obtained from a second file.
File 1 contains the fasta sequences:
>contig0001 length=11115 numreads=10777
agatgtagatctct... (6 Replies)
Hi,
I have a fasta file with multiple sequences. How can i get only unique sequences from the file.
For example
my_file.fasta
>seq1
TCTCAAAGAAAGCTGTGCTGCATACTGTACAAAACTTTGTCTGGAGAGATGGAGAATCTCATTGACTTTACAGGTGTGGACGGTCTTCAGAGATGGCTCAAGCTAACATTCCCTGACACACCTATAGGGAAAGAGCTAAC
>seq2... (3 Replies)
I could calculate the length of entire fasta sequences by following command,
awk '/^>/{if (l!="") print l; print; l=0; next}{l+=length($0)}END{print l}' unique.fasta
But, I need to calculate the length of a particular fasta sequence specified/listed in another txt file. The results to to be... (14 Replies)
I've been struggling with this one for quite a while and cannot seem to find a solution for this find/replace scenario. Perhaps I'm getting rusty.
I have a file that contains a number of metrics (exactly 3 fields per line) from a few appliances that are collected in parallel. To identify the... (3 Replies)
Hi,
I have to add 7 bases of specific nucleotide at the beginning and ending of all the fasta sequences of a file. For example, I have a multi fasta file namely test.fasta as given below
test.fasta
>TalAA18_Xoo_CIAT_NZ_CP033194.1:_2936369-2939570:+1... (1 Reply)
Discussion started by: dineshkumarsrk
1 Replies
LEARN ABOUT DEBIAN
muscle
MUSCLE(1) Muscle Manual MUSCLE(1)NAME
muscle - Multiple Protein Sequence Alignment
SYNOPSIS
muscle -in input file (fasta) [-out output file (default fasta)] [-diags] [-log log file] [-maxiters n] [-maxhours n] [-maxmb m] [-html]
[-msf] [-clw] [-clwstrict] [-log[a] logfile] [-quiet] [-stable] [-group] [-version]
DESCRIPTION
This manual page documents briefly the muscle command.
muscle aligns protein sequences and is considered superior and faster than Clustal W.
OPTIONS -in input file
Path to FASTA formatted input file
-out output file
Path to output file, FASTA formatted by default
-diags
Find diagonals (faster for similar sequences)
-maxiters n
Maximum number of iterations (integer, default 16)
-maxhours n
Maximum time to iterate in hours (default no limit)
-maxmb m
Maximum memory to allocate in Mb (default 80% of RAM)
-html
Write output in HTML format (default FASTA)
-msf
Write output in MSF format (default FASTA)
-clw
Write output in Clustal W format (default FASTA)
-clwstrict
As -clw, with 'CLUSTAL W (1.81)' header
-log[a] logfile
Log to file (append if -loga, overwrite if -log)
-quiet
Do not write progress messages to stderr
-stable
Output sequences in input order (default is -group)
-group
Group sequences by similarity (this is the default)
-version
Display version information and exit
SEE ALSO clustalw(1), seaview(1), t_coffee(1).
AUTHORS
Robert Elgar
Wrote Muscle.
Steffen Moeller <moeller@debian.org>
Wrote this manpage.
Charles Plessy <charles-debian-nospam@plessy.org>
Updated this manpage.
COPYRIGHT
Copyright (C) 2003, 2004 Steffen Moeller (manpage)
Copyright (C) 2007, 2008 Charles Plessy (manpage)
Muscle is in the public domain, and therefore not subjected to copyright.
This manual page was written by Steffen Moeller moeller@debian.org for the Debian(TM) system (but may be used by others). Permission is
granted to copy, distribute and/or modify this document as if it were in public domain.
muscle 3.7 02/06/2008 MUSCLE(1)