Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Delete specific strings in a file Post 302931142 by senhia83 on Sunday 11th of January 2015 01:25:29 AM
Old 01-11-2015
This is a protein sequence fasta file, pretty sure the sequences are in one line unless word wrapped
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Delete strings in a file

Hi, I have a file named status.txt that looks like the file below. What I want to do is to delete the part <status> and </status> and just leave the number and print each number per line. How can I do it? If I will use sed or awk how can I do it? I tried with sed but it didn't work. Maybe I... (8 Replies)
Discussion started by: ayhanne
8 Replies

2. Shell Programming and Scripting

delete strings till specific string

Hello i want to know a way so i can delete all the strings in file from the begning till a specific string (1 Reply)
Discussion started by: modcan
1 Replies

3. Shell Programming and Scripting

recursively delete the text between 2 strings from a file

i have 200000bytes size of a unix file i need to delete some text between two strings recursively using a loop with sed or awk . these two strings are : 1st string getting from a file :::2 nd string is fi...its constant . can anyone help me sed -n'/<1 st string >/,/fi/' <input_filename> is the... (2 Replies)
Discussion started by: santosh1234
2 Replies

4. Shell Programming and Scripting

Delete Strings that are present in another file

HI, if a String is present in file1.txt, i want to delete that String from file2.txt. How can i do this?? I am sure that the file1.txt is a subset of file2.txt. (2 Replies)
Discussion started by: jathin12
2 Replies

5. UNIX for Dummies Questions & Answers

Delete strings in file1 based on the list of strings in file2

Hello guys, should be a very easy questn for you: I need to delete strings in file1 based on the list of strings in file2. like file2: word1_word2_ word3_word5_ word3_word4_ word6_word7_ file1: word1_word2_otherwords..,word3_word5_others... (7 Replies)
Discussion started by: roussine
7 Replies

6. Shell Programming and Scripting

output strings to specific positions in a file

Been searching for about 3 hours for similar functionality that I can get examples of how to output text from variables into certain locations in a file. I would like to incorporate this into a script. I have not been able to find a command example that does it all in one method. I find part of... (1 Reply)
Discussion started by: bennu_500
1 Replies

7. Shell Programming and Scripting

Delete lines in file containing duplicate strings, keeping longer strings

The question is not as simple as the title... I have a file, it looks like this <string name="string1">RZ-LED</string> <string name="string2">2.0</string> <string name="string2">Version 2.0</string> <string name="string3">BP</string> I would like to check for duplicate entries of... (11 Replies)
Discussion started by: raidzero
11 Replies

8. UNIX for Dummies Questions & Answers

Add strings from one file at the end of specific lines in text file

Hello All, this is my first post so I don't know if I am doing this right. I would like to append entries from a series of strings (contained in a text file) consecutively at the end of specifically labeled lines in another file. As an example: - the file that contains the values to be... (3 Replies)
Discussion started by: gus74
3 Replies

9. Shell Programming and Scripting

How to delete strings in a file?

hi, i have a big file like this: >s31 length=12 numreads=6 gene=isotig454 status=igo ldfddfdfdfdkkkkkkfdfdkkkksdfdkkkkkkkkkksdfd dfdfdfldfdkdffdlfddflfdjkkkkkkfdgkkgfhghfgkkk ldfddfdfdfdkkkkkkfdfdkkkksdfdkkkkkkkkkksdfd dfdfdfldfdkdffdlfddflfdjkkkkkkfdgkkgfhghfgkkk >c2 length =344... (4 Replies)
Discussion started by: the_simpsons
4 Replies

10. Shell Programming and Scripting

Count specific character of a file in each line and delete this character in a specific position

I will appreciate if you help me here in this script in Solaris Enviroment. Scenario: i have 2 files : 1) /tmp/TRANSACTIONS_DAILY_20180730.txt: 201807300000000004 201807300000000005 201807300000000006 201807300000000007 201807300000000008 2)... (10 Replies)
Discussion started by: teokon90
10 Replies
HMMER(1)							   HMMER Manual 							  HMMER(1)

NAME
HMMER - profile HMMs for biological sequence analysis SYNOPSIS
hmmalign Align sequences to a profile hmmbuild Construct profile(s) from multiple sequence alignment(s) hmmconvert Convert profile file to various HMMER and non-HMMER formats hmmemit Sample sequences from a profile hmmfetch Retrieve profile HMM(s) from a file hmmpress Prepare an HMM database for hmmscan hmmscan Search sequence(s) against a profile database hmmsearch Search profile(s) against a sequence database hmmsim Collect profile score distributions on random sequences hmmstat Summary statistics for a profile file jackhmmer Iterative search of a protein sequence against a protein sequence database phmmer Search a protein sequence against a protein sequence database DESCRIPTION
HMMER is a suite of several programs for biological sequence alignment and database homology search. It uses probabilistic models called "profile hidden Markov models" (profile HMMs) to represent the likely evolutionary homologs of a single sequence or a multiple alignment of a sequence family. A main avenue of research is to improve the evolutionary predictive models in HMMER to be able to recognize and accu- rately align increasingly remote homologs, distant in time. HMMER is also used as an organizational tool, to group the exponentially growing number of biological sequences into a vastly smaller set of well-annotated sequence families. New sequences can be annotated by comparison against curated sequence family databases of prebuilt HMMER profiles, in addition or instead of comparison to the entire sequence database. Databases such as Pfam, SMART, and TIGRfams, among others, are based on this principle. HMMER is used in three main modes: to search a sequence database for new homologs of a sequence or a sequence family; to search a profile database (like Pfam) to find what known family a query sequence belongs to, or what domains it has; and to automatically construct large multiple alignments (i.e. with an effectively unlimited number of sequences) using a profile representative of a sequence family. Suppose you have a multiple sequence alignment of a sequence family of interest, and you want to search a sequence database for additional homologs. The hmmbuild program builds profile(s) from multiple alignment(s). The hmmsearch program searches profile(s) against a sequence database. Suppose you have a single sequence of interest, and you want to search a sequence database for additional homologs. The phmmer program searches a single protein sequence against a protein sequence database. The jackhmmer program does the same thing but iteratively -- homologs detected in a previous round are incorporated into a new profile, and the new profile is searched again. phmmer is used like BLASTP, and jackhmmer is used like a protein PSI-BLAST. Currently these two programs support only protein/protein comparison, but this will change with time. Suppose you have sequence(s) that you want to analyze using a HMMER-based profile HMM database like Pfam (http://pfam.sanger.ac.uk). The hmmpress program formats a profile HMM flatfile (such as the file you would download from Pfam) into a HMMER binary database. The hmmscan program searches sequence(s) against that database. Suppose you want to align lots of sequences. You can construct a manageably small alignment of a representative set of sequences, build a profile with hmmbuild, and use the hmmalign program to align any number of sequences to that profile. HMMER also includes some auxiliary tools for working with large profile databases. hmmfetch fetches one or more profiles from a database. hmmstat prints summary statistics about a profile file. For compatibility with other profile software and previous versions of HMMER, the hmmconvert program converts profiles to a few other for- mats. We intend to add more support for other formats over time. The hmmemit program generates (simulates) "homologous" sequences by sampling from a profile. It can also generate a "consensus" sequence. The hmmsim program is a simulator used for collecting statistics about score distributions on random sequences. Each program has its own man page. SEE ALSO
This is a summary man page for the entire HMMER3 package. See individual man pages [hmmbuild(1), for example] for usage, options, and description of each program in the package. For complete documentation, see the user guide that came with your HMMER distribution (Userguide.pdf); or see the HMMER web page (@HMMER_URL@). COPYRIGHT
@HMMER_COPYRIGHT@ @HMMER_LICENSE@ For additional information on copyright and licensing, see the file called COPYRIGHT in your HMMER source distribution, or see the HMMER web page (@HMMER_URL@). AUTHOR
Eddy/Rivas Laboratory Janelia Farm Research Campus 19700 Helix Drive Ashburn VA 20147 USA http://eddylab.org HMMER
@HMMER_VERSION@ @HMMER_DATE@ HMMER(1)
All times are GMT -4. The time now is 05:18 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy