Sponsored Content
Full Discussion: Recalculating frequencies
Top Forums Shell Programming and Scripting Recalculating frequencies Post 302433450 by Xterra on Tuesday 29th of June 2010 05:29:53 PM
Old 06-29-2010
It is not working on my end

I tried one more time and it did not combine the last 2. The order is random but I still can see those 2 sequences. Instead of ending up with 5 differen sequences my file contains 6. I have modified the test data and definitively is not working. I entered 1 more sequence (freq 10) identical to the first 2 at the very end of the file and it did not combine it with the other 2.

Last edited by Xterra; 06-29-2010 at 06:34 PM..
 

4 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Appending lines with word frequencies, ordering and indexing a column

Dear All, I have the following input data: w1 20 g1 w1 10 g1 w2 12 g1 w2 23 g1 w3 10 g1 w3 17 g1 w3 12.5 g1 w3 21 g1 w4 11 g1 w4 13.2 g1 w4 23 g1 w4 18 g1 First I seek to find the word frequencies in col1 and sort col2 in ascending order for each change in a col1 word. Second,... (5 Replies)
Discussion started by: Ghetz
5 Replies

2. Shell Programming and Scripting

Merging Frequencies in a File

hello, I have a file which has the following structure: word <TAB> frequency The same word can have multiple frequencies: John <TAB> 60 John <TAB> 20 John <TAB> 30 Mary <TAB> 1000 Mary <TAB> 800 Mary <TAB> 20 What I need is a script which could merge all these frequencies into one single... (10 Replies)
Discussion started by: gimley
10 Replies

3. Shell Programming and Scripting

Removal of extra spaces in *.log files to allow extraction of frequencies

Our university has upgraded its version of a computational chemistry program that our group uses quite regularly. In the past we have been able to extract frequency spectra from log files that are generated. Since the upgrade, the viewing program errors out. I've been able to trace down the changes... (16 Replies)
Discussion started by: wsuchem
16 Replies

4. UNIX for Dummies Questions & Answers

Gaps and frequencies

I have this infile: >GHL8OVD01BNNCA Freq 10 TAGATGTGCCCGTGGGTTTCCCGTCAACACCGGATAGT-GCAGCA-TA >GHL8OVD01CMQVT Freq 1 TTGATGTCGTGGGTTTCCCGTCAACACCGGCAAATAGT-GCAGCA-TA >GHL8OVD01CMQVT Freq 1 TTGATGTGCCAGTTTCCCGTCTAGCAGCACTACCAGGACCTTCGC-TA >GHL8OVD01CMQVW Freq 1... (1 Reply)
Discussion started by: Xterra
1 Replies
RNAEFFECTIVE(1) 					      General Commands Manual						   RNAEFFECTIVE(1)

NAME
RNAeffective - calculation of effective numbers of orthologous miRNA targets SYNOPSIS
RNAeffective [-h] [-d frequency_file] [-f from,to] [-k sample_size] [-l mean,std] [-m max_target_length] [-n max_query_length] [-u iloop_upper_limit] [-v bloop_upper_limit] [-s] [-t target_file] [-q query_file] [query] DESCRIPTION
RNAeffective is a tool for determining the effective number of orthologous miRNA targets. This number can be used for the calculation of more accurate joint p-values in multi-species analyses. RNAeffective searches a set of target sequences with random miRNAs that can be given on the command line or otherwise generates random sequences according to given sample size, length distribution parameters and dinu- cleotide frequencies. The empirical distribution of joint p-values is compared to the p-values themselves, and the effective number of independent targets is the one that reduces the deviation between the two distributions. OPTIONS
-h Give a short summary of command line options. -d frequency_file Generate random sequences according to dinucleotide frequencies given in frequency_file. See example directory for example files. -f from,to Forces all structures to have a helix from position from to position to with respect to the query. The first base has position 1. -k sample_size Generate sample_size random sequences. Default value is 5000. -l mean,std Generate random sequences with a normal length distribution of mean mean and standard deviation std. Default values are 22 and 0, respectively. -m max_target_length The maximum allowed length of a target sequence. The default value is 2000. This option only has an effect if a target file is given with the -t option (see below). -n max_query_length The maximum allowed length of a query sequence. The default value is 30. This option only has an effect if a query file is given with the -q option (see below). -u iloop_upper_limit The maximally allowed number of unpaired nucleotides in either side of an internal loop. -v bloop_upper_limit The maximally allowed number of unpaired nucleotides in a bulge loop. -s Generate random sequences according to the dinucleotide distribution of given queries (either with the -q option or on command line. If no -q is given, the last argument to RNAeffective is taken as a query). See -q option. -q query_file Without the -s option, each of the query sequences in query_file is subject to hybridisation with each of the targets (which are from the target_file; see -t below). The sequences in the query_file have to be in FASTA format, ie. one line starting with a > and directly followed by a name, then one or more following lines with the sequence itself. Each individual sequence line must not have more than 1000 characters. With the -s option, the query (or query file) dinucleotide distribution is counted, and random sequences are generated according to this distribution. If no -q is given, random sequences are generated as described above (see -d option). -t target_file See -q option above. REFERENCES
The energy parameters are taken from: Mathews DH, Sabina J, Zuker M, Turner DH. "Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure" J Mol Biol., 288 (5), pp 911-940, 1999 VERSION
This man page documents version 2.0 of RNAeffective. AUTHORS
Marc Rehmsmeier, Peter Steffen, Matthias Hoechsmann. LIMITATIONS
Character dependent energy values are only defined for [acgtuACGTU]. All other characters lead to values of zero in these cases. SEE ALSO
RNAhybrid, RNAcalibrate RNAEFFECTIVE(1)
All times are GMT -4. The time now is 12:14 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy