I have a big file with a certain pattern (shown below) from which I need to parse out some digits in tabular format.
The format of the file is: '-' indicates text which doesn't to be parsed
What numbers I need to parse out is; for example the 1003 from "Output of huzzle for sequence file 1003.Clade1.html" and 0.0109 from "SCORE 7.951 0.0109 -0.32 0.0026"..like wise from all those patterns
So that I would get the tab delimited file as:
I tried with the following awk code but I am not getting it correctly
Please let me know what might be the error.
I am trying to parse hundreds of shell scripts to determine how they related to each other. Ideally for every script, I would get an output of:
What other scripts it calls
What files it reads
Environment variables it accesses
Any ideas on how to do this?
TIA! (2 Replies)
Sorry for the duplicate thread this one is similar to the one in
https://www.unix.com/shell-programming-scripting/88132-awk-sed-script-read-values-parameter-files.html#post302255121
Since there were no responses on the parent thread since it got resolved partially i thought to open the new... (4 Replies)
Hi,
I need an awk command that would parse the below expression
Input Format
1 'Stmt1 ............................'2 'Stmt2 ............................'3 'Stmt3 ............................'4 'Stmt4 ............................'5 'Stmt5 ............................'6 'Stmt6... (1 Reply)
hi guys,
i want to parse a file using public function, the file contain raw data in the below format i want to get the output like this to load it to Oracle DB
MARWA1,BSS:26,1,3,0,0,0,0,0.00,22,22,22.00
MARWA2,BSS:26,1,3,0,0,0,0,0.00,22,22,22.00
this the file raw format:
Number of... (6 Replies)
Hi,
I am trying to perform a simple soustraction between two floating numbers and cannot get it done for some reason due to the use of the sub command.
The following is the straight-forward result of the soustraction:
$ echo | gawk '{a=968;b=967.99;c=a-b;print c}' ... (2 Replies)
Hi,
I am trying to do evaluate one numerical string after substitution.
++++++++++++++++==
What I have = "7.04+2.3Xlog(0.72e-6X1.0e6)X1.9596"
What I need = evaluate 7.04+2.3*log(0.72e-6*1.0e6)*1.9596 = 5.55941
what I am doing;
echo "7.04+2.3Xlog(0.72e-6X1.0e6)X1.9596" | awk... (2 Replies)
AWK Command parse a file based on string.
I am trying to write a shell script to parse a file based on a string and move the content of the file to another file.
Here is scenario.
File content below
Mime-Version: 1.0
Content-Type: multipart/mixed;
... (2 Replies)
I have a script that I am writing to parse the output of information from a command.
Here is one of the lines of the output exported into the variable OUTPUT:
export OUTPUT=”name_of_system,0,5,9,55,ip_address,another_value,/PATH/OF/A/VALUE/I/NEED"
I can get the output I need... (5 Replies)
I am trying to add a line to a BASH shell script to print out a large variable length table on a web page. I am very new to this obviously, but I tried this with awk and it prints out every line twice. What I am doing wrong?
echo "1^2^3%4^5^6%7^8^9%" | awk 'BEGIN { RS="%"; FS="^"; } {for (i =... (6 Replies)
Hi All,
I am writing a script in which I need to gather 2 numbers for 'total' and 'successful'. The goal is to compare the two numbers and if they are not equal, rerun the task until all are successful. I'm thinking the best way will be with awk or sed, but I really don't know where to begin... (8 Replies)
Discussion started by: hburnswell
8 Replies
LEARN ABOUT DEBIAN
fasta_formatter
FASTA_FORMATTER(1) User Commands FASTA_FORMATTER(1)NAME
fasta_formatter - changes the width of sequences line in a FASTA file
DESCRIPTION
usage: fasta_formatter [-h] [-i INFILE] [-o OUTFILE] [-w N] [-t] [-e] Part of FASTX Toolkit 0.0.13.2 by gordon@cshl.edu
[-h] = This helpful help screen.
[-i INFILE]
= FASTA/Q input file. default is STDIN.
[-o OUTFILE] = FASTA/Q output file. default is STDOUT. [-w N] = max. sequence line width for output FASTA file.
When ZERO (the default), sequence lines will NOT be wrapped - all nucleotides of each sequences will appear on a single line (good
for scripting).
[-t] = Output tabulated format (instead of FASTA format).
Sequence-Identifiers will be on first column, Nucleotides will appear on second column (as single line).
[-e] = Output empty sequences (default is to discard them).
Empty sequences are ones who have only a sequence identifier, but not actual nucleotides.
Input Example:
>MY-ID AAAAAGGGGG CCCCCTTTTT AGCTN
Output example with unlimited line width [-w 0]:
>MY-ID AAAAAGGGGGCCCCCTTTTTAGCTN
Output example with max. line width=7 [-w 7]:
>MY-ID AAAAAGG GGGTTTT TCCCCCA GCTN
Output example with tabular output [-t]:
MY-ID AAAAAGGGGGCCCCCTTTTAGCTN
example of empty sequence: (will be discarded unless [-e] is used)
>REGULAR-SEQUENCE-1 AAAGGGTTTCCC >EMPTY-SEQUENCE >REGULAR-SEQUENCE-2 AAGTAGTAGTAGTAGT GTATTTTATAT
SEE ALSO
The quality of this automatically generated manpage might be insufficient. It is suggested to visit
http://hannonlab.cshl.edu/fastx_toolkit/commandline.html
to get a better layout as well as an overview about connected FASTX tools.
fasta_formatter 0.0.13.2 May 2012 FASTA_FORMATTER(1)