A mistake in awk command I used to parse numbers Post: 302498190

Sponsored Content

Top Forums Shell Programming and Scripting A mistake in awk command I used to parse numbers Post 302498190 by Lucky Ali on Sunday 20th of February 2011 10:11:54 AM

02-20-2011

Registered User

A mistake in awk command I used to parse numbers

Hi

I have a big file with a certain pattern (shown below) from which I need to parse out some digits in tabular format.

The format of the file is: '-' indicates text which doesn't to be parsed

Code:

# Output of huzzle for sequence file 1000.Clade1.html
  - - - -- ------- -----------------------------------------
  ---------------------------------------------------------
 ------------------------------------------------------------
 ---------------------------------------------------------------
# SCORE     7.951      0.6909        -0.32       0.0026   
  ------------------------------------------------------------
 -------------------------------------------------------------
----------------------------------------------------------------
# Output of huzzle for sequence file 1001.Clade1.html
  - - - -- ------- -----------------------------------------
  ---------------------------------------------------------
 ------------------------------------------------------------
 ---------------------------------------------------------------
# SCORE     7.951      0.0909        -0.32       0.0026   
  ------------------------------------------------------------
 -------------------------------------------------------------
----------------------------------------------------------------
# Output of huzzle for sequence file 1002.Clade1.html
  - - - -- ------- -----------------------------------------
  ---------------------------------------------------------
 ------------------------------------------------------------
 ---------------------------------------------------------------
# SCORE     7.951      0.07909        -0.32       0.0026   
  ------------------------------------------------------------
 -------------------------------------------------------------
----------------------------------------------------------------
# Output of huzzle for sequence file 1003.Clade1.html
  - - - -- ------- -----------------------------------------
  ---------------------------------------------------------
 ------------------------------------------------------------
 ---------------------------------------------------------------
# SCORE     7.951      0.0109        -0.32       0.0026   
  ------------------------------------------------------------
 -------------------------------------------------------------
----------------------------------------------------------------

What numbers I need to parse out is; for example the 1003 from "Output of huzzle for sequence file 1003.Clade1.html" and 0.0109 from "SCORE 7.951 0.0109 -0.32 0.0026"..like wise from all those patterns

So that I would get the tab delimited file as:

Code:

1000 0.6909
1001 0.0909
1002 0.07909
1003 0.0109

I tried with the following awk code but I am not getting it correctly

Code:

awk '
/^Output of huzzle for sequence file/ {r1=$7;gsub(/\..*/,"",r1)}
/^SCORE/ {r2=$2;print r1,r2}' my file

Please let me know what might be the error.

LA

Lucky Ali

View Public Profile for Lucky Ali

Find all posts by Lucky Ali

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

How to parse large numbers of shell scripts

I am trying to parse hundreds of shell scripts to determine how they related to each other. Ideally for every script, I would get an output of: What other scripts it calls What files it reads Environment variables it accesses Any ideas on how to do this? TIA!

2. Shell Programming and Scripting

awk/sed Command : Parse parameter file / send the lines to the ksh export command

Sorry for the duplicate thread this one is similar to the one in https://www.unix.com/shell-programming-scripting/88132-awk-sed-script-read-values-parameter-files.html#post302255121 Since there were no responses on the parent thread since it got resolved partially i thought to open the new...

3. Shell Programming and Scripting

awk/sed Command: To Parse Stament between 2 numbers

Hi, I need an awk command that would parse the below expression Input Format 1 'Stmt1 ............................'2 'Stmt2 ............................'3 'Stmt3 ............................'4 'Stmt4 ............................'5 'Stmt5 ............................'6 'Stmt6...

4. Shell Programming and Scripting

Parse file using awk and work in awk output

hi guys, i want to parse a file using public function, the file contain raw data in the below format i want to get the output like this to load it to Oracle DB MARWA1,BSS:26,1,3,0,0,0,0,0.00,22,22,22.00 MARWA2,BSS:26,1,3,0,0,0,0,0.00,22,22,22.00 this the file raw format: Number of...

5. Shell Programming and Scripting

Problem with sub command (awk) and numbers

Hi, I am trying to perform a simple soustraction between two floating numbers and cannot get it done for some reason due to the use of the sub command. The following is the straight-forward result of the soustraction: $ echo | gawk '{a=968;b=967.99;c=a-b;print c}' ...

6. Shell Programming and Scripting

How to evaluate a string of numbers in the same command of AWK

Hi, I am trying to do evaluate one numerical string after substitution. ++++++++++++++++== What I have = "7.04+2.3Xlog(0.72e-6X1.0e6)X1.9596" What I need = evaluate 7.04+2.3*log(0.72e-6*1.0e6)*1.9596 = 5.55941 what I am doing; echo "7.04+2.3Xlog(0.72e-6X1.0e6)X1.9596" | awk...

7. Shell Programming and Scripting

AWK Command parse a file based on string.

AWK Command parse a file based on string. I am trying to write a shell script to parse a file based on a string and move the content of the file to another file. Here is scenario. File content below Mime-Version: 1.0 Content-Type: multipart/mixed; ...

8. Shell Programming and Scripting

Parse ouput within an AWK Command

I have a script that I am writing to parse the output of information from a command. Here is one of the lines of the output exported into the variable OUTPUT: export OUTPUT=”name_of_system,0,5,9,55,ip_address,another_value,/PATH/OF/A/VALUE/I/NEED" I can get the output I need...

9. Shell Programming and Scripting

awk syntax mistake doubles desired output

I am trying to add a line to a BASH shell script to print out a large variable length table on a web page. I am very new to this obviously, but I tried this with awk and it prints out every line twice. What I am doing wrong? echo "1^2^3%4^5^6%7^8^9%" | awk 'BEGIN { RS="%"; FS="^"; } {for (i =...

10. Shell Programming and Scripting

Parse for 2 numbers in large single line

Hi All, I am writing a script in which I need to gather 2 numbers for 'total' and 'successful'. The goal is to compare the two numbers and if they are not equal, rerun the task until all are successful. I'm thinking the best way will be with awk or sed, but I really don't know where to begin...

LEARN ABOUT DEBIAN

fasta_formatter

FASTA_FORMATTER(1)						   User Commands						FASTA_FORMATTER(1)

NAME

       fasta_formatter - changes the width of sequences line in a FASTA file

DESCRIPTION

       usage: fasta_formatter [-h] [-i INFILE] [-o OUTFILE] [-w N] [-t] [-e] Part of FASTX Toolkit 0.0.13.2 by gordon@cshl.edu

       [-h]   = This helpful help screen.

       [-i INFILE]
	      = FASTA/Q input file. default is STDIN.

	      [-o OUTFILE] = FASTA/Q output file. default is STDOUT.  [-w N]	   = max. sequence line width for output FASTA file.

	      When  ZERO  (the default), sequence lines will NOT be wrapped - all nucleotides of each sequences will appear on a single line (good
	      for scripting).

       [-t]   = Output tabulated format (instead of FASTA format).

	      Sequence-Identifiers will be on first column, Nucleotides will appear on second column (as single line).

       [-e]   = Output empty sequences (default is to discard them).

	      Empty sequences are ones who have only a sequence identifier, but not actual nucleotides.

   Input Example:
	      >MY-ID AAAAAGGGGG CCCCCTTTTT AGCTN

   Output example with unlimited line width [-w 0]:
	      >MY-ID AAAAAGGGGGCCCCCTTTTTAGCTN

   Output example with max. line width=7 [-w 7]:
	      >MY-ID AAAAAGG GGGTTTT TCCCCCA GCTN

   Output example with tabular output [-t]:
       MY-ID  AAAAAGGGGGCCCCCTTTTAGCTN

       example of empty sequence: (will be discarded unless [-e] is used)

	      >REGULAR-SEQUENCE-1 AAAGGGTTTCCC >EMPTY-SEQUENCE >REGULAR-SEQUENCE-2 AAGTAGTAGTAGTAGT GTATTTTATAT

SEE ALSO

       The quality of this automatically generated manpage might be insufficient.  It is suggested to visit

	      http://hannonlab.cshl.edu/fastx_toolkit/commandline.html

       to get a better layout as well as an overview about connected FASTX tools.

fasta_formatter 0.0.13.2					     May 2012							FASTA_FORMATTER(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

How to parse large numbers of shell scripts

Discussion started by: bliss

2. Shell Programming and Scripting

awk/sed Command : Parse parameter file / send the lines to the ksh export command

Discussion started by: rajan_san

3. Shell Programming and Scripting

awk/sed Command: To Parse Stament between 2 numbers

Discussion started by: rajan_san

4. Shell Programming and Scripting

Parse file using awk and work in awk output

Discussion started by: dagigg

5. Shell Programming and Scripting

Problem with sub command (awk) and numbers

Discussion started by: Indalecio

6. Shell Programming and Scripting

How to evaluate a string of numbers in the same command of AWK

Discussion started by: vivek_shm74

7. Shell Programming and Scripting

AWK Command parse a file based on string.

Discussion started by: aakishore

8. Shell Programming and Scripting

Parse ouput within an AWK Command

Discussion started by: jake0391S

9. Shell Programming and Scripting

awk syntax mistake doubles desired output

Discussion started by: awknewb123

10. Shell Programming and Scripting

Parse for 2 numbers in large single line

Discussion started by: hburnswell

LEARN ABOUT DEBIAN

fasta_formatter