Re-ordering data


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Re-ordering data
# 1  
Old 03-06-2011
Re-ordering data

input
Code:
Predictions for job: 1299399580
**********************************************

gg18_qqq10_100017878_100017978_-
==============================================================================

zzz Factor: XXX, ttt: crsmsgw, Cutoff: 0.6429
seqe Position	fff Coordinate	K-mer     	Score   
26               	qqq10:100017953   	cgcccgg   	0.753   
65               	qqq10:100017914   	cgcccgc   	0.736   
76               	qqq10:100017903   	ccccgga   	0.809   

Calculation parameters:
-----------------------------
Scoring function: COS(WR)
Significant p-value: 0.005
Suboptimal p-value: 0.05
Window size: 50

*****************************************************************************************

gg18_qqq10_100017878_100017978_-
==============================================================================

zzz Factor: YYY, ttt: crsmsgw, Cutoff: 0.6429
seqe Position	fff Coordinate	K-mer     	Score   
26               	qqq10:100017953   	cgcccgg   	0.753   
65               	qqq10:100017914   	cgcccgc   	0.736   
76               	qqq10:100017903   	ccccgga   	0.809   

zzz Factor: ZZZ, ttt: ugcug, Cutoff: 0.6000
seqe Position	fff Coordinate	K-mer     	Score   
49               	qqq10:100017930   	cgcug     	0.671   
52               	qqq10:100017927   	ugcuc     	0.671   
  
Calculation parameters:
-----------------------------
Scoring function: COS(WR)
Significant p-value: 0.005
Suboptimal p-value: 0.05
Window size: 50

*****************************************************************************************

output
Code:
qqq	start	end	strand	zzzFactor	ttt	cut-off	seq_poistion	kmer_location	K-mer	Score			
qqq10	100017878	100017978	-	XXX	crsmgw	0.6429	26	qqq10:100017953	cgccga	0.753
qqq10	100017878	100017978	-	YYY	crsmgw	0.6429	26	qqq10:100017953	cgccga	0.753
.........................................

# 2  
Old 03-07-2011
How about this:

Code:
awk '
BEGIN { OFS="\t" ; print "qqq", "start", "end", "strand", "zzFactor", "ttt", "cut-off", "seq_position", "kmer_location", "k-mer", "Score" }
/^gg18/ {split($0,f,"_"); s=f[3];strand=f[4];qqq=f[2]; zzz=f[5]; }
/^zzz Factor: /&&s {gsub(",","");zzz=$3;ttt=$5;cutoff=$7}
n { print qqq,s,"",strand,zzz,ttt,cutoff,$1,$2,$3,$4; s=n=0 }
/^seqe/&&s { n=1 }' infile

# 3  
Old 03-07-2011
Thanx Chubler. Only problem is that strand column is missing.
Code:
qqq	start	end	strand	zzFactor	ttt	cut-off	seq_position	kmer_location	k-mer	Score
qqq10	100017878		100017978	XXX	crsmsgw	0.6429	26	qqq10:100017953	cgcccgg	0.753
qqq10	100017878		100017978	YYY	crsmsgw	0.6429	26	qqq10:100017953	cgcccgg	0.753

# 4  
Old 03-07-2011
This should fix it:

Code:
awk '
BEGIN { OFS="\t" ; print "qqq", "start", "end", "strand", "zzFactor", "ttt", "cut-off", "seq_position", "kmer_location", "k-mer", "Score" }
/^gg18/ {split($0,f,"_"); s=f[3];e=f[4];qqq=f[2]; strand=f[5]; }
/^zzz Factor: /&&s {gsub(",","");zzz=$3;ttt=$5;cutoff=$7}
n { print qqq,s,e,strand,zzz,ttt,cutoff,$1,$2,$3,$4; s=n=0 }
/^seqe/&&s { n=1 }
' infile

Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Bash - re-ordering list of parameters

Hello. I have a script that writes parameters in alphabetic order. But I have a parameter which have 3 lines. There is no continuation character ( '\' ). Each of the three lines finish with 'cr'. But line 2 and 3 of the concerning parameter start with a tab char (but should be one or more... (7 Replies)
Discussion started by: jcdole
7 Replies

2. Shell Programming and Scripting

Ordering batch number

Hi, Could some one please help to order the batch number in sequence. I will be getting bunch of files with batch number in folder1 which are not in sequence. I need to move all files from folder1 to folder2 with batch number in sequence. Header record looks like PROCESS1... (8 Replies)
Discussion started by: zooby
8 Replies

3. Shell Programming and Scripting

Ordering Folders having Date as Names

Hi All, I have directories under /development/arun/weekly/ 20120421 20120414 . . . . I need to arrange these directories in descending order. folder name with recent date will be on top and then others. (1 Reply)
Discussion started by: Arun Mishra
1 Replies

4. Shell Programming and Scripting

Random ordering

1 2 4 5 3 I would like to use a script so that i can randomly rearrange these numbers such as 3 5 2 4 1 Thanks! (3 Replies)
Discussion started by: johnkim0806
3 Replies

5. Shell Programming and Scripting

ordering

file1 1 SNP2 3 1 SNP3 3 1 SNP5 4 2 SNP1 4 2 SNP4 4 file2 SNP1 1 1 1 SNP5 5 5 5 SNP4 4 4 4 SNP2 2 2 2 SNP3 1 1 1 desired output (1 Reply)
Discussion started by: johnkim0806
1 Replies

6. Shell Programming and Scripting

Re ordering lines - Awk

Is it possible to re-order certain rows as columns (of large files). Few lines from the file for reference. input Splicing Factor: Tra2beta, Motif: aaguguu, Cutoff: 0.5000 Sequence Position Genomic Coordinate K-mer Score 97 chr1:67052604 uacuguu 0.571 147... (3 Replies)
Discussion started by: quincyjones
3 Replies

7. Shell Programming and Scripting

ordering a data file

With an input file like this: How can I get an output like this? (In the quoted examples, the "_" sign represents an empty space) Note that there are some minus signs and no spaces, in the example above the first character of the first line is an empty space, so each number spans 10... (16 Replies)
Discussion started by: lego
16 Replies

8. Homework & Coursework Questions

word ordering problem HELP please (linux)

Hi guys I need you ,please help me i have to do this for tomorow and i don't understand how to do Q1 : Order the words of RADIO.txt by frequency Q2 : Order the words of RADIO.txt in alphabétique order Q3 : Order the words of RADIO.txt par ordre "rhymique" (exemple, put togeder words which are... (1 Reply)
Discussion started by: Lili
1 Replies

9. Shell Programming and Scripting

File ordering by portion of filename

Hi, Lets say I have a few xml files: 1234567894.xml abc_1234567895.xml abc_000_1234567890.xml abc_0000000_1234567893.xml 684_abc_000_1234567899.xml The naming convention of the files is: xxxxx_timestamp.xml OR timestamp.xml where x can be anything and I would like to order them by... (4 Replies)
Discussion started by: Leion
4 Replies

10. Shell Programming and Scripting

proper ordering of o/p values

Hi, Below is my script which creates a file: #!/bin/sh if then echo "Enter bill period " echo "Syntax: sh cpd.sh G08" exit fi sqlplus uname/pwd@dbname <<EOF set WRAP off set FEEDBACK off set PAGESIZE 0 set VERIFY off (14 Replies)
Discussion started by: ss_ss
14 Replies
Login or Register to Ask a Question