Convert columns to rows in a file


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Convert columns to rows in a file
# 1  
Old 04-19-2012
Convert columns to rows in a file

Hello,

I have a huge tab delimited file with around 40,000 columns and 900 rows I want to convert columns to a row.

INPUT file look like this.
the first line is a headed of a file.

Code:
ID    marker1        marker2         marker3      marker4    
b1    A         G        A         C         A          T      G         G     
b2    A         A        C         C         G          G      A         A



OUTPUT

Code:
b1    marker1  A         G
b1    marker2  A         C
b1    marker3  A         T
b1    marker4  G         G
b2    marker1  A         A
b2    marker2  C         C
b2    marker3  G         G
b2    marker4  A         A

Please provide me with a solution.

Thanks in advance.

Regards

Last edited by Scrutinizer; 04-19-2012 at 08:50 AM.. Reason: code tags plus formatting
# 2  
Old 04-19-2012
like this
Code:
# awk 'NR==1{for(i=2;i<=NF;i++)a[i]=$i}
NR!=1{x=2;for(i=2;i<=NF;i+=2)print $1,a[x++],$i,$(i+1) }' file
b1 marker1 A G
b1 marker2 A C
b1 marker3 A T
b1 marker4 G G
b2 marker1 A A
b2 marker2 C C
b2 marker3 G G
b2 marker4 A A

This User Gave Thanks to ygemici For This Post:
# 3  
Old 04-19-2012
Code:
perl -e '
open DATA, "<", "$ENV{HOME}/tmp/tmp.dat";  #Or wherever your data file is
$header=readline(DATA);                    # grab the headings
@header=split(/\s+/,$header);              # and parse them into separate fields
$waste=shift(@header);                     # Drop the pointless heading
while(<DATA>){                             # step through the rest of the file
	chomp;                                   # lose new lines
	@record=split(/\s+/,$_);                 # parse lines into fields
	for ($index=0;$index<@header;$index++){  # for each header field
	                                         # print out the 2 related entries in the format specified
		print "$record[0] $header[$index] $record[($index * 2 ) + 1] $record[($index * 2 )+2]\n";
	}
}'
b1 marker1 A G
b1 marker2 A C
b1 marker3 A T
b1 marker4 G G
b2 marker1 A A
b2 marker2 C C
b2 marker3 G G
b2 marker4 A A


Last edited by Skrynesaver; 04-19-2012 at 07:28 AM.. Reason: fencepost error
This User Gave Thanks to Skrynesaver For This Post:
# 4  
Old 04-19-2012
Hi Ygemici,

Thanks for the code but please explain what does 2 means. Are they number of rows?

thanks
# 5  
Old 04-19-2012
Quote:
Originally Posted by ryan9011
Hi Ygemici,

Thanks for the code but please explain what does 2 means. Are they number of rows?

thanks
2 is the column start number for the first line (except ID where $1 )
This User Gave Thanks to ygemici For This Post:
# 6  
Old 04-19-2012
Code:
awk '{for(i=2;i<=NF;i+=2)$i=(i==2?FS"":$1)FS"marker"i/2FS$i}1' infile | xargs -n 4

This User Gave Thanks to complex.invoke For This Post:
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Convert rows to columns

hi folks, I have a sample data like what is shown below: 1,ID=1000 1,Org=CedarparkHospital 1,cn=john 1,sn=doe 1,uid=User001 2,uid=User002 2,ID=2000 2,cn=steve 2,sn=jobs 2,Org=Providence I would like to convert it into the below format: 1,1000,CedarparkHospital,john,doe,User001... (11 Replies)
Discussion started by: vskr72
11 Replies

2. Shell Programming and Scripting

Convert rows to columns

I am looking to print the data in columns and after every 3 words it should be a new row. cat example.out | awk 'END { for (i = 0; ++i < m;) print _;print _ }{ _ = _ x ? _ OFS $1 : $1}' m=1| grep -i INNER I am looking to print in a new line after every 3 words. ... (2 Replies)
Discussion started by: lazydev
2 Replies

3. Shell Programming and Scripting

Convert Rows to Columns

Hi Everyone, Could someone shed some lights on how to convert the records in rows form into column basis. 172.29.59.12 IBM,8255-E8B 102691P 8 65536 MB 6100-04-11-1140 172.29.59.15 IBM,8255-E8B 102698P 4 45056 MB 6100-04-11-1140 IP SYS MODEL ... (6 Replies)
Discussion started by: ckwan
6 Replies

4. Shell Programming and Scripting

How to Convert rows in to columns?

Hi Gurus, How to convert rows in to columns using linux shell scripting Input is like (sample.txt) ABC DEF GHI JKL MNO PQR STU VWX YZA BCD output should be (sampleoutput.csv) ABC,DEF,GHI,JKL,MNO PQR,STU,VWX,YZA,BCD (2 Replies)
Discussion started by: infasriniit
2 Replies

5. Shell Programming and Scripting

Convert few columns to rows

Hi! Does anybody help me in converting following data: INPUT looks like this: 20. 100. 30 200. 40. 400. 50. 100. 60. 200. 70. 400. 80. 200. 150. 210. 30. 100. OUTPUT should look like this: 20. 100. 30 200. 40. 400. 50. 100. 60. 200. 70.... (5 Replies)
Discussion started by: lovelinux
5 Replies

6. Shell Programming and Scripting

convert rows to columns

hi, i have the file as below: abc def ghi jkl i want the output as abc,def,ghi,jki please reply, Thanks (4 Replies)
Discussion started by: namitai
4 Replies

7. Shell Programming and Scripting

convert columns into rows

hi, Apologies if this has been covered. I have requirement where i have to convert a single column into multiple column. My data will be like this - 2 3 4 5 6 Output required - 2 3 4 5 6 (1 Reply)
Discussion started by: Nishithinfy
1 Replies

8. Shell Programming and Scripting

convert rows into columns

Hi guys Could anyone advise me how to convert my rows into columns from a file My file would be similar to this: A11 A12 A13 A14 A15 ... A1n A21 A22 A23 A31 A41 A51 ... Am1 Am2 Am3 Am4 Am5 ... Amn The number of rows is not the same to the number of columns Thanks in advance (2 Replies)
Discussion started by: loperam
2 Replies

9. Shell Programming and Scripting

how to convert columns to rows

Hi, I need a shell script for below requirement Input file P1 - 173310 P2 - 173476 P3 - 173230 P4 - 172737 P1 - 173546 P2 - 173765 P3 - 173876 P4 - 172989 Out put file P1 173310 173546 P2 173476 173765 P3 173230 173876 P4 172737 172989 Suresh (6 Replies)
Discussion started by: suresh3566
6 Replies

10. Shell Programming and Scripting

Convert Columns to Rows in a File

Hi I have a input file in the format ABC,111,2008Q2, 49K ABC,111,2008Q3, 0K ABC,111,2008Q4, 0K ABC,222,2008Q2, 49K ABC,222,2008Q3, 0K ABC,222,2008Q4, 0K XYZ,111,2008Q2, 49K XYZ,111,2008Q3, 0K XYZ,111,2008Q4, 0K XYZ,222,2008Q2, 49K XYZ,222,2008Q3, 0K XYZ,222,2008Q4, 0K The output file... (3 Replies)
Discussion started by: chrismt
3 Replies
Login or Register to Ask a Question