Hi,
I have a list of 70 files in a directory and I need to merge the content of each file into one big matrix file (71 columns x 3060 rows).
Each file has the following format only two columns per file:
Things that need to consider:
1) The first column, 'unique' is a unique field among each file, it's the second column that is different and what I need merged into one data matrix.
2) The randomtext line (second line of each file) I wish I could ignore in my final file...
3) Note, not all files are sorted as the example I posted above.
Final merged file, should be
Obviously, I need 71 columns in the final file, with the first column as the 'unique' field and the other 70 columns to be the associated value from the 70 files in the common directory.
I think you've been around here long enough to have had a go at this yourself, labrazil. :-)
Try using awk to build an array indexed by the first field. As you process each file you can just append the new values to the existing values stored in the array. Then you can just print out the entire array in your END { } clause. I'm presuming that the same number of unique indices appears in all of the files, otherwise it woudl become more complicated because you'd have to cater for the ones that were missing.
I think you've been around here long enough to have had a go at this yourself, labrazil. :-)
Try using awk to build an array indexed by the first field. As you process each file you can just append the new values to the existing values stored in the array. Then you can just print out the entire array in your END { } clause. I'm presuming that the same number of unique indices appears in all of the files, otherwise it woudl become more complicated because you'd have to cater for the ones that were missing.
Thanks Anni,
I came across this code
But two problems with this.
how can I incorporate it to run through all 70 files at once without doing it one by one, and two, how can remove the second line from each file as I'm merging them?
Copy data from other file to paste
cat file1:
Name: server1.data.com
data1 server1 running
Name: server3.data.com
data3 server3 running
cat file2:
server1 good
server2 bad
network not ok
server3 good
Output: (10 Replies)
Hi Experts,
I have created multiple scripts and send the output to new file, getting this output to my mailbox on daily basis.
I would like to send the all outputs to a single file, need to merge all file outputs on a single file.
For example,
Created script for
df -h > df.doc
grep... (7 Replies)
:confused:Hello -- i just joined the forums. I am a complete noob -- only about 1 week into learning how to program anything... and starting with linux.
I am working in Linux terminal.
I have a folder with a bunch of txt files. Each file has several lines of html code. I want to combine... (2 Replies)
Hello, hopefully this is an easy on for the AWK guru's out there. I'm having some trouble figuring out how to match+merge data in 2 files into 1 single report.
I've got my 2 files filtered and delimited, just need to MATCH $3 in file1 to $1 in file2, then put $0 from File1 and $2+$3 from File2... (6 Replies)
Hi all,
I have a complex (beyond my biological expertise) problem at hand.
I need to merge multiple files into 1 big matrix. Please help me with some code.
Inp1
Ang_0 chr1 98 T A
Ang_0 chr1 352 G A
Ang_0 chr1 425 C T
Ang_0 chr2 ... (1 Reply)
Hi,
Some assistance with respect to the following problem will be very helpful.
I want to reformat my dataset in the following manner for subsequent analysis.
I have first column values (which repeat for each value of 2nd column) which are names, the second column specifies position ad the... (1 Reply)
Hi all,
Is there a way to convert full data matrix to linearised left data matrix?
e.g full data matrix
Bh1 Bh2 Bh3 Bh4 Bh5 Bh6 Bh7
Bh1 0 0.241058 0.236129 0.244397 0.237479 0.240767 0.245245
Bh2 0.241058 0 0.240594 0.241931 0.241975 ... (8 Replies)
Howdy,
I need to convert an association data matrix, currently in a two-column format, into a matrix with numbers indicating the number of associations. I've been looking around for AWK code in the list, but could not find anything. Here's an example of what I want to perform:
original... (10 Replies)