I would like to know how can I merge files based on their coordinates, but mantaining the score of each file in the output file like:
Note: 1st column is for chromosome, 2nd for start, 3rd for end of segment, 4th for score
file1:
file2:
file3:
Desired output file:
(Note: 1st column is for chromosome, 2nd for start, 3rd for end of segment, 4th-to-last column is for score of each file)
Hello guys,
I gotta question, i have a lot of log files (simple text) and i need to merge them in group of 10 files, one next to the other, that have sense?
For example, i have the files:
File1
File2
File3
File4
.
.
File100
I need to merge the contents of each file into a new file... (3 Replies)
Hi,
i have the files f1 and f2 like:
files f1:
c1 a1
c2 a2
c3 a3
file f2:
c1 b1
c2 b2
c3 b3
i want merge the f1 and f2 file to f3 file like:
c1 a1 b1
c2 a2 b3
c3 a3 b3........
....
.
.
please help me onthis..... (5 Replies)
hi,
i am facing a problem in merging two files using awk,
the problem is as stated below,
file1:
A|B|C|D|E|F|G|H|I|1
M|N|O|P|Q|R|S|T|U|2
AA|BB|CC|DD|EE|FF|GG|HH|II|1
....
....
....
file2 :
1|Mn|op|qr (2 Replies)
Hello,
i'd like a bash script to merge 2 files without duplicate lines.
Example :
file1 :
toto
titi
file2 :
toto
tata
Expected result, file3 :
toto (5 Replies)
Given are File A and File B
File A has for example 5 lines:
AAA
BBB
CCC
DDD
EEE
File B has 3 lines:
111
222
333
How can i merge A and B into:
111
222
333
AAA (first line from A)
then a new file: (4 Replies)
Hi ,
This might be the stupidest question I am asking, but I am struck with this problem. I am trying to merge 2 files, file1 has header and file2 has contents. while I merge them , it merges from the 1st line of file1.
for ex: file1
col1|col2|col3|
file2
123|234|456|... (2 Replies)
Hi all!
How could I merge all the text files (in format xml) of a single folder, after having deleted from each of them all the text from its beginning up to a specific string: "<body>" ?
Thanks a lot!
mjomba (4 Replies)
Hi,
My requirement is,there is a directory location like:
:camp/current/
In this location there can be different flat files that are generated in a single day with same header and the data will be different, differentiated by timestamp, so i need to verify how many files are generated... (10 Replies)
Dear Gents,
Please I need your help... I need small script :) to do the following.
I have a thousand of files in a folder produced daily.
I need first to merge all files called. txt (0009.txt, 0010.txt, 0011.txt) and and to output a resume of all information on 2 separate files in csv... (14 Replies)
Dear Frens,
I have two files and need to merge into one file. Like
File_1:
Field1 Field2
1 4
File_2:
Field1 Field2
3 5
I need one single output as
File_1:
Field1 Field2
1 4
3 5
This means taking header from either file. (8 Replies)
Discussion started by: manisha_singh
8 Replies
LEARN ABOUT DEBIAN
fastx_quality_stats
FASTX_QUALITY_STATS(1) User Commands FASTX_QUALITY_STATS(1)NAME
fastx_quality_stats - FASTX Statistics
DESCRIPTION
usage: fastx_quality_stats [-h] [-N] [-i INFILE] [-o OUTFILE] Part of FASTX Toolkit 0.0.13.2 by A. Gordon (gordon@cshl.edu)
[-h] = This helpful help screen. [-i INFILE] = FASTQ input file. default is STDIN. [-o OUTFILE] = TEXT output file. default is
STDOUT. [-N] = New output format (with more information per nucleotide/cycle).
The *OLD* output TEXT file will have the following fields (one row per column):
column = column number (1 to 36 for a 36-cycles read solexa file)
count = number of bases found in this column.
min = Lowest quality score value found in this column.
max = Highest quality score value found in this column.
sum = Sum of quality score values for this column.
mean = Mean quality score value for this column.
Q1 = 1st quartile quality score.
med = Median quality score.
Q3 = 3rd quartile quality score.
IQR = Inter-Quartile range (Q3-Q1).
lW = 'Left-Whisker' value (for boxplotting).
rW = 'Right-Whisker' value (for boxplotting).
A_Count = Count of 'A' nucleotides found in this column. C_Count = Count of 'C' nucleotides found in this column. G_Count = Count
of 'G' nucleotides found in this column. T_Count = Count of 'T' nucleotides found in this column. N_Count = Count of 'N' nucleo-
tides found in this column. max-count = max. number of bases (in all cycles)
The *NEW* output format:
cycle (previously called 'column') = cycle number max-count For each nucleotide in the cycle (ALL/A/C/G/T/N):
count = number of bases found in this column.
min = Lowest quality score value found in this column.
max = Highest quality score value found in this column.
sum = Sum of quality score values for this column.
mean = Mean quality score value for this column.
Q1 = 1st quartile quality score.
med = Median quality score.
Q3 = 3rd quartile quality score.
IQR = Inter-Quartile range (Q3-Q1).
lW = 'Left-Whisker' value (for boxplotting).
rW = 'Right-Whisker' value (for boxplotting).
SEE ALSO
The quality of this automatically generated manpage might be insufficient. It is suggested to visit
http://hannonlab.cshl.edu/fastx_toolkit/commandline.html
to get a better layout as well as an overview about connected FASTX tools.
fastx_quality_stats 0.0.13.2 May 2012 FASTX_QUALITY_STATS(1)