Hi,
I've already posted elsewhere but am posting again here coz im a newbie. I hope you forgive me this time.
I want to know if its possible to delete or ignore columns in a large dataset using 'sed'. For example, I have the following dataset: -
... (0 Replies)
Hi,
I want to know if its possible to delete or ignore columns in a large dataset using 'sed'. For example, I have the following dataset: -
20060714,X.XX,1,043004,Q,T,24.0000,1,25.5000,4,
20060714,X.XX,1,081209,Q,T,24.0000,1,25.5000,5,
As you can see, there are 10 columns here and the... (4 Replies)
Hii I have a file which contains huge amounts of data.I just want to delete last 3 columns in the without changing its format.The file contains data as shown below
PDE 2001 10 29 202148.60 38.92 24.20 33 4.8 MLATH .F. .......
PDE 2001 10 29 203423.57 38.88 24.41 33 3.7 MLATH... (3 Replies)
how do I delete the first 3 lines and the first column and the tab?
infile:
Colorspace 0
SA-Sample 1 in 32
FTab-Chars 10
Sequence-1 SINE1_7SL 282
Sequence-2 sapieTTns 289
Sequence-3 7SL_Hopns 289outfile:
SINE1_7SL 282
sapieTTns 289
7SL_Hopns 289Thanks (4 Replies)
I have this space delimited large text file with more than 1,000,000+ columns and about 100 rows. I want to delete all the columns that start with NA such that:
File before modification
aa bb cc NA100 dd
aa b1 c2 NA101 de
File after modification
aa bb cc dd
aa b1 c2 de
How would I... (3 Replies)
Hello Guys
I have a flat file with few thousands of rows.
Now each rows have different number of columns
I want to delete the rows which has not equal to 749 columns
Can you guys please let me know how to do the same
Your help is much appreciated. (2 Replies)
Hi,
I have a file like this
a 1 2
b 2 2
c 2 3
d 4 5
f 5 6
output
a 1 2
c 2 3
d 4 5
f 5 6
Basically, I want to delete the whole line if $2 and $3 are the same. Thanks (5 Replies)
Dear all,
I have one file (see below) with more then 100 columns, and need only column which has GType in label with Alphabets, please help me to remove this columns with numbers.
input file is
n.201.GType n-201.Theta n-201.R n_1.GType n_1.Theta n_1.R n_7.GType ... (1 Reply)
Hi,
I'd like to ask for some help with the following task, please:
there is a big file with a header (this is file.in):
NAME A_1.X A_1.Y A_1.Z B_1.X B_1.Y B_1.Z
name1 AB 0.11 0.12 BB 0.45 0.67
name2 BB 0.34 0.56 AA 0.89 0.68
what I need is to recognize a pattern in the header of this... (10 Replies)
An extension from an earlier question. Now need a sed script to delete columns 7,15 and 16 from an example txt below..
Again, thanks in advance.
98M-01.WAV,98M,01,00:00:49,01:07:36:00,"MIX",,"BOOM-MKH50",,,,,,,,,,"",
98L-01.WAV,98L,01,00:00:51,01:01:45:00,"MIX",,"BOOM-MKH50",,,,,,,,,,"", (7 Replies)
Discussion started by: Vrc2250
7 Replies
LEARN ABOUT FREEBSD
ministat
MINISTAT(1) BSD General Commands Manual MINISTAT(1)NAME
ministat -- statistics utility
SYNOPSIS
ministat [-Ans] [-C column] [-c confidence_level] [-d delimiter] [-w [width]] [file ...]
DESCRIPTION
The ministat command calculates fundamental statistical properties of numeric data in the specified files or, if no file is specified, stan-
dard input.
The options are as follows:
-A Just report the statistics of the input and relative comparisons, suppress the ASCII-art plot.
-n Just report the raw statistics of the input, suppress the ASCII-art plot and the relative comparisons.
-s Print the average/median/stddev bars on separate lines in the ASCII-art plot, to avoid overlap.
-C column Specify which column of data to use. By default the first column in the input file(s) are used.
-c confidence_level
Specify desired confidence level for Student's T analysis. Possible values are 80, 90, 95, 98, 99 and 99.5 %
-d delimiter
Specifies the column delimiter characters, default is SPACE and TAB. See strtok(3) for details.
-w width Width of ASCII-art plot in characters, default is 74.
A sample output could look like this:
$ ministat -s -w 60 iguana chameleon
x iguana
+ chameleon
+------------------------------------------------------------+
|x * x * + + x +|
| |________M______A_______________| |
| |________________M__A___________________| |
+------------------------------------------------------------+
N Min Max Median Avg Stddev
x 7 50 750 200 300 238.04761
+ 5 150 930 500 540 299.08193
No difference proven at 95.0% confidence
If ministat tells you, as in the example above, that there is no difference proven at 95% confidence, the two data sets you gave it are for
all statistical purposes identical.
You have the option of lowering your standards by specifying a lower confidence level:
$ ministat -s -w 60 -c 80 iguana chameleon
x iguana
+ chameleon
+------------------------------------------------------------+
|x * x * + + x +|
| |________M______A_______________| |
| |________________M__A___________________| |
+------------------------------------------------------------+
N Min Max Median Avg Stddev
x 7 50 750 200 300 238.04761
+ 5 150 930 500 540 299.08193
Difference at 80.0% confidence
240 +/- 212.215
80% +/- 70.7384%
(Student's t, pooled s = 264.159)
But a lower standard does not make your data any better, and the example is only included here to show the format of the output when a sta-
tistical difference is proven according to Student's T method.
SEE ALSO
Any mathematics text on basic statistics, for instances Larry Gonicks excellent "Cartoon Guide to Statistics" which supplied the above exam-
ple.
HISTORY
The ministat command was written by Poul-Henning Kamp out of frustration over all the bogus benchmark claims made by people with no under-
standing of the importance of uncertainty and statistics.
From FreeBSD 5.2 it has lived in the source tree as a developer tool, graduating to the installed system from FreeBSD 8.0.
BSD November 10, 2012 BSD