Selecting random columns from large dataset in UNIX Post: 302948954

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Help with selecting specific lines in a large file

Hello, I need to select the 3 lines above as well as below a search string, including the search string. I have been trying various combinations using sed command without any success. Can anuone help please. Thanking

2. UNIX for Dummies Questions & Answers

Using 'sed' to delete or ignore columns in a dataset

Hi, I've already posted elsewhere but am posting again here coz im a newbie. I hope you forgive me this time. I want to know if its possible to delete or ignore columns in a large dataset using 'sed'. For example, I have the following dataset: - ...

3. UNIX for Dummies Questions & Answers

Using 'sed' to delete or ignore columns in a dataset

Hi, I want to know if its possible to delete or ignore columns in a large dataset using 'sed'. For example, I have the following dataset: - 20060714,X.XX,1,043004,Q,T,24.0000,1,25.5000,4, 20060714,X.XX,1,081209,Q,T,24.0000,1,25.5000,5, As you can see, there are 10 columns here and the...

4. Programming

Extracting differences between two columns dataset (SQL command)

Hi, I have a table in my sqlite, here is an example (tab separated) 585 name1 chr1 + 1872 3533 3533 3533 6 1872,2041,2475,2837,3083,3315, 1920,2090,2560,2915,3237,3533, name2 The 10th and 11th columns have information in a comma separated format (not tab)....

5. Programming

I have C++ exe file( no source code) and need to run many large dataset under unix, b

I have C++ exe file( no source code) and need to run many large dataset under unix, but how to know the memeroy usage for one dataset?http://www.codeproject.com/script/Forums/Images/New.gif I think "top" is not good and if using the profiler, it seems no free download, any ideas?

6. Shell Programming and Scripting

How to Pick Random records from a large file

Hi, I have a huge file say with 2000000 records. The file has 42 fields. I would like to pick randomly 1000 records from this huge file. Can anyone help me how to do this?

7. Solaris

flarecreate for zfs root dataset and ignore multiple dataset

Hi All, I want to write a script to create flar images on multiple servers. In non zfs filesystem I am using -X option to refer a file to exclude mounts on different servers. but on ZFS -X option is not working. I want multiple mounts to be ignore on ZFS base system during flarecreate. I...

8. Shell Programming and Scripting

Parse large file on line count (random lines)

I have a file that needs to be parsed into multiple files every time there line contains a number 1. the problem i face is the lines are random and the file size is random. an example is that on line 4, 65, 187, 202 & 209 are number 1's so there has to be file breaks between all those to create 4...

9. Shell Programming and Scripting

How to remove a subset of data from a large dataset based on values on one line

Hello. I was wondering if anyone could help. I have a file containing a large table in the format: marker1 marker2 marker3 marker4 position1 position2 position3 position4 genotype1 genotype2 genotype3 genotype4 with marker being a name, position a numeric...

10. Shell Programming and Scripting

Selecting lines having same values for first two columns

Hello to all. This is first post. Kindly excuse me if I do not adhere to any rules and regulations of this forum. I have a file containing some rows with three columns each per row(separeted by a space). There are certain rows for which first two columns have same value but the value in...

LEARN ABOUT DEBIAN

h5totxt

H5TOTXT(1)							      h5utils								H5TOTXT(1)

NAME

       h5totxt - generate comma-delimited text from 2d slices of HDF5 files

SYNOPSIS

       h5totxt [OPTION]... [HDF5FILE]...

DESCRIPTION

       h5totxt is a utility to generate comma-delimited text (and similar formats) from one-, two-, or more-dimensional slices of numeric datasets
       in HDF5 files.  This way, the data can easily be imported into spreadsheets and similar programs for analysis and visualization.

       HDF5 is a free, portable binary format and supporting library developed by the National Center for Supercomputing Applications at the  Uni-
       versity of Illinois in Urbana-Champaign.  A single h5 file can contain multiple data sets; by default, h5totxt takes the first dataset, but
       this can be changed via the -d option, or by using the syntax HDF5FILE:DATASET.

       By default, the entire dataset is dumped to the output.	in row-major order.  For 3d datasets, this corresponds to a sequence of yz slices,
       in order of increasing x, separated by blank lines.  If -T is specified, outputs in the transposed (column-major) order instead

       Often,  however,  you  want  only a one- or two-dimensional slice of multi-dimensional data.  To do this, you specify coordinates in one or
       more slice dimensions, via the -xyzt options.

       The most basic usage is something like 'h5totxt foo.h5', which will output comma-delimited text to stdout from the data in foo.h5.

OPTIONS

       -h     Display help on the command-line options and usage.

       -V     Print the version number and copyright info for h5totxt.

       -v     Verbose output.

       -o file
	      Send text output to file rather than to stdout (the default).

       -s sep Use the string sep to separate columns of the output rather than a comma (the default).

       -x ix, -y iy, -z iz, -t it
	      This tells h5totxt to use a particular slice of a multi-dimensional dataset.  e.g.  -x causes a yz plane (of a  3d  dataset)  to	be
	      used,  at an x index of ix (where the indices run from zero to one less than the maximum index in that direction).  Here, x/y/z cor-
	      respond to the first/second/third dimensions of the HDF5 dataset. The -t option specifies a slice in the last  dimension,  whichever
	      that might be.  See also the -0 option to shift the origin of the x/y/z slice coordinates to the dataset center.

       -0     Shift  the  origin  of  the x/y/z slice coordinates to the dataset center, so that e.g. -0 -x 0 (or more compactly -0x0) returns the
	      central x plane of the dataset instead of the edge x plane.  (-t coordinates are not affected.)

       -T     Transpose the data (interchange the dimension ordering).	By default, no transposition is done.

       -. numdigits
	      Output numdigits digits after the decimal point (defaults to 16).

       -d name
	      Use dataset name from the input files; otherwise, the first  dataset  from  each	file  is  used.   Alternatively,  use  the  syntax
	      HDF5FILE:DATASET,  which allows you to specify a different dataset for each file.  You can use the h5ls command (included with hdf5)
	      to find the names of datasets within a file.

BUGS

       Send bug reports to S. G. Johnson, stevenj@alum.mit.edu.

AUTHORS

       Written by Steven G. Johnson.  Copyright (c) 2005 by the Massachusetts Institute of Technology.

h5utils 							   March 9, 2002							H5TOTXT(1)

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Help with selecting specific lines in a large file

Discussion started by: tansha

2. UNIX for Dummies Questions & Answers

Using 'sed' to delete or ignore columns in a dataset

Discussion started by: aarif

3. UNIX for Dummies Questions & Answers

Using 'sed' to delete or ignore columns in a dataset

Discussion started by: aarif

4. Programming

Extracting differences between two columns dataset (SQL command)

Discussion started by: labrazil

5. Programming

I have C++ exe file( no source code) and need to run many large dataset under unix, b

Discussion started by: Danielwang1986

6. Shell Programming and Scripting

How to Pick Random records from a large file

Discussion started by: ajithshankar@ho

7. Solaris

flarecreate for zfs root dataset and ignore multiple dataset

Discussion started by: uxravi

8. Shell Programming and Scripting

Parse large file on line count (random lines)

Discussion started by: darbs121

9. Shell Programming and Scripting

How to remove a subset of data from a large dataset based on values on one line

Discussion started by: davegen

10. Shell Programming and Scripting

Selecting lines having same values for first two columns

Discussion started by: manojmalhotra13

LEARN ABOUT DEBIAN

h5totxt