9 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
What is an efficient way of counting the number of unique values in a 400 column by 1000 row array and outputting the counts per column, assuming the unique values in the array are:
A, B, C, D
In other words the output should look like: Value COL1 COL2 COL3
A 50 51 52... (16 Replies)
Discussion started by: Geneanalyst
16 Replies
2. Shell Programming and Scripting
Hello Team,
I need your help on the following:
My input file a.txt is as below:
3330690|373846|108471
3330690|373846|108471
0640829|459725|100001
0640829|459725|100001
3330690|373847|108471
Here row 1 and row 2 of column 1 are identical but corresponding column 2 value are... (4 Replies)
Discussion started by: angshuman
4 Replies
3. Shell Programming and Scripting
Hi Folks,
I have the below feed file named abc1.txt in which you can see there is a title and below is the respective values in the rows and it is completely pipe delimited file ,.
... (4 Replies)
Discussion started by: punpun66
4 Replies
4. Linux
cat sample.csv
ID,Name,no
1,AAA,1
2,BBB,1
3,AAA,1
4,BBB,1
cut -d',' -f2 sample.csv | sort | uniq
this gives only the 2nd column values
Name
AAA
BBB
How to I get all the columns of CSV along with this? (1 Reply)
Discussion started by: sanvel
1 Replies
5. Shell Programming and Scripting
Hi, I have tab-deliminated data similar to the following:
dot is-big 2
dot is-round 3
dot is-gray 4
cat is-big 3
hot in-summer 5
I want to count the frequency of each individual "unique" value in the 1st column. Thus, the desired output would be as follows:
dot 3
cat 1
hot 1
is... (5 Replies)
Discussion started by: owwow14
5 Replies
6. Shell Programming and Scripting
Hello,
I need some sort of way to extract every date contained in a file, and count how many of those dates there are.
Here are the specifics:
The date format I'm looking for is mm/dd/yyyy
I only need to look after line 45 in the file (that's where the data begins)
The columns of... (2 Replies)
Discussion started by: ronan1219
2 Replies
7. Shell Programming and Scripting
I need to take the second column of a .csv file and count the number of instances of each unique value in that same second column. I'd like the output to be value,count sorted by most instances. Thanks for any guidance!
Data example:
317476,317756,0
816063,318861,0
313123,319091,0... (4 Replies)
Discussion started by: batcho
4 Replies
8. Shell Programming and Scripting
Hi
I have the following info in a file -
<Cell id="25D"/>
<Cell id="26A"/>
<Cell id="26B"/>
<Cell id="26C"/>
<Cell id="27A"/>
<Cell id="27B"/>
<Cell id="27C"/>
<Cell id="28A"/>
I would like to know how would you go about counting all... (4 Replies)
Discussion started by: Prega
4 Replies
9. Shell Programming and Scripting
Hi all,
I have a huge csv file with the following format of data,
Num SNPs, 549997
Total SNPs,555352
Num Samples, 157
SNP, SampleID, Allele1, Allele2
A001,AB1,A,A
A002,AB1,A,A
A003,AB1,A,A
...
...
...
I would like to write out a list of unique SNP (column 1). Could you... (3 Replies)
Discussion started by: phoeberunner
3 Replies
csv(n) CSV processing csv(n)
NAME
csv - Procedures to handle CSV data.
SYNOPSIS
package require Tcl 8.3
package require csv ?0.3?
::csv::join values {sepChar ,}
::csv::joinlist values {sepChar ,}
::csv::read2matrix chan m {sepChar ,} {expand none}
::csv::read2queue chan q {sepChar ,}
::csv::report cmd matrix ?chan?
::csv::split line {sepChar ,}
::csv::split2matrix m line {sepChar ,} {expand none}
::csv::split2queue q line {sepChar ,}
::csv::writematrix m chan {sepChar ,}
::csv::writequeue q chan {sepChar ,}
DESCRIPTION
The csv package provides commands to manipulate information in CSV FORMAT (CSV = Comma Separated Values).
COMMANDS
The following commands are available:
::csv::join values {sepChar ,}
Takes a list of values and returns a string in CSV format containing these values. The separator character can be defined by the
caller, but this is optional. The default is ",".
::csv::joinlist values {sepChar ,}
Takes a list of lists of values and returns a string in CSV format containing these values. The separator character can be defined
by the caller, but this is optional. The default is ",". Each element of the outer list is considered a record, these are separated
by newlines in the result. The elements of each record are formatted as usual (via ::csv::join).
::csv::read2matrix chan m {sepChar ,} {expand none}
A wrapper around ::csv::split2matrix (see below) reading CSV-formatted lines from the specified channel (until EOF) and adding them
to the given matrix. For an explanation of the expand argument see ::csv::split2matrix.
::csv::read2queue chan q {sepChar ,}
A wrapper around ::csv::split2queue (see below) reading CSV-formatted lines from the specified channel (until EOF) and adding them
to the given queue.
::csv::report cmd matrix ?chan?
A report command which can be used by the matrix methods format 2string and format 2chan. For the latter this command delegates the
work to ::csv::writematrix. cmd is expected to be either printmatrix or printmatrix2channel. The channel argument, chan, has to be
present for the latter and must not be present for the first.
::csv::split line {sepChar ,}
converts a line in CSV format into a list of the values contained in the line. The character used to separate the values from each
other can be defined by the caller, via sepChar, but this is optional. The default is ",".
::csv::split2matrix m line {sepChar ,} {expand none}
The same as ::csv::split, but appends the resulting list as a new row to the matrix m, using the method add row. The expansion mode
specified via expand determines how the command handles a matrix with less columns than contained in line. The allowed modes are:
none This is the default mode. In this mode it is the responsibility of the caller to ensure that the matrix has enough columns to
contain the full line. If there are not enough columns the list of values is silently truncated at the end to fit.
empty In this mode the command expands an empty matrix to hold all columns of the specified line, but goes no further. The overall
effect is that the first of a series of lines determines the number of columns in the matrix and all following lines are
truncated to that size, as if mode none was set.
auto In this mode the command expands the matrix as needed to hold all columns contained in line. The overall effect is that after
adding a series of lines the matrix will have enough columns to hold all columns of the longest line encountered so far.
::csv::split2queue q line {sepChar ,}
The same as ::csv::split, but appending the resulting list as a single item to the queue q, using the method put.
::csv::writematrix m chan {sepChar ,}
A wrapper around ::csv::join taking all rows in the matrix m and writing them CSV formatted into the channel chan.
::csv::writequeue q chan {sepChar ,}
A wrapper around ::csv::join taking all items in the queue q (assumes that they are lists) and writing them CSV formatted into the
channel chan.
FORMAT
Each record of a csv file (comma-separated values, as exported e.g. by Excel) is a set of ASCII values separated by ",". For other lan-
guages it may be ";" however, although this is not important for this case (The functions provided here allow any separator character).
If a value contains itself the separator ",", then it (the value) is put between "".
If a value contains ", it is replaced by "".
EXAMPLE
The record
123,"123,521.2","Mary says ""Hello, I am Mary"""
is parsed as follows:
a) 123
b) 123,521.2
c) Mary says "Hello, I am Mary"
SEE ALSO
matrix, queue
KEYWORDS
csv, matrix, queue, package, tcllib
csv 0.3 csv(n)