In what way did my suggestion fail? If I change the last line of your sample file as suggested in post #6, I get the following output from the code I suggested in post #4:
What output were you expecting?
PS: Note that the code I suggested prints out columns in the order in which input field 2 values were first seen and prints out rows in the order in which input field 1 values were first seen.
The simpler and faster code RudiC suggested prints rows and columns in random order (but the values in each column should be consistent).
Booo,
I apologize... The output above was produced by changing the 6th row in your sample input file to the line you provided in post #6, instead of by adding that line as a new 7th row. When I add that new line to your original sample data, the output produced by my suggestion is:
with my suggested code, with that input file, the output will always be in this order.
When I feed that input to the code RudiC provided, I get the output:
but the order of the last four columns and the order of the rows may vary with different versions of awk (because the order in which items in an array are processed by for(index in array_name) is not specified by the standards). Note that the output RudiC got using the version of awk on his system produced output with the order of the last four columns AND the order of the rows printed was different than what I got on my system, but the numeric values in the table of output values displayed was identical using both my suggestion and his suggestion (on both of our systems). Although it may be hard to tell looking at the output he provided in post #8 since he didn't include the header line in the output.
As RudiC said, just saying that "it fails" without specifying whose suggested code you're running and without specifying what is wrong with the output produced is not at all helpful.
Original file we are getting .......
Item Period Amt
P1 106 1000
P1 206 1500
P1 106 2000
P2 256 5800
P2 650 7500
My output should be like this
Item 106 206 256 650 ............
P1 1000 1500 0 ... (1 Reply)
Hi Guys,
Thank you all for helping me with my different queries and I continue to get better at scripting because of help from all of you!
I have a file that would look something like -
ID SUB ID VALUE
1 10 5
2 18 7
1 ... (1 Reply)
Hi,
I'm trying to assign a score to each row which will allow me to identify which rows differ. In the example file below, I've used "," to indicate column separators (my actual file has tab separators). In this example, I'd like to identify that row 1 and row 5 are the same, and row 2 and row... (4 Replies)
Could anybody help with this?
I have input below .....
david,39
david,39
emelie,40
clarissa,22
bob,42
bob,42
tim,32
bob,39
david,38
emelie,47
what i want to do is count how many names there are with different ages, so output would be like this ....
david,2
emelie,2
clarissa,1... (3 Replies)
Hi,
Please help me to convert the input file to a new one.
input file:
--------
1231231231 3 A
4561223343 0 D
1231231231 1 A
1231231231 2 A
1231231231 4 D
7654343444 2 A
4561223343 1 D
4561223343 2 D
the output should be:
--------------------
1231231231 3#1#2 A
4561223343 0 D... (3 Replies)
Here is the contents of an input file.
A,1,2,3,4
10,aaa,bbb,ccc,ddd
11,eee,fff,ggg,hhh
12,iii,jjj,lll,mmm
13,nnn,ooo,ppp
I wanted the output to be
A
10 1 aaa
10 2 bbb
10 3 ccc
10 4 ddd
11 1 eee
11 2 fff
11 3 ggg
11 4 hhh .....
and so on How to do it in ksh... (9 Replies)
Hi All,
I have a big file with 232 columns and 9 million rows, I want to delete all rows with same column values in col3 through col232. Also the output should be sorted based on first 2 columns.
Here is a reduced example with 6 columns. I want to remove rows with duplicate values in col3... (9 Replies)
I have a .csv file:
A,B,0.6
C,D,-0.7
D,E,0.1
A,E,0.45
D,G, -0.4
I want to select rows based on the values of the 3rd columns such that it is >=0.5 or <= -0.5
Thanks.
A,B,0.6
D,G, -0.7 (1 Reply)
Hi all ,
I have a file with the below content
Header Section
employee|employee name||Job description|Job code|Unitcode|Account|geography|C1|C2|C3|C4|C5|C6|C7|C8|C9|Csource|Oct|Nov|Dec|Jan|Feb|Mar|Apr|May|Jun|Jul|Aug|Sep
Data section ... (1 Reply)
I would like to compare values in column 8, and grep the ones where the different is > 1, columns 1 and 2 are the key for array.
Every 4 rows the records values in columns 1 and 2 changed. Then, the comparison in the column 8 need to be done for the 4 rows everytime columns 1 and 2 changed
... (4 Replies)
Discussion started by: jiam912
4 Replies
LEARN ABOUT FREEBSD
rs
RS(1) BSD General Commands Manual RS(1)NAME
rs -- reshape a data array
SYNOPSIS
rs [-[csCS][x] [kKgGw][N] tTeEnyjhHmz] [rows [cols]]
DESCRIPTION
The rs utility reads the standard input, interpreting each line as a row of blank-separated entries in an array, transforms the array accord-
ing to the options, and writes it on the standard output. With no arguments it transforms stream input into a columnar format convenient for
terminal viewing.
The shape of the input array is deduced from the number of lines and the number of columns on the first line. If that shape is inconvenient,
a more useful one might be obtained by skipping some of the input with the -k option. Other options control interpretation of the input col-
umns.
The shape of the output array is influenced by the rows and cols specifications, which should be positive integers. If only one of them is a
positive integer, rs computes a value for the other which will accommodate all of the data. When necessary, missing data are supplied in a
manner specified by the options and surplus data are deleted. There are options to control presentation of the output columns, including
transposition of the rows and columns.
The following options are available:
-cx Input columns are delimited by the single character x. A missing x is taken to be `^I'.
-sx Like -c, but maximal strings of x are delimiters.
-Cx Output columns are delimited by the single character x. A missing x is taken to be `^I'.
-Sx Like -C, but padded strings of x are delimiters.
-t Fill in the rows of the output array using the columns of the input array, that is, transpose the input while honoring any rows and
cols specifications.
-T Print the pure transpose of the input, ignoring any rows or cols specification.
-kN Ignore the first N lines of input.
-KN Like -k, but print the ignored lines.
-gN The gutter width (inter-column space), normally 2, is taken to be N.
-GN The gutter width has N percent of the maximum column width added to it.
-e Consider each line of input as an array entry.
-n On lines having fewer entries than the first line, use null entries to pad out the line. Normally, missing entries are taken from
the next line of input.
-y If there are too few entries to make up the output dimensions, pad the output by recycling the input from the beginning. Normally,
the output is padded with blanks.
-h Print the shape of the input array and do nothing else. The shape is just the number of lines and the number of entries on the first
line.
-H Like -h, but also print the length of each line.
-j Right adjust entries within columns.
-wN The width of the display, normally 80, is taken to be the positive integer N.
-m Do not trim excess delimiters from the ends of the output array.
-z Adapt column widths to fit the largest entries appearing in them.
With no arguments, rs transposes its input, and assumes one array entry per input line unless the first non-ignored line is longer than the
display width. Option letters which take numerical arguments interpret a missing number as zero unless otherwise indicated.
EXAMPLES
The rs utility can be used as a filter to convert the stream output of certain programs (e.g., spell(1), du(1), file(1), look(1), nm(1),
who(1), and wc(1)) into a convenient ``window'' format, as in
% who | rs
This function has been incorporated into the ls(1) program, though for most programs with similar output rs suffices.
To convert stream input into vector output and back again, use
% rs 1 0 | rs 0 1
A 10 by 10 array of random numbers from 1 to 100 and its transpose can be generated with
% jot -r 100 | rs 10 10 | tee array | rs -T > tarray
In the editor vi(1), a file consisting of a multi-line vector with 9 elements per line can undergo insertions and deletions, and then be
neatly reshaped into 9 columns with
:1,$!rs 0 9
Finally, to sort a database by the first line of each 4-line field, try
% rs -eC 0 4 | sort | rs -c 0 1
SEE ALSO jot(1), pr(1), sort(1), vi(1)HISTORY
The rs utility first appeared in 4.2BSD.
BUGS
Handles only two dimensional arrays.
The algorithm currently reads the whole file into memory, so files that do not fit in memory will not be reshaped.
Fields cannot be defined yet on character positions.
Re-ordering of columns is not yet possible.
There are too many options.
Multibyte characters are not recognized.
Lines longer than LINE_MAX (2048) bytes are not processed and result in immediate termination of rs.
BSD February 25, 2011 BSD