Pivoting values from column to rows Post: 302995928

Sponsored Content

Top Forums UNIX for Beginners Questions & Answers Pivoting values from column to rows Post 302995928 by Don Cragun on Saturday 15th of April 2017 01:46:08 AM

04-15-2017

Registered User

Quote:

Originally Posted by Don Cragun

In what way did my suggestion fail? If I change the last line of your sample file as suggested in post #6, I get the following output from the code I suggested in post #4:

Code:

		a	b	c	d
2017/01/01	10	20	40	
2017/02/01	10	20		30

What output were you expecting?

PS: Note that the code I suggested prints out columns in the order in which input field 2 values were first seen and prints out rows in the order in which input field 1 values were first seen.

The simpler and faster code RudiC suggested prints rows and columns in random order (but the values in each column should be consistent).

Booo,
I apologize... The output above was produced by changing the 6th row in your sample input file to the line you provided in post #6, instead of by adding that line as a new 7th row. When I add that new line to your original sample data, the output produced by my suggestion is:

Code:

		a	b	c	d
2017/01/01	10	20	40	
2017/02/01	10	20	30	30

with my suggested code, with that input file, the output will always be in this order.

When I feed that input to the code RudiC provided, I get the output:

Code:

                   d         a         b         c
2017/01/01                  10        20        40
2017/02/01        30        10        20        30

but the order of the last four columns and the order of the rows may vary with different versions of awk (because the order in which items in an array are processed by for(index in array_name) is not specified by the standards). Note that the output RudiC got using the version of awk on his system produced output with the order of the last four columns AND the order of the rows printed was different than what I got on my system, but the numeric values in the table of output values displayed was identical using both my suggestion and his suggestion (on both of our systems). Although it may be hard to tell looking at the output he provided in post #8 since he didn't include the header line in the output.

As RudiC said, just saying that "it fails" without specifying whose suggested code you're running and without specifying what is wrong with the output produced is not at all helpful.

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Pivoting Dynamic rows into columns

Original file we are getting ....... Item Period Amt P1 106 1000 P1 206 1500 P1 106 2000 P2 256 5800 P2 650 7500 My output should be like this Item 106 206 256 650 ............ P1 1000 1500 0 ...

2. Shell Programming and Scripting

How to consolidate values in one column from different rows into one?

Hi Guys, Thank you all for helping me with my different queries and I continue to get better at scripting because of help from all of you! I have a file that would look something like - ID SUB ID VALUE 1 10 5 2 18 7 1 ...

3. UNIX for Dummies Questions & Answers

How to assign scores to rows based on column values

Hi, I'm trying to assign a score to each row which will allow me to identify which rows differ. In the example file below, I've used "," to indicate column separators (my actual file has tab separators). In this example, I'd like to identify that row 1 and row 5 are the same, and row 2 and row...

4. UNIX for Dummies Questions & Answers

count number of rows based on other column values

Could anybody help with this? I have input below ..... david,39 david,39 emelie,40 clarissa,22 bob,42 bob,42 tim,32 bob,39 david,38 emelie,47 what i want to do is count how many names there are with different ages, so output would be like this .... david,2 emelie,2 clarissa,1...

5. Shell Programming and Scripting

join rows based on the column values

Hi, Please help me to convert the input file to a new one. input file: -------- 1231231231 3 A 4561223343 0 D 1231231231 1 A 1231231231 2 A 1231231231 4 D 7654343444 2 A 4561223343 1 D 4561223343 2 D the output should be: -------------------- 1231231231 3#1#2 A 4561223343 0 D...

6. Shell Programming and Scripting

Transposing rows and columns (pivoting) using shell scripting

Here is the contents of an input file. A,1,2,3,4 10,aaa,bbb,ccc,ddd 11,eee,fff,ggg,hhh 12,iii,jjj,lll,mmm 13,nnn,ooo,ppp I wanted the output to be A 10 1 aaa 10 2 bbb 10 3 ccc 10 4 ddd 11 1 eee 11 2 fff 11 3 ggg 11 4 hhh ..... and so on How to do it in ksh...

7. Shell Programming and Scripting

Remove rows with e column values

Hi All, I have a big file with 232 columns and 9 million rows, I want to delete all rows with same column values in col3 through col232. Also the output should be sorted based on first 2 columns. Here is a reduced example with 6 columns. I want to remove rows with duplicate values in col3...

8. Shell Programming and Scripting

Choosing rows based on column values

I have a .csv file: A,B,0.6 C,D,-0.7 D,E,0.1 A,E,0.45 D,G, -0.4 I want to select rows based on the values of the 3rd columns such that it is >=0.5 or <= -0.5 Thanks. A,B,0.6 D,G, -0.7

9. Shell Programming and Scripting

Convert Column data values to rows

Hi all , I have a file with the below content Header Section employee|employee name||Job description|Job code|Unitcode|Account|geography|C1|C2|C3|C4|C5|C6|C7|C8|C9|Csource|Oct|Nov|Dec|Jan|Feb|Mar|Apr|May|Jun|Jul|Aug|Sep Data section ...

10. UNIX for Beginners Questions & Answers

Compare values in multiple rows in one column using awk

I would like to compare values in column 8, and grep the ones where the different is > 1, columns 1 and 2 are the key for array. Every 4 rows the records values in columns 1 and 2 changed. Then, the comparison in the column 8 need to be done for the 4 rows everytime columns 1 and 2 changed ...

LEARN ABOUT FREEBSD

rs

RS(1)							    BSD General Commands Manual 						     RS(1)

NAME

     rs -- reshape a data array

SYNOPSIS

     rs [-[csCS][x] [kKgGw][N] tTeEnyjhHmz] [rows [cols]]

DESCRIPTION

     The rs utility reads the standard input, interpreting each line as a row of blank-separated entries in an array, transforms the array accord-
     ing to the options, and writes it on the standard output.	With no arguments it transforms stream input into a columnar format convenient for
     terminal viewing.

     The shape of the input array is deduced from the number of lines and the number of columns on the first line.  If that shape is inconvenient,
     a more useful one might be obtained by skipping some of the input with the -k option.  Other options control interpretation of the input col-
     umns.

     The shape of the output array is influenced by the rows and cols specifications, which should be positive integers.  If only one of them is a
     positive integer, rs computes a value for the other which will accommodate all of the data.  When necessary, missing data are supplied in a
     manner specified by the options and surplus data are deleted.  There are options to control presentation of the output columns, including
     transposition of the rows and columns.

     The following options are available:

     -cx     Input columns are delimited by the single character x.  A missing x is taken to be `^I'.

     -sx     Like -c, but maximal strings of x are delimiters.

     -Cx     Output columns are delimited by the single character x.  A missing x is taken to be `^I'.

     -Sx     Like -C, but padded strings of x are delimiters.

     -t      Fill in the rows of the output array using the columns of the input array, that is, transpose the input while honoring any rows and
	     cols specifications.

     -T      Print the pure transpose of the input, ignoring any rows or cols specification.

     -kN     Ignore the first N lines of input.

     -KN     Like -k, but print the ignored lines.

     -gN     The gutter width (inter-column space), normally 2, is taken to be N.

     -GN     The gutter width has N percent of the maximum column width added to it.

     -e      Consider each line of input as an array entry.

     -n      On lines having fewer entries than the first line, use null entries to pad out the line.  Normally, missing entries are taken from
	     the next line of input.

     -y      If there are too few entries to make up the output dimensions, pad the output by recycling the input from the beginning.  Normally,
	     the output is padded with blanks.

     -h      Print the shape of the input array and do nothing else.  The shape is just the number of lines and the number of entries on the first
	     line.

     -H      Like -h, but also print the length of each line.

     -j      Right adjust entries within columns.

     -wN     The width of the display, normally 80, is taken to be the positive integer N.

     -m      Do not trim excess delimiters from the ends of the output array.

     -z      Adapt column widths to fit the largest entries appearing in them.

     With no arguments, rs transposes its input, and assumes one array entry per input line unless the first non-ignored line is longer than the
     display width.  Option letters which take numerical arguments interpret a missing number as zero unless otherwise indicated.

EXAMPLES

     The rs utility can be used as a filter to convert the stream output of certain programs (e.g., spell(1), du(1), file(1), look(1), nm(1),
     who(1), and wc(1)) into a convenient ``window'' format, as in

	   % who | rs

     This function has been incorporated into the ls(1) program, though for most programs with similar output rs suffices.

     To convert stream input into vector output and back again, use

	   % rs 1 0 | rs 0 1

     A 10 by 10 array of random numbers from 1 to 100 and its transpose can be generated with

	   % jot -r 100 | rs 10 10 | tee array | rs -T > tarray

     In the editor vi(1), a file consisting of a multi-line vector with 9 elements per line can undergo insertions and deletions, and then be
     neatly reshaped into 9 columns with

	   :1,$!rs 0 9

     Finally, to sort a database by the first line of each 4-line field, try

	   % rs -eC 0 4 | sort | rs -c 0 1

SEE ALSO

     jot(1), pr(1), sort(1), vi(1)

HISTORY

     The rs utility first appeared in 4.2BSD.

BUGS

     Handles only two dimensional arrays.

     The algorithm currently reads the whole file into memory, so files that do not fit in memory will not be reshaped.

     Fields cannot be defined yet on character positions.

     Re-ordering of columns is not yet possible.

     There are too many options.

     Multibyte characters are not recognized.

     Lines longer than LINE_MAX (2048) bytes are not processed and result in immediate termination of rs.

BSD
								 February 25, 2011							       BSD

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Pivoting Dynamic rows into columns

Discussion started by: dprakash

2. Shell Programming and Scripting

How to consolidate values in one column from different rows into one?

Discussion started by: sncoupons

3. UNIX for Dummies Questions & Answers

How to assign scores to rows based on column values

Discussion started by: auburn

4. UNIX for Dummies Questions & Answers

count number of rows based on other column values

Discussion started by: itsme999

5. Shell Programming and Scripting

join rows based on the column values

Discussion started by: vsachan

6. Shell Programming and Scripting

Transposing rows and columns (pivoting) using shell scripting

Discussion started by: ksatish89

7. Shell Programming and Scripting

Remove rows with e column values

Discussion started by: alpesh

8. Shell Programming and Scripting

Choosing rows based on column values

Discussion started by: Sanchari

9. Shell Programming and Scripting

Convert Column data values to rows

Discussion started by: Hypesslearner

10. UNIX for Beginners Questions & Answers

Compare values in multiple rows in one column using awk

Discussion started by: jiam912

LEARN ABOUT FREEBSD

rs