awk split help

01-23-2015

Registered User

13, 0

Join Date: Dec 2014

Last Activity: 15 February 2015, 11:23 PM EST

Location: Canada

Posts: 13

Thanks Given: 7

Thanked 0 Times in 0 Posts

awk split help

Hello,
I have the following input file:

Code:

A=1;B=2;C=3;D=4
A=4;B=6;C=7;D=9

I wish to have the following output

Code:

1 2 3 4
4 6 7 9

Can awk split be used to do this?

I have done this without using split, but the process is quite tedious.

Any help is appreciated!

Rabu

View Public Profile for Rabu

Find all posts by Rabu

01-23-2015

Moderator

12,296, 3,792

Join Date: Nov 2008

Last Activity: 1 January 2021, 1:47 AM EST

Location: Amsterdam

Posts: 12,296

Thanks Given: 679

Thanked 3,792 Times in 3,282 Posts

Try:

Code:

awk -F'(^|;)[^=]+=' '{print $2,$3,$4,$5}' file

With split() :

Code:

awk -F\; '{for(i=1; i<=NF; i++){split($i,F,/=/); $i=F[2]}}1' file

Code:

sed 's/[^=]*=\([^;]*\)/\1 /g; s/ $//' file

Last edited by Scrutinizer; 01-23-2015 at 05:27 PM..

Scrutinizer

View Public Profile for Scrutinizer

Find all posts by Scrutinizer

01-23-2015

Registered User

15,129, 5,008

Join Date: Jul 2012

Last Activity: 4 May 2020, 4:31 PM EDT

Location: Aachen, Germany

Posts: 15,129

Thanks Given: 735

Thanked 5,008 Times in 4,483 Posts

Not sure I understand what Scrutinizer is aiming at, but as a slight modification of his proposal try

Code:

awk -F'[;=]' '{print $2,$4,$6,$8}' file
1 2 3 4
4 6 7 9

---------- Post updated at 22:14 ---------- Previous update was at 22:12 ----------

As to your question, split would work as well:

Code:

awk -F';' '{for (i=1; i<=NF; i++) {split ($i, T, "="); printf "%s ", T[2]} printf "\n" }' file
1 2 3 4 
4 6 7 9

RudiC

View Public Profile for RudiC

Find all posts by RudiC

01-23-2015

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

In addition to what Scrutinizer and RudiC have already suggested, you could also try:

Code:

echo 'Just using FS...'
awk -F'[^[:digit:]]*' '{$1=$1}1' file

printf '\nUsing FS and a for loop...\n'
awk -F'[^[:digit:]]*' '{for(i = 2; i <= NF; i++) printf("%s%s", $i, (i == NF) ? "\n" : " ")}' file

printf '\nUsing aplit() and a for loop...\n'
awk '
{	n=split($0, fields, /[^[:digit:]]*/)
	for(i = 2; i <= n; i++)
		printf("%s%s", fields[i], (i == n) ? "\n" : " ")
}' file

which, with your input file, produces the output:

Code:

Just using FS...
 1 2 3 4
 4 6 7 9

Using FS and a for loop...
1 2 3 4
4 6 7 9

Using aplit() and a for loop...
1 2 3 4
4 6 7 9

If you want to try this on a Solaris/SunOS system, change awk to /usr/xpg4/bin/awk, /usr/xpg6/bin/awk, or nawk.

The first example is simple but outputs an unwanted leading space. The 2nd and 3rd produce the desired output, one using the field separator to split fields and one using split() to split fields. The last two then use a for loop to print the desired fields (note that with the ERE used for FS and the split(), field 1 is always an empty string.)

This User Gave Thanks to Don Cragun For This Post:

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

01-23-2015

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

I hadn't noticed that this thread had been closed when I posted my last suggestion. And, since I received a private message asking how my code worked, I'm going to reopen this thread. I agree that these two threads are related, but I feel that this thread is mostly about using field delimiters other than the default sequences of one or more blank (space and tab) characters, while the other thread is mostly about deleting selected fields from input lines.

From the private e-mail:

Quote:

Hi Don,
The code you provided:

Code:

awk -F'[^[:digit:]]*' '{for(i = 2; i <= NF; i++) printf("%s%s", $i, (i == NF) ? "\n" : " ")}' file

worked great, but I was wondering if you could help me understand what this code is saying...

What does the '[^[:digit:]]*' represent?

It is an extended regular expression (aka ERE) that matches any string of characters (specified by the asterisk at the end) that are not decimal digits (specified by the bracket expression [^[:digit:]] where [:digit:] inside square brackets refers to a single digit in the current locale and the circumflex as the first character in the bracket expression reverses the set of matched characters). Since this is an option-argument to the awk -F option, that ERE as the input field separator for lines being read by awk.

Quote:

and why are you adding two strings "%s%s"

The 1st string printed is the data in the field. The 2nd string printed is the field separator or the line terminator.

Quote:

lastly, what purpose does the semicolon ":" serve at the end of the code?

That is a colon; not a semicolon. In awk (as in C and C++) the expression:

Code:

logical_expression ? true_result : false_result

evaluates to true_result if the logical_expression evaluates to true and evaluates to false_result otherwise. In this case:

Code:

(i == NF) ? "\n" : " "

returns a <newline> character to be printed as the line terminator if i is the number of the last field on the input line; otherwise it returns a <space> character to be printed as a field separator in the current output line.

Quote:

Sorry for all these questions, I just want to know exactly how it works instead of just copying and pasting into my script.

Never apologize for asking questions. We want you to learn how this stuff works.

Quote:

Thanks again!
Rabu

I hope this helps. Let us know if it is still not clear.

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

Shell Programming and Scripting

awk split help

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk split and awk calculation in the same command

Discussion started by: cmccabe

2. Shell Programming and Scripting

awk split numbers

Discussion started by: sdf

3. Shell Programming and Scripting

awk to split one field and print the last two fields within the split part.

Discussion started by: yifangt

4. UNIX for Dummies Questions & Answers

awk split

Discussion started by: heecha

5. UNIX for Dummies Questions & Answers

awk split

Discussion started by: jville

6. Shell Programming and Scripting

AWK split

Discussion started by: slarionoff

7. Shell Programming and Scripting

awk to split string

Discussion started by: EAGL�

8. Shell Programming and Scripting

split file with awk

Discussion started by: uwork72

9. Shell Programming and Scripting

awk - split function

Discussion started by: fusionX

10. UNIX for Dummies Questions & Answers

Split a file with no pattern -- Split, Csplit, Awk

Discussion started by: madhunk