Replace null values with dot using awk

08-17-2016

Registered User

1,393, 20

Join Date: Nov 2013

Last Activity: 1 May 2020, 2:35 PM EDT

Location: Chicago

Posts: 1,393

Thanks Given: 901

Thanked 20 Times in 19 Posts

Replace null values with dot using awk

Using awk I am trying to replace all blank or null values with a . in the tad delimited input. I hope the awk is close. Thank you

.

input

Code:

name    test
sam      1

liz         2
al
             1

awk

Code:

awk 'BEGIN{FS=OFS="\t"}{for(i=1;++i<NF;)$i=$i?$i:"."}1'input
awk 'BEGIN { FS = OFS = "\t" } { for(i=1; i<=NF; i++) if($i ~ /^ *$/) $i = 0 }; 1' input

desired output

Code:

name    test
sam      1
.            .
liz         2
al          .
.            1

---------- Post updated at 11:18 AM ---------- Previous update was at 09:48 AM ----------

This awk seems to work:

Code:

awk -F"\t" '{ for (i=1;i<=NF;i++) { if ($i=="") { $i="." } } OFS="\t";print }' file

Thank you

Last edited by cmccabe; 08-17-2016 at 12:36 PM.. Reason: added awk

cmccabe

View Public Profile for cmccabe

Find all posts by cmccabe

08-17-2016

Moderator

3,105, 1,603

Join Date: May 2013

Last Activity: 31 August 2020, 1:46 AM EDT

Location: Chennai

Posts: 3,105

Thanks Given: 1,269

Thanked 1,603 Times in 1,369 Posts

Hello cmccabe,

You haven't told us if you want to keep the same spaces(as shown in Input_file) into your post or not. Considering if you have only 2 fields and you want to maintain the spaces as it is(though I tried hard to keep the space same), following may help you in same.

Code:

awk -F"[[:space:]]+" 'function quick(Q){match(Q,/[[:space:]]+/);W=substr(Q,RSTART,RLENGTH);return W} NF==1{print $1 quick(Q) ".";Q=$0;next} NF==0{print "." quick(Q) ".";Q=$0;next} NF==2{if($0 ~ /^[[:space:]]+/){print "." quick($0) $2} else {print}}{Q=$0}'  Input_file

Output will be as follows.

Code:

name    test
sam      1
.      .
liz         2
al         .
.             1

If your requirements are different than the shown post, please post complete details and expected output too into your next post.
EDIT: Adding non-one liner form of solution now on same.

Code:

awk -F"[[:space:]]+" 'function quick(Q){
                                        match(Q,/[[:space:]]+/);
                                        W=substr(Q,RSTART,RLENGTH);
                                        return W
                                       }
                      NF==1            {
                                        print $1 quick(Q) ".";
                                        Q=$0;
                                        next
                                       }
                      NF==0            {
                                        print "." quick(Q) ".";
                                        Q=$0;
                                        next
                                       }
                      NF==2            {
                                        if($0 ~ /^[[:space:]]+/){
                                                                        print "." quick($0) $2
                                                                }
                                        else                    {
                                                                        print
                                                                }
                                       }
                                       {
                                        Q=$0
                                       }
                      '   Input_file

Thanks,
R. Singh

Last edited by RavinderSingh13; 08-17-2016 at 02:24 PM.. Reason: Added a non-one liner form of solution successfully now.

This User Gave Thanks to RavinderSingh13 For This Post:

RavinderSingh13

View Public Profile for RavinderSingh13

Find all posts by RavinderSingh13

08-17-2016

Read Only

1,278, 486

Join Date: Sep 2012

Last Activity: 27 February 2020, 8:59 PM EST

Location: Houston, Texas, USA

Posts: 1,278

Thanks Given: 0

Thanked 486 Times in 451 Posts

Code:

sed 's/^ *\t/.\t/; s/\t *\t/\t.\t/g; s/\t$/\t./' input

This User Gave Thanks to rdrtx1 For This Post:

rdrtx1

View Public Profile for rdrtx1

Find all posts by rdrtx1

08-17-2016

Moderator

3,105, 1,603

Join Date: May 2013

Last Activity: 31 August 2020, 1:46 AM EDT

Location: Chennai

Posts: 3,105

Thanks Given: 1,269

Thanked 1,603 Times in 1,369 Posts

Hello cmccabe,

Seems you have edited your post, so if you are trying to get number of fields from very first line(as it may be heading) and trying to put values of those fields which are having less number of fields as compare to heading, then following could do.
Let's see following is Input_file:

Code:

ame    test  test1  test2  test3  test4  test5
sam      1
liz         2
al
            1

Then following is the one.

Code:

awk 'NR==1{Q=NF;print} NR>1{for(i=1;i<=Q;i++){if(!$i){$i="."}};print}'  Input_file

Output will be as follows.

Code:

name    test  test1  test2  test3  test4  test5
sam 1 . . . . .
. . . . . . .
liz 2 . . . . .
al . . . . . .
1 . . . . . .

As you could see number of fields set by very first line to 7, similarly you could set number of fields too in case you don't want to take them from heading(very first line) too.

Thanks,
R. Singh

This User Gave Thanks to RavinderSingh13 For This Post:

RavinderSingh13

View Public Profile for RavinderSingh13

Find all posts by RavinderSingh13

08-17-2016

Registered User

1,393, 20

Join Date: Nov 2013

Last Activity: 1 May 2020, 2:35 PM EDT

Location: Chicago

Posts: 1,393

Thanks Given: 901

Thanked 20 Times in 19 Posts

Thank you very much

cmccabe

View Public Profile for cmccabe

Find all posts by cmccabe

08-17-2016

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

Quote:

Originally Posted by rdrtx1

Code:

sed 's/^ *\t/.\t/; s/\t *\t/\t.\t/g; s/\t$/\t./' input

That will work as long as:

you never have two adjacent fields only containing zero or more <space> characters (other than in a line only containing one <tab>),
you never have only one or more <space> characters in the last field on an input line,
you are using a version of sed that allows you to separate substitute commands with a <semicolon> character,
you are using a version of the sed utility that recognizes \t in an RE as a <tab> character, and
you are using a version of the sed utility that recognizes \t in a replacement string as a <tab> character

(all of the last three of which produce behavior that is not specified the standards).

The first problem can be fixed by repeating the 2nd substitute command:

Code:

sed 's/^ *\t/.\t/; s/\t *\t/\t.\t/g; s/\t *\t/\t.\t/g; s/\t$/\t./' input

The second problem can be fixed by adding a * to the RE in the last substitute command:

Code:

sed 's/^ *\t/.\t/; s/\t *\t/\t.\t/g; s/\t *\t/\t.\t/g; s/\t *$/\t./' input

The third problem can be fixed by replacing the <semicolon> characters with <newline> characters:

Code:

sed 's/^ *\t/.\t/
 s/\t *\t/\t.\t/g
 s/\t *\t/\t.\t/g
 s/\t *$/\t./' input

And the last two problems can be fixed by replacing every occurrence of \t in the above sed command with a literal <tab> character.

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

08-17-2016

Registered User

5,091, 1,931

Join Date: May 2012

Last Activity: 15 July 2020, 4:46 AM EDT

Location: Simplicity

Posts: 5,091

Thanks Given: 565

Thanked 1,931 Times in 1,668 Posts

Your 2nd solution is better structured than your working solution.

Code:

awk 'BEGIN { FS=OFS="\t" } { for (i=1; i<=NF; i++) if ($i~/^ *$/) $i="." } 1' input

It treats space characters as being empty, unlike your last solution that checks for being empty

Code:

awk 'BEGIN { FS=OFS="\t" } { for (i=1; i<=NF; i++) if ($i=="") $i="." } 1' input

---------- Post updated at 15:45 ---------- Previous update was at 15:12 ----------

Don's sed solution, after replacing \t with ${T} and putting the code in double-quotes, so the shell can substitute each ${T}.
For general safety I put the braces and escaped the $ (both not really required here).

Code:

T=$'\t'; sed "
s/^ *${T}/.${T}/
s/${T} *${T}/${T}.${T}/g
s/${T} *${T}/${T}.${T}/g
s/${T} *\$/${T}./
" input

MadeInGermany

View Public Profile for MadeInGermany

Find all posts by MadeInGermany

Shell Programming and Scripting

Replace null values with dot using awk

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Count null values in a file using awk

Discussion started by: RJG

2. Shell Programming and Scripting

Printing null values in awk

Discussion started by: rahulsk

3. Shell Programming and Scripting

Multiple columns replace with null values.

Discussion started by: onesuri

4. Shell Programming and Scripting

Replace null values in csv with zero

Discussion started by: reignangel2003

5. Shell Programming and Scripting

Handle null values-awk

Discussion started by: srivalli

6. Shell Programming and Scripting

Replace a field where values are null in a file.

Discussion started by: rudoraj

7. Shell Programming and Scripting

Selective Replace awk column values

Discussion started by: sdohn

8. Shell Programming and Scripting

How to replace quote symbol(") and dot(.) with some other values!!

Discussion started by: vinothsekark

9. Shell Programming and Scripting

Awk script to replace null columns with blank

Discussion started by: sonam273

10. Shell Programming and Scripting

Find and replace a column that has '' to NULL in a comma delimited using awk or sed

Discussion started by: gumal901