Add Delimiter after 2 decimal point for a particular column

04-04-2017

Registered User

12, 0

Join Date: Jan 2016

Last Activity: 18 March 2020, 12:39 AM EDT

Posts: 12

Thanks Given: 3

Thanked 0 Times in 0 Posts

Thanks RudiC, It willl be great if you can explain the sed command you share as I not really understand sed command well.

ckwan123

View Public Profile for ckwan123

Find all posts by ckwan123

04-04-2017

Registered User

15,129, 5,008

Join Date: Jul 2012

Last Activity: 4 May 2020, 4:31 PM EDT

Location: Aachen, Germany

Posts: 15,129

Thanks Given: 735

Thanked 5,008 Times in 4,483 Posts

It's a substitute command, matching a not-too-complicated regex:

Code:

sed -r 's/(\.[0-9][0-9])([^�])/\1�\2/2' file

sed -r          # switching on ERE (extended regex) behaviour
'               # single quote to open the first parameter
s               # substitute command
/               # default char to open a regex
(               # opening the first parenthesized subexpression for back reference
\.[0-9][0-9]    # regex: escaped dot, double digit
)               # closing first par. subexp.
([^�])          # second par. subexp.: any char except �
/               # default char to switch from matching pattern to replacement string
\1              # use first back reference
�               # insert literal �
\2              # use second b.r.
/               # close replacement string
2               # option flag: apply to second occurrence only
'               # close first parameter
 file           # file to read / modify

This User Gave Thanks to RudiC For This Post:

RudiC

View Public Profile for RudiC

Find all posts by RudiC

04-04-2017

Registered User

12, 0

Join Date: Jan 2016

Last Activity: 18 March 2020, 12:39 AM EDT

Posts: 12

Thanks Given: 3

Thanked 0 Times in 0 Posts

Thanks RudiC for your clear explanation.

ckwan123

View Public Profile for ckwan123

Find all posts by ckwan123

04-04-2017

Registered User

2,100, 402

Join Date: Apr 2009

Last Activity: 11 February 2020, 10:24 AM EST

Posts: 2,100

Thanks Given: 26

Thanked 402 Times in 360 Posts

Quote:

Originally Posted by ckwan123

...
...
manage to get it resolve by minor change on the occurrence, instead of /g use /2 on the sed command we use.

Code:

  sed 's/\(\.[0-9][0-9]\)\([^�]\)/\1�\2/2'

  1007937820�L�2016-12-19�000000002�2018-02-01�2050-12-01�00395�M�146713.57�00005.05000�762.59�00395�M�301223.05�28�1165�2017-03-31 20:34:25

Hopes we can also know how it can be achieve by using perl command too.
...
...

I must say, that feature of sed is pretty impressive!
In Perl, you could use the "e" modifier with some Perl code in the replacement pattern that increments a counter and replaces conditionally. I didn't try it though.
Alternatively, here's the long version of the solution.
I loop through a line based on the pattern "\.\d\d", incrementing a counter as well and replacing, if required.
Some flexibility has been added as well, so you can replace only 2nd occurrence, or 2nd,4th,6th or 2nd through 5th or all etc.

Code:

$ 
$ cat -n replace_occurrences.pl 
     1    #!/usr/bin/perl
     2    # Usage: perl replace_occurrences.pl data.txt 2      # replace 2nd occurrence only
     3    #        perl replace_occurrences.pl data.txt 2,4,5  # replace 2nd, 4th, 5th occurrences
     4    #        perl replace_occurrences.pl data.txt 2-6    # replace 2nd through 6th occurrences
     5    #        perl replace_occurrences.pl data.txt        # replace all occurrences
     6    #        perl replace_occurrences.pl data.txt 3-     # replace the 3rd and all occurrences after it
     7    use strict;
     8    my $infile = $ARGV[0];
     9    my $rep_range = $ARGV[1];
    10    my %rep;
    11    my $till_the_end = 0;
    12    my $start_index;
    13    
    14    # Subroutine section
    15    sub set_rep_indexes {
    16        if ($rep_range =~ /^\d+$/) {
    17            $rep{$rep_range}++;
    18        } elsif ($rep_range =~ /,/) {
    19            %rep = map {$_ => 1} split(',', $rep_range);
    20        } elsif ($rep_range =~ /(\d+)-(\d+)/) {
    21            for(my $i=$1; $i<=$2; $i++) { $rep{$i}++ }
    22        } elsif ($rep_range eq "" or $rep_range =~ /(\d+)-/) {
    23            $start_index = $rep_range eq "" ? 1 : $1;
    24            $till_the_end = 1;
    25        }
    26    }
    27    
    28    sub replace_occurrences {
    29        my $x = shift;
    30        my ($str, $pre, $token, $post) = ("", "", "", "");
    31        my $count = 1;
    32        do {
    33            ($pre, $token, $post) = $x =~ /(.*?)(\.\d\d)([^�])/;
    34            if ($pre ne "") {
    35                if (($till_the_end and $count >= $start_index) or defined $rep{$count}) {
    36                    $str .= "${pre}${token}�${post}"
    37                } else {
    38                    $str .= "${pre}${token}${post}"
    39                }
    40                $x =~ s/$pre$token$post//;
    41                $count++;
    42            }
    43        } until ($pre eq "");
    44        $str .= $x;
    45        return $str;
    46    }
    47    
    48    # Main section
    49    set_rep_indexes;
    50    open(FH, '<', $infile) or die "Can't open $infile: $!";
    51    while (<FH>) {
    52        chomp(my $line = $_);
    53        print replace_occurrences($line),"\n";
    54    }
    55    close(FH) or die "Can't close $infile: $!";
    56    
$ 
$ # Original and unchanged pattern is in red, modified pattern is in purple.
$ cat data.txt
1007937820�L�2016-12-19�000000002�2018-02-01�2050-12-01�00395�M�146713.57�00005.05000�762.59�00395�M�301223.0528�1165�2017-03-31 20:34:25
0.556abc�def�0005.0500�ghi�jkl�123.45�678�mn�opq�5678.90123�rs�tuvw�0.1234�xyz�6.55
$ 
$ perl replace_occurrences.pl data.txt 2
1007937820�L�2016-12-19�000000002�2018-02-01�2050-12-01�00395�M�146713.57�00005.05000�762.59�00395�M�301223.05�28�1165�2017-03-31 20:34:25
0.556abc�def�0005.05�00�ghi�jkl�123.45�678�mn�opq�5678.90123�rs�tuvw�0.1234�xyz�6.55
$ 
$ perl replace_occurrences.pl data.txt 1,2,4
1007937820�L�2016-12-19�000000002�2018-02-01�2050-12-01�00395�M�146713.57�00005.05�000�762.59�00395�M�301223.05�28�1165�2017-03-31 20:34:25
0.55�6abc�def�0005.05�00�ghi�jkl�123.45�678�mn�opq�5678.90123�rs�tuvw�0.12�34�xyz�6.55
$ 
$ perl replace_occurrences.pl data.txt 1-3
1007937820�L�2016-12-19�000000002�2018-02-01�2050-12-01�00395�M�146713.57�00005.05�000�762.59�00395�M�301223.05�28�1165�2017-03-31 20:34:25
0.55�6abc�def�0005.05�00�ghi�jkl�123.45�678�mn�opq�5678.90�123�rs�tuvw�0.1234�xyz�6.55
$ 
$ perl replace_occurrences.pl data.txt 2-
1007937820�L�2016-12-19�000000002�2018-02-01�2050-12-01�00395�M�146713.57�00005.05000�762.59�00395�M�301223.05�28�1165�2017-03-31 20:34:25
0.556abc�def�0005.05�00�ghi�jkl�123.45�678�mn�opq�5678.90�123�rs�tuvw�0.12�34�xyz�6.55
$ 
$ perl replace_occurrences.pl data.txt
1007937820�L�2016-12-19�000000002�2018-02-01�2050-12-01�00395�M�146713.57�00005.05�000�762.59�00395�M�301223.05�28�1165�2017-03-31 20:34:25
0.55�6abc�def�0005.05�00�ghi�jkl�123.45�678�mn�opq�5678.90�123�rs�tuvw�0.12�34�xyz�6.55
$

If you are looking for a one-liner, this isn't it unfortunately.
And it doesn't have to be in Perl. It's just an algorithm that you can implement in any language: awk, Bash, Python, C etc. that you are comfortable with and is in the toolset at your disposal.
Cheers!

Last edited by durden_tyler; 04-05-2017 at 10:10 AM..

This User Gave Thanks to durden_tyler For This Post:

durden_tyler

View Public Profile for durden_tyler

Find all posts by durden_tyler

04-04-2017

Registered User

12, 0

Join Date: Jan 2016

Last Activity: 18 March 2020, 12:39 AM EDT

Posts: 12

Thanks Given: 3

Thanked 0 Times in 0 Posts

Thanks Durden for your good effort to try this solution in perl language. Indeed unix sed is a mutual, powerful tool. Without it, we are probably need to write lengthy script to achieve same objective.

ckwan123

View Public Profile for ckwan123

Find all posts by ckwan123

04-05-2017

Registered User

2,100, 402

Join Date: Apr 2009

Last Activity: 11 February 2020, 10:24 AM EST

Posts: 2,100

Thanks Given: 26

Thanked 402 Times in 360 Posts

Quote:

Originally Posted by ckwan123

...Indeed unix sed is a mutual, powerful tool. Without it, we are probably need to write lengthy script to achieve same objective.

Agreed.

Code:

$ 
$ cat data.txt
1007937820�L�2016-12-19�000000002�2018-02-01�2050-12-01�00395�M�146713.57�00005.05000�762.59�00395�M�301223.0528�1165�2017-03-31 20:34:25
0.556abc�def�0005.0500�ghi�jkl�123.45�678�mn�opq�5678.90123�rs�tuvw�0.1234�xyz�6.55
$ 
$ perl -plne '$i=1; s/(\.\d\d)(?!�)/$i++ == 2 ? $1."�" : $1/eg' data.txt
1007937820�L�2016-12-19�000000002�2018-02-01�2050-12-01�00395�M�146713.57�00005.05000�762.59�00395�M�301223.05�28�1165�2017-03-31 20:34:25
0.556abc�def�0005.05�00�ghi�jkl�123.45�678�mn�opq�5678.90123�rs�tuvw�0.1234�xyz�6.55
$ 
$

Last edited by durden_tyler; 04-05-2017 at 12:27 AM..

This User Gave Thanks to durden_tyler For This Post:

durden_tyler

View Public Profile for durden_tyler

Find all posts by durden_tyler

Shell Programming and Scripting

Add Delimiter after 2 decimal point for a particular column

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

Convert a numeric to 2 decimal point value

Discussion started by: LoneRanger

2. UNIX for Beginners Questions & Answers

How to add following decimal point to a CSV value?

Discussion started by: sash99

3. Shell Programming and Scripting

Help with Round Up with 2 decimal point at specific column

Discussion started by: perl_beginner

4. Shell Programming and Scripting

Check for decimal point and add it at the end if its not there using awk/perl

Discussion started by: nvk_vinoth

5. Shell Programming and Scripting

awk decimal point numbers matching

Discussion started by: rmkganesh

6. Shell Programming and Scripting

remove directories with two digits after decimal point

Discussion started by: cabaciucia

7. Shell Programming and Scripting

Insert decimal point for numbers

Discussion started by: yoursdivu

8. Shell Programming and Scripting

Comparing two numbers with decimal point

Discussion started by: kinny

9. Shell Programming and Scripting

Insert a decimal point

Discussion started by: lyoncc

10. Shell Programming and Scripting

how to get rid of decimal point?

Discussion started by: cin2000