Remove '.' from file for numbers ending in '.'


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Remove '.' from file for numbers ending in '.'
# 1  
Old 02-16-2015
Remove '.' from file for numbers ending in '.'

Hi,
I have numerous files which have data in the following format

Code:
A|B|123.|Mr.|45.66|33|zz
L|16.|33.45|AC.|45.

I want to remove decimal point only if it is last character in a number.
O/p should be

Code:
A|B|123|Mr.|45.66|33|zz
L|16|33.45|AC.|45

I tried this
Code:
sed -e 's/.|/|/g'

Problem with above is that it removes the '.' for Mr.
Also I want to remove '.' for last field in second line 45. should be 45
Basically any numeric filed which has a decimal but no number after decimal, should have decimal point removed

Thanks
# 2  
Old 02-16-2015
Code:
sed -e 's/\([0-9]\).|/\1|/g' -e 's/\([0-9]\).$/\1/'

This User Gave Thanks to agent.kgb For This Post:
# 3  
Old 02-16-2015
Works almost. Escaping the dot improves it:
Code:
sed -e 's/\([0-9]\)\.|/\1|/g' -e 's/\([0-9]\)\.$/\1/' file
A|B|123|Mr.|45.66|33|zz
L|16|33.45|AC.|45

---------- Post updated at 18:47 ---------- Previous update was at 18:43 ----------

And, if your sed allows for EREs, this might work as well:
Code:
sed -r 's/([0-9])\.(\||$)/\1\2/g' file

# 4  
Old 02-16-2015
Note that the suggestions test if the preceding character is a digit, not if a field that consists of a number ends with a dot. For example if one of the fields would be A1. then this approach would fail.

An alternative would be to split it into fields and test each field if it is numeric and if it ends in a dot. For example:
Code:
awk '{for(i=1; i<=NF; i++) if($i==$i+0) sub(/\.$/,x,$i)}1' FS=\| OFS=\| file

# 5  
Old 02-16-2015
Quote:
Originally Posted by Scrutinizer
Note that the suggestions test if the preceding character is a digit, not if a field that consists of a number ends with a dot. For example if one of the fields would be A1. then this approach would fail.
True. Here is a sed-script covering this:

Code:
sed 'start:;s/\(|*[0-9][0-9]*\)\.|/\1|/;t start' /path/to/file

Note that i need a loop (instead of the "g" option) because i have to match the leading AND the trailing field separator.

I hope this helps.

bakunin
# 6  
Old 02-17-2015
Works almost. The colon needs to go in front of the start label, and, due to the |*, it will remove A1.'s dot as well. Removing the star from the pipe, it will not catch the line start. Right now, I can't see a solution...

---------- Post updated at 10:42 ---------- Previous update was at 10:39 ----------

... unless EREs are possible:
Code:
sed -r 's/(^|\|)([0-9]+)\.(\||$)/\1\2\3/g' file

# 7  
Old 02-17-2015
Even with ERE, you need to run it twice, perhaps conditionally if the first hits, as you used both start and end field '|'. Otherwise, you miss the following adjacent fields on a line like: "123.|456."

You can add pipes to both ends for the substitute and then remove them:
Code:
sed '
  s/.*/|&|/
  s/\(|[0-9]\{1,99\}\)\.|/\1|/g
  t again
  b end
  :again
  s/\(|[0-9]\{1,99\}\)\.|/\1|/g
  :end
  s/^|//
  s/|$//
 ' in_file

I could have removed both added pipes with one substitute "s/^|\(.*\)|$/\1/" but these back references are a bit slower, in my experience, so I avoid them where possible.

Two passes can be avoided if a less careful pattern is sought, like "s/\([0-9]\).|/\1/", as it encompasses only one pipe. It mangles any non-numeric field with a trailing numer and dot, like "123|The field count of this line is 3.|xyz".
These 2 Users Gave Thanks to DGPickett For This Post:
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Remove lines ending with a certain character

I have a file of a content like this: abc_bla -def 800 abc_bla -def 802 abc_bla -def 804 abc_bla -def 806 abc_bla -def 808 abc_bla -def 810 abc_bla -def 812 abc_bla -def 814 ... abc_bla -def 898 abc_bla -def 900 abc_bla -def 902 abc_bla -def 904 ... abc_bla -def 990 abc_bla -def... (7 Replies)
Discussion started by: maya3
7 Replies

2. Shell Programming and Scripting

How to remove the numbers in a file in perl script?

Thanks (1 Reply)
Discussion started by: Raysf
1 Replies

3. UNIX for Advanced & Expert Users

Pring starting and ending numbers using UNIX

Hi all, I need to do scrip for printing starting and ending numbers along with count in given file.:wall: Input: a.txt 10000030 10000029 10000028 10000027 10000026 10000024 10000023 10000021 10000018 10000018 10000017 10000016 10000015 10000014 (2 Replies)
Discussion started by: jackbell2013
2 Replies

4. Shell Programming and Scripting

Remove certain lines from file based on start of line except beginning and ending

Hi, I have multiple large files which consist of the below format: I am trying to write an awk or sed script to remove all occurrences of the 00 record except the first and remove all of the 80 records except the last one. Any help would be greatly appreciated. (10 Replies)
Discussion started by: nwalsh88
10 Replies

5. Shell Programming and Scripting

Remove ending text

Hello, I am working with a list that contains a large number of files listed by their absolute path. I am trying to determine a way to delete the file name at the end of each line, therefore leaving just the directory path. For example, I'd like to go from: /home/something/file... (2 Replies)
Discussion started by: omnivir
2 Replies

6. Shell Programming and Scripting

Remove Numbers from file

I have a file that has some text that looks like this Some Text 1. More text 2. Different text Final Text I would like the remove the lines of text that start with the numbers. Some Text Final Text I have tried to use cat file.txt | grep -Ev 1. >... (9 Replies)
Discussion started by: icculus99
9 Replies

7. Shell Programming and Scripting

Help in retrieving the ending line numbers of the functions

Hi! I've a C file which consist of many function definitions with numbers at the beginning as shown below.. 10 void search() 11 { 12 /*body 14 * 15 * 17 * 18 * 40 * 42 * 60 } 90 void func_name() 95 { 99 /*body 100 * 105 * 111 * (7 Replies)
Discussion started by: abk07
7 Replies

8. UNIX for Dummies Questions & Answers

how to remove lines ending with '*'

I have a file where some lines end with '*'. I would like to remove those lines ending with '*'. inFile: a b* c d*outFile: a cThank you (7 Replies)
Discussion started by: jdhahbi
7 Replies

9. Shell Programming and Scripting

Remove Multiple numbers from file.

Hi, I am trying to cleanup 7 or 10 digits numeric from the file. So for example : Input : 3M Corporation 3M Inc. 888-356-8765 3M Inc. 356-8765 3M Inc. 3568765 3M Inc. 356-8765 3M 8883568765 Inc. Output : 3M Corporation 3M Inc. - - 3M Inc. - 3M Inc. 3M Inc. - (8 Replies)
Discussion started by: msalam65
8 Replies

10. Shell Programming and Scripting

how to remove files with only numbers as file names?

Hi all, I have a bunch of files that are named like 12543, 467249877, etc all over some directories.These files are named only with numbers, they dont have any letters or special characters in their file names. Could you please help me out and give me some command/script to remove only those... (6 Replies)
Discussion started by: praveen_indramo
6 Replies
Login or Register to Ask a Question