How to replace all but the first 3 characters with sed?


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting How to replace all but the first 3 characters with sed?
# 29  
Old 02-13-2015
Tons of compile time for JAVA, apparently! Smilie

Is ther a Python hacker in the house? Can Ruby do this?
# 30  
Old 02-13-2015
Here profiling data of run java:
Code:
Flat profile of 0,76 secs (58 total ticks): main

  Interpreted + native   Method                        
 17,2%     0  +    10    java.io.FileOutputStream.writeBytes
  6,9%     0  +     4    java.lang.System.arraycopy
  3,4%     2  +     0    java.io.BufferedOutputStream.flush
  3,4%     2  +     0    java.util.Arrays.copyOfRange
  3,4%     2  +     0    java.io.BufferedWriter.write
  1,7%     1  +     0    java.lang.String.indexOf
  1,7%     1  +     0    sun.nio.cs.StreamEncoder.isOpen
  1,7%     1  +     0    java.nio.CharBuffer.wrap
  1,7%     1  +     0    java.io.BufferedReader.ensureOpen
  1,7%     1  +     0    sun.nio.cs.StreamEncoder.flushBuffer
  1,7%     1  +     0    java.io.PrintStream.write
  1,7%     1  +     0    sun.nio.cs.UTF_8$Decoder.decodeArrayLoop
 46,6%    13  +    14    Total interpreted

         Stub + native   Method                        
 44,8%     0  +    26    java.io.FileOutputStream.writeBytes
  5,2%     0  +     3    java.lang.System.arraycopy
 50,0%     0  +    29    Total stub

  Thread-local ticks:
  3,4%     2             Class loader


Global summary of 0,77 seconds:
100,0%    58             Received ticks
120,7%    70             Compilation
  3,4%     2             Class loader

real	0m0.908s
user	0m0.757s
sys	0m0.220s

Regards.

---------- Post updated at 10:32 PM ---------- Previous update was at 10:03 PM ----------

A ruby version it's my first ruby line, so I not sure that this line is the best...
Code:
time ruby -pe 'print $_[0,3]+"*"*($_.length-4)+"\n"' file1 >/dev/null 

real	0m0.092s
user	0m0.087s
sys	0m0.003s

Regards.
# 31  
Old 02-14-2015
Quote:
Originally Posted by disedorgue
Ok, I understand better, so here:

Time reference:
Code:
time awk '{x=substr($0,N+1); gsub(".","*",x); print substr($0,1,N) x}' N=3 file1 > /dev/null
real	0m0.038s
user	0m0.037s
sys	0m0.000s

[..]
And a perl version seems faster that awk:
Code:
time perl -ne 'print substr($_,0,3)."*"x(length()-4)."\n"' file1 >/dev/null 
real	0m0.014s
user	0m0.010s
sys	0m0.000s

Regards.
Indeed the perl is faster because of the powerful repetition (x) feature, but it is not entirely correct, since it does not work correctly for string lengths <=2 (spurious newline)

So I used
Code:
$ time perl -nle 'print substr($_,0,3)."*"x(length()-3)' file > /dev/null

real	0m0.016s
user	0m0.012s
sys	0m0.003s

instead.

The only way I could get close, was with mawk :
Code:
$ time mawk -v n=3 '{m=length()-n+1; $0=substr($0,1,3)} m>1{$m=x}1' OFS=\* file > /dev/null

real	0m0.019s
user	0m0.016s
sys	0m0.002s

Which was marginally slower.

Quote:
Originally Posted by DGPickett
Try C:
Code:
#include <stdio.h>
#include <string.h>
 
int main(){
[..]

Interestingly on OSX 10.10.2 (with LLVM 6.0 compiler) I got:
Code:
$ time ./a.out < file > /dev/null

real	0m0.045s
user	0m0.039s
sys	0m0.002s

On CentOS 6.6 it was fastest though..
Code:
$ time mawk -v n=3 '{m=length()-n+1; $0=substr($0,1,3)} m>1{$m=x}1' OFS=\* file > /dev/null

real	0m0.035s
user	0m0.034s
sys	0m0.001s
$ time perl -nle 'print substr($_,0,3)."*"x(length()-3)' file >/dev/null

real	0m0.026s
user	0m0.022s
sys	0m0.004s
$ time ./a.out < file > /dev/null

real	0m0.021s
user	0m0.019s
sys	0m0.002s

(with gcc 4.4.7)

So which compiler is being used can make quite a difference

Last edited by Scrutinizer; 02-14-2015 at 02:32 AM..
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

sed command on AIX, replace specific characters

Hi, Im using sed on an AIX machine. I am trying to change the 137-139 characters if they are a ' 36'/'000' to a '036'. The positions that need to be changed are fixed. the source data that I have is$cat v.txt 4000422985400050462239065593606500000007422985707771046154054910075641MC0318AMWAY... (9 Replies)
Discussion started by: dsid
9 Replies

2. Shell Programming and Scripting

sed replace characters using a wildcard

Hello, I have some data that looks like the following, > <SALTDATA> (OVS0199262) HCl > <IDNUMBER> (OVS0199262) OVS0199262 > <SUPPLIER> (OVS0199262) TimTec > <EMAIL> (OVS0199262) info@timtec.net > <WEBSITE> (OVS0199262) http://www.timtec.net I need to remove the data in... (3 Replies)
Discussion started by: LMHmedchem
3 Replies

3. Shell Programming and Scripting

sed replace nth characters with string

Hi, I hope you can help me out please? I need to replace from character 8-16 with AAAAAAAA and the rest should stay the same after character 16 gtwrhtrd11111111rjytwyejtyjejetjyetgeaEHT wrehrhw22222222hytekutkyukrylryilruilrGEQTH hrwjyety33333333gtrhwrjrgkreglqeriugn;RUGNEURGU ... (4 Replies)
Discussion started by: stinkefisch
4 Replies

4. Shell Programming and Scripting

Replace characters infile with sed

I have several files in a directory that look like this: jacket-n r potential-n - outcome-n f reputation-n b I want to replace the characters in the second column with certain numbers. For instance, I want the letters 'f', 'r' and 'b' in the second column to replaced with 0 and I want the... (1 Reply)
Discussion started by: owwow14
1 Replies

5. Shell Programming and Scripting

sed replace range of characters in each line

Hi, I'm trying to replace a range of characters by their position in each line by spaces. I need to replace characters 95 to 145 by spaces in each line. i tried below but it doesn't work sed -r "s/^(.{94})(.{51})/\ /" inputfile.txt > outputfile.txt can someone please help me... (3 Replies)
Discussion started by: Kevin Tivoli
3 Replies

6. UNIX for Dummies Questions & Answers

Bash: using SED, trying to replace some characters except first or last line

Hi, I require to replace 2 items: 1. replace start of all lines in a file with ' except the first line 2. replace end of all lines in a file with '||chr( except last line I am able to do the entire file using sed -e s/^/\'/g -e s/$/\'\|\|chr\(/g "$file" > newfile.txt but am not yet... (3 Replies)
Discussion started by: Chella15
3 Replies

7. Shell Programming and Scripting

Bash: using SED, trying to replace some characters except first or last line

Hi, I require to replace 2 items: 1. replace start of all lines in a file with ' except the first line 2. replace end of all lines in a file with '||chr( except last line I am able to do the entire file using sed -e s/^/\'/g -e s/$/\'\|\|chr\(/g "$file" > newfile.txt but am not yet able... (0 Replies)
Discussion started by: Chella15
0 Replies

8. Shell Programming and Scripting

Using sed to replace special characters

Hi everyone I have file1 contains: '7832' ' 8765 6543 I want a sed command that will format as: '7832' , '8765' , '6543' I tried sed -e s/\'//g -e 's/^*//;s/*$//' file1 > file2 sed -e :a -e '$!N; s/\n/ /; ta' file2 which gives: 7832 8765 6543 I need some help to continue with... (5 Replies)
Discussion started by: nimo
5 Replies

9. Shell Programming and Scripting

Sed replace characters not equal to an expression

Hi all, Suppose I have a file with the contents below, and I only want to print words %S_ then | sort -u. ------------------------------ The %S_MSG that starts with '%.*s' is too long. Maximum length is %d. The %S_MSG name '%.*s' contains more than the maximum number of prefixes. The... (5 Replies)
Discussion started by: poldo
5 Replies

10. Shell Programming and Scripting

how to replace control characters using sed?

How can I use sed to replace a ctrl character such as 'new line' (\0a) to something else? Or any other good command can do this job? Thanks, Hillxy (5 Replies)
Discussion started by: hillxy
5 Replies
Login or Register to Ask a Question