Today (Saturday) We will make some minor tuning adjustments to MySQL.

You may experience 2 up to 10 seconds "glitch time" when we restart MySQL. We expect to make these adjustments around 1AM Eastern Daylight Saving Time (EDT) US.


How to replace matching words defined in one file on another file?


Login or Register to Reply

 
Thread Tools Search this Thread
# 1  
How to replace matching words defined in one file on another file?

I have file1 and file2 as shown below,
file1:
Code:
((org14/1-131541:0.11535,((org29/1-131541:0.00055,org7/1-131541:0.00055)1.000:0.10112,((org17/1-131541:0.07344,(org23/1-131541:0.07426,((org10/1-131541:0.00201,org22/1-131541:0.00243)1.000:0.02451,

file2:
Code:
org14=india
org29=america
org7=srilanka
org17=africa
org23=europe
org10=brazil
org22=china

I need to replace the words in file1, based on the matching words defined in file2.

The expected outcome is shown below,
Code:
((india/1-131541:0.11535,((america/1-131541:0.00055,srilanka/1-131541:0.00055)1.000:0.10112,((africa/1-131541:0.07344,(europe/1-131541:0.07426,((brazil/1-131541:0.00201,china/1-131541:0.00243)1.000:0.02451,

.

I could use replace option in gedit, but here I need to replace list of words. Hence, Please help me to do the same.

Thank you in advance.

Last edited by Scrutinizer; 4 Weeks Ago at 04:14 AM.. Reason: Quote tags -> code tags; removed some superfluous quote tags
# 2  
This problem has been solved umpteen times in these fora. Did you bother to search, or look into the proposals given below under "More UNIX and Linux Forum Topics You Might Find Helpful"?


Howsoever, try

Code:
awk 'FNR==NR{REP[$1]=$2; next} {for (r in REP) gsub(r, REP[r])}1' FS="=" file2 file1

This User Gave Thanks to RudiC For This Post:
# 3  
Hi, try:
Code:
awk '
  NR==FNR {
    A[$1]=$2
    next
  } 
  {
    for(i=1; i<=NF; i++)
      if($i in A)
        sub($i,A[$i])
    print
  }
'  FS="=" file2 FS='[(/,]' file1

This User Gave Thanks to Scrutinizer For This Post:
# 4  
Note that RudiC's and Scrutinizer's suggestions both depend on the fact that the orgX and orgXX strings in file2 are distinct. Had file2 also contained the line:
Code:
org2=japan

both of those suggestions might randomly have resulted in japan9 appearing in the output instead of america, japan3 appearing instead of europe, and japan2 appearing instead of china.

If this might be a problem for you, you would either need to be sure that all of your orgXX strings are the same length or sort your orgXX values by decreasing numerical value of XX and process the substitutions from beginning to end in sequence (like Scrutinizer did) instead of using for (r in REP) (like RudiC did).

And, if using Scrutinizer's code and a single orgXX string might occur more than once in a line of input (which does not happen in your sample), you would need to use gsub() instead of sub() to get the desired results.

Last edited by Don Cragun; 4 Weeks Ago at 06:17 PM.. Reason: Fix broken ICODE tag.
These 3 Users Gave Thanks to Don Cragun For This Post:
# 5  
In post #3, isn't
Code:
      if($i in A)
        $i=A[$i]

more correct?
--
I see now, awk will reformat the line, substituting the FS characters with spaces.

Last edited by MadeInGermany; 4 Weeks Ago at 05:37 AM..
This User Gave Thanks to MadeInGermany For This Post:
# 6  
Yes that is correct, #3 uses exact strings, so it correctly identifies the right field, and the sub() in itself isn't the problem either, since iteration occurs over the fields and not over the key value pairs (therefore it can substititute multiple occurrences on one line), but the problem is in the replacement part, it was attempting to use sub() on the record instead of a direct assignment to the field, to avoid losing the file separators.

This adaptation should fix that:
Code:
awk '
  NR==FNR {
    A[$1]=$2
    next
  } 
  {
    for(i=1; i<=NF; i++) {
      n=split($i, F, /[(,]/)
      org=F[n]
      if(org in A)
        sub(org, A[org], $i)
    } 
    print
  }
'  FS="=" file2 FS=/ OFS=/ file1


Last edited by Scrutinizer; 4 Weeks Ago at 06:37 AM..
This User Gave Thanks to Scrutinizer For This Post:
Login or Register to Reply

|
Thread Tools Search this Thread
Search this Thread:
Advanced Search

More UNIX and Linux Forum Topics You Might Find Helpful
Replace particular words in file based on if finds another words in that line
Rajib Podder
Hi All, I need one help to replace particular words in file based on if finds another words in that file . i.e. my self is peter@king. i am staying at north sydney. we all are peter@king. How to replace peter to sham if it finds @king in any line of that file. Please help me...... Shell Programming and Scripting
8
Shell Programming and Scripting
Replace the words in the file to the words that user type?
malfolozy
Hello, I would like to change my setting in a file to the setting that user input. For example, by default it is ONBOOT=ON When user key in "YES", it would be ONBOOT=YES -------------- This code only adds in the entire user input, but didn't replace it. How do i go about...... UNIX for Dummies Questions & Answers
5
UNIX for Dummies Questions & Answers
search the pattern in a file and replace with variable already defined previously in csh
nehashine
I want to replace a certain pattern with the variable already defined. e.g. set path_verilog = /home/priya/bin/verilogfile my file contents are : verilog new verilog is defined here verilog_path_comes I am using the below command sed 's/verilog_path_comes/'$path_verilog'/g' <filename>...... Shell Programming and Scripting
2
Shell Programming and Scripting
How to from grep command from a file which contains matching words?
johnl
Hi all I have a file with below content (content is variable whenever new product is launched). I need form a grep command like this egrep "Unknown product|Invalid symboland so on" How to do it using a script? Unknown product Invalid symbol No ILX exch found exceeds maximum size AFX...... Shell Programming and Scripting
4
Shell Programming and Scripting
getting file words as pattern matching
arunkumar_mca
Sir, I want to check for the repation of a user address in a file i used || as my delimiter and want to check repetaip0n of the address that is mailid and then i have to use IMAP and all. How can i do this... I am in linux ...and my file is linux file. ...... Programming
5
Programming

Featured Tech Videos