Decoding and pattern matching


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Decoding and pattern matching
# 1  
Old 08-22-2014
Decoding and pattern matching

Hello,
I have a huge file with over 700,00 SNPs with 18 columns. One column is in the format

Code:
--+-+
---++
?????
-????

Now i have another list which corresponds to this code in a particular order
Code:
A-1
B-7
C-11
D-3
E-100

Now I need to match the expression above to the pattern, where +/- corresponds to A-E and ? corresponds to 0 and as an output I need to get a number.

So the output should be
Code:
--+-+ = 122  (because 1+7+11+3+100)
---++= 122
?????= 0
-???+= 101

This is just a sample, the numbers are much bigger and the pattern runs upto 24 times. Is there any way I can do it with bash scripting ?
Thank you
# 2  
Old 08-22-2014
Something like
Code:
awk     'NR==FNR        {V[NR]=$2;next}
                        {sum=0
                         for (i=1; i<=NF; i++) sum+=($i=="?")?0:V[i]
                         print $0 " = " sum}
        ' FS="-" file1 FS="" file2
--+-+ = 122
---++ = 122
????? = 0
-???? = 1

(Doesn't match your sample output as that doesn't match your input file)
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Big pattern file matching within another pattern file in awk or shell

Hi I need to do a patten match between files . I am new to shell scripting and have come up with this so far. It take 50 seconds to process files of 2mb size . I need to tune this code as file size will be around 50mb and need to save time. Main issue is that I need to search the pattern from... (2 Replies)
Discussion started by: nitin_daharwal
2 Replies

2. UNIX for Dummies Questions & Answers

Grep -v lines starting with pattern 1 and not matching pattern 2

Hi all! Thanks for taking the time to view this! I want to grep out all lines of a file that starts with pattern 1 but also does not match with the second pattern. Example: Drink a soda Eat a banana Eat multiple bananas Drink an apple juice Eat an apple Eat multiple apples I... (8 Replies)
Discussion started by: demmel
8 Replies

3. Shell Programming and Scripting

PHP - Regex for matching string containing pattern but without pattern itself

The sample file: dept1: user1,user2,user3 dept2: user4,user5,user6 dept3: user7,user8,user9 I want to match by '/^dept2.*/' but don't want to have substring 'dept2:' in output. How to compose such regex? (8 Replies)
Discussion started by: urello
8 Replies

4. Shell Programming and Scripting

Sed: printing lines AFTER pattern matching EXCLUDING the line containing the pattern

'Hi I'm using the following code to extract the lines(and redirect them to a txt file) after the pattern match. But the output is inclusive of the line with pattern match. Which option is to be used to exclude the line containing the pattern? sed -n '/Conn.*User/,$p' > consumers.txt (11 Replies)
Discussion started by: essem
11 Replies

5. UNIX for Dummies Questions & Answers

Find pattern suffix matching pattern

Hi, I am trying to get a result out of this but fails please help. Have two files /tmp/1 & /tmp/hosts. /tmp/1 IP=123.456.789.01 WAS_HOSTNAME=abcdefgh.was.tb.dsdc /tmp/hosts 123.456.789.01 I want this result in /tmp/hosts if hostname is already there dont want duplicate entry. ... (5 Replies)
Discussion started by: rajeshwebspere
5 Replies

6. Shell Programming and Scripting

sed - matching pattern one but not pattern two

All, I have the following file: -------------------------------------- # # /etc/pam.d/common-password - password-related modules common to all services # # This file is included from other service-specific PAM config files, # and should contain a list of modules that define the services... (2 Replies)
Discussion started by: RobertBerrie
2 Replies

7. Shell Programming and Scripting

counting the lines matching a pattern, in between two pattern, and generate a tab

Hi all, I'm looking for some help. I have a file (very long) that is organized like below: >Cluster 0 0 283nt, >01_FRYJ6ZM12HMXZS... at +/99% 1 279nt, >01_FRYJ6ZM12HN12A... at +/99% 2 281nt, >01_FRYJ6ZM12HM4TS... at +/99% 3 283nt, >01_FRYJ6ZM12HM946... at +/99% 4 279nt,... (4 Replies)
Discussion started by: d.chauliac
4 Replies

8. Shell Programming and Scripting

Only matching each different pattern once

I have the following snippet of script: for i in `ps axuc | awk '{ print $11 }' | grep -c 1 awk '{ print $1 }'` ; do ps axuc | grep $i | awk '{ do { x+=1 ; y+=$4 } while ( x < NR )} END { print y }' done The idea is to: a) print the process list b) grab the last field ($11 ==... (3 Replies)
Discussion started by: fiori_musicali
3 Replies

9. Shell Programming and Scripting

comment/delete a particular pattern starting from second line of the matching pattern

Hi, I have file 1.txt with following entries as shown: 0152364|134444|10.20.30.40|015236433 0233654|122555|10.20.30.50|023365433 ** ** ** In file 2.txt I have the following entries as shown: 0152364|134444|10.20.30.40|015236433 0233654|122555|10.20.30.50|023365433... (4 Replies)
Discussion started by: imas
4 Replies

10. Shell Programming and Scripting

pattern matching in an if-then

I'm having trouble getting my syntax right here; I want to test the value of $1 and return true if it is between 0 and 9. I've tried many combinations including if ] but none have worked when passing, say, "5" into $1. Any help is appreciated. Thanks. (4 Replies)
Discussion started by: lumix
4 Replies
Login or Register to Ask a Question