Hi guys,
I am stuck in this problem. Please help.
I have two files.
FILE1 (with records starting from '>' )
>TC1723_3 similar to Scific_A7Q9Q3
EMSPSQDYCDDYFKLTYPCTAGAQYYGRGALPVYWNYNYGAIGEALKLDLLNHPEYIEQN
ATMAFQAAIWRWMNPMKKGQPSAHDAFVGNWKP
>TC214_2 similar to Quiet_Ref100_Q8W2B2 Cluster; Capsule catabar holesome, partial (58%)
S**ELSSCY*QRRKMRYSFLIFLTLALLLTTSSAQQCGKQAGGRVCANKLCCSQYGFCGS
SRNYCGAGCQSNCRSVASGNTESEAANAHRKNLPGHSN*SCYSF*FTMNIIMFHVC*LLR
TTNKN
FILE2 ( with 3 columns, col1 is ID col2 and col3 are the substring co-ordinates). It is a single space separated file but shown with '-' for clarity
TC1723_3 - 10 - 40
TC214_2 - 5 - 115
I need the OUTPUT FILE as -
>TC1723_3 similar to Scific_A7Q9Q3 (Region 10 - 40 of 95)
DYFKLTYPCTAGAQYYGRGALPVYWNYNYGA
>TC214_2 similar to Quiet_Ref100_Q8W2B2 Cluster; n=1; Capsule catabar holesome, partial (58%) (Region 5 - 115 of 125)
SSCY*QRRKMRYSFLIFLTLALLLTTSSAQQCGKQAGGRVCANKLCCSQYGFCGSSRNYC
GAGCQSNCRSVASGNTESEAANAHRKNLPGHSN*SCYSF*FTMNIIMFHV
where (Region 10 - 40 of 95) represents region of substring and 95 is the total length of the subsring following the line beginning with '>'
Thanks in advance.