Go Back   The UNIX and Linux Forums > Top Forums > Shell Programming and Scripting
.
google site



Shell Programming and Scripting Post questions about KSH, CSH, SH, BASH, PERL, PHP, SED, AWK and OTHER shell scripts and shell scripting languages here.

Closed Thread
English Japanese Spanish French German Portuguese Italian Powered by Powered by Google
 
Thread Tools Search this Thread Rate Thread Display Modes
  #1 (permalink)  
Old 07-29-2007
Registered User
 

Join Date: Jul 2007
Posts: 114
Post using shell script or awk......

suppose my one file has
>1|lol|hai i am cdfd
dafvshbejndjnudbnjdbyebjcb
bdhbdjndjndmdkmdkldhddjndj
dbdhbndjndjmdklm
sjskndknsknsknks
sbsjksnknsknknklsl
>2|lol|hai ia m cdfd123
jbsjbjbjbjbjbjbjbjbjbj
jkbkkkkkkkkkkkkk
khkkkkkkkkkkkkk
scbjbjbakbqlblabkla
>3lol|hai ia m cdfd3435
jshjhjhjhklbusbikjljsl
skhmsbcjkbkhk;sk=
bsjkhslskksjkljskjls
continue sure that it is a very large file
another file
>1|lol|hai i am cdfd
>2|lol|hai ia m cdfd123
>5lol|hai ia m cdfd6756

so output shud be
1|lol|hai i am cdfd
dafvshbejndjnudbnjdbyebjcb
bdhbdjndjndmdkmdkldhddjndj
dbdhbndjndjmdklm
sjskndknsknsknks
sbsjksnknsknknklsl
>2|lol|hai ia m cdfd123
jbsjbjbjbjbjbjbjbjbjbj
jkbkkkkkkkkkkkkk
khkkkkkkkkkkkkk
scbjbjbakbqlblabkla
>5|lol|i a cdfd7647
akjklsdjljslslkls
ksksjl;vlsjvkd;lsjls
sjsdkvnslnlsjls;lks
sjkdlsjdldsvlsj;;sk;;
Sponsored Links
  #2 (permalink)  
Old 07-29-2007
Shell_Life's Avatar
Registered User
 

Join Date: Mar 2007
Location: Bahia, Brazil
Posts: 695

Code:
csplit -f xyz input_file /^>/ {99}
for mFile in `egrep -l -f reference_file xyz*`
do
  cat ${mFile}
done


Last edited by Shell_Life; 07-29-2007 at 08:57 AM.. Reason: Shorter version.
  #3 (permalink)  
Old 07-29-2007
Registered User
 

Join Date: Jul 2007
Posts: 114
Angry no help for 2 days anything unix or shell command

but if u have one file like that
>gi|62859953|ref|NP_001017322.1| plexin B2 [Xenopus tropicalis]
MKEKERTKAITEIYLTRLLSVKGTLQQFVDNFFQSVLNSNQVVPPAVKYFFDFLDEQAEKYEIKDEDTVHIWKTNSLSLR
FWVNILKNPHFIFDVHVHQVVDASLSVIAQTFMDACSRTEHKLSRESPSNKLLYAKEISTYKKMVEDYYKGIRQMVQVSD
QDMNTHLAEISRAHTESLNTLVALHQLYQYTNKYYDEIINALEEDPAAQRMQLAYRLQQIAAALENKVTDL
>gi|62859377|ref|NP_001016112.1| O-sialoglycoprotein endopeptidase [Xenopus tropicalis]^Agi|111305744|gb|AAI21531.1| O-sialoglycoprotein endopeptidase [Xenopus tropicalis]

MTIVVGFEGSANKIGVGIIQDGKVLSNPRRTYITPPGQGFMPSDTARHHRSCILDVLQEALEEAKIKPQDVDCVAYTKGP
GMGAPLLSVAIVARTVAQLWKKPLLGVNHCIGHIEMGRLITGAENPSVLYVSGGNTQVIAYSERCYRIFGETIDIAVGNC
LDRFARVLKISNDPSPGYNIEQMAKKGKKFVELPYTVKGMDVSFSGILSYIEDMSHKMLSSGECTPEDLCFSLQETLFSM
LVEITERAMAHCGSQEVLIVGGVGCNVRLQEMMGVMCQERGAKLFATDERFCIDNGAMIAQAGWEMFRSGQVTNLQDSWI
TQRYRTDEVEVTWRD
continuing.....
As first file contains full list of all. proteins..............


and another 2nd file like that......
>gb|AAI30281.1| ABCA8 protein [Homo sapiens]
>ref|XP_001165971.1| PREDICTED: similar to KIAA0822 protein isoform 5 [Pan troglodytes]
>ref|XP_001082492.1| PREDICTED: similar to ATP-binding cassette, sub-family A member 8[Macaca mulatta]
>ref|XP_001165871.1| PREDICTED: similar to KIAA0822 protein isoform 2 [Pan troglodytes]
>ref|XP_001494133.1| PREDICTED: similar to ABCA8 protein [Equus caballus]
continuing.......


so two files have one in common >gb|AAI30281.1| ABCA8 protein [Homo sapiens]
>>gi|114670186|ref|XP_00........contnuing....

So, while taking second file to search for in the first file like this .
u shud have a output like this
>gb|AAI30281.1| ABCA8 protein [Homo sapiens]
MRKRKISVCQQTWALLCKNFLKKWRMKRESLMEWLNSLLLLLCLYIYPHSHQVNDFSSLL
TMDLGRVDTFNESRFSVVYTPVTNTTQQIMNKVASTPFLAGKEVLGLPDEESIKEFTANY
PEEIVRVTFTNTYSYHLKFLLGHGMPAKKEHKDHTAHCYETNEDVYCEVSVFWKEGFVAL
QAAINAAIIEITTNHSVMEELMSVTGKNMKMHSFIGQSGVITDLYLFSCIISFSSFIYYA
SVNVTRERKRMKALMTMMGLRDSAFWLSWGLLYAGFIFIMALFLALVIRSTQFIILSGFM
VVFSLFLLYGLSLVALAFLMSILVKKSFLTGLVVFLLTVFWGCLGFTSLYRHLPASLEWI
LSLLSPFAFMLGMAQLLHLDYDLNSNAFPHPSDGSNLIVATNFMLAFDTCLYLALAIYFE
KILPNEYGHRRPPLFFLKSSFWSQTQKTDHVALEDEMDADPSFHDSFEQAPPEFQGKEAI
RIRNVTKEYKGKPDKIEALKDLVFDIYEGQITAILGHSGAGKSTLLNILSGLSVPTKGSV
TIYNNKLSEMADLENLSKLTGVCPQSNVQFDFLTVRENLRLFAKIKGILPQEVDKEIFLL
DEPTAGLDPFSRHQVWNLLKERKTDRVILFSTQFMDEADILADRKVFLSQGKLKCAGSSL
FLKKKWGIGYHLSLQLNEICVEENITSLVKQHIPDAKLSAKSEGKLIYTLPLERTNKFPE
>gi|114670186|ref|XP_001165971.1| PREDICTED: similar to KIAA0822 protein isoform 5 [Pan troglodytes]
MRKRKISVCQQTWALLCKNFLKKWRMKRESLMEWLNSLLLLLCLYIYPHSHQVNDFSSLLTMDLGRVDTFNESRFSVVYT
PVTNTTQQIMNKVASIPFLAGKEVLGLPDEESIKEFTANYPEEIVRVTFTNTYSYHLKFLLGHGMPAKKEHKDHTAHCYE
TNEDVYCEVSVFWKEGFVALQAAINAAIIEITTNHSVMEELMSVTGKNMKMHSFIGQSGVITDLYLFSCLISFSSFIYYA
SVNVTRERKRMKGLMTMMGLRDSAFWLSWGLLYAGFIFIMALFLALVIKSTQFIILSGFMVVFSLFLLYGLSLVALAFLM
SILVKKSFLTGLVVFLLTVFWGCLGFTSLYRHLPASLEWILSLLSPFAFMLGMAQLVHLDYDLNSNAFPHPSDGSNLIVA
>gi|109117480|ref|XP_001082492.1| PREDICTED: similar to ATP-binding cassette, sub-family A member 8 isoform 2 [Macaca mulatta]
MRKREISVCQQTWALLCKNFLKKWRMKRESLMEWLNSFLLLLCLYIYPHSHQVNDFSSLLTTDLGRVDTFNESRFSVAYT
PVTNTTQQIMNKVASAPFLAGKEVLGLPDEESIKEFTANYPEEIVRVIFTNTYSYQLKFLLGHEMPAKKEHKDHTAHCYE
TNEDVYCEVSVFWKEGFVALQAAINAAIIEITTNHSVMEELMSVTGKNMKMHSFIGQSGVITDLYLFSCIISFSPFIYYA
SVNVTRERKRMKGLMTMMGLRDSAFWLSWGLLYAGFIFIMALFLALVIKSTLFIILSGFMVVFSLFLLYGLSLIALAFLM
SILVKKSFLTGLVVFLLTVFWGSLGFTALYRYLPASLEWILSLLSPFAFMLGMAQLLHLDYDLNSNAFPHPLDGSNIIVA
TNFMLAFDTCLYLALAIYFEKILPNEYGHRRSPLFFLKSSFWSQTQKADHVALEDEMDADPSSHDSFEPVPPEFHGKEAI
RIRNVTKEYKGKPDKIEALKDLVFDIYEGQITAILGHSGAGKSTLLNILSGLSTPTKGSVTIYNNKLSEVADLENLSKLT

continue


Thanks if u understood and reply soon.....
............

Last edited by cdfd123; 07-30-2007 at 09:03 AM..
Sponsored Links
Closed Thread

Bookmarks

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


More UNIX and Linux Forum Topics You Might Find Helpful
Thread Thread Starter Forum Replies Last Post
Accessing variables of one shell script in another shell script rsendhilmani Shell Programming and Scripting 2 03-17-2009 01:17 AM
To call/execute a shell script from a shell script konark UNIX for Dummies Questions & Answers 1 10-26-2007 06:16 PM
How to pass a parameter from one Shell-script to another Shell-script subodhbansal Shell Programming and Scripting 2 09-22-2007 06:19 AM
How to Run a shell script from Perl script in Parent shell? hifake Shell Programming and Scripting 16 08-28-2007 09:42 PM
Have a shell script call another shell script and exit heprox Shell Programming and Scripting 2 11-20-2006 08:17 AM



All times are GMT -4. The time now is 12:22 AM.


Powered by: vBulletin, Copyright ©2000 - 2006, Jelsoft Enterprises Limited. Language Translations Powered by .
vBCredits v1.4 Copyright ©2007 - 2008, PixelFX Studios
The UNIX and Linux Forums Content Copyright ©1993-2010. All Rights Reserved.Ad Management by RedTyger

Content Relevant URLs by vBSEO 3.2.0