using shell script or awk......


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting using shell script or awk......
# 1  
Old 07-29-2007
Java using shell script or awk......

suppose my one file has
Code:
>1|lol|hai i am cdfd
dafvshbejndjnudbnjdbyebjcb
bdhbdjndjndmdkmdkldhddjndj
dbdhbndjndjmdklm
sjskndknsknsknks
sbsjksnknsknknklsl
>2|lol|hai ia m cdfd123
jbsjbjbjbjbjbjbjbjbjbj
jkbkkkkkkkkkkkkk
khkkkkkkkkkkkkk
scbjbjbakbqlblabkla
>3lol|hai ia m cdfd3435
jshjhjhjhklbusbikjljsl
skhmsbcjkbkhk;sk=
bsjkhslskksjkljskjls
continue sure that it is a very large file
another file
>1|lol|hai i am cdfd
>2|lol|hai ia m cdfd123
>5lol|hai ia m cdfd6756

so output shud be 
1|lol|hai i am cdfd
dafvshbejndjnudbnjdbyebjcb
bdhbdjndjndmdkmdkldhddjndj
dbdhbndjndjmdklm
sjskndknsknsknks
sbsjksnknsknknklsl
>2|lol|hai ia m cdfd123
jbsjbjbjbjbjbjbjbjbjbj
jkbkkkkkkkkkkkkk 
khkkkkkkkkkkkkk
scbjbjbakbqlblabkla 
>5|lol|i a cdfd7647
akjklsdjljslslkls
ksksjl;vlsjvkd;lsjls
sjsdkvnslnlsjls;lks
sjkdlsjdldsvlsj;;sk;;

# 2  
Old 07-29-2007
Code:
csplit -f xyz input_file /^>/ {99}
for mFile in `egrep -l -f reference_file xyz*`
do
  cat ${mFile}
done


Last edited by Shell_Life; 07-29-2007 at 08:57 AM.. Reason: Shorter version.
# 3  
Old 07-29-2007
Network no help for 2 days anything unix or shell command

but if u have one file like that
Code:
>gi|62859953|ref|NP_001017322.1| plexin B2 [Xenopus tropicalis]
MKEKERTKAITEIYLTRLLSVKGTLQQFVDNFFQSVLNSNQVVPPAVKYFFDFLDEQAEKYEIKDEDTVHIWKTNSLSLR
FWVNILKNPHFIFDVHVHQVVDASLSVIAQTFMDACSRTEHKLSRESPSNKLLYAKEISTYKKMVEDYYKGIRQMVQVSD
QDMNTHLAEISRAHTESLNTLVALHQLYQYTNKYYDEIINALEEDPAAQRMQLAYRLQQIAAALENKVTDL
>gi|62859377|ref|NP_001016112.1| O-sialoglycoprotein endopeptidase [Xenopus tropicalis]^Agi|111305744|gb|AAI21531.1| O-sialoglycoprotein endopeptidase [Xenopus tropicalis]

MTIVVGFEGSANKIGVGIIQDGKVLSNPRRTYITPPGQGFMPSDTARHHRSCILDVLQEALEEAKIKPQDVDCVAYTKGP
GMGAPLLSVAIVARTVAQLWKKPLLGVNHCIGHIEMGRLITGAENPSVLYVSGGNTQVIAYSERCYRIFGETIDIAVGNC
LDRFARVLKISNDPSPGYNIEQMAKKGKKFVELPYTVKGMDVSFSGILSYIEDMSHKMLSSGECTPEDLCFSLQETLFSM
LVEITERAMAHCGSQEVLIVGGVGCNVRLQEMMGVMCQERGAKLFATDERFCIDNGAMIAQAGWEMFRSGQVTNLQDSWI
TQRYRTDEVEVTWRD
continuing.....
As first file contains full list of all. proteins..............


and another 2nd  file like that......
>gb|AAI30281.1| ABCA8 protein [Homo sapiens]
>ref|XP_001165971.1| PREDICTED: similar to KIAA0822 protein isoform 5 [Pan troglodytes]
>ref|XP_001082492.1| PREDICTED: similar to ATP-binding cassette, sub-family A member 8[Macaca mulatta]
>ref|XP_001165871.1| PREDICTED: similar to KIAA0822 protein isoform 2 [Pan troglodytes]
>ref|XP_001494133.1| PREDICTED: similar to ABCA8 protein [Equus caballus]
continuing.......


 so two files have one in common >gb|AAI30281.1| ABCA8 protein [Homo sapiens]
>>gi|114670186|ref|XP_00........contnuing....

So, while taking second file to search for in the first file  like this .
u shud have a output like this 
>gb|AAI30281.1| ABCA8 protein [Homo sapiens]
MRKRKISVCQQTWALLCKNFLKKWRMKRESLMEWLNSLLLLLCLYIYPHSHQVNDFSSLL
TMDLGRVDTFNESRFSVVYTPVTNTTQQIMNKVASTPFLAGKEVLGLPDEESIKEFTANY
PEEIVRVTFTNTYSYHLKFLLGHGMPAKKEHKDHTAHCYETNEDVYCEVSVFWKEGFVAL
QAAINAAIIEITTNHSVMEELMSVTGKNMKMHSFIGQSGVITDLYLFSCIISFSSFIYYA
SVNVTRERKRMKALMTMMGLRDSAFWLSWGLLYAGFIFIMALFLALVIRSTQFIILSGFM
VVFSLFLLYGLSLVALAFLMSILVKKSFLTGLVVFLLTVFWGCLGFTSLYRHLPASLEWI
LSLLSPFAFMLGMAQLLHLDYDLNSNAFPHPSDGSNLIVATNFMLAFDTCLYLALAIYFE
KILPNEYGHRRPPLFFLKSSFWSQTQKTDHVALEDEMDADPSFHDSFEQAPPEFQGKEAI
RIRNVTKEYKGKPDKIEALKDLVFDIYEGQITAILGHSGAGKSTLLNILSGLSVPTKGSV
TIYNNKLSEMADLENLSKLTGVCPQSNVQFDFLTVRENLRLFAKIKGILPQEVDKEIFLL
DEPTAGLDPFSRHQVWNLLKERKTDRVILFSTQFMDEADILADRKVFLSQGKLKCAGSSL
FLKKKWGIGYHLSLQLNEICVEENITSLVKQHIPDAKLSAKSEGKLIYTLPLERTNKFPE
>gi|114670186|ref|XP_001165971.1| PREDICTED: similar to KIAA0822 protein isoform 5 [Pan troglodytes]
MRKRKISVCQQTWALLCKNFLKKWRMKRESLMEWLNSLLLLLCLYIYPHSHQVNDFSSLLTMDLGRVDTFNESRFSVVYT
PVTNTTQQIMNKVASIPFLAGKEVLGLPDEESIKEFTANYPEEIVRVTFTNTYSYHLKFLLGHGMPAKKEHKDHTAHCYE
TNEDVYCEVSVFWKEGFVALQAAINAAIIEITTNHSVMEELMSVTGKNMKMHSFIGQSGVITDLYLFSCLISFSSFIYYA
SVNVTRERKRMKGLMTMMGLRDSAFWLSWGLLYAGFIFIMALFLALVIKSTQFIILSGFMVVFSLFLLYGLSLVALAFLM
SILVKKSFLTGLVVFLLTVFWGCLGFTSLYRHLPASLEWILSLLSPFAFMLGMAQLVHLDYDLNSNAFPHPSDGSNLIVA
>gi|109117480|ref|XP_001082492.1| PREDICTED: similar to ATP-binding cassette, sub-family A member 8 isoform 2 [Macaca mulatta]
MRKREISVCQQTWALLCKNFLKKWRMKRESLMEWLNSFLLLLCLYIYPHSHQVNDFSSLLTTDLGRVDTFNESRFSVAYT
PVTNTTQQIMNKVASAPFLAGKEVLGLPDEESIKEFTANYPEEIVRVIFTNTYSYQLKFLLGHEMPAKKEHKDHTAHCYE
TNEDVYCEVSVFWKEGFVALQAAINAAIIEITTNHSVMEELMSVTGKNMKMHSFIGQSGVITDLYLFSCIISFSPFIYYA
SVNVTRERKRMKGLMTMMGLRDSAFWLSWGLLYAGFIFIMALFLALVIKSTLFIILSGFMVVFSLFLLYGLSLIALAFLM
SILVKKSFLTGLVVFLLTVFWGSLGFTALYRYLPASLEWILSLLSPFAFMLGMAQLLHLDYDLNSNAFPHPLDGSNIIVA
TNFMLAFDTCLYLALAIYFEKILPNEYGHRRSPLFFLKSSFWSQTQKADHVALEDEMDADPSSHDSFEPVPPEFHGKEAI
RIRNVTKEYKGKPDKIEALKDLVFDIYEGQITAILGHSGAGKSTLLNILSGLSTPTKGSVTIYNNKLSEVADLENLSKLT

continue


Thanks if u understood and reply soon.....
............

Last edited by cdfd123; 07-30-2007 at 09:03 AM..
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Shell script to call and sort awk script and output

I'm trying to create a shell script that takes a awk script that I wrote and a filename as an argument. I was able to get that done but I'm having trouble figuring out how to keep the header of the output at the top but sort the rest of the rows alphabetically. This is what I have now but it is... (1 Reply)
Discussion started by: Eric7giants
1 Replies

2. UNIX for Dummies Questions & Answers

Passing shell script parameter value to awk command in side the script

I have a shell script (.sh) and I want to pass a parameter value to the awk command but I am getting exception, please assist. diff=$1$2.diff id=$2 new=new_$diff echo "My id is $1" echo "I want to sync for user account $id" ##awk command I am using is as below cat $diff |... (1 Reply)
Discussion started by: Sarita Behera
1 Replies

3. Shell Programming and Scripting

Unable to pass shell script variable to awk command in same shell script

I have a shell script (.sh) and I want to pass a parameter value to the awk command but I am getting exception, please assist. diff=$1$2.diff id=$2 new=new_$diff echo "My id is $1" echo "I want to sync for user account $id" ##awk command I am using is as below cat $diff | awk... (2 Replies)
Discussion started by: Ashunayak
2 Replies

4. Post Here to Contact Site Administrators and Moderators

Unable to pass shell script parameter value to awk command in side the same script

Variable I have in my shell script diff=$1$2.diff id=$2 new=new_$diff echo "My id is $1" echo "I want to sync for user account $id" ##awk command I am using is as below cat $diff | awk -F'~' ''$2 == "$id"' {print $0}' > $new I could see value of $id is not passing to the awk... (0 Replies)
Discussion started by: Ashunayak
0 Replies

5. Shell Programming and Scripting

Calling shell script within awk script throws error

I am getting the following error while passing parameter to a shell script called within awk script. Any idea what's causing this issue and how to ix it ? Thanks sh: -c: line 0: syntax error near unexpected token `newline' sh: -c: line 0: `./billdatecalc.sh ... (10 Replies)
Discussion started by: Sudhakar333
10 Replies

6. Shell Programming and Scripting

Call shell script function from awk script

hi everyone i am trying to do this bash> cat abc.sh deepak() { echo Deepak } deepak bash>./abc.sh Deepak so it is giving me write simply i created a func and it worked now i modified it like this way bash> cat abc.sh (2 Replies)
Discussion started by: aishsimplesweet
2 Replies

7. Shell Programming and Scripting

want to pass parameters to awk script from shell script

Hello, I have this awk script that I want to execute by passing parameters through a shell script. I'm a little confused. This awk script removes duplicates from an input file. Ok, so I have a .sh file called rem_dups.sh #!/usr/bin/sh... (4 Replies)
Discussion started by: script_op2a
4 Replies

8. Shell Programming and Scripting

create a shell script that calls another script and and an awk script

Hi guys I have a shell script that executes sql statemets and sends the output to a file.the script takes in parameters executes sql and sends the result to an output file. #!/bin/sh echo " $2 $3 $4 $5 $6 $7 isql -w400 -U$2 -S$5 -P$3 << xxx use $4 go print"**Changes to the table... (0 Replies)
Discussion started by: magikminox
0 Replies

9. UNIX for Advanced & Expert Users

AWK instead of Shell script

I've a list file, which has some file names. ex: list file "list_file" will have data_file1.txt data_file2.txt data_file3.txt : : data_filen.txt Above files will have the below layout: Header1 Header2 *TM* Data record 1 Datarecord 2 Datarecord n *TM* (19 Replies)
Discussion started by: Lokesha
19 Replies

10. UNIX for Advanced & Expert Users

awk in shell script...

I have an awk script contained within a ksh script that I want to read the contents of a file with, say 3 records. Fields are a, b, and c, delimiter '|'. I want to read the file and then I want to set different variables for $1, $2, $3 to be used as parameters to run a series of reports outside... (2 Replies)
Discussion started by: giannicello
2 Replies
Login or Register to Ask a Question