The UNIX and Linux Forums  

Go Back   The UNIX and Linux Forums > Top Forums > Shell Programming and Scripting
.
google unix.com



Shell Programming and Scripting Post questions about KSH, CSH, SH, BASH, PERL, PHP, SED, AWK and OTHER shell scripts and shell scripting languages here.

More UNIX and Linux Forum Topics You Might Find Helpful
Thread Thread Starter Forum Replies Last Post
extract the lines prvnrk Shell Programming and Scripting 2 05-25-2008 07:17 AM
logic needed to extract lines ganapati UNIX for Dummies Questions & Answers 2 01-10-2008 07:18 AM
Extract lines from a file automatically. Please a Help alexcol Shell Programming and Scripting 8 12-16-2006 05:25 PM
Extract known lines nhatch UNIX for Dummies Questions & Answers 2 09-16-2004 12:59 PM
how to extract a range of lines from a file beilstwh Shell Programming and Scripting 5 07-09-2004 09:20 AM

 
English Japanese Spanish French German Portuguese Italian Dutch Swedish Russian Norwegian Hungarian Hebrew Danish Bulgarian Greek Powered by Powered by Google
 
LinkBack Thread Tools Search this Thread Rate Thread Display Modes
Prev Previous Post   Next Post Next
  #1 (permalink)  
Old 04-10-2006
mskcc mskcc is offline
Registered User
  
 

Join Date: Jul 2005
Posts: 37
extract lines with a given list of identifiers

Hi All,

My question is if the simple but powerful shell scripts can extract data from a big data file by using a list of identifier. I used to put everything in the database and do joining, which sounds stupid but only way I knew. For example, my data file looks like,

GENE13810X GENE7798X 0.982666016
GENE4333X GENE487X 0.981506348
GENE7806X GENE3731X 0.981079102
GENE13020X GENE4755X 0.980102539
GENE7521X GENE3733X 0.979614258
GENE6499X GENE233X 0.979370117
GENE12708X GENE8435X 0.979064941
GENE4114X GENE4113X 0.978820801
GENE10919X GENE10568X 0.978820801
GENE5651X GENE1342X 0.978210449
GENE7657X GENE6004X 0.977905273
NODE9X GENE3712X 0.977783203
GENE12950X NODE22X 0.977783203
NODE19X GENE34X 0.977783203
GENE7642X GENE3768X 0.977539063
GENE10831X GENE8296X 0.977294922
GENE7952X NODE10X 0.977111816
GENE3807X GENE3806X 0.976501465
GENE12393X NODE23X 0.976501465
GENE2694X NODE29X 0.976501465
NODE30X GENE11332X 0.976501465
GENE3703X GENE3702X 0.976257324
GENE9709X GENE5625X 0.976013184
GENE3526X GENE2743X 0.975769043
GENE12776X NODE3X 0.975708008
GENE11770X NODE35X 0.975708008
GENE4542X NODE24X 0.975463867
GENE5074X GENE1267X 0.975280762
GENE14374X GENE8560X 0.975219727
GENE5872X NODE36X 0.974914551
GENE8550X NODE38X 0.974914551

The given list based on first column can be,
GENE12708X
GENE4114X
GENE10919X
GENE5651X
GENE7657X
NODE9X
GENE12950X
NODE19X
GENE7642X
GENE10831X
GENE7952X

Thanks
 

Bookmarks

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On




All times are GMT -4. The time now is 11:08 AM.


Powered by: vBulletin, Copyright ©2000 - 2006, Jelsoft Enterprises Limited. Language Translations Powered by .
vBCredits v1.4 Copyright ©2007 - 2008, PixelFX Studios
The UNIX and Linux Forums Content Copyright ©1993-2009. All Rights Reserved.Ad Management by RedTyger

Content Relevant URLs by vBSEO 3.2.0