Hi Rohon,
The list fiiles are as follows: List1.txt, List2.txt, ....., List4000.txt
And the corresponding data files are: DataFile1.txt, DataFile2.txt, ...., DataFile4000.txt
List1.txt will be used for DataFaile1.txt and so on.
Contents of both the files are not sorted. For example:
List1.txt
HTML Code:
contig00002 length=653 numreads=34
contig00015 length=636 numreads=21
contig00005 length=662 numreads=51
contig00045 length=584 numreads=24
contig00033 length=539 numreads=19
DataFile1.txt
HTML Code:
>contig00015 length=477 numreads=22
GGGGCTGACGTGGCCGCTAATACGACTCACTATAGGGAGAGTAAGTGAAT
GTCACATCGTTTGGATCAAGACCCATTTGCAGCACAAGCCCTGTTTTGTT
>contig00002 length=530 numreads=27
GGGCTGACGTGGCCGCTAATACGACTCACTATAGGGAGAGGAGGATAGGG
AGCTGAGCAGCCAGTGACAGGATCCAGCTCCAGGGGGTGAATGGGGATGG
>contig00005 length=670 numreads=22
GGGGCTGACGTGGCCGCTAATACGACTCACTATAGGGAGAGATTGTTGAA
GTGGAAAGCCATTTTGACTATTACCGCCCGGTGGCAGAAACCAAACCTGG
>contig00045 length=636 numreads=21
GGGCAGCTGCGGCCGCTAATACGACTCACTATAGGGAGAGATCGTGGCGA
TCGCCAATCACCCAGGTGCCGTTAGCCAGAGCTGGTTTGATGACCGTTTC
>contig00072 length=662 numreads=51
GGGCAGCTGCGGCCGCTAATACGACTCACTATAGGGAGAGAGCTCCAGCA
GAATGGACACGCCTCCTGAGCTGTGATAGGGAGAGCATAAACACGCCTCC
One way would be, read one header from the list file, say 'contig00002 length=653 numreads=34', search for it in the DataFile1.txt and retrieve the following:
'>contig00002 length=530 numreads=27
GGGCTGACGTGGCCGCTAATACGACTCACTATAGGGAGAGGAGGATAGGG
AGCTGAGCAGCCAGTGACAGGATCCAGCTCCAGGGGGTGAATGGGGATGG'
Cheers.