Thanks rudic. Will give it a try.
If there is any other options please let me know..
---------- Post updated 12-11-13 at 07:33 AM ---------- Previous update was 12-10-13 at 08:44 AM ----------
Rudic,
I have tried out. But in my flat file there is one more disadvantage. It is not seperated by " " between records , but by "|"
example is
Please suggest any solutions for this..
Your solution for the same was effective. But since it is seperated with "|" what i need to do ??
Thanks
Sam
Last edited by Franklin52; 12-15-2013 at 08:48 AM..
Reason: Please use code tags
How can i remove the duplicate lines from a file, for example
sample123456Sample
testing123456testing
XXXXX131323XXXXX
YYYYY423432YYYYY
fsdfdsf123456gsdfdsd
all the duplicates from column 6-12 , must be deleted. I want to consider the first row, if same comes in the given range i want to... (1 Reply)
hi.. i have a file in the following format :-
name-a
age -12
address-123
age-12
phone-22222
============
name-ab
age -11
address-123
age-11
phone-222223
=============
name-abc
age -12
address-1234
age-12
phone-2222223
============= (2 Replies)
Hi,
I want to remove the first line from a flat file using unix command as simple as possible. Can anybody give me a hand ?
Thanks in advance.
xli (21 Replies)
Hi,
I need to remove duplicates from a file. The file will be like this
0003 10101 20100120 abcdefghi
0003 10101 20100121 abcdefghi
0003 10101 20100122 abcdefghi
0003 10102 20100120 abcdefghi
0003 10103 20100120 abcdefghi
0003 10103 20100121 abcdefghi
Here if the first colum and... (6 Replies)
All,
I have a file 1181CUSTOMER-L061411_003500.dat.Z having duplicate records in it.
bash-2.05$ zcat 1181CUSTOMER-L061411_003500.dat.Z|grep "90876251S"
90876251S|ABG, AN ADAYANA COMPANY|3550 DEPAUW BLVD|||US|IN|INDIANAPOLIS||DAL|46268||||||GEN|||||||USD|||ABG, AN ADAYANA... (3 Replies)
HI,
can any one help me please ..
i have flat file like
qwer123rt ass3242ccf jjk654
kjh838ppp nhdg453ok hdkk34
i want remove numeric characters in the flat file
i want output like this
qwerrt assccf jjk
kjhppp nhdgok hdkk
help me... (4 Replies)
Hi,
I have a tablular separated file and I want to remove all the rows that have duplicates. The diuplicates I need to check are in column 13.
I have tried to use awk but I have no Idea how to keep the duplicate file.
awk 'FNR==NR{a++;next}(a> 1)' tomodify.txt tomodify.txt > new.txt
... (4 Replies)
Hi some one please help me to remove duplicates from a pipe delimited file based on first two columns.
123|asdf|sfsd|qwrer
431|yui|qwer|opws
123|asdf|pol|njio
Here My first record and last record are duplicates.As per my requirement I want all the latest records into one file.
I want the... (12 Replies)
Discussion started by: ginrkf
12 Replies
LEARN ABOUT DEBIAN
bp_bioflat_index
BP_BIOFLAT_INDEX(1p) User Contributed Perl Documentation BP_BIOFLAT_INDEX(1p)NAME
bioflat_index.pl - index sequence files using Bio::DB::Flat
DESCRIPTION
Create or update a biological sequence database indexed with the
Bio::DB::Flat indexing scheme. The arguments are a list of flat files
containing the sequence information to be indexed.
USAGE
bioflat_index.pl <options> file1 file2 file3...
Options:
--create Create or reinitialize the index. If not specified,
the index must already exist.
--format <format> The format of the sequence files. Must be one
of "genbank", "swissprot", "embl" or "fasta".
--location <path> Path to the directory in which the index files
are stored.
--dbname <name> The symbolic name of the database to be created.
--indextype <type> Type of index to create. Either "bdb" or "flat".
"binarysearch" is the same as "flat".
Options can be abbreviated. For example, use -i for --indextype.
The following environment variables will be used as defaults if the corresponding options are not provided:
OBDA_FORMAT format of sequence file
OBDA_LOCATION path to directory in which index files are stored
OBDA_DBNAME name of database
OBDA_INDEX type of index to create
perl v5.14.2 2012-03-02 BP_BIOFLAT_INDEX(1p)