Following is the shell script which I am using for splitting the contents from a flat file to the multiple XMLs. The problem is that this script is working file when the input file is having 10000 lines. When the number of lines increases; the performance degrades drastically.
That's what happens when you call multiple external commands for every line of the file.
Quote:
To read a file line by line, use a while read loop:
However, with a large file, you would be better off using awk rather than a pure shell script.
Hi,
I'm experiencing difficulty in loading an XML file to an Oracle destination table.I keep running into a memory problem due to the large size of the file.
I want to split the XML file into several smaller files based on the keyword(s)/tags : '' and '' and would like to use a Unix shell... (2 Replies)
hi there
i write one awk script file in shell programing
the code is related to dd/mm/yy to month, day year format
but i get an error
please can anybody help me out in this problem ??????
i give my code here including error
awk `
# date-month -- convert mm/dd/yy to month day,... (2 Replies)
Hi,
How do I parse/split lines (strings) read from a file and display the individual tokens in a shell script? Given that the length of individual lines is not constant and number of tokens in each line is also not constant.
The input file could be as below:
... (3 Replies)
i need one help....
if i have a string like aaaaa,bbbbb,ccccc,aaaaa
How to to split the string and check howmany times aaaaa will be in that string?
Thanks (7 Replies)
Hi,
I need help to split lines from a file into multiple files.
my input look like this:
13
23 45 45 6 7
33 44 55 66 7
13
34 5 6 7 87
45 7 8 8 9
13
44 55 66 77 8
44 66 88 99 6
I want to split every 3 lines from this file to be written to individual files. (3 Replies)
could you please provide solution to split file and load into different files in shell script.
cat test.txt
"01"|""emp_name"":ram|""emp_sal"":600000|""emp_value"":""CREDITCARD""|""1410"":]
"02"|""emp_name"":syam|""emp_sal"":500000|""emp_value"":""voucher""|""1610"":]|""1510"":]... (7 Replies)
Hi Team,
I have a file a1.txt with data as follows.
dfjakjf...asdfkasj</EnableQuotedIDs><SQL><SelectStatement modified='1' type='string'><!
The delimiter string: <SelectStatement modified='1' type='string'><!
dlm="<SelectStatement modified='1' type='string'><!
The above command is... (7 Replies)
I have a large semicolon delimited file with thousands of columns and many thousands of line. It looks like:
ID1;ID2;ID3;ID4;A_1;B_1;C_1;A_2;B_2;C_2;A_3;B_3;C_3
AA;ax;ay;az;01;02;03;04;05;06;07;08;09
BB;bx;by;bz;03;05;33;44;15;26;27;08;09
I want to split this table in to multiple files:
... (1 Reply)
Discussion started by: trymega
1 Replies
LEARN ABOUT DEBIAN
insdseqget
INSDSEQGET(1) NCBI Tools User's Manual INSDSEQGET(1)NAME
insdseqget - format sequences from GenBank as an XML INSDSet
SYNOPSIS
insdseqget [-] [-d filename] [-i filename] [-m n/p/b] [-n] [-o filename] [-v]
DESCRIPTION
insdseqget retrieves biological sequence data from GenBank (according to an input list of GI accession numbers) and prints it out as an XML
INSDSet document.
OPTIONS
A summary of options is included below.
- Print usage message
-d filename
Input file name for date list (desired accessions, one per line, followed by a blank line and a list of allowed dates, also one per
line)
-i filename
Input file name for GI list (default = stdin)
-m n/p/b
Molecule type:
n Nucleotide (default)
p Protein
b Both
-n Return only new records (for which the given GI refers to an old version)
-o filename
Output file name for the XML INSDSet (default = stdout)
-v Fetch SNP Variations
AUTHOR
The National Center for Biotechnology Information.
SEE ALSO asn2gb(1), gbseqget(1).
NCBI 2006-04-14 INSDSEQGET(1)