hi all
im new to this forum..excuse me if anythng wrong.
I have a file containing 600 MB data in that. when i do parse the data in perl program im getting out of memory error.
so iam planning to split the file into smaller files and process one by one.
can any one tell me what is the code... (1 Reply)
I need to write a shell script for below scenario
My input file has data in format:
qwerty0101TWE 12345 01022005 01022005 datainala alanfernanded 26
qwerty0101mXZ 12349 01022005 06022008 datainalb johngalilo 28
qwerty0101TWE 12342 01022005 07022009 datainalc hitalbert 43
qwerty0101CFG 12345... (19 Replies)
hi Guys
i need some help here..
i have a file which has > 800,000 lines in it. I need to split this file into smaller files with 25000 lines each.
please help
thanks (1 Reply)
Hello, I am using awk to split a file into multiple files using command:
nawk '{
if ( $1 == "<process" )
{
n=split($2, arr, "\"");
file=arr
}
print > file }' processes.xml
<process name="Process1.process">
... (3 Replies)
I will simplify the explaination a bit, I need to parse through a 87m file -
I have a single text file in the form of :
<NAME>house........
SOMETEXT
SOMETEXT
SOMETEXT
.
.
.
.
</script>
MORETEXT
MORETEXT
.
.
. (6 Replies)
Hi,
I have a Huge 7 GB file which has around 1 million records, i want to split this file into 4 files to contain around 250k messages each.
Please help me as Split command cannot work here as it might miss tags..
Format of the file is as below
<!--###### ###### START-->... (6 Replies)
Hi all.
I've tried searching the web but could not find similar problem to mine.
I have one large file to be splitted into several files based on the matching pattern found in each row.
For example, let's say the file content:
... (13 Replies)
Dears,
Need you help with the below file manipulation. I want to split the file into 8 smaller files but without cutting/disturbing the entries (meaning every small file should start with a entry and end with an empty line). It will be helpful if you can provide a one liner command for this... (12 Replies)
Discussion started by: Kamesh G
12 Replies
LEARN ABOUT HPUX
diffmk
diffmk(1) General Commands Manual diffmk(1)NAME
diffmk - mark changes between two different versions of a file
SYNOPSIS
prevfile currfile markfile
DESCRIPTION
compares the previous version of a file with the current version and creates a file that includes ``change mark'' commands. prevfile is
the name of the previous version of the file and currfile is the name of the current version of the file. generates markfile which con-
tains all the lines of the currfile plus inserted formatter ``change mark'' requests. When markfile is formatted, changed or inserted text
is shown by a character at the right margin of each line. The position of deleted text is shown by a single
If the characters and are inappropriate, a copy of can be edited to change them because is a shell script.
EXTERNAL INFLUENCES
International Code Set Support
Single- and multi-byte character code sets are supported.
EXAMPLES
A typical command line for comparing two versions of an file and generating a file with the changes marked is:
can also be used to produce listings of C (or other) programs with changes marked. A typical command line for such use is:
where the file contains:
The request can specify a different line length, depending on the nature of the program being printed. The request is probably needed only
for C programs.
WARNINGS
Aesthetic considerations may dictate manual adjustment of some output.
does not differentiate between changes in text and changes in formatter request coding. Thus, file differences involving only formatting
changes (such as replacing with in a text source file) with no change in actual text can produce change marks.
Although unlikely, certain combinations of formatting requests can cause change marks to either disappear or to mark too much. Manual
intervention may be required because the subtleties of various formatting macro packages and preprocessors is beyond the scope of cannot
tolerate commands in its input (see tbl(1)), so any request that would appear inside a range is silently deleted. The script can be
changed if this action is inappropriate, or can be run on two files that have both been run through the preprocessor before any comparisons
are made.
uses and thus has the same limitations on file size and performance that may impose (see diff(1)). In particular the performance is non-
linear with the size of the file, and very large files (well over 1000 lines) may take extremely long to process. Breaking the file into
smaller pieces may be advisable.
also uses the ed(1) editor. If the file is too large for error messages may be embedded in the file. Again, breaking the file into
smaller pieces may be advisable.
SEE ALSO diff(1), nroff(1).
diffmk(1)