If the awk on your system only supports single character settings for RS, or if you'd like to base the output filenames on the input filenames, be able to specify more than one input file, and be able to specify the number of files to be produced before updating the value of the 1st numeric value in the output filename, you could try the following script:
It uses the Korn shell, but will also work with any other shell that accepts parameter expansions specified by the POSIX Standards (including bash).
Note that if the first line in an input file or two or more adjacent lines in an input file start with a >, empty files will not be created; the corresponding filename will just be skipped.
I have gone through all the threads in the forum and tested out different things. I am trying to split a 3GB file into multiple files. Some files are even larger than this.
For example:
split -l 3000000 filename.txt
This is very slow and it splits the file with 3 million records in each... (10 Replies)
dear all,
i have file with format like this
file_master.txt
20110212|231213|rio|apri|23112|222222
20110212|312311|jaka|dino|31223|543234
20110301|343322|alfan|budi|32131|333311
...
i want filter with output like this
index_nm.txt
rio|apri
jaka|dino
...
index_years.txt
20110212... (7 Replies)
Hey all,
I'm doing a project currently and want to index words in a webpage.
So there would be a file with webpage content and a file with list of words, I want an output file with true and false that would show which word exists in the webpage.
example:
Webpage content data.html
... (2 Replies)
hello guys,
I have a file like this:
input.dat
Push-to-talk
No
Coonection
IP support
Support for IP telephony
Yes
Built-in SIP stack
Yes
Support via software
Yes
Microsoft
Support for Microsoft Exchange
Yes
UMA (5 Replies)
cat filename|nawk ' { FS="="; if (!a++ == 0) print $0 } '
can anyone plz explain how does array inexing works,how it is evaluating if (!a++ == 0)?? (2 Replies)
Hi All
I think I might have bitten off more than I can chew here and I'm hoping some of you guys with advanced pattern matching skills can help me.
What I want to do is index the occurrence of variable names within a library of scripts that I have. Don't ask why, I'm just sad like that...
... (3 Replies)
Hello,
This is pretty simple, I`m looking for a faster and better method than brute force that I`m doing.
I have a 20GB file looks like
Name1,Var1,Val1
Name1,Var2,Val2
Name2,Var1,Val3
Name2,Var2,Val4
I want 3 files.
Nameindex
1 Name1
2 Name2
... (2 Replies)
Gents
Actually I have question and i need your support.
I have this NAS file system mounted as /coresys has size of 7 TB
I need to Split this file system into several file systems as mount points I mean how to can I Split it professionally to different NAS mount points how to can I decide... (2 Replies)
Gents
I have huge NAS File System as /sys with size 10 TB and I want to Split each 1TB in spirit File System to be mounted in the server.
How to can I do that without changing anything in the source.
Please your support. (1 Reply)
Discussion started by: AbuAliiiiiiiiii
1 Replies
LEARN ABOUT DEBIAN
rlam
RLAM(1) General Commands Manual RLAM(1)NAME
rlam - laminate records from multiple files
SYNOPSIS
rlam [ -tS ][ -u ][ -iaN | -ifN | -idN | -iiN | -iwN | -ibN ] input1 input2 ..
DESCRIPTION
Rlam simply joins records (or lines) from multiple inputs, separating them with the given string (TAB by default). Different separators
may be given for different files by specifying additional -t options in between each file name. Note that there is no space between this
option and its argument. If none of the input files uses an ASCII separator, then no end-of-line character will be printed, either.
An input is either a stream or a command. Commands are given in quotes, and begin with an exclamantion point ('!'). If the inputs do not
have the same number of lines, then shorter files will stop contributing to the output as they run out.
The -ia option may be used to specify ASCII input (the default), or the -if option may be used to indicated binary IEEE 32-bit floats on
input. Similarly, the -id and -ii options may be used to indicate binary 64-bit doubles or integer words, respectively. The -iw option
specifies 2-byte short words, and the -ib option specifies bytes. If a number is immediately follows any of these options, then it indi-
cates that multiple such values are expected for each record. For example, -if3 indicates three floats per input record for the next named
input. In the case of the -ia option, no number indicates one line per input record, and numbers greater than zero indicate that many
characters exactly per record. For binary input formts, no number implies one value per record. For anything other than EOL-separated
input, the default tab separator is reset to the empty string.
A hyphen ('-') by itself can be used to indicate the standard input, and may appear multiple times. The -u option forces output after each
record (i.e., one run through inputs).
EXAMPLE
To join files output1 and output2, separated by a comma:
rlam -t, output1 output2
To join a file with line numbers (starting at 0) and its reverse:
cnt `wc -l < lam.c` | rlam - -t: lam.c -t '!tail -r lam.c'
To join four data files, each having three doubles per record:
rlam -id3 file1.dbl file2.dbl file3.dbl file4.dbl > combined.dbl
AUTHOR
Greg Ward
SEE ALSO cnt(1), histo(1), neaten(1), rcalc(1), tabfunc(1), total(1)RADIANCE 7/8/97 RLAM(1)