06-11-2009
How to generate sample records from a file
i have a file having 30 million records.i want to generate a file having say 5% of total records in another file. the records in the new file shud be randomly generated.
9 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I have a flat file and need to count no of records in the file less the header and the trailer record.
I would appreciate any and all asistance
Thanks
Hadi Lalani (2 Replies)
Discussion started by: guiguy
2 Replies
2. Virtualization and Cloud Computing
1. Can somebody tell me the log file location of HPVM where all the events of guest OS are reported ?
2. And if possible a log file with important events in it ? (1 Reply)
Discussion started by: thegeek
1 Replies
3. UNIX for Dummies Questions & Answers
Hi everyone.
I am a newbie to Linux stuff. I have this kind of problem which couldn't solve alone. I have a text file with records separated by empty lines like this:
ID: 20
Name: X
Age: 19
ID: 21
Name: Z
ID: 22
Email: xxx@yahoo.com
Name: Y
Age: 19
I want to grep records that... (4 Replies)
Discussion started by: Atrisa
4 Replies
4. Shell Programming and Scripting
I have 2 files
"File 1" is delimited by ";" and "File 2" is delimited by "|".
File 1 below (3 record shown):
Doc1;03/01/2012;New York;6 Main Street;Mr. Smith 1;Mr. Jones
Doc2;03/01/2012;Syracuse;876 Broadway;John Davis;Barbara Lull
Doc3;03/01/2012;Buffalo;779 Old Windy Road;Charles... (2 Replies)
Discussion started by: vestport
2 Replies
5. Shell Programming and Scripting
Dear All,
I have a template xml file like below.
....Some---Header.......
<SignalPreference>
...
<SignalName>STRING</SignalName>
...
</SignalPreference>
......Some formatting text.......
<SignalPreference>
.........
... (3 Replies)
Discussion started by: ks_reddy
3 Replies
6. UNIX for Dummies Questions & Answers
Hello
Could you please help me to find a code that can randomly select 1224 lines from a file of 12240 and make tn output with 1224 line each.
my input is txt file with 12240 lines like :
13474 999003507 0 0 2 -9
13475 999003508 0 0 2 -9
13476 999003509 0 0 1 -9
13477 999003510 0 0 1 -9
... (7 Replies)
Discussion started by: biopsy
7 Replies
7. Shell Programming and Scripting
hi all, I need some help in regards of how to process just a sample from a large .txt file
I have a large file from many new lines (say above 200.000 new lines), I need a script that process just a sample of it, say 10.000 bur a random sample (taking rows from top top to the the bottom)
... (4 Replies)
Discussion started by: c_lady
4 Replies
8. Shell Programming and Scripting
Hi I am new to shell programming in unix
Please if I can provide help.
I have a file structure of a header record and "N" detail records.
The header record will be the total number of detail records
I need to split the file in 2:
One for the header
Another for all detail records
Could... (1 Reply)
Discussion started by: jamcogar
1 Replies
9. Shell Programming and Scripting
Hello All,
I have need as below:
1--> I need to get all users(who submit jobs) and their details by using below command:
qstat -u \*
output of the above command looks line below:
job-ID prior name user-id state "submit/start at" queue jclass slots ja-task-ID... (5 Replies)
Discussion started by: VasuKukkapalli
5 Replies
LEARN ABOUT PLAN9
acctcms
acctcms(1M) System Administration Commands acctcms(1M)
NAME
acctcms - command summary from process accounting records
SYNOPSIS
/usr/lib/acct/acctcms [ -a [-o] [-p]] [-c] [-j] [-n] [-s] [-t] filename...
DESCRIPTION
acctcms reads one or more filenames, normally in the form described in acct.h(3HEAD). It adds all records for processes that executed iden-
tically named commands, sorts them, and writes them to the standard output, normally using an internal summary format.
OPTIONS
-a Print output in ASCII rather than in the internal summary format. The output includes command name, number of times executed,
total kcore-minutes, total CPU minutes, total real minutes, mean size (in K), mean CPU minutes per invocation, "hog factor," char-
acters transferred, and blocks read and written, as in acctcom(1). Output is normally sorted by total kcore-minutes.
Use the following options only with the -a option:
-o Output a (non-prime) offshift-time-only command summary.
-p Output a prime-time-only command summary.
When -o and -p are used together, a combination prime-time and non-prime-time report is produced. All the output summaries are
total usage except number of times executed, CPU minutes, and real minutes, which are split into prime and non-prime.
-c Sort by total CPU time, rather than total kcore-minutes.
-j Combine all commands invoked only once under "***other".
-n Sort by number of command invocations.
-s Any file names encountered hereafter are already in internal summary format.
-t Process all records as total accounting records. The default internal summary format splits each field into prime and non-prime-
time parts. This option combines the prime and non-prime time parts into a single field that is the total of both, and provides
upward compatibility with old style acctcms internal summary format records.
EXAMPLES
Example 1: Using the acctcms command.
A typical sequence for performing daily command accounting and for maintaining a running total is:
example% acctcms filename ... > today
example% cp total previoustotal
example% acctcms -s today previoustotal > total
example% acctcms -a -s today
ATTRIBUTES
See attributes(5) for descriptions of the following attributes:
+-----------------------------+-----------------------------+
| ATTRIBUTE TYPE | ATTRIBUTE VALUE |
+-----------------------------+-----------------------------+
|Availability |SUNWaccu |
+-----------------------------+-----------------------------+
SEE ALSO
acctcom(1), acct(1M), acctcon(1M), acctmerg(1M), acctprc(1M), acctsh(1M), fwtmp(1M), runacct(1M), acct(2), acct.h(3HEAD), utmpx(4),
attributes(5)
NOTES
Unpredictable output results if -t is used on new style internal summary format files, or if it is not used with old style internal summary
format files.
SunOS 5.10 22 Feb 1999 acctcms(1M)