Need optimized awk/perl/shell to give the statistics for the Large delimited file Post: 303023382

9 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Could someone give me an example of awk accessing array defined in Korn Shell?

As per title and much apprecieated!

2. UNIX for Dummies Questions & Answers

Trim String in 3rd Column in Tab Delimited File...SED/PERL/AWK?

Hey Everybody, I am having much trouble figuring this out, as I am not really a programmer..:mad: Datafile.txt Column0 Column1 Column2 ABC DEF xxxGHI I am running using WGET on a cronjob to grab a datafile, but I need to cut the first three characters from...

3. Shell Programming and Scripting

Large pipe delimited file that I need to add CR/LF every n fields

I have a large flat file with variable length fields that are pipe delimited. The file has no new line or CR/LF characters to indicate a new record. I need to parse the file and after some number of fields, I need to insert a CR/LF to start the next record. Input file ...

4. Shell Programming and Scripting

Extracting a portion of data from a very large tab delimited text file

Hi All I wanted to know how to effectively delete some columns in a large tab delimited file. I have a file that contains 5 columns and almost 100,000 rows 3456 f g t t 3456 g h 456 f h 4567 f g h z 345 f g 567 h j k lThis is a very large data file and tab delimited. I need...

5. Shell Programming and Scripting

Script Optimization - large delimited file, for loop with many greps

Since there are approximately 75K gsfiles and hundreds of stfiles per gsfile, this script can take hours. How can I rewrite this script, so that it's much faster? I'm not as familiar with perl but I'm open to all suggestions. ls file.list>$split for gsfile in `cat $split`; do csplit...

6. Shell Programming and Scripting

Awk getting statistics of a grid file,

Hi , I have the following file which is basically a grid (has more than 100000 rows) LLL1 PPP1 LLL1 PPP2 LLL1 PPP3 ............... LLL1 5500 ..... LLL2 PPP1 LLL2 PPP2 LLL2 PPP3 ............... LLL1 5500 ..... L100 PPP1 L100 PPP2 L100 PPP3 ............... 2100 5500...

7. Shell Programming and Scripting

awk read one delimited file, search another delimited file

Hello folks, I have another doozy. I have two files. The first file has four fields in it. These four fields map to different locations in my second file. What I want to do is read the master file (file 2 - 23 fields) and compare each line against each record in file 1. If I get a match in all four...

8. Shell Programming and Scripting

Removing dupes within 2 delimited areas in a large dictionary file

Hello, I have a very large dictionary file which is in text format and which contains a large number of sub-sections. Each sub-section starts with the following header : #DATA #VALID 1 and ends with a footer as shown below #END The data between the Header and the Footer consists of...

9. Shell Programming and Scripting

Perl script give answers by file

Hi, I am new in perl. I am running a perl installation script, its asking for paths and so many inputs. Can we provide that info by any file. so i can avoid the interactive installation.

LEARN ABOUT DEBIAN

pescetti

PESCETTI(1)						      General Commands Manual						       PESCETTI(1)

NAME

       pescetti -- Pseudo-Duplimate Generator

SYNOPSIS

       pescetti

DESCRIPTION

       This manual page documents briefly the pescetti command.

OPTIONS

       Here are a list of the available options and what they do. You must specify exactly one from --demo, --generate or --load.

       --help	 Prints the help text

       --demo	 Demonstration mode. Generates one hand with permutations and the tutorial for how to use them.

       --generate=N
		 Generate N random boards

       --load=boards.txt
		 Load boards+analysis from boards.txt

       --load-dds=boards.dds
		 Load boards from boards.dds in dds format

       --load-analysis=tricks.txt
		 Load analysis from tricks.txt

       --permutations=permutations.txt
		 Generate the permutations and save them to the given file

       --curtains=curtains.txt
		 Save curtain cards to file curtains.txt

       --save=boards.txt
		 Save the boards+analysis to boards.txt

       --save-dds=boards.dds
		 Save the boards to boards.dds in dds format

       --save-analysis=tricks.txt
		 Save the analysis to tricks.txt

       --format=html|txt|pdf
		 Set the output mode to the given format

       --title=title
		 Set the title for the output

       --output=hands.txt
		 Print the hands to hands.txt, rather than to standard output

       --stats	 Generate statistics about the set of boards; included in the hands output

       --analyze Run the dds analyzer on the boards and print the resulting numberof tricks (warning SLOW)

       --criteria=
		 A  list of criteria to apply to each generated hand to generate specific hand types.  The list should be space separated and each
		 item may be suffixed with a colon and a (fractional) probability value which can be used to weight the criteria.

		 E.g. --criteria="weaknt:0.8 strongnt:0.5"

		 Valid criteria are: unbalanced weaknt strongnt twont strongtwo weaktwo three twoclubs 4441  singlesuit  twosuits  partscore  game
		 slam game-invite slam-invite jumpshift jumpfit splinter bacon weird

       --probability=factor
		 Generate  hands matching the criteria with only the given probability. Factor is in the range 0 to 1. On each attempt to generate
		 a board it is rejected if it doesn't match the criteria with the given probability.  A factor of about  0.8  gives  roughly  half
		 matching boards

AUTHOR

       This manual page was written by Matthew Johnson <debian@matthew.ath.cx>. Permission is granted to copy, distribute and/or modify this docu-
       ment under the terms of the GNU General Public License, Version 2 as published by the Free Software Foundation.

       On Debian systems, the complete text of the GNU General Public License can be found in /usr/share/common-licenses/GPL.

																       PESCETTI(1)