Help to make awk script more efficient for large files Post: 302526358

8 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Is there a way to make this more efficient

2. Shell Programming and Scripting

Sed or awk script to remove text / or perform calculations from large CSV files

I have a large CSV files (e.g. 2 million records) and am hoping to do one of two things. I have been trying to use awk and sed but am a newbie and can't figure out how to get it to work. Any help you could offer would be greatly appreciated - I'm stuck trying to remove the colon and wildcards in...

3. Shell Programming and Scripting

AWK Shell Program to Split Large Files

Hi, I need some help creating a tidy shell program with awk or other language that will split large length files efficiently. Here is an example dump: <A001_MAIL.DAT> 0001 Ronald McDonald 01 H81 0002 Elmo St. Elmo 02 H82 0003 Cookie Monster 01 H81 0004 Oscar ...

4. Shell Programming and Scripting

Running rename command on large files and make it faster

Hi All, I have some 80,000 files in a directory which I need to rename. Below is the command which I am currently running and it seems, it is taking fore ever to run this command. This command seems too slow. Is there any way to speed up the command. I have have GNU Parallel installed on my...

5. Programming

Help with make this Fortran code more efficient (in HPC manner)

Hi there, I had run into some fortran code to modify. Obviously, it was written without thinking of high performance computing and not parallelized... Now I would like to make the code "on track" and parallel. After a whole afternoon thinking, I still cannot find where to start. Can any one...

6. Shell Programming and Scripting

Process multiple large files with awk

Hi there, I'm camor and I'm trying to process huge files with bash scripting and awk. I've got a dataset folder with 10 files (16 millions of row each one - 600MB), and I've got a sorted file with all keys inside. For example: a sample_1 200 a.b sample_2 10 a sample_3 10 a sample_1 10 a...

7. Shell Programming and Scripting

Combining awk command to make it more efficient

VARIABLE="jhovan 5259 5241 0 20:11 ? 00:00:00 /proc/self/exe --type=gpu-process --channel=5182.0.1597089149 --supports-dual-gpus=false --gpu-driver-bug-workarounds=2,45,57 --disable-accelerated-video-decode --gpu-vendor-id=0x80ee --gpu-device-id=0xbeef --gpu-driver-vendor...

8. Shell Programming and Scripting

How to make awk command faster for large amount of data?

I have nginx web server logs with all requests that were made and I'm filtering them by date and time. Each line has the following structure: 127.0.0.1 - xyz.com GET 123.ts HTTP/1.1 (200) 0.000 s 3182 CoreMedia/1.0.0.15F79 (iPhone; U; CPU OS 11_4 like Mac OS X; pt_br) These text files are...

LEARN ABOUT HPUX

comm

comm(1) 						      General Commands Manual							   comm(1)

NAME

       comm - select or reject lines common to two sorted files

SYNOPSIS

       file1 file2

DESCRIPTION

       comm  reads  file1  and	file2, which should be ordered in increasing collating sequence (see sort(1) and Environment Variables below), and
       produces a three-column output:

	      Column 1:   Lines that appear only in file1,
	      Column 2:   Lines that appear only in file2,
	      Column 3:   Lines that appear in both files.

       If is used for file1 or file2, the standard input is used.

       Options 1, 2, or 3 suppress printing of the corresponding column.  Thus prints only the lines common to the two files; prints only lines in
       the first file but not in the second; does nothing useful.

EXTERNAL INFLUENCES

   Environment Variables
       determines the collating sequence expects from the input files.

       determines the language in which messages are displayed.

       If is not specified in the environment or is set to the empty string, the value of determines the language in which messages are displayed.
       If is not specified in the environment or is set to the empty string, the value of is used as a default.  If is not specified or is set	to
       the  empty  string,  a  default of ``C'' (see lang(5)) is used instead of If any internationalization variable contains an invalid setting,
       behaves as if all internationalization variables are set to ``C''.  See environ(5).

   International Code Set Support
       Single- and multi-byte character code sets are supported.

EXAMPLES

       The following examples assume that and have been ordered in the collating sequence defined by the or environment variable.

       Print all lines common to and (in other words, print column 3):

       Print all lines that appear in but not in (in other words, print column 1):

       Print all lines that appear in but not in (in other words, print column 2):

SEE ALSO

       cmp(1), diff(1), sdiff(1), sort(1), uniq(1).

STANDARDS CONFORMANCE

																	   comm(1)