Making a faster alternative to a slow awk command Post: 302666577

9 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Which is faster AWK or CUT

If I just wanted to get andred08 from the following ldap dn would I be best to use AWK or CUT? uid=andred08,ou=People,o=example,dc=com It doesn't make a difference if it's just one ldap search I am getting it from but when there's a couple of hundred people in the group that retruns all...

2. UNIX for Advanced & Expert Users

Making things run faster

I am processing some terabytes of information on a computer having 8 processors (each with 4 cores) with a 16GB RAM and 5TB hard drive implemented as a RAID. The processing doesn't seem to be blazingly fast perhaps because of the IO limitation. I am basically running a perl script to read some...

3. UNIX for Dummies Questions & Answers

Which command will be faster? y?

i)wc -c/etc/passwd|awk'{print $1}' ii)ls -al/etc/passwd|awk'{print $5}'

4. UNIX and Linux Applications

Alternative for slow SQL subquery

Hi -- I have the following SQL query in my UNIX shell script -- but the subquery in the second section is very slow. I know there must be a way to do this with a union or something which would be better. Can anyone offer an alternative to this query? Thanks. select count(*) from ...

5. Shell Programming and Scripting

Multi thread awk command for faster performance

Hi, I have a script below for extracting xml from a file. for i in *.txt do echo $i awk '/<.*/ , /.*<\/.*>/' "$i" | tr -d '\n' echo -ne '\n' done . I read about using multi threading to speed up the script. I do not know much about it but read it on this forum. Is it a...

6. Shell Programming and Scripting

Making script run faster

Can someone help me edit the below script to make it run faster? Shell: bash OS: Linux Red Hat The point of the script is to grab entire chunks of information that concerns the service "MEMORY_CHECK". For each chunk, the beginning starts with "service {", and ends with "}". I should...

7. Shell Programming and Scripting

Faster way to use this awk command

awk "/May 23, 2012 /,0" /var/tmp/datafile the above command pulls out information in the datafile. the information it pulls is from the date specified to the end of the file. now, how can i make this faster if the datafile is huge? even if it wasn't huge, i feel there's a better/faster way to...

8. Shell Programming and Scripting

How to make awk command faster?

I have the below command which is referring a large file and it is taking 3 hours to run. Can something be done to make this command faster. awk -F ',' '{OFS=","}{ if ($13 == "9999") print $1,$2,$3,$4,$5,$6,$7,$8,$9,$10,$11,$12 }' ${NLAP_TEMP}/hist1.out|sort -T ${NLAP_TEMP} |uniq>...

9. Shell Programming and Scripting

How to make awk command faster for large amount of data?

I have nginx web server logs with all requests that were made and I'm filtering them by date and time. Each line has the following structure: 127.0.0.1 - xyz.com GET 123.ts HTTP/1.1 (200) 0.000 s 3182 CoreMedia/1.0.0.15F79 (iPhone; U; CPU OS 11_4 like Mac OS X; pt_br) These text files are...

LEARN ABOUT BSD

egrep

GREP(1) 						      General Commands Manual							   GREP(1)

NAME

       grep, egrep, fgrep - search a file for a pattern

SYNOPSIS

       grep [ option ] ...  expression [ file ] ...

       egrep [ option ] ...  [ expression ] [ file ] ...

       fgrep [ option ] ...  [ strings ] [ file ]

DESCRIPTION

       Commands  of  the  grep	family search the input files (standard input default) for lines matching a pattern.  Normally, each line found is
       copied to the standard output.  Grep patterns are limited regular expressions in the style of ex(1); it	uses  a  compact  nondeterministic
       algorithm.   Egrep  patterns  are  full regular expressions; it uses a fast deterministic algorithm that sometimes needs exponential space.
       Fgrep patterns are fixed strings; it is fast and compact.  The following options are recognized.

       -v     All lines but those matching are printed.

       -x     (Exact) only lines matched in their entirety are printed (fgrep only).

       -c     Only a count of matching lines is printed.

       -l     The names of files with matching lines are listed (once) separated by newlines.

       -n     Each line is preceded by its relative line number in the file.

       -b     Each line is preceded by the block number on which it was found.	This is sometimes useful in locating disk block  numbers  by  con-
	      text.

       -i     The  case  of  letters  is ignored in making comparisons -- that is, upper and lower case are considered identical.  This applies to
	      grep and fgrep only.

       -s     Silent mode.  Nothing is printed (except error messages).  This is useful for checking the error status.

       -w     The expression is searched for as a word (as if surrounded by `<' and `>', see ex(1).)	(grep only)

       -e expression
	      Same as a simple expression argument, but useful when the expression begins with a -.

       -f file
	      The regular expression (egrep) or string list (fgrep) is taken from the file.

       In all cases the file name is shown if there is more than one input file.  Care should be taken when using the characters $ * [ ^ | ( ) and
        in the expression as they are also meaningful to the Shell.  It is safest to enclose the entire expression argument in single quotes ' '.

       Fgrep searches for lines that contain one of the (newline-separated) strings.

       Egrep accepts extended regular expressions.  In the following description `character' excludes newline:

	      A  followed by a single character other than newline matches that character.

	      The character ^ matches the beginning of a line.

	      The character $ matches the end of a line.

	      A .  (period) matches any character.

	      A single character not otherwise endowed with special meaning matches that character.

	      A  string  enclosed in brackets [] matches any single character from the string.	Ranges of ASCII character codes may be abbreviated
	      as in `a-z0-9'.  A ] may occur only as the first character of the string.  A literal - must be placed where it can't be mistaken	as
	      a range indicator.

	      A  regular  expression  followed	by  an	* (asterisk) matches a sequence of 0 or more matches of the regular expression.  A regular
	      expression followed by a + (plus) matches a sequence of 1 or more matches of the regular expression.  A regular expression  followed
	      by a ? (question mark) matches a sequence of 0 or 1 matches of the regular expression.

	      Two regular expressions concatenated match a match of the first followed by a match of the second.

	      Two regular expressions separated by | or newline match either a match for the first or a match for the second.

	      A regular expression enclosed in parentheses matches a match for the regular expression.

       The order of precedence of operators at the same parenthesis level is [] then *+? then concatenation then | and newline.

       Ideally there should be only one grep, but we don't know a single algorithm that spans a wide enough range of space-time tradeoffs.

SEE ALSO

       ex(1), sed(1), sh(1)

DIAGNOSTICS

       Exit status is 0 if any matches are found, 1 if none, 2 for syntax errors or inaccessible files.

BUGS

       Lines are limited to 256 characters; longer lines are truncated.

4th Berkeley Distribution					  April 29, 1985							   GREP(1)