05-09-2006
faster way to loop?
Sample Log file
IP.address Date&TimeStamp GET/POST URL ETC
123.45.67.89 MMDDYYYYHHMM GET myURL
http://ABC.com
123.45.67.90 MMDDYYYYHHMM GET myURL
http://XYZ.com
I have a very huge web server log file (about 1.3GB) that contains entries like the one above. I need to get the last entries of all the different IPs that has myURL in it? Is there a quick way of looping? My idea was
# Get all the Unique IP addresses and then proceed to check each
cat weblog | awk '{print $1} > ip.list
for i in `cat ip.list`
do
cat weblog | grep $i | grep myURL > lastpages.lis
done
each day has around 3000+ unique IP entries and a day's log is about 48MB. with this process, it takes around 30 mins to process a days worth of data. is there a faster way to do this?
9 More Discussions You Might Find Interesting
1. IP Networking
For some reason 8.1 Mandrake Linux seems much slower than Windows 2000 with my cable modem. DSL reports test says they conferable speed with Windows2 though.
This is consistant slow with both of my boxes, at the same time. Linux used to be faster, but not with Mandrake. Any way to fix this? (17 Replies)
Discussion started by: lancest
17 Replies
2. Shell Programming and Scripting
Hi ,
I need to copy every day about 35GB of files from one file system to another.
Im using the cp command and its toke me about 25 min.
I also tried to use dd command but its toke much more.
Is there better option ?
Regards. (6 Replies)
Discussion started by: yoavbe
6 Replies
3. UNIX for Dummies Questions & Answers
Hi I have to grep for 2000 strings in a file one after the other.Say the file name is Snxx.out which has these strings.
I have to search for all the strings in the file Snxx.out one after the other.
What is the fastest way to do it ??
Note:The current grep process is taking lot of time per... (7 Replies)
Discussion started by: preethgideon
7 Replies
4. UNIX for Dummies Questions & Answers
i)wc -c/etc/passwd|awk'{print $1}'
ii)ls -al/etc/passwd|awk'{print $5}' (4 Replies)
Discussion started by: karthi_g
4 Replies
5. UNIX for Dummies Questions & Answers
I have read anecdotes about people installing RAID0 (RAID - Wikipedia, the free encyclopedia) on some of their machines because it gives a performance boost. Because bandwidth on the motherboard is limited, can someone explain exactly why it should be faster? (7 Replies)
Discussion started by: figaro
7 Replies
6. Shell Programming and Scripting
Hi all,
In bash scripting, I use to read files:
cat $file | while read line; do
...
doneHowever, it's a very slow way to read file line by line.
E.g. In a file that has 3 columns, and less than 400 rows, like this:
I run next script:
cat $line | while read line; do ## Reads each... (10 Replies)
Discussion started by: AlbertGM
10 Replies
7. Shell Programming and Scripting
I have the following code running against a file. The file can have upwards of 10000 lines.
problem is, the for loop takes a while to go through all those lines. is there a faster way to go about it?
for line in `grep -P "${MONTH} ${DAY}," file | ${AWK} -F" " '{print $4}' | awk -F":"... (2 Replies)
Discussion started by: SkySmart
2 Replies
8. UNIX for Dummies Questions & Answers
i'm trying to decide if to move operations from one of these hosts to the other. but i cant decide which one of them is the most powerful.
each host has 8 cpus.
HOSTA
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 44
model name : Intel(R) Xeon(R) CPU ... (6 Replies)
Discussion started by: SkySmart
6 Replies
9. Shell Programming and Scripting
Hello,
I am under Ubuntu 18.04 Bionic.
I have one shell script run.sh (which is out of my topic) to run files under multiple directories and one file to control all processes running under those directories (control.sh).
I set a cronjob task to check each of them with two minutes of intervals.... (3 Replies)
Discussion started by: baris35
3 Replies
LEARN ABOUT DEBIAN
pactester
pactester(1) pactester(1)
NAME
pactester - Tool to test proxy auto-config (pac) files.
SYNOPSIS
pactester <-p pacfile> <-u url> [-h host] [-c client_ip] [-e]
pactester <-p pacfile> <-f urlslist> [-c client_ip] [-e]
DESCRIPTION
pactester is a tool to test proxy auto-config (pac) files. It returns the proxy config string for the given URL and the pac file. pactester
uses pacparser C library for most of its functionality.
OPTIONS
-p pacfile
PAC file to test. Specify "-" to read from the standard input.
-u url URL to test the PAC file for.
-h host
Host part of the URL. If not specified, it's determined from the URL.
-c client_ip
Client's IP address (as returned by the function myIpAddress() in PAC files). If not specified, it defaults to the IP address of
the machine on which this tool is running.
-e Enable Microsoft PAC extensions (dnsResolveEx, myIpAddressEx, isResolvableEx).
-f urlslist
A file containing the list of URLs to be tested. This is good for testing a PAC file against a set of URLs.
EXAMPLES
To find out the proxy config string for the pac file "wpad.dat" and the URL "http://www.google.com":
$ pactester -p wpad.dat -u http://www.google.com
For a client with IP address 10.0.12.123:
$ pactester -p wpad.dat -c 10.0.12.123 -u http://www.google.com
For a pac file hosted at http://wpad/wpad.dat:
$ curl -s http://wpad/wpad.dat | pactester -p - -u http://google.com
BUGS
If you have come across a bug in pactester, please submit a bug report at http://code.google.com/p/pacparser/issues/list.
AUTHOR
Written by Manu Garg (http://www.manugarg.com).
RESOURCES
Homepage: http://code.google.com/p/pacparser.
pactester(1)