A Novel Traffic Analysis for Identifying Search Fields in the Long Tail of Web Sites

 
Thread Tools Search this Thread
Special Forums News, Links, Events and Announcements UNIX and Linux RSS News A Novel Traffic Analysis for Identifying Search Fields in the Long Tail of Web Sites
# 1  
Old 02-22-2010
A Novel Traffic Analysis for Identifying Search Fields in the Long Tail of Web Sites

HPL-2010-27 A Novel Traffic Analysis for Identifying Search Fields in the Long Tail of Web Sites - Forman, George; Kirshenbaum, Evan; Rajaram, Shyamsundar
Keyword(s): web data mining, clickstream analysis, machine learning classification, active learning
Abstract: Using a clickstream sample of 2 billion URLs from many thousand volunteer Web users, we wish to analyze typical usage of keyword searches across the Web. In order to do this, we need to be able to determine whether a given URL represents a keyword search and, if so, which field contains the query. A ...
Full Report

More...
Login or Register to Ask a Question

Previous Thread | Next Thread

7 More Discussions You Might Find Interesting

1. What is on Your Mind?

Your Favorite Tech Support Web Sites and Why?

Where do you go to participate in technical discussions besides UNIX.COM and why? Personally, I do not really participate in other forums and discussion boards, but I do ask questions from time to time on Stack sites. The problem I have with Stack is that my questions are never answered on any... (30 Replies)
Discussion started by: Neo
30 Replies

2. Red Hat

Web sites

Hi, I can't view web portal in my intranet from linux RHE, and neither to web application. My network configuration /etc/sysconfig/network-scripts/fcfg-eth0 is ok, what is happen?, can you help me please. (2 Replies)
Discussion started by: xochitl
2 Replies

3. Shell Programming and Scripting

Identifying entries based on 2 fields in a string.

Hi Guys, I’m struggling to use two fields to do a duplicate/ unique by output. I want to look IP addresses assigned to more than one account during a given period in the logs. So duplicate IP and account > 1 then print all the logs for that IP. I have been Using AWK (just as its installed... (3 Replies)
Discussion started by: wabbit02
3 Replies

4. Shell Programming and Scripting

Identifying specific fields in a Row

Hi, I am new to UNIX. Can some one help me to solve the below. I have a requirement to to identify the specific fields in row and also some part of the field. In my file I have a record as sundra;10.44.48.65;10thstreet TCP packet out of state: First packet isn't SYN;telno:... (3 Replies)
Discussion started by: suneel.mekala
3 Replies

5. Web Development

How do you make web sites?

:confused: I've read how on some websites but I still don't get it. I need specific details. I want to make a website for my photography. Please help!:D (3 Replies)
Discussion started by: animelibara123
3 Replies

6. OS X (Apple)

Use UNIX to track web sites viewed?

I'm on OSX 10.4. I was wondering if you can use UNIX terminal to track what web sites have been viewed on this Mac... Thank you! (1 Reply)
Discussion started by: tracymanusa
1 Replies

7. Solaris

Identifying new fields of data

i have hundreds of lines of formatted data with 10 different fields per line. the data is refreshed every few minutes and some fields in some lines may reflect new data. i'm looking for a sample of code that help me to identify those new fields so that i can write them to a file to indicate that... (0 Replies)
Discussion started by: davels
0 Replies
Login or Register to Ask a Question
ap-mrtg(8)						      System Manager's Manual							ap-mrtg(8)

NAME
ap-mrtg, ap-rrd - AP statistics translators SYNOPSIS
ap-mrtg -i ip -c community -t type [-b bssid ] [-v] [-r] [-h] ap-rrd -i ip -c community -t type -ddb_file [-b bssid ] [-r] [-h] DESCRIPTION
ap-mrtg get stats from AP and return it in MRTG parsable format. The Multi Router Traffic Grapher (MRTG) is a tool to monitor the traffic load on network-links. MRTG generates HTML pages containing graph- ical images which provide a LIVE visual representation of this traffic. You may get it from: http://peo- ple.ee.ethz.ch/~oetiker/webtools/mrtg/. ap-rrd gets stats from an AP and writes them to an RRDtool database. RRDtool is a data logging and graphing system for time series data. OPTIONS
-i ip AP ip address -c community SNMP community string -t type Statistics type <w>ireless, <e>thernet, associated <s>tations or <l>ink quality in client mode; -b bssid Mac address of the AP to which get link quality. -d db_file Name of the rrdtool database to write to. -v report MRTG about problems connecting to AP - if you want to get report about when AP spot work enable it -r reset AP when getting LinkQuality stats -h print help screen MRTG Configuration Example Simple mrtg.cfg for get stats from Atmel based AP: Target[ap_e]: `/usr/local/bin/ap-mrtg -i 192.168.0.1 -c "public" -t ethernet -v` Options[ap_e]: noinfo, growright, bits MaxBytes[ap_e]: 12500 Title[ap_e]: Traffic Analysis for AP Ethernet port PageTop[ap_e]: <H1>Traffic Analysis for AP Ethernet port</H1> Target[ap_w]: `/usr/local/bin/ap-mrtg -i 192.168.0.1 -c "public" -t wireless -v` Options[ap_w]: noinfo, growright, perminute ShortLegend[ap_w]: pkt/min MaxBytes[ap_w]: 100000000 Legend1[ap_w]: Packets In Legend2[ap_w]: Packets Out YLegend[ap_w]: packets/min Title[ap_w]: Traffic Analysis for AP Wireless port PageTop[ap_w]: <H1>Traffic Analysis for AP Wireless port</H1> Target[ap_s]: `/usr/local/bin/ap-mrtg -i 192.168.0.1 -c "public" -t stations -v` Options[ap_s]: growright, gauge, noinfo ShortLegend[ap_s]: stations MaxBytes[ap_s]: 10 YTics[ap_s]: 10 Legend2[ap_s]: none Legend1[ap_s]: Associated Stations YLegend[ap_s]: Associated Stations Title[ap_s]: Traffic Analysis for AP Associated Stations PageTop[ap_s]: <H1>Traffic Analysis for AP Associated Stations</H1> LegendO[ap_s]: none LegendI[ap_s]: Associated Stations Link Quality stats If the AP is not in client mode, it errors out. Grabbing the signal strength resets the ethernet and wireless stats and AP. It's safe do not use this type of stat. Optionally you may set mac address (bssid) of the AP to which getting link quality stats, by default used first returned AP. Simple mrtg.cfg for get stats from Atmel based AP with Link Quality and reset stats: Target[ap_e]: `/usr/local/bin/ap-mrtg -i 192.168.0.1 -c "public" -t ethernet -v` Options[ap_e]: noinfo, growright, bits, gauge MaxBytes[ap_e]: 12500 Title[ap_e]: Traffic Analysis for AP Ethernet port PageTop[ap_e]: <H1>Traffic Analysis for AP Ethernet port</H1> Target[ap_w]: `/usr/local/bin//ap-mrtg -i 192.168.0.1 -c "public" -t wireless -v` Options[ap_w]: noinfo, growright, perminute, gauge ShortLegend[ap_w]: pkt/min MaxBytes[ap_w]: 100000000 Legend1[ap_w]: Packets In Legend2[ap_w]: Packets Out YLegend[ap_w]: packets/min Title[ap_w]: Traffic Analysis for AP Wireless port PageTop[ap_w]: <H1>Traffic Analysis for AP Wireless port</H1> Target[EdAP_s]: `/usr/local/bin/ap-mrtg -i 192.168.0.1 -c "public" -t link -b FFFFFFFF -v -r` MaxBytes[EdAP_s]: 100 Options[EdAP_s]: noinfo,growright,integer,unknaszero,nopercent,gauge Title[EdAP_s]: Signal Analysis for EdAP Wireless port YLegend[EdAP_s]: Signal Strength dBm/Quality ShortLegend[EdAP_s]: dBm PageTop[EdAP_s]: <H1>Signal/Quality Strength for EdAP</H1> LegendI[EdAP_s]: Signal Quality LegendO[EdAP_s]: Signal Strength in dBm AUTHOR
Roman Festchook <roma@polesye.net> http://ap-utils.polesye.net SEE ALSO
ap-config(8), ap-trapd(8) Wireless Access Point Utilities for Unix ap-mrtg(8)