09-14-2012
Well, you'll need
A spider to traverse the web
A way of extracting the text of a link and the url it points to
A way of ranking the value of pages for any keyword, perhaps occurrences of the keyword in links to the url as well as occurences within the page?
A database to store your keywords and urls.
That would give you a basic platform - now all you have to do is research the above and design the ranking algorithm.
6 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
How do you write a search engline to search offline scripts? (3 Replies)
Discussion started by: hawaiifiver
3 Replies
2. Programming
Hello everybody,
I need help with this,
I need to design a CGI search engine in C but i have no idea on what or how to do it.
Do i have to open all the html files one by one and search for the given strings? i think this process will be slow, and will take too much of the server processing... (11 Replies)
Discussion started by: semash!
11 Replies
3. Web Development
Hey guys. I have a quick question. My friends and I are working on a search engine project that will hopefully be up and running by December of 2011. Here's my concern. What programs should I use to create the search engine. Thanks guys! :b: (9 Replies)
Discussion started by: OussenkoSearch
9 Replies
4. What is on Your Mind?
Hi Unix Gurus,
In my Co. we have intranet site hosted on Unix box. In Explorer there is a text box for searching information on internet. By default it is using Google Custom Search. This search engine is little old one. Now I want to patch this search engine with latest patch. If any one know... (0 Replies)
Discussion started by: sriramis4u
0 Replies
5. Shell Programming and Scripting
Hello,
How to create a search engine in unix using commands?...Atleast guidelines to craete this search engine...Thank you in advance. (10 Replies)
Discussion started by: Sindhu R
10 Replies
6. What is on Your Mind?
Getting a bit more comfortable making quick YT videos in 4K, here is:
Search Engine Optimization | How To Fix Soft 404 Errors and A.I. Tales from Google Search Console
https://youtu.be/I6b9T2qcqFo (0 Replies)
Discussion started by: Neo
0 Replies
LEARN ABOUT DEBIAN
urn-scheme
uri_urn(3tcl) Tcl Uniform Resource Identifier Management uri_urn(3tcl)
__________________________________________________________________________________________________________________________________________________
NAME
uri_urn - URI utilities, URN scheme
SYNOPSIS
package require Tcl 8.2
package require uri::urn ?1.1.2?
uri::urn::quote url
uri::urn::unquote url
_________________________________________________________________
DESCRIPTION
This package provides two commands to quote and unquote the disallowed characters for url using the urn scheme, registers the scheme with
the package uri, and provides internal helpers which will be automatically used by the commands uri::split and uri::join of package uri to
handle urls using the urn scheme.
COMMANDS
uri::urn::quote url
This command quotes the characters disallowed by the urn scheme (per RFC 2141 sec2.2) in the url and returns the modified url as its
result.
uri::urn::unquote url
This commands performs the reverse of ::uri::urn::quote. It takes an urn url, removes the quoting from all disallowed characters,
and returns the modified urls as its result.
BUGS, IDEAS, FEEDBACK
This document, and the package it describes, will undoubtedly contain bugs and other problems. Please report such in the category uri of
the Tcllib SF Trackers [http://sourceforge.net/tracker/?group_id=12883]. Please also report any ideas for enhancements you may have for
either package and/or documentation.
KEYWORDS
rfc 2141, uri, url, urn
CATEGORY
Networking
uri 1.1.2 uri_urn(3tcl)