by Mike Hodgkinson, THE INDEPENDENT…or at least that’s what Ray Kurzweil thinks. He has spent his life inventing machines that help people, from the blind to dyslexics. Now, he believes we’re on the brink of a new age - the ’singularity’ - when mind-boggling technology will allow us to email each other toast, run as [...]
A global group of scientists are spending the next ten years and a billion dollars to try and develop a computer simulation of the brain:
I always found it fascinating that the brain can understand itself. This almost sounds like in a few years the computer... (0 Replies)
To the Brain Teaser, if I add another condition, say the executable should not be altered, how the program should be altered? (no perl please, purely C). I forgot to mention this condition my staff had mentioned. ( forgot then and got now :D )
The program executed the first time... (4 Replies)
I have encountered a C question, which I thought of sharing with you.
This question was asked by one of my technical training staff...Though my training was over I'm still thinking of a solution for this..
Write a C program to do a small task(lets say just simply printing a "Hello... (34 Replies)
Can anybody provide me Pointers to Practice tests or any Material to prepare for Brainbench certification in Unix Shell Scripting? Also how good is this Certification for UNIX programmers. Is it worth it? I'm planning to take this certification in 2 weeks. Kindly let me know all the pros... (0 Replies)
KinoSearch1::Analysis::Token(3pm) User Contributed Perl Documentation KinoSearch1::Analysis::Token(3pm)NAME
KinoSearch1::Analysis::Token - unit of text
# private class - no public API
You can't actually instantiate a Token object at the Perl level -- however, you can affect individual Tokens within a TokenBatch by way of
TokenBatch's (experimental) API.
Token is the fundamental unit used by KinoSearch1's Analyzer subclasses. Each Token has 4 attributes: text, start_offset, end_offset, and
pos_inc (for position increment).
The text of a token is a string.
A Token's start_offset and end_offset locate it within a larger text, even if the Token's text attribute gets modified -- by stemming, for
instance. The Token for "beating" in the text "beating a dead horse" begins life with a start_offset of 0 and an end_offset of 7; after
stemming, the text is "beat", but the end_offset is still 7.
The position increment, which defaults to 1, is a an advanced tool for manipulating phrase matching. Ordinarily, Tokens are assigned
consecutive position numbers: 0, 1, and 2 for "three blind mice". However, if you set the position increment for "blind" to, say, 1000,
then the three tokens will end up assigned to positions 0, 1, and 1001 -- and will no longer produce a phrase match for the query '"three
Copyright 2006-2010 Marvin Humphrey
LICENSE, DISCLAIMER, BUGS, etc.
See KinoSearch1 version 1.00.
perl v5.14.2 2011-11-15 KinoSearch1::Analysis::Token(3pm)