Efficiency Network Security Analysis for Data Mining Post: 303044620

Sponsored Content

Special Forums Cybersecurity Efficiency Network Security Analysis for Data Mining Post 303044620 by Neo on Friday 28th of February 2020 03:24:31 AM

02-28-2020

Administrator

Efficiency Network Security Analysis for Data Mining

I see this all the time, and so I'm used to it; but sometimes it is fun to post it anyway, since this is dated just a week ago, and today flew into my inbox;

Efficiency Network Security Analysis for Data Mining, February 21, 2020

Code:

http://scholar.google.com/scholar_url?url=https://login.easychair.org/publications/preprint_download/cMcG&hl=en&sa=X&d=872921198596594327&scisig=AAGBfm23CJODjEb5smjz3KBMiWXh-i-wbg&nossl=1&oi=scholaralrt&hist=grVHePAAAAAJ:4043656301718464717:AAGBfm2_NcrjsJBVFjrKOQP7nmLhnQem7Q

Efficiency Network Security Analysis for Data Mining-screen-shot-2020-02-28-31950-pmjpg

These 2 Users Gave Thanks to Neo For This Post:

Neo

View Public Profile for Neo

Visit Neo's homepage!

Find all posts by Neo

We Also Found This Discussion For You

1. Shell Programming and Scripting

Data mining a text file.

I'm auditing UID consistency across our hosts, and have created the following datafile, consisting of four fields. I would like to get a count of the combination of the last two fields. ie: I would like to find out how many instances there are of "root 0" and how many of "uucp 5", for every line...

LEARN ABOUT DEBIAN

kinosearch1::analysis::tokenizer

KinoSearch1::Analysis::Tokenizer(3pm)			User Contributed Perl Documentation		     KinoSearch1::Analysis::Tokenizer(3pm)

NAME

       KinoSearch1::Analysis::Tokenizer - customizable tokenizing

SYNOPSIS

	   my $whitespace_tokenizer
	       = KinoSearch1::Analysis::Tokenizer->new( token_re => qr/S+/, );

	   # or...
	   my $word_char_tokenizer
	       = KinoSearch1::Analysis::Tokenizer->new( token_re => qr/w+/, );

	   # or...
	   my $apostrophising_tokenizer = KinoSearch1::Analysis::Tokenizer->new;

	   # then... once you have a tokenizer, put it into a PolyAnalyzer
	   my $polyanalyzer = KinoSearch1::Analysis::PolyAnalyzer->new(
	       analyzers => [ $lc_normalizer, $word_char_tokenizer, $stemmer ], );

DESCRIPTION

       Generically, "tokenizing" is a process of breaking up a string into an array of "tokens".

	   # before:
	   my $string = "three blind mice";

	   # after:
	   @tokens = qw( three blind mice );

       KinoSearch1::Analysis::Tokenizer decides where it should break up the text based on the value of "token_re".

	   # before:
	   my $string = "Eats, Shoots and Leaves.";

	   # tokenized by $whitespace_tokenizer
	   @tokens = qw( Eats, Shoots and Leaves. );

	   # tokenized by $word_char_tokenizer
	   @tokens = qw( Eats Shoots and Leaves   );

METHODS

   new
	   # match "O'Henry" as well as "Henry" and "it's" as well as "it"
	   my $token_re = qr/
		   	     # start with a word boundary
		   w+	     # Match word chars.
		   (?:	     # Group, but don't capture...
		      'w+   # ... an apostrophe plus word chars.
		   )?	     # Matching the apostrophe group is optional.
		   	     # end with a word boundary
	       /xsm;
	   my $tokenizer = KinoSearch1::Analysis::Tokenizer->new(
	       token_re => $token_re, # default: what you see above
	   );

       Constructor.  Takes one hash style parameter.

       o   token_re - must be a pre-compiled regular expression matching one token.

COPYRIGHT

       Copyright 2005-2010 Marvin Humphrey

LICENSE, DISCLAIMER, BUGS, etc.
       See KinoSearch1 version 1.00.

perl v5.14.2							    2011-11-15				     KinoSearch1::Analysis::Tokenizer(3pm)

We Also Found This Discussion For You

1. Shell Programming and Scripting

Data mining a text file.

Discussion started by: akbar

LEARN ABOUT DEBIAN

kinosearch1::analysis::tokenizer