BNS Scaling: An Improved Representation over TF.IDF for SVM Text Classification


 
Thread Tools Search this Thread
Special Forums News, Links, Events and Announcements UNIX and Linux RSS News BNS Scaling: An Improved Representation over TF.IDF for SVM Text Classification
# 1  
Old 08-07-2008
BNS Scaling: An Improved Representation over TF.IDF for SVM Text Classification

HPL-2007-32 (R.1) BNS Scaling: An Improved Representation over TF.IDF for SVM Text Classification - Forman, George
Keyword(s): text classification; topic identification; machine learning; feature selection; Support Vector Machine; TF*IDF text representation
Abstract: In the realm of machine learning for text classification, TF.IDF is the most widely used representation for real-valued feature vectors. Unfortunately, it is oblivious to the training class labels, and naturally scales some features inappropriately. We replace IDF with Bi-Normal Separation (BNS), wh ...
Full Report

More...
Login or Register to Ask a Question

Previous Thread | Next Thread

5 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Data imputation with scaling

Hello masters, this is difficult to explain and maybe complicated to implement...looks beyond what I taught myself (from this forum), some help is greatly appreciated. I have a base file a1 10 a2 15 a3 20 a4 21 I have a non-base file a1 170 b12 175 c12 180 d12 190 a2 ... (3 Replies)
Discussion started by: senhia83
3 Replies

2. Shell Programming and Scripting

Re-scaling values - perl

Hey folks I have a big tab delimited file with 3 columns looks like this: chr2L 552 0.85 chr2R 135 1.06 chr3L 820 2.89 chr3R 581 3.93 chr4 585 0.94 chrX 605 1.93 All I want to do is re-scaling the third column to be between 0-1. Which means that the highest valu in 3rd column will... (5 Replies)
Discussion started by: @man
5 Replies

3. UNIX for Dummies Questions & Answers

Reading a file and Classification

Hello Everyone, I am new to UNIX. I have got a requirement. Thought of posting it in this forum so that someone might help me. Please have a look at the scenario. The Objective is to "classify books into four seperate files and then print a summary report". Specifications are as... (3 Replies)
Discussion started by: yarlagadda999
3 Replies

4. Shell Programming and Scripting

scripting for classification

hi i am very new to scripting. i am learning by myself. i found this example. can any one help in writing script for this example, so that i can have an idea how to analyse and script. example: overview: The aim of this exercise is to classify books into four seperate files and then print a... (1 Reply)
Discussion started by: yonex
1 Replies

5. UNIX Desktop Questions & Answers

CDE Classification Banner

Hello; I need to place a classification banner at the top of my Solaris 2.6 CDE Desktop. The Banner must be displayed in all CDE desktop sessions available on the CDE dashboard (1,2,3,4). The Banner must remain anchored at the top of the CDE desktop and must not be able to be closed or hidden... (0 Replies)
Discussion started by: rambo15
0 Replies
Login or Register to Ask a Question
LINCLASS(1)						      General Commands Manual						       LINCLASS(1)

NAME
linclass - predict labels by a linear classification rule SYNOPSIS
linclass [options] example_file model_file DESCRIPTION
linclass is a program that predicts labels by a linear classification rule. example_file is a file with testing examples in SVM^light format, and model_file is the file which contains either a binary (two-class) rule f(x)=w'*x+w0 or a multi-class rule f(x)=W'*x. These are produced svmocas(1) and msvmocas(1), respectively. OPTIONS
A summary of options is included below. -h Show summary of options. -v (0|1) Set the verbosity level (default: 1) -e Print the classification error computed from predicted labels and labels contained in example_file. -o out_file Save predictions to the file out_file. -t (0|1) Output type: 0 ... predicted labels (default) 1 ... discriminant values EXAMPLES
Train the multi-class SVM classifier from example file fiply_trn.light, using svmocas(1) with the regularization constant C=10, verbosity switched off, and save model to svmocas.model: svmocas -c 10 -b 1 -v 0 riply_trn.light svmocas.model Compute the testing error of the classifier stored in svmocas.model using testing examples from riply_tst.light and save the predicted labels to riply_tst.pred: linclass -e -o riply_tst.pred riply_tst.light svmocas.model SEE ALSO
svmocas(1), msvmocas(1). AUTHORS
linclass was written by Vojtech Franc <xfrancv@cmp.felk.cvut.cz> and Soeren Sonnenburg <Soeren.Sonnenburg@tu-berlin.de>. This manual page was written by Christian Kastner <debian@kvr.at>, for the Debian project (and may be used by others). June 16, 2010 LINCLASS(1)