Sponsored Content
Top Forums Shell Programming and Scripting Regular Expression to match repeated characters Post 302346398 by romanhr on Friday 21st of August 2009 07:30:27 PM
Old 08-21-2009
Como eliminar datos repetidos en un archivo de texto en perl

I have a file called texto.txt

like this

sil
a
b
c
s
sil
b
a

and i wold like to have this output, a new file like this

sil
a
b
c
s

I want to erase all repeated words, i tried to do it but i couldn't, if somebody can help me i will apreciate (Sorry for my bad english)

Last edited by romanhr; 08-21-2009 at 10:41 PM..
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Exact match with regular expression

Hi I have a file with data arranged into columns. The first column is the chromosome name. When I use grep to subset only rows with chr1, I get chr1 but also chr10, chr11,.. How do I get only rows with chr1? grep chr1 filein > fileout head fileout chr1 59757841 chr11 108258691 ... (2 Replies)
Discussion started by: jdhahbi
2 Replies

2. UNIX for Dummies Questions & Answers

Regular Expression - match 'b' that follows 'a' and is at the end of a string

Hi, I'm struggling with a regex that would match a 'b' that follows an 'a' and is at the end of a string of non-white characters. For example: Line 1: aba abab b abb aab bab baa I can find the right strings but I'm lacking knowledge of how to "discard" the bits that precede bs.... (2 Replies)
Discussion started by: machinogodzilla
2 Replies

3. Shell Programming and Scripting

Regular expression match

Hi all, any idea how to match the following: char*<no or any string or space> buf and char *<no or any string or space> buf i need to capture the buf characters too. currently i need two checks to cover this: #search char* <any string> buf or char *<any string> buf @noarray =... (2 Replies)
Discussion started by: ChaMeN
2 Replies

4. Shell Programming and Scripting

regular expression to match repeated appearance

Hi all, I am looking for a regex syntax to match repeated appearance. Likes, ']+]+' matches for string '65A SOME MORE AND 78B' Now, this gets messy if I need to extract all such repeated appearance. I don't want to write ] four or five times for matching repeated appearance. Thanks in... (2 Replies)
Discussion started by: guruparan18
2 Replies

5. Shell Programming and Scripting

regular expression match

I am trying to match a similar line using grep with regular expression the line is /remote/mac/pbbbb/abc/def/hij/hop/include/abc/tif/element/test/testfiles/Office.cpp:57: const OfficeType& getType().get() const; I just need to extract the bold characters using grep with regular expression.... (5 Replies)
Discussion started by: prasbala
5 Replies

6. Shell Programming and Scripting

regular expression exact match

hi everyone suppose we have two scenario echo ABCD | grep \{4\} DATE echo SYSDATE | grep \{4\} SYSDATE i want to match the string of four length only please help (5 Replies)
Discussion started by: aishsimplesweet
5 Replies

7. Shell Programming and Scripting

matched characters - regular expression

Hi, I read the book of <<unix shell programming>>. The regular expression ^\(.\)\1 matches the first character on the line and stores it in register 1. Then the expression matches whatever is stored in the register 1, as specified by the \1. The net effect of this regular expression is to match... (2 Replies)
Discussion started by: jianma
2 Replies

8. Homework & Coursework Questions

Regular Expression to match files in Perl

Hi Everybody! I need some help with a regular expression in Perl that will match files named messages, but also files named message.1, message.2 and so on. So really I need one that will find messages and messages that might be followed by a period and a digit without matching other files like... (2 Replies)
Discussion started by: Hax0rc1ph3r
2 Replies

9. Shell Programming and Scripting

Perl split match regular expression with or

I cannot seem to get this to work correct: my ($k, $v) = split(/F/, $fc{$DIR}{symbolic}, 2); Below is the input (the $fc{$DIR}{symbolic} variable): QMH2562 FW:v5.06.03 DVR:v8.03.07.15.05.09-kbut i also need it to break on FV: Emulex NC553i FV4.2.401.6 DV8.3.5.86.2pthe code above... (2 Replies)
Discussion started by: rusted_planet
2 Replies

10. Shell Programming and Scripting

Regular expression match

echo 20110101 | awk '{ print match($0,/^((17||18||19||20)|)-*(|0|1)-*(|0||3)$/)) I am getting a match for the above, where as it shouldn't, as there is no hyphen in the echoed date. Another question is what is the difference between || and | in the above statement (4 Replies)
Discussion started by: tostay2003
4 Replies
svm-train(1)							   User Manuals 						      svm-train(1)

NAME
svm-train - train one or more SVM instance(s) on a given data set to produce a model file SYNOPSIS
svm-train [-s svm_type ] [ -t kernel_type ] [ -d degree ] [ -g gamma ] [ -r coef0 ] [ -c cost ] [ -n nu ] [ -p epsilon ] [ -m cachesize ] [ -e epsilon ] [ -h shrinking ] [ -b probability_estimates ] ] [ -wi weight ] [ -v n ] [ -q ] training_set_file [ model_file ] DESCRIPTION
svm-train trains a Support Vector Machine to learn the data indicated in the training_set_file and produce a model_file to save the results of the learning optimization. This model can be used later with svm_predict(1) or other LIBSVM enabled software. OPTIONS
-s svm_type svm_type defaults to 0 and can be any value between 0 and 4 as follows: 0 -- C-SVC 1 -- nu-SVC 2 -- one-class SVM 3 -- epsilon-SVR 4 -- nu-SVR -t kernel_type kernel_type defaults to 2 (Radial Basis Function (RBF) kernel) and can be any value between 0 and 4 as follows: 0 -- linear: u.v 1 -- polynomial: (gamma*u.v + coef0)^degree 2 -- radial basis function: exp(-gamma*|u-v|^2) 3 -- sigmoid: tanh(gamma*u.v + coef0) 4 -- precomputed kernel (kernel values in training_set_file) -- -d degree Sets the degree of the kernel function, defaulting to 3 -g gamma Adjusts the gamma in the kernel function (default 1/k) -r coef0 Sets the coef0 (constant offset) in the kernel function (default 0) -c cost Sets the parameter C ( cost ) of C-SVC, epsilon-SVR, and nu-SVR (default 1) -n nu Sets the parameter nu of nu-SVC, one-class SVM, and nu-SVR (default 0.5) -p epsilon Set the epsilon in the loss function of epsilon-SVR (default 0.1) -m cachesize Set the cache memory size to cachesize in MB (default 100) -e epsilon Set the tolerance of termination criterion to epsilon (default 0.001) -h shrinking Whether to use the shrinking heuristics, 0 or 1 (default 1) -b probability-estimates probability_estimates is a binary value indicating whether to calculate probability estimates when training the SVC or SVR model. Values are 0 or 1 and defaults to 0 for speed. -wi weight Set the parameter C (cost) of class i to weight*C, for C-SVC (default 1) -v n Set n for n -fold cross validation mode -q quiet mode; suppress messages to stdout. FILES
training_set_file must be prepared in the following simple sparse training vector format: <label> <index1>:<value1> <index2>:<value2> . . . . . . There is one sample per line. Each sample consists of a target value (label or regression target) followed by a sparse representation of the input vector. All unmentioned coordinates are assumed to be 0. For classification, <label> is an integer indicating the class label (multi-class is supported). For regression, <label> is the target value which can be any real number. For one-class SVM, it's not used so can be any number. Except using precomputed kernels (explained in another section), <index>:<value> gives a feature (attribute) value. <index> is an integer starting from 1 and <value> is a real number. Indices must be in an ASCENDING order. ENVIRONMENT
No environment variables. DIAGNOSTICS
None documented; see Vapnik et al. BUGS
Please report bugs to the Debian BTS. AUTHOR
Chih-Chung Chang, Chih-Jen Lin <cjlin@csie.ntu.edu.tw>, Chen-Tse Tsai <ctse.tsai@gmail.com> (packaging) SEE ALSO
svm-predict(1), svm-scale(1) Linux MAY 2006 svm-train(1)
All times are GMT -4. The time now is 09:18 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy