08-21-2009
Como eliminar datos repetidos en un archivo de texto en perl
I have a file called texto.txt
like this
sil
a
b
c
s
sil
b
a
and i wold like to have this output, a new file like this
sil
a
b
c
s
I want to erase all repeated words, i tried to do it but i couldn't, if somebody can help me i will apreciate (Sorry for my bad english)
Last edited by romanhr; 08-21-2009 at 10:41 PM..
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hi
I have a file with data arranged into columns. The first column is the chromosome name.
When I use grep to subset only rows with chr1, I get chr1 but also chr10, chr11,..
How do I get only rows with chr1?
grep chr1 filein > fileout
head fileout
chr1 59757841
chr11 108258691 ... (2 Replies)
Discussion started by: jdhahbi
2 Replies
2. UNIX for Dummies Questions & Answers
Hi,
I'm struggling with a regex that would match a 'b' that follows an 'a' and is at the end of a string of non-white characters. For example:
Line 1: aba abab b abb aab bab baa
I can find the right strings but I'm lacking knowledge of how to "discard" the bits that precede bs.... (2 Replies)
Discussion started by: machinogodzilla
2 Replies
3. Shell Programming and Scripting
Hi all,
any idea how to match the following:
char*<no or any string or space> buf and
char *<no or any string or space> buf
i need to capture the buf characters too.
currently i need two checks to cover this:
#search char* <any string> buf or char *<any string> buf
@noarray =... (2 Replies)
Discussion started by: ChaMeN
2 Replies
4. Shell Programming and Scripting
Hi all,
I am looking for a regex syntax to match repeated appearance. Likes,
']+]+' matches for string '65A SOME MORE AND 78B'
Now, this gets messy if I need to extract all such repeated appearance. I don't want to write ] four or five times for matching repeated appearance.
Thanks in... (2 Replies)
Discussion started by: guruparan18
2 Replies
5. Shell Programming and Scripting
I am trying to match a similar line using grep with regular expression
the line is
/remote/mac/pbbbb/abc/def/hij/hop/include/abc/tif/element/test/testfiles/Office.cpp:57: const OfficeType& getType().get() const;
I just need to extract the bold characters using grep with regular expression.... (5 Replies)
Discussion started by: prasbala
5 Replies
6. Shell Programming and Scripting
hi everyone
suppose we have two scenario
echo ABCD | grep \{4\}
DATE
echo SYSDATE | grep \{4\}
SYSDATE
i want to match the string of four length only please help (5 Replies)
Discussion started by: aishsimplesweet
5 Replies
7. Shell Programming and Scripting
Hi,
I read the book of <<unix shell programming>>. The regular expression ^\(.\)\1 matches the first character on the line and stores it in register 1. Then the expression matches whatever is stored in the register 1, as specified by the \1. The net effect of this regular expression is to match... (2 Replies)
Discussion started by: jianma
2 Replies
8. Homework & Coursework Questions
Hi Everybody!
I need some help with a regular expression in Perl that will match files named messages, but also files named message.1, message.2 and so on. So really I need one that will find messages and messages that might be followed by a period and a digit without matching other files like... (2 Replies)
Discussion started by: Hax0rc1ph3r
2 Replies
9. Shell Programming and Scripting
I cannot seem to get this to work correct:
my ($k, $v) = split(/F/, $fc{$DIR}{symbolic}, 2);
Below is the input (the $fc{$DIR}{symbolic} variable):
QMH2562 FW:v5.06.03 DVR:v8.03.07.15.05.09-kbut i also need it to break on FV:
Emulex NC553i FV4.2.401.6 DV8.3.5.86.2pthe code above... (2 Replies)
Discussion started by: rusted_planet
2 Replies
10. Shell Programming and Scripting
echo 20110101 | awk '{ print match($0,/^((17||18||19||20)|)-*(|0|1)-*(|0||3)$/))
I am getting a match for the above, where as it shouldn't, as there is no hyphen in the echoed date.
Another question is what is the difference between || and | in the above statement (4 Replies)
Discussion started by: tostay2003
4 Replies
LEARN ABOUT DEBIAN
canadian-english
canadian-english(5) Users' Manual canadian-english(5)
NAME
canadian-english - a list of English words
DESCRIPTION
/usr/share/dict/canadian-english is an ASCII file which contains an alphabetic list of words, one per line.
FILES
There may be any number of word lists in /usr/share/dict/. /etc/dictionaries-common/words is a symbolic link to the currently-chosen
/usr/share/dict/<language> file. /usr/share/dict/words is a symbolic link to /etc/dictionaries-common/words, and is the name by which
other software should refer to the system word list. See select-default-wordlist(8) for more information, and/or to change the currently-
chosen word list.
The directory /usr/share/dict can contain word lists for many languages, with name of the language in English, e.g., /usr/share/dict/french
and /usr/share/dict/danish contain respectively lists of French and Danish words if they exist. Such lists should be coded using the ISO
8859-1 character set encoding.
SEE ALSO
ispell(1), select-default-wordlist(8), and the Filesystem Hierarchy Standard.
HISTORY
The words lists are not specific, and may be generated from any number of sources.
The system word list used to be /usr/dict/words. For compatibility, software should check that location if /usr/share/dict/words does not
exist.
AUTHOR
Word lists are collected and maintained by various authors. The Debian English word lists are built from the SCOWL (Spell- Checker Ori-
ented Word Lists) package, whose upstream editor is Kevin Atkinson <kevina@users.sourceforge.net>.
Debian 16 June 2003 canadian-english(5)