03-24-2019
Remove Special Characters and Numbers From a Wordlist
I sux at this type of stuff. I have a huge wordlist. I want to get rid of everything in each word except the letters. I want to remove numbers and all special characters. And since this list was created using cewl I some how picked up something like so Latin characters and would like to remove them as well. If there is a way to do this and someone gives me the string to use could you also drop down and explain to me how the above string works since I would love to learn how to do things like this myself.
Thanks in advance.
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hi,
How do I remove the lines where special characters or Unicode characters appear?
The following query does work but I wonder if there is a better way.
cat test.txt | egrep -v '\)|#|,|&|-|\(|\\|\/|\.'
The following lines show that my query is incomplete.
Warning: The word "*Khan" is... (1 Reply)
Discussion started by: shantanuo
1 Replies
2. UNIX for Dummies Questions & Answers
Hi All,
I have a script written that creates a new directory within the shell program and if a parameter isn't passed in, it creates a strange directory name by mistake. So I have a directory like "-_12" and I am unable to remove it. I tried removing it using double quote and many others. I have... (12 Replies)
Discussion started by: datherriault
12 Replies
3. Shell Programming and Scripting
Hi there,
I'd like to write a script that removes any set of character from any string. The first argument would be the string, the second argument would be the characters to remove. For example:
$ myscript "My name's Santiago. What's yours?" "atu"
My nme's Snigo. Wh's yors?
I wrote the... (11 Replies)
Discussion started by: chebarbudo
11 Replies
4. UNIX for Dummies Questions & Answers
Dear Members,
We have a file which contains some special characters. I need to replace these special character by a new line character(\n).
The Special character is \x85.
I am not sure what this character means and how we can remove it.
Any inputs are greatly appreciated.
Thanks... (5 Replies)
Discussion started by: sandeep_1105
5 Replies
5. UNIX for Dummies Questions & Answers
Hi,
I have a directory that has a file which contained special characters in the filename. Can someone please advise how to remove the file, preferably with a rm -i ?
Thanks in advance.
Listing is as below:
{oracle}> ls -1b
bplog.bkup.001
bplog.bkup.002
bplog.bkup.003
bplog.bkup.004... (1 Reply)
Discussion started by: newbie_01
1 Replies
6. Shell Programming and Scripting
hello all
I am writing a perl code and i wish to remove the special characters for text.
I wish to remove all extended ascii characters. If the list of special characters is huge, how can i do this using substitute command
s/specialcharacters/null/g
I really want to code like... (3 Replies)
Discussion started by: vasuarjula
3 Replies
7. Shell Programming and Scripting
Hi All,
I have a variable like
AVAIL="\
BACK:bkpstg:testdb3.iad.expertcity.com:backtest|\
#AUTH:authstg:testdb3.iad.expertcity.com:authiapd|\
TEST:authstg:testdb3.iad.expertcity.com:authiapd|\
"
What I want to do here is that If a find # before any entry, remove the entire string... (5 Replies)
Discussion started by: engineermayur
5 Replies
8. Shell Programming and Scripting
Hi,
In source data few of columns are having special charates(like *) due to this i am not able to display the data into flat file.it's displaying the some of junk data into the flat file.
source dataExample:
Address1="XDERFTG * HYJUYTG"
how to remove the special charates in a string (2 Replies)
Discussion started by: koti_rama
2 Replies
9. Shell Programming and Scripting
Hi,
I have a file with this line, it's always in the first line:
I want to remove these special characters: ´╗┐
file1
´╗┐\\bar\c$\test2\;3.348.118 Bytes;160 ;3
\\bar\c$\test\;35 Bytes;2 ;1
I want the same file to be only
\\bar\c$\test2\;3.348.118 Bytes;160 ;3
\\bar\c$\test\;35... (4 Replies)
Discussion started by: nakaedu
4 Replies
10. Shell Programming and Scripting
Hi Gurus,
I have file which contains some unicode charachator like "ü". I want to replace it with some charactors. I searched in internet and got command sed "s/ü/-/g", but I don't know how to type ü in unix command line.
Please help me for this one.
Thanks in advance (7 Replies)
Discussion started by: ken6503
7 Replies
LEARN ABOUT SUSE
prezip-bin
PREZIP-BIN(1) Aspell Abbreviated User's Manual PREZIP-BIN(1)
NAME
prezip-bin - prefix zip delta word list compressor/decompressor
SYNOPSIS
prezip-bin [ -V | -d | -z ]
DESCRIPTION
prezip-bin compresses/decompresses sorted word lists from standard input to standard output.
Prezip-bin is similar to word-list-compress(1) but it allows a larger character set of {0x00...0x09, 0x0B, 0x0C, 0x0E...0xFF} and
multi-words larger than 255 characters in length. It can also decompress word-list-compress(1) compatible files.
COMMANDS
Prezip-bin accepts only one of these commands.
-V Display prezip-bin version number to standard output.
-d Read a compressed word list from standard input and decompress it to standard output. This can be a word-list-compress(1) or a
prezip-bin compressed file.
-z Read a binary word list from standard input and compress it to standard output.
EXAMPLES
prezip-bin -d <wordlist.cwl >wordlist.txt
Decompress file wordlist.cwl to text file wordlist.txt
prezip-bin -z <wordlist.txt >wordlist.pz 2>errors.txt
Compress wordlist.txt to binary file wordlist.pz and send any error messages to a text file named errors.txt
LC_COLLATE=C sort -u <wordlist.txt | prezip-bin -z >wordlist.pz
Sort a word list, then pipe it to prezip-bin to create a compressed binary wordlist.pz file.
prezip-bin -d <words.pz | aspell create master ./words.rws
Decompress a wordlist, then pipe it to aspell(1) to create a spelling list. Please check the aspell(1) info manual for proper usage
and options.
TIPS
Prezip-bin is best used with sorted word list type files. It is not a general purpose compression program since resulting files may actu-
ally increase in size.
Unlike word-list-compress(1) if your word list has leading or trailing blank spaces for formatting purposes, you should remove them first
before you compress your list using prezip-bin -z , otherwise those spaces will be included in the compressed binary output.
DIAGNOSTICS
Prezip-bin normally exits with a return code of 0. If it encounters an error, a message is sent to standard error output (stderr), and
prezip-bin exits with a non-zero return value. Error messages are listed below:
(display help/usage message)
Unknown command given on the command line so prezip-bin displays a usage message to standard error output.
unknown format
The input file appears not to be an expected format, or may possibly be a more advanced format. The output file will be empty.
corrupt input
This is only for the decompression command -d. The input file appeared to be of a correct format, but something appears wrong now.
There may be some valid data in output, but due to input corruption, the rest of the file can not be completed.
unexpected EOF
The input file appeared okay but ended sooner than expected, therefore the output file is not complete.
SEE ALSO
aspell(1), aspell-import(1), run-with-aspell(1), word-list-compress(1)
Aspell is fully documented in its Texinfo manual. See the `aspell' entry in info for more complete documentation.
REPORTING BUGS
For help, see the Aspell homepage at <http://aspell.net>. Send bug reports/comments to the Aspell user list at the above address.
AUTHOR
This info page was written by Jose Da Silva <digital@joescat.com>.
prezip-bin-0.1.2 2005-09-30 PREZIP-BIN(1)