08-31-2011
Removing Non-printable characters in unix file
Thanks for your reply ....
as the above link mentioned "sed -e 's/\"®\"/ /g' -e 's/\"™\"/ /g' < file" i followed still i am not able to convert it. please put your suggestions.
thanks in advance
Sue
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I need to check ftp'd incoming files for characters that are not alphanumeric,<tab>, <cr>, or <lf> characters. Each file would have 10-20,000 line with up to 3,000 characters per line. Should I use awk, sed, or grep and what would the command look like to do such a search? Thanks much to anyone... (2 Replies)
Discussion started by: jvander
2 Replies
2. Shell Programming and Scripting
Sometimes obvious things... are not so obvious. I always thought that it was possible to grep non printable characters but not with my GNU grep (5.2.1) version.
printf "Hello\tWorld" | grep -l '\t'
printf "Hello\tWorld" | grep -l '\x09'
printf "Hello\tWorld" | grep -l '\x{09}'
None of them... (3 Replies)
Discussion started by: ripat
3 Replies
3. UNIX for Dummies Questions & Answers
i have a file which contains non printable characters
like enter,escape etc
i want to delete them from the file (2 Replies)
Discussion started by: alokjyotibal
2 Replies
4. Shell Programming and Scripting
How do I remove non-printable characters from all txt files and output the results to one file?
I've tried the following:
tr -cd '\n' < *.txt > out.txt
and it gives ambiguous redirect error.
How can I get it to operate on all txt files in the current directory and append the output to... (1 Reply)
Discussion started by: revax
1 Replies
5. Shell Programming and Scripting
Hi,
I want to removing ^M characters from a file and combine the line with the next line.
ex:
issue i have:
ABC^M^M
DEF
solution i need:
ABCDEF
I found that you by using the following command you can remove new line characters.
tr -d '\r' < infile.csv > outfile.csv
still... (10 Replies)
Discussion started by: mwrg
10 Replies
6. HP-UX
I have been using OKI data Microline printers; models 590 and 591 to print a bar code using the following escape sequence:
\E^PA^H^C00^D^C^A^A^A\E^PB^H
The escape sequence is stored in a unix file which is edited using vi.
Now, we are considering Microline printer model 395C and the bar code... (3 Replies)
Discussion started by: Joy Conner
3 Replies
7. UNIX for Dummies Questions & Answers
Hi,
in a file, i have records as below:
123|62|absnb|267629
123|267|28728|uiuip
123|567|26761|2676
i want to remove the non printable characters after the end of each record.
I guess there are certain charcters but not visible.
i don't know what character that is exactly.
I used... (2 Replies)
Discussion started by: pandeesh
2 Replies
8. Shell Programming and Scripting
Unable to grep:
Able to grep: (11 Replies)
Discussion started by: proactiveaditya
11 Replies
9. Shell Programming and Scripting
Hi All,
I am trying to find non-printable characters in a string. The sting could have alphanumeric, puntuations and characters like (*&%$#.') but not non-printable (or that is what I think they are called) which are introduced when you copy any text from DOS to unix box.
Input string1:... (10 Replies)
Discussion started by: dips_ag
10 Replies
10. Shell Programming and Scripting
Hi,
I have a huge file (50 Mil rows) which has certain non-printable ASCII characters in it. I am cleaning the file by deleting those characters using the following command -
tr -cd '\11\12\15\40-\176' < unclean_file > clean_file
Please note that I am excluding the following -
tab,... (6 Replies)
Discussion started by: rishigc
6 Replies
LEARN ABOUT PHP
pspell_new
PSPELL_NEW(3) 1 PSPELL_NEW(3)
pspell_new - Load a new dictionary
SYNOPSIS
int pspell_new (string $language, [string $spelling], [string $jargon], [string $encoding], [int $mode])
DESCRIPTION
pspell_new(3) opens up a new dictionary and returns the dictionary link identifier for use in other pspell functions.
For more information and examples, check out inline manual pspell website:http://aspell.net/.
PARAMETERS
o $language
- The language parameter is the language code which consists of the two letter ISO 639 language code and an optional two letter
ISO 3166 country code after a dash or underscore.
o $spelling
- The spelling parameter is the requested spelling for languages with more than one spelling such as English. Known values are
'american', 'british', and 'canadian'.
o $jargon
- The jargon parameter contains extra information to distinguish two different words lists that have the same language and spell-
ing parameters.
o $encoding
- The encoding parameter is the encoding that words are expected to be in. Valid values are 'utf-8', 'iso8859-*', 'koi8-r',
'viscii', 'cp1252', 'machine unsigned 16', 'machine unsigned 32'. This parameter is largely untested, so be careful when using.
o $mode
- The mode parameter is the mode in which spellchecker will work. There are several modes available:
o PSPELL_FAST - Fast mode (least number of suggestions)
o PSPELL_NORMAL - Normal mode (more suggestions)
o PSPELL_BAD_SPELLERS - Slow mode (a lot of suggestions)
o PSPELL_RUN_TOGETHER - Consider run-together words as legal compounds. That is, "thecat" will be a legal compound, although
there should be a space between the two words. Changing this setting only affects the results returned by pspell_check(3);
pspell_suggest(3) will still return suggestions.
Mode is a bitmask constructed from different constants listed above. However, PSPELL_FAST, PSPELL_NORMAL and PSPELL_BAD_SPELLERS
are mutually exclusive, so you should select only one of them.
RETURN VALUES
Returns the dictionary link identifier on success or FALSE on failure.
EXAMPLES
Example #1
pspell_new(3)
<?php
$pspell_link = pspell_new("en", "", "", "",
(PSPELL_FAST|PSPELL_RUN_TOGETHER));
?>
PHP Documentation Group PSPELL_NEW(3)