Sponsored Content
Top Forums UNIX for Dummies Questions & Answers How to Remove Special Characters Post 302349753 by methyl on Tuesday 1st of September 2009 05:58:54 PM
Old 09-01-2009
I have a confession. I used the M$ Windows "calc" program to convert from hex to octal. We could of course have used unix "bc".

I've called the environment variable "X85" . It's just a name.

The special syntax to "echo" just allows you to generate any character from its octal representation. The "\c" at the end of the string stops "echo" adding a newline.

The translate program "tr" used in this way translates all occurances of one character to another character. Program "tr" and many others use "\n" to represent a newline.
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

remove special and unicode characters

Hi, How do I remove the lines where special characters or Unicode characters appear? The following query does work but I wonder if there is a better way. cat test.txt | egrep -v '\)|#|,|&|-|\(|\\|\/|\.' The following lines show that my query is incomplete. Warning: The word "*Khan" is... (1 Reply)
Discussion started by: shantanuo
1 Replies

2. Shell Programming and Scripting

Remove special characters from string

Hi there, I'd like to write a script that removes any set of character from any string. The first argument would be the string, the second argument would be the characters to remove. For example: $ myscript "My name's Santiago. What's yours?" "atu" My nme's Snigo. Wh's yors? I wrote the... (11 Replies)
Discussion started by: chebarbudo
11 Replies

3. Shell Programming and Scripting

How to remove special characters from each line?

Hello, Is there a simpler way to remove special characters (color codes) from each lines in a log file? I use sed like in the example below but I think there should be a more simple way to achieve the same result: $ cat -vet file1 ^, , , , Maybe to convert the file somehow? ... (5 Replies)
Discussion started by: majormark
5 Replies

4. UNIX for Dummies Questions & Answers

Files with special characters - how to remove

Hi, I have a directory that has a file which contained special characters in the filename. Can someone please advise how to remove the file, preferably with a rm -i ? Thanks in advance. Listing is as below: {oracle}> ls -1b bplog.bkup.001 bplog.bkup.002 bplog.bkup.003 bplog.bkup.004... (1 Reply)
Discussion started by: newbie_01
1 Replies

5. Shell Programming and Scripting

remove special characters

hello all I am writing a perl code and i wish to remove the special characters for text. I wish to remove all extended ascii characters. If the list of special characters is huge, how can i do this using substitute command s/specialcharacters/null/g I really want to code like... (3 Replies)
Discussion started by: vasuarjula
3 Replies

6. Shell Programming and Scripting

Remove string between two special characters

Hi All, I have a variable like AVAIL="\ BACK:bkpstg:testdb3.iad.expertcity.com:backtest|\ #AUTH:authstg:testdb3.iad.expertcity.com:authiapd|\ TEST:authstg:testdb3.iad.expertcity.com:authiapd|\ " What I want to do here is that If a find # before any entry, remove the entire string... (5 Replies)
Discussion started by: engineermayur
5 Replies

7. Shell Programming and Scripting

Remove the special characters from field

Hi, In source data few of columns are having special charates(like *) due to this i am not able to display the data into flat file.it's displaying the some of junk data into the flat file. source dataExample: Address1="XDERFTG * HYJUYTG" how to remove the special charates in a string (2 Replies)
Discussion started by: koti_rama
2 Replies

8. Shell Programming and Scripting

Sed - remove special characters

Hi, I have a file with this line, it's always in the first line: I want to remove these special characters: ´╗┐ file1 ´╗┐\\bar\c$\test2\;3.348.118 Bytes;160 ;3 \\bar\c$\test\;35 Bytes;2 ;1 I want the same file to be only \\bar\c$\test2\;3.348.118 Bytes;160 ;3 \\bar\c$\test\;35... (4 Replies)
Discussion started by: nakaedu
4 Replies

9. Shell Programming and Scripting

How to remove some special characters in a string?

Hi, I have string like this ="Lookup Procedure" But i want the output like this Lookup Procedure =," should be removed. Please suggest me the solution. Regards, Madhuri (2 Replies)
Discussion started by: srimadhuri
2 Replies

10. Shell Programming and Scripting

How to remove special characters?

Hi Gurus, I have file which contains some unicode charachator like "ü". I want to replace it with some charactors. I searched in internet and got command sed "s/ü/-/g", but I don't know how to type ü in unix command line. Please help me for this one. Thanks in advance (7 Replies)
Discussion started by: ken6503
7 Replies
big5(5) 							File Formats Manual							   big5(5)

NAME
big5 - A character encoding system (codeset) for Traditional Chinese DESCRIPTION
The big5 codeset is one of several codesets that support the Traditional Chinese language. This codeset includes the following character sets: ASCII Big-5 The big5 codeset uses a combination of single-byte data and two-byte data to represent ASCII characters, symbols, and Chinese ideographic characters. ASCII Characters All ASCII characters are represented in the form of single-byte, 7-bit data in the big5 codeset; that is, the most significant bit (MSB) of a byte that represents an ASCII character is always set off. For more information, see ascii(5). Big-5 Character Groups The Big-5 character set defines the following character groups: Special symbols (408) Level 1 characters (5401) Level 2 characters (7652) Level 1 user-defined space (785) Level 2 user-defined space (2983) Level 3 user-defined space (2041) Code Values for Big-5 Characters Each Big-5 character is represented by a two-byte code that compiles according to the Big-5 standard. The MSB of the first byte is always set on while that of the second byte can be on or off. Code ranges for characters in the different character groups are as follows: Special symbols: A140 to A3BF Level 1 characters: A440 to C67E Level 2 characters: C940 to F9D5 Level 1 user-defined space: FA40 to FEFE Level 2 user-defined space: 8E40 to A0FE Level 3 user-defined space: 8140 to 8DFE In this space, the valid code range for the first byte is 81 to FE, while that for the second byte is 40 to 7E and A1 to FE. Codeset Conversion The following codeset converter pairs are available for converting Traditional Chinese characters between big5 and other encoding formats. Refer to iconv_intro(5) for an introduction to codeset conversion. For more information about the other codeset for which big5 is the input or output, see the reference page specified in the list item. dechanyu_big5, big5_dechanyu Converting from and to DEC Hanyu: dechanyu(5) dechanzi_big5, big5_dechanzi Converting from and to DEC Hanzi: dechanzi(5) eucTW_big5, big5_eucTW Converting from and to Taiwanese Extended UNIX Code: eucTW(5) sbig5_big5, big5_sbig5 Converting from and to Shift Big-5: sbig5(5) telecode_big5, big5_telecode Converting from and to Telecode: telecode(5) UCS-2_big5, big5_UCS-2 Converting from and to UCS-2: Unicode(5) UCS-4_big5, big5_UCS-4 Converting from and to UCS-4: Unicode(5) UTF-8_big5, big5_UTF-8 Converting from and to UTF-8: Unicode(5) Note The big5 encoding format is identical to the encoding format used in PC code pages that support Traditional Chinese. Therefore, you can use codeset converters that convert between big5 and UCS-2, UCS-4, or UTF-8 to convert Traditional Chinese data between PC code-page and Uni- code encoding formats. Refer to code_page(5) for a discussion of how the operating system supports PC code pages. Fonts for Big-5 Characters The operating system supports Big-5 code by internally converting characters to DEC Hanyu. Therefore, DEC Hanyu fonts are used for Big-5 characters. Both display and printer fonts are provided for DEC Hanyu and these are listed in the dechanyu(5) reference page. For general information about printer support for and codeset conversion of Asian text, refer to i18n_printing(5). SEE ALSO
Commands: locale(1) Others: ascii(5), Chinese(5), code_page(5), dechanyu(5), dechanzi(5), eucTW(5), GBK(5), i18n_intro(5), i18n_printing(5), iconv_intro(5), l10n_intro(5), sbig5(5), telecode(5), Unicode(5) big5(5)
All times are GMT -4. The time now is 08:53 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy