Sponsored Content
Top Forums Shell Programming and Scripting Help with \u0401 codes ? unicode or something Post 302461151 by fpmurphy on Friday 8th of October 2010 04:17:48 PM
Old 10-08-2010
Looks like CYRILLIC SMALL LETTER I (i.e the reversed small N) and LATIN SMALL LETTER E WITH STROKE (i.e small e with a forward stroke though it) but my betting is that the are something else as far as your application is concerned.

Can you provide an example of the "casual characters" you need to convert?
 

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

unicode

Hi, I have some software I need to install on HP-UX 11iv1 64bit but it must not be set up in unicode mode. I know unicode/ ASCII etc. I don't know how to get unix to switch between these. Is there an environment setting for that? I use the Korn shell. Thanks. (2 Replies)
Discussion started by: rein
2 Replies

2. Programming

How to display unicode characters / unicode string

I have a stream of characters like "\u8BBE\u5907\u7BA1" and i want to display it. I tried following things already without any luck. 1) printf("%s",L("\u8BBE\u5907\u7BA1")); 2) printf("%lc",0x8BBE); 3) setlocale followed by fwide followed by wprintf 4) also changed the local manually... (3 Replies)
Discussion started by: jackdorso
3 Replies

3. UNIX for Advanced & Expert Users

Unix and Unicode

All, I'm trying to grasp how to use Unicode with/in Unix. I've made progress on some fronts, for example, when uploading files to my server I can use the intermediary language to convert the file to UTF-8. I'm having trouble getting Samba to do this (I'm using "unix charset" in smb.conf);... (4 Replies)
Discussion started by: effigy
4 Replies

4. Programming

Concerned about C and UNICODE

Dear experts, While developping a C UNICODE application under AIX 5.3, I encountered the following problem, and after days of investigations I still could not find any solution. Please note that the application is full wchar_t based (not utf8) and that I could compile and run it without any... (4 Replies)
Discussion started by: tgilbert
4 Replies

5. Programming

unicode problem

on some distributions UTF-32 is the default and i need to change the size of wchar_t to 2 bytes. i tried to compile it with -fwide-exec-charset=UTF-16 but it didn't help. anyone have any ideas? thanks, Akos (3 Replies)
Discussion started by: Akimaki
3 Replies

6. Programming

Unicode programing in C

im starting to go a little serious with c, woking in a personal project that will read a xml, which might contain Unicode characters (i know it will on my system, which is set to es_AR.UTF-8) im using mxml, and the documentation says it uses utf8 internally (no worries here). so i need to be... (4 Replies)
Discussion started by: broli
4 Replies

7. UNIX for Advanced & Expert Users

mail with Unicode support

Hello, I have a question. There is a command line mail client "mail", it is good, but obviously, does not support Unicode. Are there any (other) mail clients for command line having support for Unicode (UTF-8) and maybe other encodings? Or are there any other versions of mail/mailx programm which... (0 Replies)
Discussion started by: Action
0 Replies

8. Programming

Unicode filenames in C++?

I'm trying to figure out how to support Unicode or atleast an unsigned char in the d_name of struct dirent The problem i'm facing is that I'm checking file names for special characters and obviously the "char d_name" doesn't like it. I'm looping through the directory and getting the file... (3 Replies)
Discussion started by: james2432
3 Replies

9. Shell Programming and Scripting

Unicode file validation

I don't want HTML_CONTENT,RICH_CONTENT,TEXT_CONTENT columns data in the file and reset of data we need to extract. Find the attached file. Need to extract date in between DI_UX_ROW_END tag. Can help me using unix command using AWK. Thanks, (2 Replies)
Discussion started by: bmk
2 Replies

10. Shell Programming and Scripting

Unicode help

is there any way to handle unicode such as ʃʰɐm̆ (1 Reply)
Discussion started by: sreejithalokkan
1 Replies
ISO_8859-10(7)						     Linux Programmer's Manual						    ISO_8859-10(7)

NAME
iso_8859-10 - ISO 8859-10 character set encoded in octal, decimal, and hexadecimal DESCRIPTION
The ISO 8859 standard includes several 8-bit extensions to the ASCII character set (also known as ISO 646-IRV). ISO 8859-10 encodes the characters used in Nordic languages. ISO 8859 alphabets The full set of ISO 8859 alphabets includes: ISO 8859-1 West European languages (Latin-1) ISO 8859-2 Central and East European languages (Latin-2) ISO 8859-3 Southeast European and miscellaneous languages (Latin-3) ISO 8859-4 Scandinavian/Baltic languages (Latin-4) ISO 8859-5 Latin/Cyrillic ISO 8859-6 Latin/Arabic ISO 8859-7 Latin/Greek ISO 8859-8 Latin/Hebrew ISO 8859-9 Latin-1 modification for Turkish (Latin-5) ISO 8859-10 Lappish/Nordic/Eskimo languages (Latin-6) ISO 8859-11 Latin/Thai ISO 8859-13 Baltic Rim languages (Latin-7) ISO 8859-14 Celtic (Latin-8) ISO 8859-15 West European languages (Latin-9) ISO 8859-16 Romanian (Latin-10) ISO 8859-10 characters The following table displays the characters in ISO 8859-10, which are printable and unlisted in the ascii(7) manual page. The fourth col- umn will only show the proper glyphs in an environment configured for ISO 8859-10. Oct Dec Hex Char Description ---------------------------------------------------------------- 240 160 A0 NO-BREAK SPACE 241 161 A1 i LATIN CAPITAL LETTER A WITH OGONEK 242 162 A2 c LATIN CAPITAL LETTER E WITH MACRON 243 163 A3 L LATIN CAPITAL LETTER G WITH CEDILLA 244 164 A4 EUR LATIN CAPITAL LETTER I WITH MACRON 245 165 A5 Y LATIN CAPITAL LETTER I WITH TILDE 246 166 A6 LATIN CAPITAL LETTER K WITH CEDILLA 247 167 A7 S SECTION SIGN 250 168 A8 LATIN CAPITAL LETTER L WITH CEDILLA 251 169 A9 (C) LATIN CAPITAL LETTER D WITH STROKE 252 170 AA a LATIN CAPITAL LETTER S WITH CARON 253 171 AB << LATIN CAPITAL LETTER T WITH STROKE 254 172 AC ~ LATIN CAPITAL LETTER Z WITH CARON 255 173 AD SOFT HYPHEN 256 174 AE (_) LATIN CAPITAL LETTER U WITH MACRON 257 175 AF LATIN CAPITAL LETTER ENG (Sami) 260 176 B0 o DEGREE SIGN 261 177 B1 +- LATIN SMALL LETTER A WITH OGONEK 262 178 B2 2 LATIN SMALL LETTER E WITH MACRON 263 179 B3 3 LATIN SMALL LETTER G WITH CEDILLA 264 180 B4 LATIN SMALL LETTER I WITH MACRON 265 181 B5 u LATIN SMALL LETTER I WITH TILDE 266 182 B6 9| LATIN SMALL LETTER K WITH CEDILLA 267 183 B7 . MIDDLE DOT 270 184 B8 LATIN SMALL LETTER L WITH CEDILLA 271 185 B9 1 LATIN SMALL LETTER D WITH STROKE 272 186 BA o LATIN SMALL LETTER S WITH CARON 273 187 BB >> LATIN SMALL LETTER T WITH STROKE 274 188 BC OE LATIN SMALL LETTER Z WITH CARON 275 189 BD oe HORIZONTAL BAR 276 190 BE LATIN SMALL LETTER U WITH MACRON 277 191 BF c LATIN SMALL LETTER ENG (Sami) 300 192 C0 A LATIN CAPITAL LETTER A WITH MACRON 301 193 C1 A LATIN CAPITAL LETTER A WITH ACUTE 302 194 C2 A LATIN CAPITAL LETTER A WITH CIRCUMFLEX 303 195 C3 A LATIN CAPITAL LETTER A WITH TILDE 304 196 C4 A LATIN CAPITAL LETTER A WITH DIAERESIS 305 197 C5 A LATIN CAPITAL LETTER A WITH RING ABOVE 306 198 C6 AE LATIN CAPITAL LETTER AE 307 199 C7 C LATIN CAPITAL LETTER I WITH OGONEK 310 200 C8 E LATIN CAPITAL LETTER C WITH CARON 311 201 C9 E LATIN CAPITAL LETTER E WITH ACUTE 312 202 CA E LATIN CAPITAL LETTER E WITH OGONEK 312 202 CB E LATIN CAPITAL LETTER E WITH DIAERESIS 314 204 CC I LATIN CAPITAL LETTER E WITH DOT ABOVE 315 205 CD I LATIN CAPITAL LETTER I WITH ACUTE 316 206 CE I LATIN CAPITAL LETTER I WITH CIRCUMFLEX 317 207 CF I LATIN CAPITAL LETTER I WITH DIAERESIS 320 208 D0 D LATIN CAPITAL LETTER ETH (Icelandic) 321 209 D1 N LATIN CAPITAL LETTER N WITH CEDILLA 322 210 D2 O LATIN CAPITAL LETTER O WITH MACRON 323 211 D3 O LATIN CAPITAL LETTER O WITH ACUTE 324 212 D4 O LATIN CAPITAL LETTER O WITH CIRCUMFLEX 325 213 D5 O LATIN CAPITAL LETTER O WITH TILDE 326 214 D6 O LATIN CAPITAL LETTER O WITH DIAERESIS 327 215 D7 x LATIN CAPITAL LETTER U WITH TILDE 330 216 D8 O LATIN CAPITAL LETTER O WITH STROKE 331 217 D9 U LATIN CAPITAL LETTER U WITH OGONEK 332 218 DA U LATIN CAPITAL LETTER U WITH ACUTE 333 219 DB U LATIN CAPITAL LETTER U WITH CIRCUMFLEX 334 220 DC U LATIN CAPITAL LETTER U WITH DIAERESIS 335 221 DD Y LATIN CAPITAL LETTER Y WITH ACUTE 336 222 DE b LATIN CAPITAL LETTER THORN (Icelandic) 337 223 DF B LATIN SMALL LETTER SHARP S (German) 340 224 E0 a LATIN SMALL LETTER A WITH MACRON 341 225 E1 a LATIN SMALL LETTER A WITH ACUTE 342 226 E2 a LATIN SMALL LETTER A WITH CIRCUMFLEX 343 227 E3 a LATIN SMALL LETTER A WITH TILDE 344 228 E4 a LATIN SMALL LETTER A WITH DIAERESIS 345 229 E5 a LATIN SMALL LETTER A WITH RING ABOVE 346 230 E6 ae LATIN SMALL LETTER AE 347 231 E7 c LATIN SMALL LETTER I WITH OGONEK 350 232 E8 e LATIN SMALL LETTER C WITH CARON 351 233 E9 e LATIN SMALL LETTER E WITH ACUTE 352 234 EA e LATIN SMALL LETTER E WITH OGONEK 353 235 EB e LATIN SMALL LETTER E WITH DIAERESIS 354 236 EC i LATIN SMALL LETTER E WITH DOT ABOVE 355 237 ED i LATIN SMALL LETTER I WITH ACUTE 356 238 EE i LATIN SMALL LETTER I WITH CIRCUMFLEX 357 239 EF i LATIN SMALL LETTER I WITH DIAERESIS 360 240 F0 o LATIN SMALL LETTER ETH (Icelandic) 361 241 F1 n LATIN SMALL LETTER N WITH CEDILLA 362 242 F2 o LATIN SMALL LETTER O WITH MACRON 363 243 F3 o LATIN SMALL LETTER O WITH ACUTE 364 244 F4 o LATIN SMALL LETTER O WITH CIRCUMFLEX 365 245 F5 o LATIN SMALL LETTER O WITH TILDE 366 246 F6 o LATIN SMALL LETTER O WITH DIAERESIS 367 247 F7 -:- LATIN SMALL LETTER U WITH TILDE 370 248 F8 o LATIN SMALL LETTER O WITH STROKE 371 249 F9 u LATIN SMALL LETTER U WITH OGONEK 372 250 FA u LATIN SMALL LETTER U WITH ACUTE 373 251 FB u LATIN SMALL LETTER U WITH CIRCUMFLEX 374 252 FC u LATIN SMALL LETTER U WITH DIAERESIS 375 253 FD y LATIN SMALL LETTER Y WITH ACUTE 376 254 FE b LATIN SMALL LETTER THORN (Icelandic) 377 255 FF y LATIN SMALL LETTER KRA (Greenlandic) NOTES
ISO 8859-10 is also known as Latin-6. SEE ALSO
ascii(7) COLOPHON
This page is part of release 3.53 of the Linux man-pages project. A description of the project, and information about reporting bugs, can be found at http://www.kernel.org/doc/man-pages/. Linux 2010-09-20 ISO_8859-10(7)
All times are GMT -4. The time now is 11:52 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy