07-01-2012
Quote:
The file is getting dynamicaaly generated through a .net application.The name of the file is kept as per the entry made by the users from the interface.
These all files have french names and hence have the accent character.
Imho this is a system design error. By all means hold the user's reference to the file within the file, but don't let them decide filenames on your system. Apart from your current issues with cross-platform character sets, you lay yourself open to code injection.
If you want to maintain cross-platform compatibility, stick to alphanumeric filenames within the 7-bit ASCII character set. When Windows and FTP is involved, the MSDOS 8.3 format (not the contracted Windows version of 8.3 which includes a tilde character) is preferred.
i.e. Filenames like XXXXXXXX.XXX (i.e 8-dot-3).
Btew. UTF-8 makes the stuation worse because it uses more than one character to express one special character.
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hello,
I try to insert charcheters with accent with emacs.
It doesn't work.
How can I do ?
Thank you in advance (2 Replies)
Discussion started by: annemar
2 Replies
2. Shell Programming and Scripting
Hi everyone !
I'd like to write a unix command for correcting all european accent errors in a document (spanish, german, french, danisch, etc )!
i need to do this for correcting my document :
sed -e 's/%2B/\ /g' -e 's/%25C9/É/g' doc1 > doc2
The first command is ok and change "%2B" into... (3 Replies)
Discussion started by: Tomat75
3 Replies
3. Shell Programming and Scripting
I am new to shell scripting, and need a script to randomly distribute each character from a file into one of three new files. I also need each character to maintain it's position from the original file in the new file (such that if a character is written to File 1, Files 2 and 3 have spaces... (10 Replies)
Discussion started by: foxcastle
10 Replies
4. Shell Programming and Scripting
Hello All
in a text file I have to replace some numeric code by a string.
This is an exemple of the file:
000000001 LDR L ^^^^^nam^^2200169Ia^45e0
000000001 008 L 100604s9999^^^^xx^^^^^^^^^^^^000^0^und^d
000000001 022 L $$a0365-6675
000000001 090 L $$aBMA 1934-1937.
000000001 245... (1 Reply)
Discussion started by: ldiaz2106
1 Replies
5. Shell Programming and Scripting
Hi,
I am creating a script that will:
1. Connect to SFTP server
2. Get the file content's first 6 character (ddmmyy) and compare it to today's date (also in ddmmyy format). This is the header of the file. However, the header contains a few more information. So i need to only get the first 6... (0 Replies)
Discussion started by: cherriesh
0 Replies
6. Shell Programming and Scripting
Hello All
hope all fine,
I have a question about spanish accents...
I have in a redhat server, a lot of files with Ñ or Ú accent into the NAME of the file.
So my question is: Is it possible to change this name but with a script...to change all the occurence in one shot.
Exemple:
cd... (2 Replies)
Discussion started by: ldiaz2106
2 Replies
7. Shell Programming and Scripting
Hello all,
After searching through the similar topics and not finding a working solution, I decided to join and post my question (and maybe kill time and help other users).
Essentially I am trying to get a file from the sftp and then delete it after the file is pulled. all the sftp commands... (3 Replies)
Discussion started by: MarcMaiden
3 Replies
8. Shell Programming and Scripting
Hi friend,
i have the following problem:
when i am writting the below command on the command prompt , its working.
while read -n 1 ch; do echo "$ch" ; echo "$ch" ; done<file_name.out.
but when i am executing it after saving it in a ksh file, its not working.
Please helppppppppp .. thankss... (18 Replies)
Discussion started by: neelmani
18 Replies
9. Red Hat
Hello,
I have a weird issue, I have RHEL 5.7 running with openssh5.2 where sftpgroup OS group is chroot. I see the difference difference in timestamp on files, when I login via ssh and SFTP, I see four hour difference, is something missing in my configuration.
#pwd... (8 Replies)
Discussion started by: bobby320
8 Replies
10. AIX
Hi guys, I'm having a issue with a windows file from, at first the file is readed like one big and extense line and the famous "Ctrl+V Ctrl+R" or "^M return carriage"... fixed with:
perl -pe 'if ( s/\r\n?/\n/g ) { $f=1 }; if ( $f || ! $m ) { s/()\z/$1\n/ }; $m=1' $file_input > file_output
... (5 Replies)
Discussion started by: jockx
5 Replies
LEARN ABOUT OSF1
iso-2022
iso2022(5) File Formats Manual iso2022(5)
NAME
iso2022, iso-2022, ISO-2022 - A character encoding mechanism standardized by the International Standards Organization (ISO)
DESCRIPTION
The ISO-2022 standard defines a mechanism for handling single-byte and multibyte characters. The standard specifies four classes of charac-
ter sets: The 94-charset class, which contains character sets with 94 positions (single-byte characters). Examples are the ASCII and JIS
X0201 character sets. The 96-charset class, which contains character sets with 96 positions (single-byte characters). Examples are the ISO
Latin series of character sets. The 94x94-charset class, which contains character sets with 94x94 positions (2-byte characters). Examples
are the GB 2312 and the CNS 11643 character sets. The 96x96-charset class, which contains character sets with 96x96 positions (2-byte
characters).
In the ISO-2022 standard, four registers, called G0, G1, G2 and G3, are used to reference a character set. Before a character set can be
used, the character set must be assigned, or designated, to one of these registers. The designation of a character set is done by using an
escape sequence in the following format:
ESC [I] F
In this format: Is an intermediate character that is used to designate a character set to one of the registers (G0, G1, G2, oR G3). Is a
unique final character of a particular character set.
The designation of a character set, whose final character is F, to different registers is as follows: Designates a multibyte character set
(94x94 or 96x96) to G0. Designates a character set in the 94-charset class to G0. Designates a character set in the 94-charset class to
G1. Designates a character set in the 94-charset class to G2. Designates a character set in the 94-charset class to G3. Designates a
character set in the 96-charset class to G1. Designates a character set in the 96-charset class to G2. Designates a character set in the
96-charset class to G3.
SEE ALSO
Commands: locale(1)
Others: ascii(5), i18n_intro(5), iso2022jp(5), l10n_intro(5)
iso2022(5)