Sponsored Content
Full Discussion: Unicode file validation
Top Forums Shell Programming and Scripting Unicode file validation Post 302618029 by Corona688 on Tuesday 3rd of April 2012 01:06:08 PM
Old 04-03-2012
Bumping up posts or double posting is not permitted in these forums.

Please read the rules, which you agreed to when you registered, if you have not already done so.

You may receive an infraction for this. If so, don't worry, just try to follow the rules more carefully. The infraction will expire in the near future

Thank You.

The UNIX and Linux Forums.

Also, your attachment doesn't seem to be working.
 

9 More Discussions You Might Find Interesting

1. Programming

How to display unicode characters / unicode string

I have a stream of characters like "\u8BBE\u5907\u7BA1" and i want to display it. I tried following things already without any luck. 1) printf("%s",L("\u8BBE\u5907\u7BA1")); 2) printf("%lc",0x8BBE); 3) setlocale followed by fwide followed by wprintf 4) also changed the local manually... (3 Replies)
Discussion started by: jackdorso
3 Replies

2. UNIX for Dummies Questions & Answers

grep and UNICODE (utf-16) file

I'm using shell scripting in Applescript. When searching a file with the ANSEL character set (for GEDCOM files) using (grep '1 CHAR ANSEL' filepath) gives the expected result. When searching a UNICODE formatted file (utf-16), searching for text known to exist in the file using (grep '1 CHAR... (4 Replies)
Discussion started by: Whiterock
4 Replies

3. Shell Programming and Scripting

Find Unicode Character in File

I have a very large file in Unix that I would like to search for all instances of the unicode character 0x17. I need to remove these characters because the character is causing my SAX Parser to throw an exception. Does anyone know how to find a unicode character in a file? Thank you for your... (1 Reply)
Discussion started by: azelinsk
1 Replies

4. AIX

Dont want to change the codepage of a unicode file

I have a unicode file which needs to be modifed in an AIX environment from within a shell (ksh). I am concerned that the modification may involve a change in the file's codepage. Is my concern correct ? If so, whats the way around ? Thanks in advance. (0 Replies)
Discussion started by: shibajighosh
0 Replies

5. Shell Programming and Scripting

sed replacement in unicode file

Hi there, I have a file generated by a windows registry (it's unicode) and can't get to do some replacements on it. I want to join lines that end with backslash with the next one. santiago@ks354286:~$ cat win.reg ÿþWindows Registry Editor Version 5.00 ... (10 Replies)
Discussion started by: chebarbudo
10 Replies

6. Shell Programming and Scripting

Converting Unicode file to UTF8 format

Hi, I have a file in my desktop which is a unicode format. After this file is transferred to Unix using FTP, we are seeing some special character (like rectangle box type) at the first line. The same file is saved as UTF8 (using textpad tool, selecting encode to UTF-8 option) on my desktopand... (7 Replies)
Discussion started by: vfrg
7 Replies

7. Shell Programming and Scripting

How to remove Unicode <feff> from top of file?

Experts, this has been dumped on me at the last minute.... i am having issue on few files where im getting files from source with BOM mark at the top of every file and i need to check for its existence and remove it. <feff> header Coulmn1|column2......n i know i can simply do sed on... (5 Replies)
Discussion started by: biztank
5 Replies

8. Shell Programming and Scripting

Reading/Viewing an Unicode file

WE have a file coming from a server that has characters for 4-5 languages. If I download the file to my windows PC and open in Notepad ++, I can clearly see the text in different languages. Notepad++ is able to reder text that is in Portugese, French, Thai etc. My objective it to do the following:... (2 Replies)
Discussion started by: vskr72
2 Replies

9. Shell Programming and Scripting

Wget download file content in unicode

Hi All, I am trying to download a XML from a URL through wget and successful in that but the problem is that I have to check for some special characters inside that XML. But when I download through wget it transfers the content of the XML in plain text and I'm not able to search for those... (2 Replies)
Discussion started by: dips_ag
2 Replies
nomarch(1)							Archive Extraction							nomarch(1)

NAME
nomarch - extract `.arc' archives SYNOPSIS
nomarch [-hlptUv] [archive.arc] [match1 [match2 ... ]] DESCRIPTION
nomarch lists, extracts, or tests `.arc' archives. (An alternate extension sometimes used was `.ark'; these work too.) This is a very out- dated file format which should certainly not be used for anything new, but you may still need an extraction utility, and here it is. :-) The default action is to extract all files in the specified archive; see OPTIONS below for how to do other things instead. OPTIONS
-h give terse usage help. -l list files in archive. If verbose listings are enabled, it shows the filename, compression method, compressed/uncompressed size, date/time, and CRC; but by default, it just shows the filename, uncompressed size, and date/time. -p extract to standard output, rather than to separate files. -t test files in archive (more precisely, check file CRCs). -U use uppercase filenames; more precisely, preserve original case from archive. -v give verbose output (when used with `-l'). archive.arc the archive to operate on. match1 etc. optionally specify which archive members to list/extract/test. Those which match any of these filenames/wildcards are processed. Wildcard operators supported are shell-like `*' and `?', but don't forget to quote arguments which use these (e.g. `nomarch foo.arc '*.bar''). EXTRACTING MULTIPLE ARCHIVES
nomarch follows the `unzip'-like practice of working on only one archive per run, with further `filenames' given on the command-line actu- ally specifying files to extract (or whatever). The easiest way to work on multiple files with nomarch is simply to run it multiple times using for; for example: for i in *.arc; do nomarch $i; done The above would extract all archives in the current directory. USING THE PROGRAM FROM EMACS
Emacs's arc-mode facility lets you work with various kinds of archive file directly from the editor. Making it use nomarch for extracting `.arc' files isn't too hard. Just add the following to your ~/.emacs file: (setq archive-arc-extract '("nomarch" "-U")) BUGS
The CRC used by the format is only 16-bit, so `-t' is a less-than-perfect test. One compression method, obsolete even by `.arc' standards :-), isn't supported yet. This is partly because I've yet to find a single file which uses it, despite testing an awful lot of files. Subdirectories in Spark archives are extracted as the `.arc'-format files they really are, which may not be terribly convenient. SEE ALSO
tar(1), gzip(1), bzip2(1), lbrate(1) AUTHOR
Russell Marks (rus@svgalib.org). Version 1.4 18th June, 2006 nomarch(1)
All times are GMT -4. The time now is 07:01 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy