Sponsored Content
Top Forums Shell Programming and Scripting Convert directory of text files to Unix/Linux Line Ending Post 302548570 by chipperuga on Thursday 18th of August 2011 06:30:13 PM
Old 08-18-2011
Quote:
Originally Posted by alister
That question doesn't really make much sense. Unicode is not an encoding; it is a character set. UTF-8 is one of many ways to encode the Unicode character set. This means you have yet to state the source's encoding.

In any case, you probably want to take a look at iconv for the transcoding aspect of the task.

Regards,
Alister
Interesting. When I open Notepad in Windows and click on save as, there are 4 options for encoding: ANSI, Unicode, Unicode big endian, UTF-8.

The logs I am referring to are encoded, at least according to Notepad, in Unicode. I'm able to change them to UTF-8 and they look right with the cat command in cygwin.
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

How do I convert unix text to to win text?

How do I convert unix text files into readable text for windows. Dave (1 Reply)
Discussion started by: nucca
1 Replies

2. UNIX for Dummies Questions & Answers

How to convert binary Unix file to text

Hi all, I have a print control file (dflt) for Oracle which is in binary. As I am going to develope an application in Window environment, I would like to reference the dflt file. But it is in binary format and I cannot access it. Anyone can suggest me how to convert the file into text or... (5 Replies)
Discussion started by: user12345
5 Replies

3. UNIX for Dummies Questions & Answers

convert all *. c files to *.cpp files in a directory

Hiiiii.... how to convert all *. c files to *.cpp files , in a directory given using shell script. :pThnaking u.:p (10 Replies)
Discussion started by: krishnampkkm
10 Replies

4. Shell Programming and Scripting

Stress testing php files at Unix/Linux Command line

Hi, Your great help is very appreciated. I am looking for any Unix command or tool for doing Stress/Load test of php files at command prompt. I tried torture.pl but it is not working after20 concurrent threads/users. as it is very urgent for me..please suggest ur ideas asap. thanks (5 Replies)
Discussion started by: Malleswari
5 Replies

5. UNIX for Dummies Questions & Answers

How to get last word from a line ending with "/" in Unix

Hello All, I have a scenario to read a file containing text like this:(say file name is Dummy.txt) /home/abc/test1/ | file1 /home/abc/test2/ | file2 I used a variable to store the content from file like this (say for line1): File=`head -1 Dummy.txt | cut -f1 -d "|"` Dir=`head -1 Dummy.txt... (2 Replies)
Discussion started by: Quesemail
2 Replies

6. UNIX for Dummies Questions & Answers

Listing files in a Unix Directory ending with .....

Hi Unix Gurus, I need to list all files in a Unix Directory which either end with a .pdf or .rtf and they should be case insensitive ie .Pdf , .pDF , .RtF etc are also possible. How can i accomplish this with with a ls command ? If not then a find command. (6 Replies)
Discussion started by: pchegoor
6 Replies

7. Windows & DOS: Issues & Discussions

Convert UNIX text file in Windows to recognize line breaks

Hmmm I think I found the correct subforum to ask my question... I have some text files that I prepared in vi some time ago, and now I want to open and edit them with Windows Notepad. I don't have a Unix terminal at the moment so I need to do the conversion in Windows. Is there a way to do this?... (1 Reply)
Discussion started by: frys_hp
1 Replies

8. UNIX for Dummies Questions & Answers

Convert UNIX text file in Windows to recognize line breaks

Hi all, I have some text files that I prepared in vi some time ago, and now I want to open and edit them with Windows Notepad. I don't have a Unix terminal at the moment so I need to do the conversion in Windows. Is there a way to do this? Or just reinsert thousands of line breaks again :eek: ? (2 Replies)
Discussion started by: frys_hp
2 Replies

9. Shell Programming and Scripting

Extract specific line in an html file starting and ending with specific pattern to a text file

Hi This is my first post and I'm just a beginner. So please be nice to me. I have a couple of html files where a pattern beginning with "http://www.site.com" and ending with "/resource.dat" is present on every 241st line. How do I extract this to a new text file? I have tried sed -n 241,241p... (13 Replies)
Discussion started by: dejavo
13 Replies

10. Shell Programming and Scripting

Add text at start and ending of every line

Hi all, Is there other way to Add text at start and ending of every line? here my script: cat file.txt |awk '{print "<p align=\"justify\">"$0"</p>"}' but the problem they put including white spaces, I only need those line have a sentence or text not an skip all have empty string or have... (7 Replies)
Discussion started by: lxdorney
7 Replies
uniname(1)						      General Commands Manual							uniname(1)

NAME
uniname - Name the characters in a Unicode text file SYNOPSIS
uniname ([option flags]) (<file name>) If no input file name is supplied, uniname reads from the standard input. DESCRIPTION
uniname names the characters in a Unicode text file. For each character, uniname defaults to printing the character offset, the byte off- set, the hexadecimal UTF-32 character code, the encoding as a sequence of hex byte values, the glyph, and the character's Unicode name. Command line flags allow undesired information to be suppressed. Glyphs that do not display nicely, such as control characters and spaces, are not displayed. For the Latin-1 control characters, whose official Unicode name is "control", the real name is given. Character and byte offsets both start from 0. Where a character does not have a unique Unicode name, as is the case with Chinese characters, the character is identified as "character in such-and-such a range". However, if the character is a Chinese character listed in Nelson's dictionary, the Nelson number is supplied. By default, input is expected to be UTF-8. Native order UTF-32 may be specified via the command line flag If invalid UTF8 is encountered, an explanation is printed as to why it is invalid. -q. COMMAND LINE FLAGS
-A Skip ASCII whitespace characters. -a Skip ASCII characters. -B Skip characters within the Basic Multilingual Plane. -b Suppress printing of byte offset. -c Suppress printing of character offset. -e Suppress printing of encoding. -g Suppress printing of glyph. -h Print usage information. -l Print line number. -n Suppress printing of Unicode name. -p Suppress printing of headers every screenfull. -q Input is native order UTF-32. -r Print Unicode range. The ranges reported include both official Unicode ranges and the constructed language ranges within the Pri- vate Use Areas registered with the Conscript Unicode Registry (http://www.evertype.com/standards/csur/). -s <character offset> Skip to specified character offset. -S <byte offset> Skip to specified byte offset. Note that even if the file consists of well-formed Unicode there is no guarantee that the byte sequence beginning at an arbitrary byte will be valid Unicode. This option is provided for use where other programs generate only byte offsets or where it is necessary to skip over damaged Unicode. In most circumstances use of a character offset will be more apprpriate. If a byte offset is used, the character offsets shown are with respect to the beginning of the section of the file exam- ined rather than the beginning of the file. -u Suppress printing of UTF32 code. -V Validate the input. In this case, nothing is done other than determine whether the input is valid UTF-8 Unicode. If it is, no output is produced and the program exits with status 0. If invalid UTF-8 is encountered, the program reports the location of the first invalid UTF-8 encountered, explains why it is invalid, and exits with status 1. -v Print version information. SEE ALSO
unidesc REFERENCES
Unicode Standard, version 5.1 AUTHOR
Bill Poser billposer@alum.mit.edu LICENSE
GNU General Public License February, 2009 uniname(1)
All times are GMT -4. The time now is 04:27 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy