Detect lines beginning with double-byte characters (Japanese) and delete Post: 302371593

9 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

delete lines from file2 beginning w/file1

I've been searching around here and other places, but can't put this together... I've got a unique list of words in file 1 (one word on each line). I need to delete each line in file2 that begins with the word in file1. I started this way, but want to know how to use file1 words instead...

2. Shell Programming and Scripting

delete zero byte file

Hello I have a requirement where i need to find the zero byte size file in the directory and need to delete that zero byte file. Thanks

3. Shell Programming and Scripting

Email a File from UNIX which has Japanese characters in it

Hi, I'm trying to email from UNIX, a file which has Japanese characters in it (i,e. in the contents -- not the filename). The file gets emailed, but the Japanese characters do not show up properly when I open the file on Windows in my Outlook mailbox. I searched a lot of forums but still...

4. Shell Programming and Scripting

How to delete all lines with less then 32 characters from a textfile?

I need to delete all lines with less then 32 characters from a textfile. :)

5. Shell Programming and Scripting

Removing one or more blank characters from beginning of a line

Hi, I was trying to remove the blank from beginning of a line. when I try: sed 's/^ +//' filename it does not work but when I try sed 's/^ *//' filename it works But I think the first command should have also replaced any line with one or more blanks. Kindly help me in understanding...

6. Red Hat

How to display Chinese and Japanese Characters on Rhel 6?

Hello, I'm trying to figure out how to display Chinese and Japanese Characters on my RHEL 6 Console. There is no more "bogl-bterm" for RHEL6, that is not supported anymore. Is there any way that I could display them? Thank you.

7. SuSE

Display Chinese and Japanese characters on my SLES console.

Hello, I'm trying to figure out how to display Chinese and Japanese Characters on my SLES 11 Console. Is there any way that I could display those characters on my console? Thank you.

8. UNIX for Beginners Questions & Answers

Removing characters from beginning of multiple files

Hi, I have been searching how to do this but I can't seem to find how to do it. Hopefully someone can help. I have multiplr files, 100's example 12345-zxys.213423.zyz.txt. I want to be able to take all these files and remove the first '12345-' from each of the files. '12345-' these characters...

9. UNIX for Beginners Questions & Answers

Inserting n characters to beginning of line if match

I would like to insert n number of characters at the beginning of each line that starts with a given character. If possible, I would be most appreciative for a sed or awk solution. Given the data below, I would like to be able to insert either 125 spaces or 125 "-" at the beginning of every line...

LEARN ABOUT REDHAT

encoding

encoding(n)						       Tcl Built-In Commands						       encoding(n)

__________________________________________________________________________________________________________________________________________________

NAME

       encoding - Manipulate encodings

SYNOPSIS

       encoding option ?arg arg ...?
_________________________________________________________________

INTRODUCTION

       Strings	in Tcl are encoded using 16-bit Unicode characters.  Different operating system interfaces or applications may generate strings in
       other encodings such as Shift-JIS.  The encoding command helps to bridge the gap between Unicode and these other formats.

DESCRIPTION

       Performs one of several encoding related operations, depending on option.  The legal options are:

       encoding convertfrom ?encoding? data
	      Convert data to Unicode from the specified encoding.  The characters in data are treated as binary data where the  lower	8-bits	of
	      each  character  is  taken  as a single byte.  The resulting sequence of bytes is treated as a string in the specified encoding.	If
	      encoding is not specified, the current system encoding is used.

       encoding convertto ?encoding? string
	      Convert string from Unicode to the specified encoding.  The result is a sequence of bytes  that  represents  the	converted  string.
	      Each byte is stored in the lower 8-bits of a Unicode character.  If encoding is not specified, the current system encoding is used.

       encoding names
	      Returns a list containing the names of all of the encodings that are currently available.

       encoding system ?encoding?
	      Set the system encoding to encoding. If encoding is omitted then the command returns the current system encoding.  The system encod-
	      ing is used whenever Tcl passes strings to system calls.

EXAMPLE

       It is common practice to write script files using a text editor that produces output in the euc-jp encoding,  which  represents	the  ASCII
       characters  as  singe bytes and Japanese characters as two bytes.  This makes it easy to embed literal strings that correspond to non-ASCII
       characters by simply typing the strings in place in the script.	However, because the source command always reads files using the ISO8859-1
       encoding, Tcl will treat each byte in the file as a separate character that maps to the 00 page in Unicode.  The resulting Tcl strings will
       not contain the expected Japanese characters.  Instead, they will contain a sequence of Latin-1 characters that correspond to the bytes	of
       the original string.  The encoding command can be used to convert this string to the expected Japanese Unicode characters.  For example,
		set s [encoding convertfrom euc-jp "xA4xCF"]
       would return the Unicode string "u306F", which is the Hiragana letter HA.

SEE ALSO

       Tcl_GetEncoding(3)

KEYWORDS

       encoding

Tcl									8.1							       encoding(n)