Sponsored Content
Operating Systems Linux How to create new empty utf8 file for appending? Post 302907715 by umen on Tuesday 1st of July 2014 08:33:52 AM
Old 07-01-2014
well i did touch test.txt
and when i do file -bi output_gt/test.txt
im getting :
Code:
application/x-empty; charset=binary

and then i trying to write data in utf8 encoding to it with
$data >> test.txt
and all in gibberish
when i do on the file file -bi output_gt/test.txt:
im getting :
Code:
text/plain; charset=unknown-8bit

what can i do ?
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

simultaneously create three empty files?

I can't get touch to simultaneously create three empty files file1, file2, file3. I tried:$ touch filebut all I got was one file:$ fileWhat did I do wrong? (4 Replies)
Discussion started by: na5m
4 Replies

2. UNIX for Dummies Questions & Answers

Getting same exit status for empty and non empty file

Hi All, I am checking for a empty input file to do some further action , but I am getting exit status 0 in both the cases , for empty and non empty file both. The value of $? is coming 0 in if part also and else part too. #!/bin/ksh if ]; then echo "data" # exit 0 echo "$?" else... (4 Replies)
Discussion started by: mavesum
4 Replies

3. Shell Programming and Scripting

Converting Unicode file to UTF8 format

Hi, I have a file in my desktop which is a unicode format. After this file is transferred to Unix using FTP, we are seeing some special character (like rectangle box type) at the first line. The same file is saved as UTF8 (using textpad tool, selecting encode to UTF-8 option) on my desktopand... (7 Replies)
Discussion started by: vfrg
7 Replies

4. Shell Programming and Scripting

How to replace comma by slash using sed in an UTF8 file

Hello all, I'd like to replace "," by "/" in a utf8 file from postion X to Y. Comma "," is also defined as delimiter. 12345678901234567890,123456789012345,12345678901234567890, aaaa,aaaa,aaaaa ,bbb,bbbb,bbbbb ,cccccc,cc , Result should be... (1 Reply)
Discussion started by: fmofmo
1 Replies

5. Shell Programming and Scripting

cowardly refusing to create an empty archive

Hi, I am trying to write an Archive script that should look for files older than x days, zip them and move the zip to the archive directory and delete the files that have been zipped. I am not sure how i can handle this error: "cowardly refusing to create an empty archieve". Just wanted to know... (9 Replies)
Discussion started by: stunnerz_84
9 Replies

6. Shell Programming and Scripting

Convert to UTF8 File - Unix

All, I have several *.dat files which is created in windows (ANSI Endoing) Or PC File format, once I copy those files to unix. How can I convert those file to utf8 encoding ? I tired iconv, it says not supported Please help Thanks - S (5 Replies)
Discussion started by: Shanks
5 Replies

7. UNIX Desktop Questions & Answers

how to create empty wav file

Dear All, Kindly explain me a command in unix to create a empty wav file with example. Thanks in Advance! (1 Reply)
Discussion started by: thillai_selvan
1 Replies

8. Shell Programming and Scripting

Single command to create multiple empty files(no trailing lines as well).

Hi, i need a single command to create multiple empty files(no trailing lines as well) and empty the files if already existing. please let me know or if this has been ansered, if some ocan share the link please, thanks > newfile.txt or :> newfile.txt do not work (4 Replies)
Discussion started by: Onkar Banerjee
4 Replies

9. Shell Programming and Scripting

Create empty files from a list on file

Hello Guys. Please I would like to create empty files from a list In file1 will be the followin values, so i will like to create for each name a empty file. file1 2191off-r0.sps 2192off-r0.sps 2193off-r0.sps 2194off-r0.sps 2195off-r0.sps So I need to get 5 empty files. Thanks for... (7 Replies)
Discussion started by: jiam912
7 Replies

10. Shell Programming and Scripting

Need UNIX command to create a empty dataset

Hi all, Is Der any unix command which will create a empty dataset ..?(in .ds ).I know we can use orchadmin dump command to view dataset data.. but how to create a dataset..? like sampledataset.ds thanks in advance hemanthsaikumar (1 Reply)
Discussion started by: hemanthsaikumar
1 Replies
Test::utf8(3pm) 					User Contributed Perl Documentation					   Test::utf8(3pm)

NAME
Test::utf8 - handy utf8 tests SYNOPSIS
# check the string is good is_valid_string($string); # check the string is valid is_sane_utf8($string); # check not double encoded # check the string has certain attributes is_flagged_utf8($string1); # has utf8 flag set is_within_ascii($string2); # only has ascii chars in it isnt_within_ascii($string3); # has chars outside the ascii range is_within_latin_1($string4); # only has latin-1 chars in it isnt_within_ascii($string5); # has chars outside the latin-1 range DESCRIPTION
This module is a collection of tests useful for dealing with utf8 strings in Perl. This module has two types of tests: The validity tests check if a string is valid and not corrupt, whereas the characteristics tests will check that string has a given set of characteristics. Validity Tests is_valid_string($string, $testname) Checks if the string is "valid", i.e. this passes and returns true unless the internal utf8 flag hasn't been set on scalar that isn't made up of a valid utf-8 byte sequence. This should never happen and, in theory, this test should always pass. Unless you (or a module you use) goes monkeying around inside a scalar using Encode's private functions or XS code you shouldn't ever end up in a situation where you've got a corrupt scalar. But if you do, and you do, then this function should help you detect the problem. To be clear, here's an example of the error case this can detect: my $mark = "Mark"; my $leon = "Lx{e9}on"; is_valid_string($mark); # passes, not utf-8 is_valid_string($leon); # passes, not utf-8 my $iloveny = "I x{2665} NY"; is_valid_string($iloveny); # passes, proper utf-8 my $acme = "Lx{c3}x{a9}on"; Encode::_utf8_on($acme); # (please don't do things like this) is_valid_string($acme); # passes, proper utf-8 byte sequence upgraded Encode::_utf8_on($leon); # (this is why you don't do things like this) is_valid_string($leon); # fails! the byte x{e9} isn't valid utf-8 is_sane_utf8($string, $name) This test fails if the string contains something that looks like it might be dodgy utf8, i.e. containing something that looks like the multi-byte sequence for a latin-1 character but perl hasn't been instructed to treat as such. Strings that are not utf8 always automatically pass. Some examples may help: # This will pass as it's a normal latin-1 string is_sane_utf8("Hello Lx{e9}eon"); # this will fail because the x{c3}x{a9} looks like the # utf8 byte sequence for e-acute my $string = "Hello Lx{c3}x{a9}on"; is_sane_utf8($string); # this will pass because the utf8 is correctly interpreted as utf8 Encode::_utf8_on($string) is_sane_utf8($string); Obviously this isn't a hundred percent reliable. The edge case where this will fail is where you have "x{c2}" (which is "LATIN CAPITAL LETTER WITH CIRCUMFLEX") or "x{c3}" (which is "LATIN CAPITAL LETTER WITH TILDE") followed by one of the latin-1 punctuation symbols. # a capital letter A with tilde surrounded by smart quotes # this will fail because it'll see the "x{c2}x{94}" and think # it's actually the utf8 sequence for the end smart quote is_sane_utf8("x{93}x{c2}x{94}"); However, since this hardly comes up this test is reasonably reliable in most cases. Still, care should be applied in cases where dynamic data is placed next to latin-1 punctuation to avoid false negatives. There exists two situations to cause this test to fail; The string contains utf8 byte sequences and the string hasn't been flagged as utf8 (this normally means that you got it from an external source like a C library; When Perl needs to store a string internally as utf8 it does it's own encoding and flagging transparently) or a utf8 flagged string contains byte sequences that when translated to characters themselves look like a utf8 byte sequence. The test diagnostics tells you which is the case. String Characteristic Tests These routines allow you to check the range of characters in a string. Note that these routines are blind to the actual encoding perl internally uses to store the characters, they just check if the string contains only characters that can be represented in the named encoding: is_within_ascii Tests that a string only contains characters that are in the ASCII charecter set. is_within_latin_1 Tests that a string only contains characters that are in latin-1. Simply check if a scalar is or isn't flagged as utf8 by perl's internals: is_flagged_utf8($string, $name) Passes if the string is flagged by perl's internals as utf8, fails if it's not. isnt_flagged_utf8($string,$name) The opposite of "is_flagged_utf8", passes if and only if the string isn't flagged as utf8 by perl's internals. Note: you can refer to this function as "isn't_flagged_utf8" if you really want to. AUTHOR
Written by Mark Fowler mark@twoshortplanks.com COPYRIGHT
Copyright Mark Fowler 2004,2012. All rights reserved. This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself. BUGS
None known. Please report any to me via the CPAN RT system. See http://rt.cpan.org/ for more details. SEE ALSO
Test::DoubleEncodedEntities for testing for double encoded HTML entities. perl v5.14.2 2012-02-18 Test::utf8(3pm)
All times are GMT -4. The time now is 04:20 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy