PPI::Token::BOM(3) User Contributed Perl Documentation PPI::Token::BOM(3)NAME
PPI::Token::BOM - Tokens representing Unicode byte order marks
INHERITANCE
PPI::Token::BOM
isa PPI::Token
isa PPI::Element
DESCRIPTION
This is a special token in that it can only occur at the beginning of documents. If a BOM byte mark occurs elsewhere in a file, it should
be treated as PPI::Token::Whitespace. We recognize the byte order marks identified at this URL:
<http://www.unicode.org/faq/utf_bom.html#BOM>
UTF-32, big-endian 00 00 FE FF
UTF-32, little-endian FF FE 00 00
UTF-16, big-endian FE FF
UTF-16, little-endian FF FE
UTF-8 EF BB BF
Note that as of this writing, PPI only has support for UTF-8 (namely, in POD and strings) and no support for UTF-16 or UTF-32. We support
the BOMs of the latter two for completeness only.
The BOM is considered non-significant, like white space.
METHODS
There are no additional methods beyond those provided by the parent PPI::Token and PPI::Element classes.
SUPPORT
See the support section in the main module
AUTHOR
Chris Dolan <cdolan@cpan.org>
COPYRIGHT
Copyright 2001 - 2011 Adam Kennedy.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
The full text of the license can be found in the LICENSE file included with this module.
perl v5.16.2 2011-02-25 PPI::Token::BOM(3)
Check Out this Related Man Page
PPI::Token::Operator(3) User Contributed Perl Documentation PPI::Token::Operator(3)NAME
PPI::Token::Operator - Token class for operators
INHERITANCE
PPI::Token::Operator
isa PPI::Token
isa PPI::Element
SYNOPSIS
# This is the list of valid operators
++ -- ** ! ~ + -
=~ !~ * / % x
<< >> lt gt le ge cmp ~~
== != <=> . .. ... ,
& | ^ && || //
? : = += -= *= .= //=
< > <= >= <> => ->
and or dor not eq ne
DESCRIPTION
All operators in PPI are created as "PPI::Token::Operator" objects, including the ones that may superficially look like a PPI::Token::Word
object.
METHODS
There are no additional methods beyond those provided by the parent PPI::Token and PPI::Element classes.
Got any ideas for methods? Submit a report to rt.cpan.org!
SUPPORT
See the support section in the main module.
AUTHOR
Adam Kennedy <adamk@cpan.org>
COPYRIGHT
Copyright 2001 - 2011 Adam Kennedy.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
The full text of the license can be found in the LICENSE file included with this module.
perl v5.18.2 2011-02-25 PPI::Token::Operator(3)
Hi, there:
I am relatively new to Unix. So, I am not even sure if I am asking is an easy or difficult task.
I want to peform GREP like command which will generate a list of files with a file format of UTF-8. I would especially like to know whether the files use UTF-8 or UTF-8N (in other... (0 Replies)
Sometimes we recieve some excel files containing French/Japanese characters over the mail, and these files are manually transferred to the server by using SFTP (security is not a huge concern here). The data is changed to text format before transferring it using Notepad.
Problem is: When saving... (4 Replies)
Experts,
this has been dumped on me at the last minute.... i am having issue on few files where im getting files from source with BOM mark at the top of every file and i need to check for its existence and remove it.
<feff>
header
Coulmn1|column2......n
i know i can simply do sed on... (5 Replies)
I'm in the process of being forward-thinking and finally converting my site's db to UTF-8. I've already done the UTF-8 conversion (on a copy for testing) and now I want to go through and convert html entities to their actual characters.
I ran an entity decode on a mysqldump file but realized... (10 Replies)
Hi,
I need to run a SQL which check for special UTF char in DB. When I try to copy that in UNIX file it changes it to some wierd chat. How can in retain the UTF chars in my script?
e.g. ο|π|ρ|σ|τ|υ|φ|χ|ψ
Any help will be appriciated.
Thanks, (14 Replies)