character encodings in Perl

The 'Encode' module provides the interface between Perl strings and the
rest of the system. Perl strings are sequences of _characters_.

The repertoire of characters that Perl can represent is a superset of those
defined by the Unicode Consortium. On most platforms the ordinal values of
a character as returned by 'ord(_S_)' is the _Unicode codepoint_ for that
character. The exceptions are platforms where the legacy encoding is some
variant of EBCDIC rather than a superset of ASCII; see the perlebcdic
manpage.

During recent history, data is moved around a computer in 8-bit chunks,
often called "bytes" but also known as "octets" in standards documents.
Perl is widely used to manipulate data of many types: not only strings of
characters representing human or computer languages, but also "binary"
data, being the machine's representation of numbers, pixels in an image, or
just about anything.

When Perl is processing "binary data", the programmer wants Perl to process
"sequences of bytes". This is not a problem for Perl: because a byte has
256 possible values, it easily fits in Perl's much larger "logical
character".

Download package
Checkout Package
osc -A https://api.opensuse.org checkout home:illuusio:perl/perl-Encode && cd $_
Create Badge

Build Results
RPM Lint

Refresh

Source Files

Filename	Size	Changed
Encode-2.87.tar.gz	0002027712 1.93 MB	almost 9 years ago
_link	0000000128 128 Bytes	almost 9 years ago
cpanspec.yml	0000000669 669 Bytes	over 9 years ago
perl-Encode-rpmlintrc	0000000186 186 Bytes	about 14 years ago
perl-Encode.changes	0000003727 3.64 KB	almost 9 years ago
perl-Encode.spec	0000002955 2.89 KB	almost 9 years ago

Comments 0

No comments available

Places

Actions on this page