Dual-license your content for inclusion in The Perl 5 Wiki using this HOWTO, or join us for a chat on irc.freenode.net#PerlNet.

Talk:Unicode

From PerlNet

Jump to: navigation, search

Using Unicode in PerlNet

PerlNet joyfully supports unicode. Simply use &#dddd; to insert a unicode character, the same way you would in HTML. For example, こにちわ produces こにちわ (Konichiwa). Note that you must use the decimal representation of the character.

When in doubt about how to do something in PerlNet, try HTML. In the great majority of cases it will work.

All the best,

--PJF (talk) 15:54, 24 February 2006 (EST)

Using combiners?

Paul, what about combiners - how do you wiki-ify an 'A' with "combining umlaut" so that you end up with this => Ä

I had to use the code point for "diareses", because code point 776 ("combining diaresis" wasn't displaying properly - probably because I wasn't combining it correctly. Looks like this => ̈ )

I know there is already a A-umlaut at code point 0xC4 => Ä but what about using the combiner properties to link an A at codepoint 65 => "latin capital letter A" and the character at codepoint 776 => "combining diaresis" ?

Oh and should we mention that sometimes there are two or more code points that have the same glyph ? Like "A with ring" (Å) and Angstrom (Å) - visually the same ( or should be - my firefox seems to display one in Arial and one in Courier) , but with different code points and different properties ?

-- Leriksen 10:41, 25 February 2006 (EST)

Combiners work, but they're more likely to 'look ugly', as many displays will map them by overlaying two characters. As such, ä (simply written as 'ä') looks fine in my browser, but Ä (capital 'A' with the same combiner) looks awful; the umlaut appears too low and intersects with one of the strokes of the 'A'.
So you can quite happily use combiners by writing their encoding after the glyph you wish to combine; I believe the same holds true for vanilla HTML. However using dedicated code-points appears to often have a better visual effect.
--PJF (talk) 14:00, 26 February 2006 (EST)

Remove mention of locale mixing with Unicode and Regex ?

I'm not sure how accurate the statement "Properties of a locale and Unicode mix in interesting ways..." is - I dont thinks the section is wrong, but I feel that _perhaps_ locale has nothing to do with it - guess I'm looking for someone to expand the article in the right direction

Leriksen 10:06, 25 February 2006 (EST)

Personal tools