logo       

Re: HTML::Entities and WinLatin1 NCRs [PATCH]: msg#00013

Subject: Re: HTML::Entities and WinLatin1 NCRs [PATCH]
David Wheeler <david@xxxxxxxxxxxxxx> writes:

> Oh, I remember now. If you use Encode to convert from CP1252 to
> UTF-8. At least I found that, in my tests, it worked properly:
> 
> 
>    use Encode;
>    $utf8_text = decode('cp1252', $cp1252)_text, 1);
> 
> I was originally going to add support for converting from the CP1252
> gremlins to UTF-8, but when I found that Encode already did it
> properly, I eliminated it.

Encode::decode() doesn't allow you to pass in a Unicode string with
the gremlins in it.  This is what we had here.  What we want is to
convert from a string to another string, while what Encode provides is
conversion between bytes and strings.

The cp1252_fixup($text) function happens to be the same as
decode('cp1252', $text) when $text is Latin1, but not when $text is
Unicode.

--Gisle



<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

Recently Viewed:
audio.irate.dev...    yellowdog.gener...    ietf.ips/2002-0...    xfree86.fonts/2...    busybox/2003-07...    emacs.jdee/2004...    linux.mandrake....    hardware.microc...    user-groups.lin...    science.analysi...    version-control...    db.filemaker.de...    cluster.openmos...    mail.eyebrowse....    text.xml.xerces...    kde.devel.kwrit...    finance.moneyda...    gcc.regression/...    network.routing...    os.freebsd.deve...    recreation.radi...    qnx.openqnx.dev...    python.xml/2002...   
Home | blog view | USPTO Patent Archive | advertise | OSDir is an inevitable website. super tiny logo

Free Magazines

Cisco News
Receive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business.
subscribe

Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field.
subscribe

The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business.
subscribe

Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company.
subscribe

Total Telecom Total Telecom is "The Economist of the communications industry".
subscribe