|
Re: HTML::Parser modifies unicode characters: msg#00019lang.perl.modules.lwp
Moshe Kaminsky wrote: It appears that HTML::Parser modifies some unicode characters while parsing. The following program gives an example: It looks like HTML::Parser is losing the UTF-8 flag. XS modules have a nasty tendency to do this. :( Thankfully the workaround is fairly simple. Add "use Encode" to the top of the script, and change the callback slightly: sub { print TEST decode_utf8(shift) } seems to work ok here. -Dom |
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | Re: HTML::Parser modifies unicode characters: 00019, Moshe Kaminsky |
|---|---|
| Next by Date: | Re: How to simulate https secured login using lwp: 00019, John J Lee |
| Previous by Thread: | HTML::Parser modifies unicode charactersi: 00019, Moshe Kaminsky |
| Next by Thread: | Re: HTML::Parser modifies unicode characters: 00019, Moshe Kaminsky |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
| News | FAQ | advertise |