nrss can't parse non-UTF-8 encoded feed that contains non-ASCII characters
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
nrss (Debian) |
Fix Released
|
Unknown
|
|||
nrss (Ubuntu) |
Triaged
|
Low
|
Unassigned |
Bug Description
Binary package hint: nrss
nrss 0.3.9-1 gets an error parsing a feed encoded in ISO 8859-1 that contains international symbols. Sometimes only the first item get displayed if it contains no accented characters.
I've tried this feed:
http://
But Atom feed from the same site works flawlessly (it is encoded in UTF-8)
http://
The problem seems to be that XML_ParserCreate is called in parse.c with encoding set to "UTF-8". When called without explicitly set encoding, Expat honors the document encoding declaration.
I've tested it with XML_ParserCreat
ProblemType: Bug
Architecture: i386
DistroRelease: Ubuntu 9.04
Package: nrss 0.3.9-1
ProcEnviron:
PATH=(custom, user)
LANG=en_US.UTF-8
SHELL=/bin/bash
SourcePackage: nrss
Uname: Linux 2.6.28-4-generic i586
Changed in nrss: | |
status: | New → In Progress |
Changed in nrss: | |
status: | Unknown → New |
tags: | added: patch-forwarded-debian |
Changed in nrss (Debian): | |
status: | New → Fix Released |
This debdiff should close the bug for jaunty. Please sponsor the upload.