Changelog notes do not preserve non-UTF8 characters from original changelog entries
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
git-ubuntu |
Triaged
|
Low
|
Unassigned |
Bug Description
Some old changelog entries contain non-UTF8 characters, such as:
autoconf2.13 2.13-55
autogen 1:5.8.3-2
dbconfig-common 1.8.17
dpatch 2.0.10
dput 0.9.2.16ubuntu1
emacs-
evolution 2.10.1-0ubuntu2
gmp 2:4.2.2+
gnome-session 2.17.92-0ubuntu2
gnupg 1.4.3-2ubuntu1
iptables 1.3.5.0debian1-
jadetex 3.13-2.1ubuntu2
llvm-
git-ubuntu will soon start to successfully import such changelog entries into changelog notes, but will do so by losing fidelity. The original characters will be lost. This is unfortunate but is due to pygit2 currently not supporting any way to "pass through" the non-unicode code points. We can't for example use "errors=
This bug tracks the loss in fidelity in git-ubuntu changelog notes.
Related branches
- Bryce Harrington: Approve
- Server Team CI bot: Approve (continuous-integration)
-
Diff: 109 lines (+78/-1)2 files modifiedgitubuntu/importer.py (+29/-1)
gitubuntu/importer_test.py (+49/-0)