UTF-8 error blocking import of Mercurial (hg) repository
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Bazaar Fast Import |
Fix Released
|
Medium
|
Jelmer Vernooij |
Bug Description
I have a Mercurial repository that exported successfully to the fast-import format. However, when I try to import, it fails with a 'utf8' error:
...
16:02:09 800/2100 commits processed at 199/minute (800)
16:03:52 900/2100 commits processed at 157/minute (900)
ABORT: exception occurred processing commit :901
bzr: ERROR: exceptions.
Traceback (most recent call last):
File "/usr/lib/
return the_callable(*args, **kwargs)
File "/usr/lib/
ret = run(*run_argv)
File "/usr/lib/
return self.run(
File "/usr/lib/
return self._operation
File "/usr/lib/
self.cleanups, self.func, *args, **kwargs)
File "/usr/lib/
result = func(*args, **kwargs)
File "/usr/lib/
user_
File "/usr/lib/
return proc.process(
File "/usr/lib/
super(
File "/usr/lib/
handler(self, cmd)
File "/usr/lib/
handler.
File "/usr/lib/
handler(self, fc)
File "/usr/lib/
self.
File "/usr/lib/
return codecs.
UnicodeDecodeError: 'utf8' codec can't decode byte 0xb9 in position 14: unexpected code byte
I'm running this on Ubuntu. bzr version "2.4.0-
Maybe this problem is because older Mercurial commits didn't always contain valid UTF-8 data??
Note: I was able to work around this issue by changing
"path.decode(
on lines 890 and 895 of /usr/lib/
Related branches
Changed in bzr-fastimport: | |
status: | Triaged → Fix Committed |
assignee: | nobody → Jelmer Vernooij (jelmer) |
milestone: | none → 0.12.0 |
Changed in bzr-fastimport: | |
status: | Fix Committed → Fix Released |
we should print a saner error (without a traceback) and allow the user to specify some way to ignore utf8-invalid data.