unicode unsupported
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
svn2bzr |
Confirmed
|
Medium
|
Unassigned |
Bug Description
If you create a SVN repository with files whose names are non-ASCII, svn2bzr fails to convert the repositories:
: tfheen@golem /tmp/trunk > ./svn2bzr.py ../d/dump blah
Revision 0 read
/usr/lib/
if head == base:
Traceback (most recent call last):
File "./svn2bzr.py", line 1073, in <module>
main()
File "./svn2bzr.py", line 1066, in main
opts.prefix, opts.filter)
File "./svn2bzr.py", line 999, in svn2bzr
creator.run()
File "./svn2bzr.py", line 471, in run
self.
File "./svn2bzr.py", line 214, in add_file
abspath = brt.tree.
File "/usr/lib/
return pathjoin(
File "/usr/lib/
path += '/' + b
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 1: ordinal not in range(128)
: tfheen@golem /tmp/trunk >
I'll attach the repository dump, it was generated by the following sequence:
: tfheen@golem /tmp/d > svnadmin create dir
: tfheen@golem /tmp/d > mkdir tmp
: tfheen@golem /tmp/d > cd tmp
: tfheen@golem /tmp/d/tmp > svn co file:///tmp/d/dir
Sjekket ut revisjon 0.
: tfheen@golem /tmp/d/tmp > cd dir
: tfheen@golem /tmp/d/tmp/dir > touch æøå
: tfheen@golem /tmp/d/tmp/dir > svn add æøå
A æøå
: tfheen@golem /tmp/d/tmp/dir > svn ci æøå -m'Blah'
Related branches
summary: |
- does not handle non-ascii file names + unicode unsupported |
I wrote a small patch. It worked with your dump file and my own branches which have French characters in the filenames.
But I currently don't know enough about the internals of bzrlib to say if it has some side effects.