diff is confused by japanese filenames
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
diffutils (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
Binary package hint: diff
Using diff 2.8.1-11ubuntu
Here's some output from a diff -r
root@vivo:
File ./log/BackupPC.sock is a socket while file /media/
Files ./pc/lap.
Files ./pc/lap.
Files ./pc/lap.
Files ./pc/lap.
In 4 lines it says the files differ. They do because they're different files. For some reason diff -r has decided to compare
"f産休関係/f産休引き継ぎ セント・ラファエラ LC2.doc" with
"f産休関係/f産休引き継ぎ ホーリーファミリー LC2.doc"
I have been able to reproduce this on a second run of diff.
The whole directory system that's I'm diffing is 20G so it takes a while. I'm also going on vacation tomorrow so I don't have time to try produce a minimal test case.
By the way, comparing the files individually shows there are no diffs.
diff -qr on just the directory with those files gives the same problem.
Creating dummy files with just those names in another directory does not reproduce. Nor does recreating the whole directory with dummy files.
I can't do anything for about a week but if you have any test you'd like me to run, let me know.