Xapian should have all the Unicode support you need for this built in, so you shouldn't need to add a dependency on libunicode, icu, or glib.
Does SC use Xapian::QueryParser and u-p-a use Xapian::TermGenerator? If not, that could be fun...
Also, Xapian is taking part in GSoC this year, and "CJK support" is one of the potential projects. We've had promising interest in it, though it's too soon to know if that'll happen, and it wouldn't be done until August anyway. It might also be just Chinese support or just Japanese (or possibly students working on each separately). So a patch with a more generic approach may still be useful (probably would be for Korean at least).
Xapian should have all the Unicode support you need for this built in, so you shouldn't need to add a dependency on libunicode, icu, or glib.
Does SC use Xapian::QueryParser and u-p-a use Xapian: :TermGenerator? If not, that could be fun...
Also, Xapian is taking part in GSoC this year, and "CJK support" is one of the potential projects. We've had promising interest in it, though it's too soon to know if that'll happen, and it wouldn't be done until August anyway. It might also be just Chinese support or just Japanese (or possibly students working on each separately). So a patch with a more generic approach may still be useful (probably would be for Korean at least).