@Robert: assuming the list is complete yes, but the drupal list is not
sufficient since it does not tell us if letters are capitals or not
(of course we could derive that again from the "translation" being
capital or not but the list was not made with that intent).
I found a more official list here http://www.unicode.org/Public/5.1.0/ucd/UCD.html which includes
information about letters being capitals or not. Will try to compile
that into a big regular expression. Not sure about the performance
though if we need to check each word with such a self-made regex.
@Robert: assuming the list is complete yes, but the drupal list is not
sufficient since it does not tell us if letters are capitals or not
(of course we could derive that again from the "translation" being
capital or not but the list was not made with that intent).
I found a more official list here www.unicode. org/Public/ 5.1.0/ucd/ UCD.html which includes
http://
information about letters being capitals or not. Will try to compile
that into a big regular expression. Not sure about the performance
though if we need to check each word with such a self-made regex.