Use re.UNICODE
Bug #545452 reported by
Max Rabkin
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ibid |
Triaged
|
Low
|
Unassigned |
Bug Description
The regex flag re.U causes \b, \w, \s, etc. to match all appropriate characters, not just the ones in ASCII. In conjunction with re.I, it does correct (or at least better) case-insensitive matching for non-ASCII characters. We should probably use it everywhere.
To post a comment you must log in.