strip accent characters in solr index
Bug #540866 reported by
Anand Chitipothu
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Open Library |
New
|
High
|
Edward Betts |
Bug Description
Most international records from Library of Congress have titles and authors are in a romanized form with accent characters. It is impossible for people to find these records from search unless we add the accent-less version of the names and titles to solr index.
For example: http://
This author name when written in English becomes "Sri Sri". I knew that there exists an entry about this author in OL and still it took me more than one hour to find this record.
Changed in openlibrary: | |
milestone: | none → upstream |
assignee: | nobody → Edward Betts (edwardbetts) |
importance: | Undecided → High |
summary: |
- strip ascent characters in solr index + strip accent characters in solr index |
description: | updated |
To post a comment you must log in.
http:// openlibrary. org/authors/ OL617A has "Sri Sri" as an alternative name, so it matches. We need another example