DeeTextAnalyzer feature checklist
Bug #885600 reported by
Mikkel Kamstrup Erlandsen
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Unity |
Triaged
|
Undecided
|
Unassigned | ||
dee |
Triaged
|
High
|
Mikkel Kamstrup Erlandsen | ||
dee (Ubuntu) |
Triaged
|
Undecided
|
Unassigned | ||
unity (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
This is a tracker bug to help me remember which features I want in DeeTextAnalyzer:
- Detect numeric sub sequences. Fx "Foo125" -> "foo", "125"
- Split on "CamelCase" -> "camel", "case"
- Detect and create CJK n-grams (and tokenize CJK subsequences when embedded in non-CJK text)
Changed in dee: | |
status: | New → Triaged |
importance: | Undecided → High |
assignee: | nobody → Mikkel Kamstrup Erlandsen (kamstrup) |
milestone: | none → 1.0.0 |
Changed in unity: | |
status: | New → Triaged |
Changed in dee (Ubuntu): | |
status: | New → Triaged |
Changed in dee: | |
milestone: | 1.0.0 → none |
Changed in unity (Ubuntu): | |
status: | New → Confirmed |
To post a comment you must log in.