Many islands, atolls etc are mislabelled in Time Zones

Bug #344310 reported by Matthew Paul Thomas
4
Affects Status Importance Assigned to Milestone
CLDR
New
Undecided
Unassigned
icu (Ubuntu)
New
Undecided
Unassigned
ubiquity (Ubuntu)
Fix Released
Medium
Unassigned

Bug Description

Binary package hint: ubiquity

Some islands and island groups, such as Fiji, the Maldives, Mauritius, and Niue, are commonly referred to only by a single word. Many are not, though, but Ubiquity refers to them as if they are; it omits the essential "Island", "Islands", "Archipelago" etc. Several island names are also missing diacritics or use out-of-date spelling.

In particular:
* "Lindeman" should be either "Lindeman Island" or "Lindeman Islands" <http://en.wikipedia.org/wiki/Lindeman_Island>
* "Chagos" should be "Chagos Archipelago" <http://en.wikipedia.org/wiki/Chagos_Archipelago>
* "Lord Howe" should be "Lord Howe Island" <http://en.wikipedia.org/wiki/Lord_Howe_Island>
* "Christmas" should be "Christmas Island" <http://en.wikipedia.org/wiki/Christmas_Island>
* "Cocos" should be "Cocos Islands" <http://en.wikipedia.org/wiki/Cocos_(Keeling)_Islands>
* "Comoro" should be either "Comoro Islands" <http://en.wikipedia.org/wiki/Comoro_Islands> or "Comoros" <http://en.wikipedia.org/wiki/Comoros>
* "Kerguelen" should be "Kerguelen Islands" <http://en.wikipedia.org/wiki/Kerguelen_Islands>
* "Mahe" should be "Mahé" <http://en.wikipedia.org/wiki/Mah%C3%A9,_Seychelles>
* "Reunion" should be "Réunion" <http://en.wikipedia.org/wiki/R%C3%A9union>
* "Canary" should be "Canary Islands" <http://en.wikipedia.org/wiki/Canary_Islands>
* "Faroe" should be either "Faroe Islands" or "Faroes" <http://en.wikipedia.org/wiki/Faroe_Islands>
* "St Helena" should be "Saint Helena" <http://en.wikipedia.org/wiki/Saint_Helena>
* "Chatham" should be "Chatham Islands" <http://en.wikipedia.org/wiki/Chatham_Islands>
* "Easter" should be "Easter Island" <http://en.wikipedia.org/wiki/Easter_Island>
* "Enderbury" should be "Enderbury Island" <http://en.wikipedia.org/wiki/Enderbury_Island>
* "Galapagos" should be "Galápagos Islands" <http://en.wikipedia.org/wiki/Gal%C3%A1pagos_Islands>
* "Gambier" should be "Gambier Islands" <http://en.wikipedia.org/wiki/Gambier_Islands>
* "Johnston" should be either "Johnston Atoll" or "Johnston Island" <http://en.wikipedia.org/wiki/Johnston_Atoll>
* "Marquesas" should be "Marquesas Islands" <http://en.wikipedia.org/wiki/Marquesas_Islands>
* "Norfolk" should be "Norfolk Island" <http://en.wikipedia.org/wiki/Norfolk_Island>
* "Pitcairn" should be "Pitcairn Island" <http://en.wikipedia.org/wiki/Pitcairn_Islands>
* "Ponape" should be "Pohnpei" <http://en.wikipedia.org/wiki/Pohnpei>
* "Truk" should be "Chuuk" <http://en.wikipedia.org/wiki/Truk>
* "Wake" should be "Wake Island" <http://en.wikipedia.org/wiki/Wake_Island>
* "Wallis" should be either "Wallis Island" or "Wallis and Futuna" <http://en.wikipedia.org/wiki/Wallis_and_Futuna>

There are probably similar problems in the "America" region, but I have not checked that yet.

In some of these cases, the missing term makes the name ambiguous; for example "Norfolk" and "Chatham" are localities in England, but there is only one "Norfolk Island" and one "Chatham Islands".

Revision history for this message
Colin Watson (cjwatson) wrote :

This should be fixed along with bug 344253.

Changed in ubiquity (Ubuntu):
importance: Undecided → Medium
status: New → Triaged
Revision history for this message
Michael Terry (mterry) wrote :

Names that have been changed in Karmic in English (seemingly satisfactorily):
Chagos -> British Indian Ocean Territory
Christmas -> Christmas Island
Cocos -> Cocos Islands
Comoro -> Comoros
Kerguelen -> French Southern Territories
Mahe -> Seychelles
Reunion -> France
Faroe -> Faroe Islands
St Helena -> Saint Helena
Norfolk -> Norfolk Island
Wallis -> Wallis and Futuna

These have been changed as a result of pulling timezone translations from libicu.

Names that still seem to be a problem:
Australia (Lindeman)
Australia (Lord Howe)
Spain (Canary)
New Zealand (Chatham)
Chile (Easter)
Kiribati (Enderbury)
Ecuador (Galapagos)
French Polynesia (Gambier)
United States Minor Outlying Islands (Johnston)
French Polynesia (Marquesas)
Pitcairn
Micronesia (Ponape)
Micronesia (Truk)
United States Minor Outlying Islands (Wake)

You'll notice that all the fixed names are top-level names, whereas most of the still-problematic names are parenthetical. The extra context may make some of the previous 'problem names' suitable, but I left them there.

At this point, the bug is really in the translation provided by libicu, so I'm opening a task against it, as soon as I can figure out which package they come from (which doesn't on first look appear to be icu).

Revision history for this message
Michael Terry (mterry) wrote :

Marking the ubiquity side of this Fix Released. The remaining work is in translations for ICU. In my last comment, I indicated the translations weren't in ICU, but that's not true as it turns out.

The translations are just not in gettext and only conveniently live in ICU's source control. The tarball that ICU ships has the locale data all stuck in source/data/in/icudt40l.dat.

Changed in ubiquity (Ubuntu):
status: Triaged → Fix Released
Revision history for this message
Michael Terry (mterry) wrote :

The ICU translation problems have been filed as bug 421917.

Revision history for this message
Steven R. Loomis (srl-icu-project) wrote :

Actually, they don't live in ICU. The data comes from CLDR. What you want to do is to verify and/or correct the data in CLDR. http://unicode.org/cldr

Revision history for this message
Steven R. Loomis (srl-icu-project) wrote :

Should this bug be closed, with a note that folks should contribute corrections to http://unicode.org/cldr ?

Revision history for this message
Matthew Paul Thomas (mpt) wrote :

Contributing corrections there may be the best way of fixing this bug, but that does not mean the bug is already fixed.

Revision history for this message
Steven R. Loomis (srl-icu-project) wrote :

Matthew, you are quite right. What I meant to say was, can this bug be filed upstream (CLDR) with some more precision?

The bug title should note that these are time zone names, not "names of Islands".
Christmas, for example, refers to the time zone "Indian/Christmas".

Can you describe the context a bit more clearly as to where you see these names?

You shouldn't see "Norfolk" for example, but "Norfolk Islands Time". But, it may depend on the application and use.

Revision history for this message
Matthew Paul Thomas (mpt) wrote :

It looks like the closest thing CLDR has to a bug tracker is <http://unicode.org/cldr/apps/survey>, which is currently offline.

The names appear both in the Ubuntu installer, and in "System" > "Administration" > "Time and Date" > "Time zone".

Revision history for this message
Steven R. Loomis (srl-icu-project) wrote :

CLDR has a trac at <http://unicode.org/cldr/trac>

I'm not the maintainer of the CLDR launchpad project, so I can't update the links.

summary: - Many islands, atolls etc are mislabelled
+ Many islands, atolls etc are mislabelled in Time Zones
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.