WiktionaryZ will be able to host the content that can be found in the
Wiktionaries. It is therefore of an extreme importance that the data in those wiktionaries conforms to certain specifications. These specifications are simple;
- All words must belong to a language
- All words must be spelled as they are used in that language
- Only data that is more or less structured can be parsed and thereby become a candidate for inclusion in the database
When there are dictionary conventions for a particular language that does not conform to these specifications we have a problem. I have a problem with the
Latin wiktionary. The word "
os" for instance is not Os and is not ŏs. The word "
avia" is not Avia nor
ăvĭa. The reason given why it is written like that is to conform to standards of some dead wood dictionaries. When there is a need to give additional information to the way a word appears normally in a text, we can provide that information in addition to the normal appearance.
If this means that some data cannot be converted I find this disturbing. I do however not see how it can be done in a different way.
Thanks,
GerardM