Specification of XDXF Layer 1, Version 1, Draft 001 The purpose of the layer 1 is to specify a format into which can be converted all the dictionaries we currently have. XDXF Layer 1 specifies the base format for all XDXF dictionaries. Later, more layers will be defined to extend the functionality, but all subsequent layers will comply with all the previous layers specifications. That is all rules of a certain layer are valid for that layer and all the layers above it. The version number represents the release version of the layer specification and ideally should always stay at 1. The only case when the version number of a certain layer is incremented is when a serious bug is found in the specification after its release to the public. If we make a change before the public release we increment the draft number. The intent is to keep XDXF format simple, because if it will be too complicated nobody will use it. We will not add any feature, unless we really need it. Don't not say: "Let's add such and such tag, because it might be useful in the future". Instead, if you have a dictionary with certain features that cannot be converted into current XDXF format, then we will think of something. Each dictionary is located in its own folder, which has the same name as the dictionary. So, if the dictionary name is "Webster1913" then the folder name and the base name of dictionary files will also be "Webster1913". Each dictionary must have an icon in PNG format with size of 32x32 pixels. For example: ./xdxf_dictionaries/Webster1913/Webster1913.xdxf ./xdxf_dictionaries/Webster1913/Webster1913.png Note that the names are case sensitive and the extensions are always lower case. All XDXF dictionary text files (those with .xdxf extension) are in XML format with UTF-8 encoding. ------------------------------------------------------------------------------------ Tags: ------------------------------------------------------------------------------------ The root element. Information about the dictionary. Information necessary for dictionary parsing. XDXF format layer number. Version of the layer. Ideally it should never change. Information for the user about the dictionary contents. Can be either "meaning" or "pronunciation_only". A dictionary of type "meaning" must have a tag in each
. It also may provide pronunciation. A dictionary of type "pronunciation_only" should not have any tags at all. For example a transcription dictionary will have type "pronunciation_only" because it does not provide any information but pronunciation. ISO 639 three letter code of the language of the key-phrases. ISO 639 three letter code of the language of the meanings. Copyright. License. Description of the dictionary in free words. The origin of the dictionary file. From where this file can be downloaded. From where can be downloaded the unformatted file, i.e. the original dictionary file before the conversion into XDXF format. From where the original unformatted dictionary file was downloaded. Link to the script which was used to convert the original unformatted dictionary file into XDXF format. The version number of the script mentioned above. List of abbreviations used in the dictionary. Definition of an abbreviation. The abbreviated text. The full text.
This tag groups together all the stuff related to one key-phrase.
The key-phrase.
This tag marks definition or a group of definitions which fall into a certain category. For English language those categories are parts of speech: noun, verb, adverb, etc.
Attribute "pos" specifies the Part Of Speech like: noun, verb, adverb, etc.
Marks the text of an example.
Marks reference to another key-phrase in the same dictionary.
Marks an abbreviation that is listed in section.
Marks transcription ------------------------------------------------------------------------------------ Example: ------------------------------------------------------------------------------------ 1 1 meaning ENG ENG C. G. Merriam Co. (C)1913 Public Domain Webster's Unabridged Dictionary published 1913 by the C. G. Merriam Co. http://xdxf.sourceforge.net/dicts/Webster1913.xdxf ="http://xdxf.sourceforge.net/unformatted/Webster1913.ptk" http://www.dict.com/Webster1913.ptk? http://xdxf.sourceforge.net/scripts/ptk2xdxf.pl 1 n. noun v. verb
record n. [re'kord] Anything written down and preserved. v. [reko'rd] To write down for future use.
home [ho:um] n. One's own dwelling place; the house in which one lives. One's native land; the place or country in which one dwells. The abiding place of the affections. For without hearts there is no home. See also: home-made
------------------------------------------------------------------------------------ Ideally XDXF would have only the tags that specifies the logical structure of the text and not the appearance. It would be up to the program to decide how a certain element would look like, and that appearance would be customizable by the user. That is why we use tags like and not However, in the real world we have dictionaries which already have some visual markings, like and , and often it is impossible for a converting script to determine with which logical markings they should be substituted. Therefor we use several XHTML tags for visual markings:
,

...

..., ..., ..., ..., ..., ..., ... ... Each XDXF dictionary is stored in a ZIP file at http://xdxf.sourceforge.net/TBD. This will allow programs to download and install dictionaries on the fly.