We welcome anyone who has made a wordnet that is available under an open license to join the Open Multilingual Wordnet (OMW). This links your wordnet to wordnets in other languages and makes it more easily accessible. Wordnets in OMW are used by other projects such as the Natural Language Toolkit. We also provide a restful interface.
To join the Open Multilingual Wordnet you must:
wordnet http://opendefinition.org/licenses/cc-by/ http://opendefinition.org/licenses/cc-by/3.0 http://opendefinition.org/licenses/cc-by/4.0 http://opendefinition.org/licenses/odc-by/ http://www.cecill.info/licences/Licence_CeCILL-C_V1-en.html http://opendefinition.org/licenses/cc-by-sa/ http://opendefinition.org/licenses/cc-by-sa/3.0 http://opendefinition.org/licenses/cc-by-sa/4.0 https://creativecommons.org/publicdomain/zero/1.0/ https://creativecommons.org/licenses/by/ https://creativecommons.org/licenses/by-sa/ https://creativecommons.org/licenses/by/3.0/ https://creativecommons.org/licenses/by-sa/3.0/ https://creativecommons.org/licenses/by/4.0/ https://creativecommons.org/licenses/by-sa/4.0/ http://www.cecill.info/licences/Licence_CeCILL-C_V1-en.html https://opensource.org/licenses/MIT/ https://opensource.org/licenses/Apache-2.0
There is extensive documentation with the schemas. Here we include a few tips that are not covered there.
If you want to include a definition from somewhere else (such as the Princeton wordnet), or in a language other than that of the wordnet, please note it explicitly:
<Definition language="ja">辞書の編集者または筆者</Definition> <Definition dc:source="pwn-3.0" language="en">a compiler or writer of a dictionary</Definition>
If you have a relation type not included in the list we have, please use other and give your more explicit type as dc:type. Or, if your type is a more specific subclass of an existing type, you can use the supertype and mark the specific type with dc:type.
<SynsetRelation relType="other" dc:type="emotion" target="example-en-1234-n"/> <SynsetRelation relType="antonym" dc:type="gradable antonym" target="example-en-1234-n"/>
You can add variations of lemmas, including orthographic variations and transliterations, as shown below. You can have various classes of transliteration, and if they are automatically generated, you can give them a confidence score.
<LexicalEntry id="w613347"> <Lemma writtenForm="动物沟通" partOfSpeech="n" script="Hans"/> <Form writtenForm="dòngwùgōutōng" script="Latn-pinyin"> <Tag category="transliteration">pīnyīn</Tag> <Tag category="confidence">0.77</Tag> </Form> <Form writtenForm="dong4wu4gou1tong1" script="Latn-pinyin"> <Tag category="transliteration">pin1yin1</Tag> <Tag category="confidence">0.77</Tag> </Form> <Form writtenForm="dongwugoutong" script="Latn-pinyin"> <Tag category="transliteration">pinyin</Tag> <Tag category="confidence">0.77</Tag> </Form> </LexicalEntry>
The basic structure of the OMW and CILI is described here (this web page is more up-to-date):
The Open Multilingual Wordnet may not contain all of the information in the component wordnets. It is the (large and we hope useful) subset of information we know how to represent.