|
|
|
|
|
|
|
<HTML> |
|
<HEAD> |
|
<TITLE>WNINPUT(5WN) manual page</TITLE> |
|
</HEAD> |
|
<BODY> |
|
<A HREF="#toc">Table of Contents</A><P> |
|
|
|
<H2><A NAME="sect0" HREF="#toc0">NAME </A></H2> |
|
noun.<I>suffix </I>, verb.<I>suffix </I>, adj.<I>suffix </I>, adv.<I>suffix </I> - WordNet lexicographer |
|
files that are input to <B><A HREF="grind.1WN.html">grind</B>(1WN)</A> |
|
|
|
<H2><A NAME="sect1" HREF="#toc1">DESCRIPTION </A></H2> |
|
WordNet's source files |
|
are written by lexicographers. They are the product of a detailed relational |
|
analysis of lexical semantics: a variety of lexical and semantic relations |
|
are used to represent the organization of lexical knowledge. Two kinds |
|
of building blocks are distinguished in the source files: word forms and |
|
word meanings. Word forms are represented in their familiar orthography; |
|
word meanings are represented by synonym sets (<I>synset </I>s) - lists of synonymous |
|
word forms that are interchangeable in some context. Two kinds of relations |
|
are recognized: lexical and semantic. Lexical relations hold between word |
|
forms; semantic relations hold between word meanings. <P> |
|
Lexicographer files |
|
correspond to the syntactic categories implemented in WordNet - noun, verb, |
|
adjective and adverb. All of the synsets in a lexicographer file are in |
|
the same syntactic category. Each synset consists of a list of synonymous |
|
words or collocations (eg. <B>"fountain pen" </B>, <B>"take in" </B>), and pointers that |
|
describe the relations between this synset and other synsets. These relations |
|
include (but are not limited to) hypernymy/hyponymy, antonymy, entailment, |
|
and meronymy/holonymy. A word or collocation may appear in more than one |
|
synset, and in more than one part of speech. Each use of a word in a synset |
|
represents a sense of that word in the part of speech corresponding to |
|
the synset. <P> |
|
Adjectives may be organized into clusters containing head |
|
synsets and satellite synsets. Adverbs generally point to the adjectives |
|
from which they are derived. <P> |
|
See <B><A HREF="wngloss.7WN.html">wngloss</B>(7WN)</A> |
|
for a glossary of WordNet |
|
terminology and a discussion of the database's content and logical organization. |
|
|
|
<H3><A NAME="sect2" HREF="#toc2">Lexicographer File Names </A></H3> |
|
The names of the lexicographer files are of |
|
the form: <P> |
|
<blockquote><I>pos</I>.<I>suffix</I> </blockquote> |
|
<P> |
|
where <I>pos </I> is either <B>noun </B>, <B>verb </B>, <B>adj </B> or <B>adv |
|
</B>. <I>suffix </I> may be used to organize groups of synsets into different files, |
|
for example <B>noun.animal </B> and <B>noun.plant </B>. See <B><A HREF="lexnames.5WN.html">lexnames</B>(5WN)</A> |
|
for a list of |
|
lexicographer file names that are used in building WordNet. |
|
<H3><A NAME="sect3" HREF="#toc3">Pointers </A></H3> |
|
Pointers |
|
are used to represent the relations between the words in one synset and |
|
another. Semantic pointers represent relations between word meanings, |
|
and therefore pertain to all of the words in the source and target synsets. |
|
Lexical pointers represent relations between word forms, and pertain |
|
only to specific words in the source and target synsets. The following |
|
pointer types are usually used to indicate lexical relations: Antonym, |
|
Pertainym, Participle, Also See, Derivationally Related. The remaining |
|
pointer types are generally used to represent semantic relations. <P> |
|
A relation |
|
from a source to a target synset is formed by specifying a word from the |
|
target synset in the source synset, followed by the <I>pointer_symbol </I> indicating |
|
the pointer type. The location of a pointer within a synset defines it |
|
as either lexical or semantic. The <FONT SIZE=-1><B>Lexicographer File Format </B></FONT> |
|
section |
|
describes the syntax for entering a semantic pointer, and <FONT SIZE=-1><B>Word Syntax |
|
</B></FONT> |
|
describes the syntax for entering a lexical pointer. <P> |
|
Although there |
|
are many pointer types, only certain types of relations are permitted |
|
between synsets of each syntactic category. <P> |
|
The <I>pointer_symbol </I>s for nouns |
|
are: <blockquote><B>! </B> <tt> </tt> <tt> </tt> Antonym <BR> |
|
<B>@ </B><tt> </tt> <tt> </tt> Hypernym <BR> |
|
<B>@i </B><tt> </tt> <tt> </tt> Instance Hypernym <BR> |
|
<B> </B><tt> </tt> <tt> </tt> Hyponym <BR> |
|
<B> i </B><tt> </tt> <tt> </tt> Instance |
|
Hyponym <BR> |
|
<B>#m </B><tt> </tt> <tt> </tt> Member holonym <BR> |
|
<B>#s </B><tt> </tt> <tt> </tt> Substance holonym <BR> |
|
<B>#p </B><tt> </tt> <tt> </tt> Part holonym <BR> |
|
<B>%m |
|
</B><tt> </tt> <tt> </tt> Member meronym <BR> |
|
<B>%s </B><tt> </tt> <tt> </tt> Substance meronym <BR> |
|
<B>%p </B><tt> </tt> <tt> </tt> Part meronym <BR> |
|
<B>= </B><tt> </tt> <tt> </tt> Attribute <BR> |
|
<B>+ |
|
</B><tt> </tt> <tt> </tt> Derivationally related form<tt> </tt> <tt> </tt> <tt> </tt> <tt> </tt> <BR> |
|
<B>;c </B><tt> </tt> <tt> </tt> Domain of synset - TOPIC <BR> |
|
<B>-c </B><tt> </tt> <tt> </tt> Member of this |
|
domain - TOPIC <BR> |
|
<B>;r </B><tt> </tt> <tt> </tt> Domain of synset - REGION <BR> |
|
<B>-r </B><tt> </tt> <tt> </tt> Member of this domain - REGION |
|
<BR> |
|
<B>;u </B><tt> </tt> <tt> </tt> Domain of synset - USAGE <BR> |
|
<B>-u </B><tt> </tt> <tt> </tt> Member of this domain - USAGE <BR> |
|
</blockquote> |
|
<P> |
|
The <I>pointer_symbol |
|
</I>s for verbs are: <blockquote><B>! </B> <tt> </tt> <tt> </tt> Antonym <BR> |
|
<B>@ </B><tt> </tt> <tt> </tt> Hypernym <BR> |
|
<B> </B><tt> </tt> <tt> </tt> Hyponym <BR> |
|
<B>* </B><tt> </tt> <tt> </tt> Entailment <BR> |
|
<B>> </B><tt> </tt> <tt> </tt> Cause |
|
<BR> |
|
<B>^ </B><tt> </tt> <tt> </tt> Also see <BR> |
|
<B>$ </B><tt> </tt> <tt> </tt> Verb Group <BR> |
|
<B>+ </B><tt> </tt> <tt> </tt> Derivationally related form<tt> </tt> <tt> </tt> <tt> </tt> <tt> </tt> <BR> |
|
<B>;c </B><tt> </tt> <tt> </tt> Domain of |
|
synset - TOPIC <BR> |
|
<B>;r </B><tt> </tt> <tt> </tt> Domain of synset - REGION <BR> |
|
<B>;u </B><tt> </tt> <tt> </tt> Domain of synset - USAGE |
|
<BR> |
|
</blockquote> |
|
<P> |
|
The <I>pointer_symbol </I>s for adjectives are: <blockquote><B>! </B><tt> </tt> <tt> </tt> Antonym <BR> |
|
<B>& </B><tt> </tt> <tt> </tt> Similar to <BR> |
|
<B>< |
|
</B><tt> </tt> <tt> </tt> Participle of verb <BR> |
|
<B>\ </B><tt> </tt> <tt> </tt> Pertainym (pertains to noun) <BR> |
|
<B>= </B><tt> </tt> <tt> </tt> Attribute <BR> |
|
<B>^ </B><tt> </tt> <tt> </tt> Also |
|
see <BR> |
|
<B>;c </B><tt> </tt> <tt> </tt> Domain of synset - TOPIC <BR> |
|
<B>;r </B><tt> </tt> <tt> </tt> Domain of synset - REGION <BR> |
|
<B>;u </B><tt> </tt> <tt> </tt> Domain |
|
of synset - USAGE <BR> |
|
</blockquote> |
|
<P> |
|
The <I>pointer_symbol </I>s for adverbs are: <blockquote><B>! </B><tt> </tt> <tt> </tt> Antonym <BR> |
|
|
|
<B>\ </B><tt> </tt> <tt> </tt> Derived from adjective <BR> |
|
<B>;c </B><tt> </tt> <tt> </tt> Domain of synset - TOPIC <BR> |
|
<B>;r </B><tt> </tt> <tt> </tt> Domain of synset |
|
- REGION <BR> |
|
<B>;u </B><tt> </tt> <tt> </tt> Domain of synset - USAGE <BR> |
|
</blockquote> |
|
<P> |
|
Many pointer types are reflexive, |
|
meaning that if a synset contains a pointer to another synset, the other |
|
synset should contain a corresponding reflexive pointer. <B><A HREF="grind.1WN.html">grind</B>(1WN)</A> |
|
automatically |
|
inserts missing reflexive pointers for the following pointer types: <P> |
|
|
|
<TABLE BORDER=0> |
|
<TR> <TD ALIGN=CENTER><B>Pointer </B> </TD> <TD ALIGN=CENTER><B>Reflect </B> </TD> </TR> |
|
<TR> <TR> <TD ALIGN=LEFT>Antonym </TD> <TD ALIGN=LEFT>Antonym </TD> </TR> |
|
<TR> <TD ALIGN=LEFT>Hyponym </TD> <TD ALIGN=LEFT>Hypernym </TD> </TR> |
|
<TR> <TD ALIGN=LEFT>Hypernym |
|
</TD> <TD ALIGN=LEFT>Hyponym </TD> </TR> |
|
<TR> <TD ALIGN=LEFT>Instance Hyponym </TD> <TD ALIGN=LEFT>Instance Hypernym </TD> </TR> |
|
<TR> <TD ALIGN=LEFT>Instance Hypernym </TD> |
|
<TD ALIGN=LEFT>Instance Hyponym </TD> </TR> |
|
<TR> <TD ALIGN=LEFT>Holonym </TD> <TD ALIGN=LEFT>Meronym </TD> </TR> |
|
<TR> <TD ALIGN=LEFT>Meronym </TD> <TD ALIGN=LEFT>Holonym </TD> </TR> |
|
<TR> <TD ALIGN=LEFT>Similar to |
|
</TD> <TD ALIGN=LEFT>Similar to </TD> </TR> |
|
<TR> <TD ALIGN=LEFT>Attribute </TD> <TD ALIGN=LEFT>Attribute </TD> </TR> |
|
<TR> <TD ALIGN=LEFT>Verb Group </TD> <TD ALIGN=LEFT>Verb Group </TD> </TR> |
|
<TR> <TD ALIGN=LEFT>Derivationally |
|
Related </TD> <TD ALIGN=LEFT>Derivationally Related </TD> </TR> |
|
<TR> <TD ALIGN=LEFT>Domain of synset </TD> <TD ALIGN=LEFT>Member of Doman </TD> |
|
</TR> |
|
</TABLE> |
|
|
|
<H3><A NAME="sect4" HREF="#toc4">Verb Frames </A></H3> |
|
Each verb synset contains a list of generic sentence frames |
|
illustrating the types of simple sentences in which the verbs in the synset |
|
can be used. For some verb senses, example sentences illustrating actual |
|
uses of the verb are provided. (See <FONT SIZE=-1><B>Verb Example Sentences </B></FONT> |
|
in <B><A HREF="wndb.5WN.html">wndb</B>(5WN)</A> |
|
.) |
|
Whenever there is no example sentence, the generic sentence frames specified |
|
by the lexicographer are used. The generic sentence frames are entered |
|
in a synset as a comma-separated list of integer frame numbers. The following |
|
list is the text of the generic frames, preceded by their frame numbers: |
|
<P> |
|
<blockquote>1<tt> </tt> <tt> </tt> Something ----s <BR> |
|
2<tt> </tt> <tt> </tt> Somebody ----s <BR> |
|
3<tt> </tt> <tt> </tt> It is ----ing <BR> |
|
4<tt> </tt> <tt> </tt> Something is ----ing PP <BR> |
|
5<tt> </tt> <tt> </tt> Something |
|
----s something Adjective/Noun <BR> |
|
6<tt> </tt> <tt> </tt> Something ----s Adjective/Noun <BR> |
|
7<tt> </tt> <tt> </tt> Somebody ----s Adjective |
|
<BR> |
|
8<tt> </tt> <tt> </tt> Somebody ----s something <BR> |
|
9<tt> </tt> <tt> </tt> Somebody ----s somebody <BR> |
|
10<tt> </tt> <tt> </tt> Something ----s somebody <BR> |
|
|
|
11<tt> </tt> <tt> </tt> Something ----s something <BR> |
|
12<tt> </tt> <tt> </tt> Something ----s to somebody <BR> |
|
13<tt> </tt> <tt> </tt> Somebody ----s on something |
|
<BR> |
|
14<tt> </tt> <tt> </tt> Somebody ----s somebody something <BR> |
|
15<tt> </tt> <tt> </tt> Somebody ----s something to somebody <BR> |
|
|
|
16<tt> </tt> <tt> </tt> Somebody ----s something from somebody <BR> |
|
17<tt> </tt> <tt> </tt> Somebody ----s somebody with something |
|
<BR> |
|
18<tt> </tt> <tt> </tt> Somebody ----s somebody of something <BR> |
|
19<tt> </tt> <tt> </tt> Somebody ----s something on somebody |
|
<BR> |
|
20<tt> </tt> <tt> </tt> Somebody ----s somebody PP <BR> |
|
21<tt> </tt> <tt> </tt> Somebody ----s something PP <BR> |
|
22<tt> </tt> <tt> </tt> Somebody ----s PP |
|
<BR> |
|
23<tt> </tt> <tt> </tt> Somebody's (body part) ----s <BR> |
|
24<tt> </tt> <tt> </tt> Somebody ----s somebody to INFINITIVE <BR> |
|
25<tt> </tt> <tt> </tt> Somebody |
|
----s somebody INFINITIVE <BR> |
|
26<tt> </tt> <tt> </tt> Somebody ----s that CLAUSE <BR> |
|
27<tt> </tt> <tt> </tt> Somebody ----s to somebody |
|
<BR> |
|
28<tt> </tt> <tt> </tt> Somebody ----s to INFINITIVE <BR> |
|
29<tt> </tt> <tt> </tt> Somebody ----s whether INFINITIVE <BR> |
|
30<tt> </tt> <tt> </tt> Somebody |
|
----s somebody into V-ing something <BR> |
|
31<tt> </tt> <tt> </tt> Somebody ----s something with something |
|
<BR> |
|
32<tt> </tt> <tt> </tt> Somebody ----s INFINITIVE <BR> |
|
33<tt> </tt> <tt> </tt> Somebody ----s VERB-ing <BR> |
|
34<tt> </tt> <tt> </tt> It ----s that CLAUSE <BR> |
|
35<tt> </tt> <tt> </tt> Something |
|
----s INFINITIVE <BR> |
|
</blockquote> |
|
|
|
<H3><A NAME="sect5" HREF="#toc5">Lexicographer File Format </A></H3> |
|
Synsets are entered one per |
|
line, and each line is terminated with a newline character. A line containing |
|
a synset may be as long as necessary, but no newlines can be entered within |
|
a synset. Within a synset, spaces or tabs may be used to separate entities. |
|
Items enclosed in italicized square brackets may not be present. <P> |
|
The |
|
general synset syntax is: <P> |
|
<blockquote><B>{ </B> <I> words pointers </I> <B>( </B> <I> gloss </I> <B>) } </B> |
|
<BR> |
|
</blockquote> |
|
<P> |
|
Synsets of this form are valid for all syntactic categories except |
|
verb, and are referred to as basic synsets. At least one <I>word </I> and a <I>gloss |
|
</I> are required to form a valid synset. Pointers entered following all the |
|
<I>words </I> in a synset represent semantic relations between all the words |
|
in the source and target synsets. <P> |
|
For verbs, the basic synset syntax is |
|
defined as follows: <P> |
|
<blockquote><B>{ </B> <I> words pointers frames </I> <B>( </B> <I>gloss </I> <B>) } |
|
</B> <BR> |
|
</blockquote> |
|
<P> |
|
Adjective may be organized into clusters containing one or more head |
|
synsets and optional satellite synsets. Adjective clusters are of the |
|
form: <P> |
|
<blockquote><B>[ </B><BR> |
|
<I>head synset </I><BR> |
|
[satellite synsets] <BR> |
|
[-] <BR> |
|
[additional head/satellite |
|
synsets] <BR> |
|
<B>] </B> <BR> |
|
</blockquote> |
|
<P> |
|
Each adjective cluster is enclosed in square brackets, |
|
and may have one or more parts. Each part consists of a head synset and |
|
optional satellite synsets that are conceptually similar to the head synset's |
|
meaning. Parts of a cluster are separated by one or more hyphens (<B>- </B>) on |
|
a line by themselves, with the terminating square bracket following the |
|
last synset. Head and satellite synsets follow the syntax of basic synsets, |
|
however a "Similar to" pointer must be specified in a head synset for |
|
each of its satellite synsets. Most adjective clusters contain two antonymous |
|
parts. See <B><A HREF="wngloss.7WN.html">wngloss</B>(7WN)</A> |
|
for a discussion of adjective clusters, and <FONT SIZE=-1><B>Special |
|
Adjective Syntax </B></FONT> |
|
for more information on adjective cluster syntax. <P> |
|
Synsets |
|
for relational adjectives (pertainyms) and participial adjectives do not |
|
adhere to the cluster structure. They use the basic synset syntax. <P> |
|
Comments |
|
can be entered in a lexicographer file by enclosing the text of the comment |
|
in parentheses. Note that comments <B>cannot </B> appear within a synset, as |
|
parentheses within a synset have an entirely different meaning (see <FONT SIZE=-1><B>Gloss |
|
Syntax </B></FONT> |
|
). However, entire synsets (or adjective clusters) can be "commented |
|
out" by enclosing them in parentheses. This is often used by the lexicographers |
|
to verify the syntax of files under development or to leave a note to |
|
oneself while working on entries. |
|
<H3><A NAME="sect6" HREF="#toc6">Word Syntax </A></H3> |
|
A synset must have at least |
|
one word, and the words of a synset must appear after the opening brace |
|
and before any other synset constructs. A word may be entered in either |
|
the simple word or word/pointer syntax. <P> |
|
A simple word is of the form: |
|
<P> |
|
<blockquote><I>word[ </I> <B>( </B> <I>marker </I> <B>) </B> <I>][lex_id] </I> <B>, </B> <BR> |
|
</blockquote> |
|
<P> |
|
<I>word </I> may be entered in any combination |
|
of upper and lower case unless it is in an adjective cluster. A collocation |
|
is entered by joining the individual words with an underscore character |
|
(<B>_ </B>). Numbers (integer or real) may be entered, either by themselves or |
|
as part of a word string, by following the number with a double quote |
|
(<B>" </B>). <P> |
|
See <FONT SIZE=-1><B>Special Adjective Syntax </B></FONT> |
|
for a description of adjective clusters |
|
and markers. <P> |
|
<I>word </I> may be followed by an integer <I>lex_id </I> from <B>1 </B> to <B>15 |
|
</B>. The <I>lex_id </I> is used to distinguish different senses of the same word |
|
within a lexicographer file. The lexicographer assigns <I>lex_id </I> values, |
|
usually in ascending order, although there is no requirement that the |
|
numbers be consecutive. The default is <B>0 </B>, and does not have to be specified. |
|
A <I>lex_id </I> must be used on pointers if the desired sense has a non-zero |
|
<I>lex_id </I> in its synset specification. <P> |
|
Word/pointer syntax is of the form: |
|
<P> |
|
<blockquote><B>[ </B> <I>word[ </I> <B>( </B> <I>marker </I> <B>) </B> <I>][lex_id] </I> <B>, </B> <I> pointers </I> <B>] </B> <BR> |
|
</blockquote> |
|
<P> |
|
This syntax |
|
is used when one or more pointers correspond only to the specific word |
|
in the word/pointer set, rather than all the words in the synset, and |
|
represents a lexical relation. Note that a word/pointer set appears within |
|
a synset, therefore the square brackets used to enclose it are treated |
|
differently from those used to define an adjective cluster. Only one word |
|
can be specified in each word/pointer set, and any number of pointers |
|
may be included. A synset can have any number of word/pointer sets. Each |
|
is treated by <B><A HREF="grind.1WN.html">grind</B>(1WN)<B></B></A> |
|
essentially as a <I>word </I>, so they all must appear |
|
before any synset <I>pointers </I> representing semantic relations. <P> |
|
For verbs, |
|
the word/pointer syntax is extended in the following manner to allow the |
|
user to specify generic sentence frames that, like pointers, correspond |
|
only to a specific word, rather than all the words in the synset. In this |
|
case, <I>pointers </I> are optional. <P> |
|
<blockquote><B>[ </B> <I>word </I> <B>, </B> <I>[pointers] frames </I> <B>] |
|
</B> <BR> |
|
</blockquote> |
|
|
|
<H3><A NAME="sect7" HREF="#toc7">Pointer Syntax </A></H3> |
|
Pointers are optional in synsets. If a pointer is specified |
|
outside of a word/pointer set, the relation is applied to all of the words |
|
in the synset, including any words specified using the word/pointer syntax. |
|
This indicates a semantic relation between the meanings of the words |
|
in the synsets. If specified within a word/pointer set, the relation corresponds |
|
only to the word in the set and represents a lexical relation. <P> |
|
A pointer |
|
is of the form: <P> |
|
<blockquote><I>[lex_filename </I><B>: </B> <I>]word[lex_id] </I><B>, </B><I>pointer_symbol </I> <BR> |
|
</blockquote> |
|
<P> |
|
|
|
or: <P> |
|
<blockquote><I>[lex_filename </I><B>: </B> <I>]word[lex_id] </I><B>^ </B><I>word[lex_id] </I><B>, </B><I>pointer_symbol </I> <BR> |
|
|
|
</blockquote> |
|
<P> |
|
For pointers, <I>word </I> indicates a word in another synset. When the second |
|
form of a pointer is used, the first <I>word </I> indicates a word in a head |
|
synset, and the second is a word in a satellite of that cluster. <I>word |
|
</I> may be followed by a <I>lex_id </I> that is used to match the pointer to the |
|
correct target synset. The synset containing <I>word </I> may reside in another |
|
lexicographer file. In this case, <I>word </I> is preceded by <I>lex_filename </I> as |
|
shown. <P> |
|
See <FONT SIZE=-1><B>Pointers </B></FONT> |
|
for a list of <I>pointer_symbol </I>s and their meanings. |
|
|
|
<H3><A NAME="sect8" HREF="#toc8">Verb Frame List Syntax </A></H3> |
|
Frame numbers corresponding to generic sentence |
|
frames must be entered in each verb synset. If a frame list is specified |
|
outside of a word/pointer set, the verb frames in the list apply to all |
|
of the words in the synset, including any words specified using the word/pointer |
|
syntax. If specified within a word/pointer set, the verb frames in the |
|
list correspond only to the word in the set. <P> |
|
A frame number list is entered |
|
as follows: <P> |
|
<blockquote><B>frames: </B> <I>f_num </I>[<B>, </B><I>f_num...] </I> </blockquote> |
|
<P> |
|
Where <I>f_num </I> specifies a generic |
|
frame number. See <FONT SIZE=-1><B>Verb Frames </B></FONT> |
|
for a list of generic sentences and their |
|
corresponding frame numbers. |
|
<H3><A NAME="sect9" HREF="#toc9">Gloss Syntax </A></H3> |
|
A gloss is included in all synsets. |
|
The lexicographer may enter a text string of any length desired. A gloss |
|
is simply a string enclosed in parentheses with no embedded carriage returns. |
|
It provides a definition of what the synset represents and/or example |
|
sentences. |
|
<H3><A NAME="sect10" HREF="#toc10">Special Adjective Syntax </A></H3> |
|
The syntax for representing antonymous |
|
adjective synsets requires several additional conditions. <P> |
|
The first word |
|
of a head synset <B>must </B> be entered in upper case, and can be thought of |
|
as the head word of the head synset. The <I>word </I> part of a pointer from |
|
one head synset to another head synset within the same cluster (usually |
|
an antonym) must also be entered in upper case. Usually antonymous adjectives |
|
are entered using the word/pointer syntax described in <FONT SIZE=-1><B>Word Syntax </B></FONT> |
|
to |
|
indicate a lexical relation. There is no restriction on the number of |
|
parts that a cluster may have, and some clusters have three parts, representing |
|
antonymous triplets, such as <B>solid </B>, <B>liquid </B>, and <B>gas </B>. <P> |
|
A cross-cluster |
|
pointer may be specified, allowing a head or satellite synset to point |
|
to a head synset in a different cluster. A cross-cluster pointer is indicated |
|
by entering the <I>word </I> part of the pointer in upper case. <P> |
|
An adjective |
|
may be annotated with a syntactic marker indicating a limitation on the |
|
syntactic position the adjective may have in relation to noun that it |
|
modifies. If so marked, the marker appears between the word and its following |
|
comma. If a <I>lex_id </I> is specified, the marker immediately follows it. The |
|
syntactic markers are: <blockquote><B>(p) </B><tt> </tt> <tt> </tt> predicate position <BR> |
|
<B>(a) </B><tt> </tt> <tt> </tt> prenominal (attributive) |
|
position <BR> |
|
<B>(ip) </B><tt> </tt> <tt> </tt> immediately postnominal position<tt> </tt> <tt> </tt> <tt> </tt> <tt> </tt> <BR> |
|
</blockquote> |
|
|
|
<H2><A NAME="sect11" HREF="#toc11">EXAMPLES </A></H2> |
|
<I>(Note that |
|
these are hypothetical examples not found in the WordNet lexicographer |
|
files.) </I> <P> |
|
Sample noun synsets: <blockquote>{ canine, [ dog1, cat,! ] pooch, canid,@ |
|
} <BR> |
|
{ collie, dog1,@ (large multi-colored dog with pointy nose) } <BR> |
|
{ hound, |
|
hunting_dog, pack,#m dog1,@ } <BR> |
|
{ dog, } <BR> |
|
</blockquote> |
|
<P> |
|
Sample verb synsets: <blockquote>{ [ confuse, |
|
clarify,! frames: 1 ] blur, obscure, frames: 8, 10 } <BR> |
|
{ [ clarify, confuse,! |
|
] make_clear, interpret,@ frames: 8 } <BR> |
|
{ interpret, construe, understand,@ |
|
frames: 8 } <BR> |
|
</blockquote> |
|
<P> |
|
Sample adjective clusters: <blockquote>[ <BR> |
|
{ [ HOT, COLD,! ] lukewarm(a), |
|
TEPID,^ (hot to the touch) } <BR> |
|
{ warm, } <BR> |
|
- <BR> |
|
{ [ COLD, HOT,! ] frigid, (cold |
|
to the touch) } <BR> |
|
{ freezing, } <BR> |
|
] <BR> |
|
</blockquote> |
|
<P> |
|
Sample adverb synsets: <blockquote>{ [ basically, |
|
adj.all:essential^basic,\ ] [ essentially, adj.all:basic^fundamental,\ ] ( by |
|
one's very nature )} <BR> |
|
{ pointedly, adj.all:pungent^pointed,\ } <BR> |
|
{ [ badly, |
|
adj.all:bad,\ well,! ] ill, ("He was badly prepared") } <BR> |
|
</blockquote> |
|
|
|
<H2><A NAME="sect12" HREF="#toc12">SEE ALSO </A></H2> |
|
<B><A HREF="grind.1WN.html">grind</B>(1WN)</A> |
|
, |
|
<B><A HREF="wnintro.5WN.html">wnintro</B>(5WN)</A> |
|
, <B><A HREF="lexnames.5WN.html">lexnames</B>(5WN)</A> |
|
, <B><A HREF="wndb.5WN.html">wndb</B>(5WN)</A> |
|
, <B><A HREF="uniqbeg.7WN.html">uniqbeg</B>(7WN)</A> |
|
, <B><A HREF="wngloss.7WN.html">wngloss</B>(7WN)</A> |
|
. <P> |
|
Fellbaum, |
|
C. (1998), ed. <I>"WordNet: An Electronic Lexical Database" </I>. MIT Press, Cambridge, |
|
MA. <P> |
|
<P> |
|
|
|
<HR><P> |
|
<A NAME="toc"><B>Table of Contents</B></A><P> |
|
<UL> |
|
<LI><A NAME="toc0" HREF="#sect0">NAME</A></LI> |
|
<LI><A NAME="toc1" HREF="#sect1">DESCRIPTION</A></LI> |
|
<UL> |
|
<LI><A NAME="toc2" HREF="#sect2">Lexicographer File Names</A></LI> |
|
<LI><A NAME="toc3" HREF="#sect3">Pointers</A></LI> |
|
<LI><A NAME="toc4" HREF="#sect4">Verb Frames</A></LI> |
|
<LI><A NAME="toc5" HREF="#sect5">Lexicographer File Format</A></LI> |
|
<LI><A NAME="toc6" HREF="#sect6">Word Syntax</A></LI> |
|
<LI><A NAME="toc7" HREF="#sect7">Pointer Syntax</A></LI> |
|
<LI><A NAME="toc8" HREF="#sect8">Verb Frame List Syntax</A></LI> |
|
<LI><A NAME="toc9" HREF="#sect9">Gloss Syntax</A></LI> |
|
<LI><A NAME="toc10" HREF="#sect10">Special Adjective Syntax</A></LI> |
|
</UL> |
|
<LI><A NAME="toc11" HREF="#sect11">EXAMPLES</A></LI> |
|
<LI><A NAME="toc12" HREF="#sect12">SEE ALSO</A></LI> |
|
</UL> |
|
</BODY></HTML> |
|
|