- ...
task1.1
- A task in which the vector components are discrete values rather than
continuous values.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... fnTBL2.1
- fnTBL is to be pronounced as funtible <
>
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ...
stop2.2
is a threshold that needs be set in advance; it can
be
, in which case the algorithm learns until there are no
more rules to be learned.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ...3.1
- In general, there should be
features, where
is the set of features associated with the
sample and
is the set of classifications. For POS tagging
and
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... word3.2
- The gold-standard POS, to be more precise.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ...3.3
- The presented class is the Wordnet label of one of the senses of the
word use.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... rule4.1
- A rule whose output classification is part of the decision - such
as the example above - the classified class - NN - is part of the
predicate.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ...5.1
- The argument to the -t parameter of the most_likely_tag.prl script
is capital o (O) and not the number 0.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... constraints5.2
- One might want to not impose constraints on low occurrence words,
because the statistics on them might not be reliable.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... distribution5.3
- The file is presented here in double column, to save some space, but
it's single column in the actual file.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... types5.4
- We are describing here the basic predicate types - the rules' predicates
are made out of a conjunction of these predicate types.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... time5.5
- One might want to do this, as the distribution on the first part of
the training data might be different than the distribution on the
second part. However, one should exclude the words that appear only
on the second part, as it will create conditions similar to the testing
time situation, i.e. do not use the true tag for the unknown words,
but rather the guessed tag.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... memory5.6
- Everything is relative nowadays, though. It is debatable whether is
better to invest time in making the program run in smaller amounts
of memory or to wait until computers with a large amount of memory
become standard.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... properties5.7
- But it should be mentioned that the fact they applied only one on
the data also says that they did not apply on the rest of the samples,
which means that they should not make a big negative impact on the
test data, as they should not apply too often. However, if the training
and test data are not drawn from the same distribution, they can hurt
the performance.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... classifications5.8
- This is true if the instances are not part of the same document, where
the ``one sense per discourse'' [11]; as the task
was presented in Senseval2 [2], the samples are independent.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... necessary5.9
- This character was selected because it still keeps the sample line
readable, but you can replace it with any character you like.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... time5.10
- Actually, we observed a significant decrease in running time when
using this feature; its use is highly recommended in cases such as
this one.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ...strike%2:35:03::|strike_a_chord%2:31:00::5.11
- The values presented here represent the wordnet 1.7 tags [3]associated
with the senses.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ...
count5.12
- In this case defined as would convert the value to correct, if applied,
regardless of the actual value of the classification - usually, rules
are not considered to apply if the classification is already correct.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
- ... tiesA.1
- Tie = a case where 2 rules have the same score
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.