An annotation tool for coreference

Coref Annotator
Annotation Concepts
Import and Export

Annotation Concepts

Difficult cases

While annotating, one typically stumbles upon difficult cases that need to be discussed with peers or supervisors. To support these discussions, difficult mentions can be tagged with the exclamation mark symbol.


Sometimes, anaphoric expressions can appear in plural without a plural antecedent. In these cases, entities can be grouped into entity groups.\   (1) Mary was eating ice cream with John. They had a lot of fun.\ “They” in the previous example refers to both Mary and John, but they are not present as an antecedent.

In CorefAnnotator, any number of entities can be selected and formed as a group entity. Group entities behave like regular entities (i.e., one can drag and drop mentions onto them, assign shortcuts and colors), but they appear separately at the bottom of the tree view. If they are expanded, entity groups reveal their members (which are entities), as well as the mentions that are used to refer to the group.


Genericity expresses whether an entity is a kind, instead of an instance. The prototypical example is shown in (1) and (2):
  (1) The elephant met a rabbit. He asked him to be his friends.
  (2) When under water, the elephant uses its trunk as a snorkel.
The underlined strings are exactly the same, but the first refers to a specific instance or individual of a class, while the second refers to the class as a whole.

In CorefAnnotator, entities can be marked as generic via the context or top menu. Generic entities are marked with the cloud symbol.