Locked History Actions

Diff for "SRL"

Differences between revisions 3 and 5 (spanning 2 versions)
Revision 3 as of 2013-01-26 15:43:21
Size: 837
Comment:
Revision 5 as of 2013-01-26 16:08:13
Size: 863
Comment:
Deletions are marked like this. Additions are marked like this.
Line 5: Line 5:
The main objective of the project was to provide a corpus for evaluation of various unsupervised techniques of semantic roles annotation. The corpus consist of parts of multilingual parallel corpus PELCRA annotated manually with FrameNet semantic roles. PELCRA corpus was chosen as a source corpus in order to be able to evaluate techniques of cross-language projection of semantic roles. This is the also the reason why English frames from FrameNet were chosen instead of creating their own version designed for Polish. The main objective of the project was to provide a corpus for evaluation of various unsupervised techniques of semantic roles annotation. The corpus consist of parts of multilingual parallel corpus PELCRA annotated manually with FrameNet-based semantic roles. PELCRA corpus was chosen as a source corpus in order to be able to evaluate techniques of cross-language projection of semantic roles. This is the also the reason why English frames from the FrameNet project were chosen instead of creating their own version designed for Polish.
Line 9: Line 9:
|| '''# of sentences''' || ? ||
|| '''# of annotated frames''' || ? ||
|| '''# of annotated frame elements''' || ? ||
|| '''# of sentences''' || 266 ||
|| '''# of annotated frames''' || 1327 ||
|| '''# of annotated frame elements''' || 2518 ||

SRL

The main objective of the project was to provide a corpus for evaluation of various unsupervised techniques of semantic roles annotation. The corpus consist of parts of multilingual parallel corpus PELCRA annotated manually with FrameNet-based semantic roles. PELCRA corpus was chosen as a source corpus in order to be able to evaluate techniques of cross-language projection of semantic roles. This is the also the reason why English frames from the FrameNet project were chosen instead of creating their own version designed for Polish.

Corpus statistics

# of sentences

266

# of annotated frames

1327

# of annotated frame elements

2518

Resources

Available resources:

  • annotated sentences in NEGRA format: <tgz>