GATE track 1 session
Jump to navigation
Jump to search
A full week of learning GATE text mining/information extraction language processing and talks. Session wiki
GATE is written in Java and very Java centric. This makes it portable, fast, and heavyweight. A programming library is available. It's 14 years old and has many users and contributors.
Using GATE developer
GATE developer is used to process sets of Language Resources in Corpus using Processing Resources. They are typically saved to a serialized Datastore.
ANNIE, VG (verb group) processors.
Save with formatting embeds tags in HTML or XML.
To investigate
- markupAware for HTML/XML
- AnnotationStack
- Advanced Options