GATE track 1 session

From ZooidWiki

Jump to: navigation, search

A full week of learning GATE text mining/information extraction language processing and talks. Session wiki

GATE developer screenshot

GATE is written in Java and very Java centric. This makes it portable, fast, and heavyweight. A programming library is available. It's 14 years old and has many users and contributors.

Contents

[edit] Using GATE developer

[edit] Information Extraction

Old Bailey IE project - 17th century english (Online)

[edit] Evaluation / Metrics


[edit] To investigate

[edit] JAPE

Phase: MatchingStyles
Input: Lookup
Options: control = appelt
Rule: Test1
(
({Lookup.majorType == location})?
{Lookup.majorType == loc_key}
):match
-->
:match.Location = {rule=Test1}

Copying features: :match.Location = { type = :match.Lookup.minorType}

[edit] To review, gotchas

[edit] Matching types

Matching styles for JAPE

[edit] To follow up

[edit] Other notes

[edit] Lucene data store and ANNIC

GATE-lucene-person-money.png

[edit] Demos

[edit] Conclusions

While it can do a lot out of the box and benefits from development time and breadth of connectivity, to be useful to more than patient specialists, it needs usability testing. A lot of things are inobvious and too domain specific that with a bit of work could be more broadly useful. Interaction could include a lot more immediate, useful and interesting looking displays. A web based version could have these features. However the team seems somewhat ambivalent about development. :)

Looking forward to learning about programming using GATE libraries.



RSS

Blikied on Aug 30, 2010

Your comments would be appreciated, click on the Discussion tab or add them here.



Personal tools
Namespaces
Variants
Actions
Navigation
Toolbox