• Home
  • CLT seminar: Christian Chiarcos – Linking annotations: Use Cases of Linked Data for NLP

CLT seminar: Christian Chiarcos – Linking annotations: Use Cases of Linked Data for NLP

SEMINAR

Within the NLP community, interoperability has been a major issue in the last 25 years, it has been subject of several standardization efforts, but nevertheless remains a problem partially solved at best.

Interoperability of linguistic resources involves two major aspects: Structural interoperability (annotations of different origin are represented using the same formalism) and conceptual interoperability (annotations of different origin are linked to a common vocabulary). Recently, it has been argued that both aspects can be addressed by representing linguistic resources using Semantic Web formalisms and in accordance with the Linked Data paradigm (Chiarcos et al., 2013).

In particular, the RDF data model (labeled directed multi-graphs) allows to generalize over the concept of feature structures which is underlying existing efforts to standardize corpora (ISO TC37/SC4:LAF, TEI), linguistic annotations (EAGLES, ISO TC37/SC4:ISOcat), and lexical resources (ISO TC37/SC4:LMF, TEI), thereby contributing to the interoperability between these standardization efforts.

This talk provides a general introduction into the topic and elaborates on two selected use cases:
– exploiting structural interoperability: combining annotated corpora and lexical resources
– exploiting conceptual interoperability: dealing with heterogeneous annotations in NLP pipelines

References:

Christian Chiarcos (2010), Towards Robust Multi-Tool Tagging. An OWL/DL-Based Approach. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Uppsala, Sweden, July 2010, 659--670.

Christian Chiarcos (2012), POWLA: Modeling linguistic corpora in OWL/DL. In: E. Simperl et al. (eds.), Proceedings of the 9th Extended Semantic Web Conference (ESWC 2012). Springer, Heidelberg, Heraklion, Crete, May 2012 (LNCS 7295), 225--239.

Christian Chiarcos, John McCrae, Philipp Cimiano, and Christiane Fellbaum (2013), Towards open data for linguistics: Lexical Linked Data. In: Alessandro Oltramari, Piek Vossen, Lu Qin, and Eduard Hovy (eds.), New Trends of Research in Ontologies and Lexical Resources. Springer, Heidelberg.

Date: 2015-02-12 10:30 - 12:00

Location: L308, Lennart Torstenssonsgatan 8

Permalink

add to Outlook/iCal

To the top

Page updated: 2015-02-08 16:39

Send as email
Print page
Show as pdf

X
Loading