• Home
  • About
  • Repositories
  • Search
  • Web API
  • Feedback
<< Go Back

Metadata

Name
AnCora Catalan 2.0.0
Repository
ZENODO
Identifier
doi:10.5281/zenodo.4762031
Description
AnCora Catalan 2.0.0&nbsp;consists of 500,000 words. The corpus&nbsp;is annotated at different levels:


Lemma and Part of Speech
Syntactic constituents and functions
Argument structure and thematic roles
Semantic classes of the verb
Nouns related to WordNet synsets
Named Entities
Coreference relations


AnCora Catalan 2.0.0 is mainly based on journalist texts. For more information, click&nbsp;AnCora-corpus.

The annotators of AnCora Catalan 2.0.0 are:

Oriol Borrega, Isabel Briz, N&uacute;ria Buf&iacute;, Montserrat Civit, Mar&iacute;a Jes&uacute;s D&iacute;az, Silvia Garcia, Raquel Hern&aacute;ndez, Marina Lloberes, Raquel Marcos, Difda Monterde, Montserrat Nofre, Aina Peris, Lourdes Puiggr&ograve;s, Marta Recasens, B&agrave;rbara Soriano, Rita Zaragoza.
Data or Study Types
multiple
Source Organization
Unknown
Access Conditions
available
Year
2021
Access Hyperlink
https://doi.org/10.5281/zenodo.4762031

Distributions

  • Encoding Format: HTML ; URL: https://doi.org/10.5281/zenodo.4762031
This project was funded in part by grant U24AI117966 from the NIH National Institute of Allergy and Infectious Diseases as part of the Big Data to Knowledge program. We thank all members of the bioCADDIE community for their valuable input on the overall project.