Resources

CLARIN’s repository contains datasets, models and software. They are mostly products of the language technology program, but also various other data submitted by institutions and individuals. On www.clarin.is/en/resources most of what is found in the repository is listed in a structured way, which gives a good overview of the content of the repository.

News

October 9, 2024

The Árni Magnússon Institute's language processing website is up again, enhanced and improved. There you can use the following tools, both by pasting text into a form and by using an API:

  • Tokenizer - Tokenizer from Miðeind ehf
  • PoS Tagger - POS from The Language and Voice Technology Lab at the University of Reykjavík
  • Lemmatizer - Nefnir by Jón Friðrik Daðason
  • Hyphenation Tool - Skiptir from The Árni Magnússon Institute
 
August 23, 2023

The Árna Magnússon Institute of Icelandic Studies has now been upgraded from being a Metadata Providing Centre (C-center) to a Service Providing Centre (B-center) for CLARIN in Iceland. To become a validated service-providing center, strict conditions must be met in terms of exterior maintenance and technical infrastructure. The service centers are in fact the backbone of CLARIN and are responsible for providing the scientific community with access to data and services as well as sharing knowledge.

News Archive