Mission

PORTULAN CLARIN is the Research Infrastructure for the Science and Technology of Language.

Its mission is to support researchers, innovators, citizen scientists, students, language professionals and users in general whose activities resort to research results from the Science and Technology of Language by means of the distribution of scientific resources, the supplying of technological support, the provision of consultancy, and the fostering of scientific dissemination.

It supports activities in all scientific and cultural domains with special relevance to those that are more directly concerned with language — whether as their immediate subject, or as an instrumental mean to address their topics —, including among others, the areas of Humanities, Arts and Social Sciences, Artificial Intelligence, Computation and Cognitive Sciences, Healthcare, Language teaching and promotion, Cultural creativity, Cultural heritage, etc.

It serves all those whose activity requires the handling and exploration of language resources, including language data and services:

  • in all sorts of modalities – spoken, written, sign, multimodal, etc.
  • in all types of representations – audio, text, video, records of brain activity, etc.
  • and in all types of functions – instrument for communication, symbolic object, cognitive ability to be stimulated through formal education in native language, knowledge vehicle, ability to be exercised in the acquisition of a second language, reflection of mental activity, natural form of interaction with artificial agents and devices, etc.
  • etc.

The infrastructure is used when it is necessary, for example:

  • to use a language processing tool – e.g. conjugators, terminology extractors, concordancers, part-of-speech taggers, deep linguistic processing grammars, etc.
  • to access data sets – e.g. linguistically interpreted corpora, terminology data bases, EEG records of neurolinguistic experiments, collections of literary texts, etc.
  • to obtain a data sample – e.g. video recording of deaf children sign language, words for concepts in the Organization subontology, etc.
  • to use specific research support applications – e.g. lemma frequency extractors, treebank annotators, etc.
  • to use an appropriately equipped online workbench – to support field work on the documentation of endangered languages, to do research on translation, etc.
  • etc.

PORTULAN CLARIN ensures the preservation and fostering of the scientific heritage regarding the Portuguese language, supporting the preservation, promotion, distribution, sharing and reuse of language resources for this language, including text collections, lexicons, processing tools, etc.

It represents an asset of utmost importance for the technological development of the Portuguese language and to its preparation for the digital age, contributing to ensure the citizenship of its speakers in the information society.

PORTULAN CLARIN belongs to the Portuguese National Roadmap of Research Infrastructures of Strategic Relevance and is part of the international research infrastructure CLARIN ERIC.

It fosters Open Science practices by supporting its users in making their results and resources accessible to all sectors of an inquiring society.

User-centered operation

The infrastructure pursues its mission by seeking to bring scientific, professional or personal advantages to its users by means of its operation being primarily centered on its users.

User-centered by design

Users were involved with the infrastructure well before it entered into operation, right from the start of its planning, and contributed to its design and implementation through the Network of Implementation Partners; and they keep involved in its operation through the Helpdesk and, at a higher level, through the Scientific Advisory Forum.

Users are free to access

Users have free access to the infrastructure, with no registration required and without any “members only” constraint.

Users are free to choose

Users are free to choose the distribution licenses for their resources to be distributed by the infrastructure

Users keep in control

Users distributing their resources through the infrastructure grant it only the non-exclusive right to distribute those resources: they keep all rights, including the right to distribute their resources through other means, and to withdraw their resources from the infrastructure.

Users are not tracked

No users’ data is retained related to their usage of the infrastructure, be it their scientific data or their personal data. To protect users’ rights over their resources distributed through the infrastructure, their personal and scientific data and their resources are only recorded and displayed to other users of the infrastructure under their explicit consent and control.

Users are not outrun

The infrastructure is a new type of open research organization that supports science and technology and the optimization of their resources: the infrastructure supports researchers, research centers and any other users but does not compete with its users and does not undertake research.

Users have a reliable infrastructure

The operation of the infrastructure by its users relies on a professionally run, institutional data center, hosted and maintained by the Faculty of Sciences of the University of Lisbon, and on a dedicated management team, and holds several certifications awarded by different independent evaluation organizations.

Users are supported

Users have a helpdesk to resort to for getting support in their utilization of the infrastructure.

Users are involved

Users benefit from initiatives to enhance their engagement with the infrastructure.

Users are listened

Users can address the infrastructure at any moment and are called to provide advice through the Scientific Advisory Forum.

Timeline

2024

May: organisation of the "Pledge by scientific researchers on Artificial Intelligence and Language Technology for Portuguese"

April: collaboration established with the Lisbon Academy of Sciences to make the Academy's dictionary available online also through PORTULAN.

March: the PORTULAN CLARIN repository obtains renewal of the international CoreTrustSeal certification.

March: organisation of the debate "Artificial Intelligence and the Future of the Portuguese Language" as part of the 16th International Conference on the Computational Processing of the Portuguese Language (PROPOR2024).

2023

October: collaboration is established with the International Portuguese Language Institute (IILP), a body of the Community of Portuguese Speaking Countries (CPLP), to make corpora of non-European variants of Portuguese available through PORTULAN.

April: Lisbon hosts the General Assembly of CLARIN ERIC, which was the first face to face after pandemics lockdown, organized by PORTULAN CLARIN.

2022

November: conclusion of the PORTULAN CLARIN implementation project approved by the Foundation for Science and Technology (FCT).

March: organisation of the round table "Languages, Technology and Innovation" as part of the International Conference on the Portuguese and Spanish Languages (CILPE2022) of the Organisation of Ibero-American States (OEI).

2021

June: concluded with full success the implementation project for the first phase of the PORTULAN CLARIN infrastructure, with the goals in the working plan being accomplished and surpassed.

May: Paulo Quaresma, formerly Executive Director of PORTULAN CLARIN, appointed to the Steering Committee of EOSC - European Open Science Cloud.

April: collaboration with LER+ National Plan for Reading established for the reutilization of language processing tools available in the PORTULAN CLARIN infrastructure for educational purposes.

January: partnership with DefinedCrowd, an international company in the area of AI and labelled datasets, for the distribution of scientific resources established.

January: Strategic Advisory Board, with Ana Paula Laborinho (Organization of Ibero-American States, cultural manager), Daniela Braga (DefinedCrowd, entrepreneur) and Nicolau Santos (national news agency Agência Lusa, journalist) in its inaugural constituency, established.

January: Scientific Advisory Forum established, based on the network of implementation partners and open to further contributions.

2020

December: workshop of the network of implementation partners.

June: public event of inauguration of the PORTULAN CLARIN infrastructure temporarily postponed due to the covid-19 outbreak.

March: score "High" assigned to the maturity status of PORTULAN CLARIN in the national round of assessment of the infrastructures in the National Roadmap of Research Infrastructures of Strategic Interest, by an external committee of independent experts, appointed by FCT-Foundation for Science and Technology, of the Portuguese Ministry of Science, Technology and Higher Education.

February: the extension until at least 2023 of the membership of Portugal, and consequently of PORTULAN CLARIN, to the European infrastructure consortium CLARIN ERIC was approved by FCT-Foundation for Science and Technology, of the Portuguese Ministry of Science, Technology and Higher Education.

2019

December: PORTULAN CLARIN teams up with ROSSIO research infrastructure for the Winter School on Digital Humanities.

December: PORTULAN CLARIN repository obtains the international CoreTrustSeal certification.

December: PORTULAN CLARIN obtains from CLARIN ERIC the certification as a national centre.

November: PORTULAN CLARIN is part of the Advisory Board of the portuguese node of the RDA-Research Data Alliance

November: The PORTULAN CLARIN website, beta version, is presented at the CLARIN Annual Conference.

July: PORTULAN CLARIN obtains the K-Centre certification from CLARIN ERIC as a knowledge centre.

2018

November: CLARIN European Research Infrastructure Consortium (CLARIN ERIC) is presented as on of the two highlighted use cases in the launching ceremony of the European Open Science Cloud (EOSC)

June: joint organization with AMA — Agency for the Digital Transformation of Public Administration and DGT-PT — Portuguese Language Department of the Directorate-General for Translation of the Lisbon Workshop of the European Language Resources Consortium

April: CLARIN ERIC constituency reaches 20 members

February: upon the definition of the regulatory framework for scientific employment being concluded by the national authorities, implementation activities of PORTULAN CLARIN intensify

2017

November: successful application, jointly with AMA — Agency for the Digital Transformation of Public Administration and other partners, for a European project for the gathering of language resources, under the CEF program

July: joint organization with META — Multilingual Europe Technology Alliance of the Lisbon METAFORUM conference on language technologies

June: grant contract for the implementation project of PORTULAN CLARIN is signed

April: agreement with INCD — National Infrastructure of Distributed Computation to support PORTULAN CLARIN with compute-intensive services

April: joint organization with DefinedCrowd (Daniela Braga) of the audience with the Secretary of State of Science, Technology and Higher Education, Professor Fernanda Rollo, in April 10, 2017, aiming at the inclusion of the technological preparation of the Portuguese language as a chapter in the National Science and Technology Plan 2030, under preparation.

March: application proposal for the implementation project is successful and the national funding for the implementation of PORTULAN CLARIN is approved

2016

July: joint organization with DGT-PT — Portuguese Language Department of the Directorate-General for Translation of the Workshop on Corpora and Tools for Processing Corpora

July: application of CLARIN PORTULAN submitted to the national competitive call for the funding of implementation projects of research infrastructures

2015

March: FCT — Fundação para a Ciência e Tecnologia, the Portuguese national funding agency for research, proceeds with an analysis of design and preparation maturity of the infrastructures proposed in the national Roadmap of Research Infrastructures, and PORTULAN CLARIN undergoes a most successful analysis

2014

November: Portugal becomes the 11th member of the European distributed research infrastructure CLARIN ERIC after its request for membership being accepted by the CLARIN ERIC General Assembly

October: joint development with Camões — Institute for Cooperation and Language of the online service for the categorization of texts according to their level of proficiency

September: joint organization with Unbabel (João Graça) of the round table about the topic "Preparing the Portuguese Language for the Digital Age", in the conference "Languages: Translating the Future", which took place in Lisbon, in September 26, 2014, associated to the celebration of the European Day of Languages, and jointly promoted by the Directorate-General for Translation of the European Commission, the Assembly of the Republic (Portuguese Parliament), the Prosecutor General of Portugal and the Camões IP, Institute for Cooperation and Language of the Ministry of Foreign Affairs.

February: obtaining the top score from the evaluation committee, the application of PORTULAN CLARIN is successful and this infrastruture is an inaugural member of the first National Roadmap of Research Infrastructures of Strategic Relevance

2013

September: application proposal of PORTULAN CLARIN is submitted to the competitive call for the roadmap of national research infrastructures, by 3 proponent partners and 19 implementation partners, including Camões IP, the national body for the promotion of the Portuguese Language

August: IILP — International Institute for the Portuguese Language, a body of CPLP endorses the application proposal of PORTULAN CLARIN

August: ELRA — European Language Resources Association endorses the application proposal of PORTULAN CLARIN

2012

December: the international workshop “New Technologies and the Future of Languages” is organized in Lisbon by the CLARIN network for the Portuguese language

September: the CLARIN network for the Portuguese language responds to the call of interest to becoming part of the future National Roadmap of Research Infrastructures issued by FCT — Fundação para a Ciência e Tecnologia

April: the joining to CLARIN ERIC is recommended by IILP — International Institute of the Portuguese Language, a body of the international organization CPLP — The Community of Portuguese Speaking Countries

February: CLARIN ERIC is founded, starting with 8 member countries, and becomes part of the first European Roadmap of Research Infrastructures promoted by European Strategy Forum on Research Infrastructures (ESFRI)

2011

June: European preparatory project of CLARIN is successfully concluded, with the CLARIN network for the Portuguese language, with 18 members, being the largest one in CLARIN

2010

September: after a commonly signed letter, seven Brazilian universities become members of the CLARIN network for the Portuguese language, following a positive response to their request

March: Portugal is invited to be founding member of the future CLARIN ERIC

March: workshop supported by FCT — Fundação para a Ciência e Tecnologia to form the CLARIN national network successfully takes place in Lisbon, gathering 17 research units accredited by FCT

2008

January: European preparatory project for CLARIN starts, funded by the European Commission: the University of Lisbon team, coordinated by António Branco, is the consortium partner representing Portugal

2006

the European language research community submits a proposal to be an inaugural member of the first European Roadmap of Research Infrastructures being designed by ESFRI, and another proposal to a competitive call in the FP7 Framework Program for projects to prepare the design of European infrastructures

Name

Scientific knowledge is grounded on falsifiable predictions and thus its credibility and raison d'être relies on the possibility of repeating experiments and getting similar results as originally obtained and reported. Scientific knowledge is also cumulative, with more recent advancements originating from developments obtained over previous breakthroughs.

Crucial for the scientific endeavor, and for science-based activities, it is the availability of data and of companion analytical devices. Also crucial, it is the moral, and many times physical, courage to challenge the status quo, together with the altruism to share the research results.

While seeking to foster this scientific ethos, this research infrastructure is named "portulan", the term designating the maps where discoveries by courageous sailors were documented such that these discoveries and associated data could subsequently be confirmed or corrected, the originating travel could be repeated with increased efficiency, and new discoveries and routes could be reached beyond those already known.

Portolan chart by Jorge de Aguiar (1492), the oldest known signed and dated chart of Portuguese origin (Beinecke Rare Book and Manuscript Library, Yale University, New Haven, USA)

If you are interested to know more about portulan maps, you can start here.

Background image of the PORTULAN CLARIN website.