Completed PROJECT GRANT Swedish Research Council

Semantica: Algorithms and Tools for Multilingual Knowledge Extraction

34M kr SEK

Funder	Swedish Research Council
Recipient Organization	Lund University
Country	Sweden
Start Date	Jan 01, 2022
End Date	Dec 31, 2025
Duration	1,460 days
Number of Grantees	1
Roles	Principal Investigator
Data Source	Swedish Research Council
Grant ID	`2021-04533_VR`

Grant Description

Text is the main repository of human knowledge and automatic knowledge extraction is now made possible at unprecedented levels as a gigantic number of documents has been digitized.

The goal of this project is to create deep-learning algorithms to carry out a large-scale multilingual knowledge extraction from text.

In previous projects, our group developed algorithms to identify real-world entities, events in texts, semantic roles, and temporal relations. We used supervised machine-learning techniques that we trained on semantically annotated corpora. However, such corpora are a scarce resource for languages like Swedish, but even for German or French.

The proposed project will build on the results from earlier projects and create algorithms that require less, possibly no, manual annotation.

We will explore distant supervision with the mapping of entities in loosely parallel texts such as the Swedish and English versions of Wikipedia. We will investigate algorithms linking entities to existing knowledge bases and multilingual embeddings. We will apply these components to much larger corpora that cover substantial portions of human knowledge.

We will process the documents and store the results with frameworks we designed at Lund. Knowledge extraction is a key element to many advanced applications in text processing.

Generic multilingual tools will make it easier to design semantic applications that we believe will be instrumental in the development of the information society.

All Grantees

Lund University

Interested in applying for this grant?

Complete our application form to express your interest and we'll guide you through the process.

Apply for This Grant

Semantica: Algorithms and Tools for Multilingual Knowledge Extraction

Grant Description

All Grantees

Interested in applying for this grant?

Quick Summary

Related Grants