Loading…
Loading grant details…
| Funder | Swedish Research Council |
|---|---|
| Recipient Organization | Kth, Royal Institute of Technology |
| Country | Sweden |
| Start Date | Jan 01, 2022 |
| End Date | Dec 31, 2025 |
| Duration | 1,460 days |
| Number of Grantees | 1 |
| Roles | Principal Investigator |
| Data Source | Swedish Research Council |
| Grant ID | 2021-05266_VR |
Recent results in information theory show that the generalization error of stochastic learningalgorithms is bounded by the amount of information that an algorithm (e.g., a neural network) stores in its parameters about the training data.
The bounds suggest that in order to obtain learning algorithms that generalize well to unseen data, the information stored in the parameters needs to be controlled to find the best tradeoff between empirical risk minimization and minimization of the generalization error. How this can be achieved is an open problem that we address in this project.
The goal of this project is to develop new theoretical tools that are required for crafting learning algorithms that perform well, generalize well, and are robust, and that provide new insights into how machines learn.
To reach this goal, the objectives of this project are (1) to develop a new family of neural networks that are derived from parity-check matrices of sparse-graph codes and allow us to relate the amount of information stored to their design parameters, (2) to develop tools for tracking the information in the parameters during training, and (3) to extend these tools to multitask learning.
To achieve this, we combine tools from information and coding theory with Bayesian neural networks.
The work in this project will be carried out by one PhD student over the duration of four years, supervised by the applicant.
Kth, Royal Institute of Technology
Complete our application form to express your interest and we'll guide you through the process.
Apply for This Grant