Curriculum learning

From Robowaifu Institute of Technology
Revision as of 01:24, 28 April 2023 by RobowaifuDev (talk | contribs) (Created page with "'''Curriculum learning''' is a training strategy in machine learning where the examples used to train the model are presented to the model in a meaningful order, from easiest to hardest. This is done in an attempt to increase the speed and effectiveness of the learning process, as well as the final performance of the model. The concept is inspired by the way humans learn, where new knowledge is often presented in a structured and scaffolded way. With curriculum learning,...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Curriculum learning is a training strategy in machine learning where the examples used to train the model are presented to the model in a meaningful order, from easiest to hardest. This is done in an attempt to increase the speed and effectiveness of the learning process, as well as the final performance of the model. The concept is inspired by the way humans learn, where new knowledge is often presented in a structured and scaffolded way. With curriculum learning, the idea is that by starting with simpler examples and gradually increasing the complexity of the examples, the model will be able to better learn the underlying patterns and features of the data.

Methods

A simple way to do automated curriculum learning is to ignore training examples with high noise. Min-SNR found that by attenuating the loss of Stable Diffusion examples below a SNR of 5 sped up training convergence by 3.4x.[1]

Applications

Curriculum learning has been used across a variety of machine learning domains, including computer vision, natural language processing, and reinforcement learning. Some researchers have also investigated the use of curriculum learning in multi-task learning, where the model has to learn multiple tasks simultaneously.

References

  1. Hang et al. "Efficient Diffusion Training via Min-SNR Weighting Strategy." 2023. arXiv:2303.09556