Home Knowledge Base Self-Soupervision: Cooking Model Soups

Self-Soupervision: Cooking Model Soups

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Self-Soupervision: Cooking Model Soups without Labels

Announce Type: replace Abstract: Model soups are strange and strangely effective combinations of parameters. They take a model (the stock), fine-tune it into multiple models (the ingredients), and then mix their parameters back into one model (the soup) to improve predictions. While all known soups require supervised learning, and optimize the same loss on labeled data, our recipes for Self-Soupervision generalize soups to self-supervised learning (SSL).

arXiv CS 7d ago