Hierarchical Few-Shot Generative Models

Contributions

We increase the input set representation expressivity of latent variable models for sets through hierarchical inference and a learnable aggregation mechanism.

We explore forward and iterative sampling strategies for the marginal and the predictive distributions implicitly defined by few-shot generative models.

We study the few-shot transfer for this new class of models, exploring generalization to set cardinality, new classes and new datasets, providing evidence that a hierarchical set representation increases the expressivity of few-shot generative models.

Abstract

A few-shot generative model should be able to generate data from a distribution by only observing a limited set of examples. In few-shot learning the model is trained on data from many sets from different distributions sharing some underlying properties such as sets of characters from different alphabets or sets of images of different type objects. We extend current latent variable models for sets to a fully hierarchical approach with an attention-based point to set-level aggregation and call our approach SCHA-VAE for Set-Context-Hierarchical-Aggregation Variational Autoencoder. We explore iterative data sampling, likelihood-based model comparison, and adaptation-free out of distribution generalization. Our results show that the hierarchical formulation better captures the intrinsic variability within the sets in the small data regime. With this work we generalize deep latent variable approaches to few-shot learning, taking a step towards large-scale few-shot generation with a formulation that readily can work with current state-of-the-art deep generative models.

Generative Metrics

Generalization on disjoint Omniglot classes. Models trained on set size 5 and results for a VAE, NSs with mean/learnable aggregation (MEAN/LAG) convolutional variants (C) and for a SCHA-VAE with a hierarchy over c.

Sampling

Given a small set from an unknown character (right on black background), we sample the model and then refine iteratively using the inference model. We show 20 iterations from left to right. We can see how the generative process refines its guess at each iteration.

Iterative sampling.

Stochastic reconstruction, input sets, conditional, refined, unconditional sampling on Omniglot.

Stochastic reconstruction, input sets, conditional, unconditional sampling on CelebA.

Hierarchical Few-Shot Generative Models

Giorgio Giannone

Ole Winther

Contributions

Abstract

Generative Metrics

Transfer

Sampling