By allowing models to actively update their weights during inference, Test-Time Training (TTT) creates a "compressed memory" ...
Diffusion models are widely used in many AI applications, but research on efficient inference-time scalability*, particularly for reasoning and planning (known as System 2 abilities) has been lacking.
Bayesian inference provides a robust framework for combining prior knowledge with new evidence to update beliefs about uncertain quantities. In the context of statistical inverse problems, this ...