Transformer Neural Process Introduces Kernel Regression for Scalable Stochastic Modeling

ai-technology · 2026-04-20

A new research paper introduces Transformer Neural Process - Kernel Regression (TNP-KR), a scalable model designed to overcome computational bottlenecks in existing Neural Processes. Neural Processes were originally developed as a more scalable alternative to Gaussian Processes, which face O(n³) runtime limitations. While modern Neural Processes can match Gaussian Processes in accuracy, they still encounter O(n²) constraints due to attention mechanisms. The TNP-KR model incorporates a Kernel Regression Block that achieves complexity of O(n_c² + n_c n_t), where n_c and n_t represent context and test points respectively. This architecture includes kernel-based attention bias and two novel attention mechanisms: scan attention, which enhances memory efficiency, and translation invariance when paired with kernel bias. The research was published on arXiv with identifier 2411.12502v4, categorized as a replacement cross announcement. The work addresses fundamental challenges in modeling posterior predictive distributions of stochastic processes through transformer-based approaches.

Key facts

Transformer Neural Process - Kernel Regression (TNP-KR) introduces scalable modeling of stochastic processes
Neural Processes were developed as scalable alternatives to Gaussian Processes with O(n³) complexity
Modern Neural Processes can rival Gaussian Processes in accuracy but suffer O(n²) bottlenecks
TNP-KR features Kernel Regression Block with complexity O(n_c² + n_c n_t)
Model includes kernel-based attention bias and novel scan attention mechanism
Scan attention paired with kernel bias enables translation invariance
Research published on arXiv with identifier 2411.12502v4
Paper announcement type is replace-cross

Transformer Neural Process Introduces Kernel Regression for Scalable Stochastic Modeling

Key facts

Entities

Institutions

Sources