SAE-RNA Interprets RNA Language Model Representations
Researchers introduce SAE-RNA, a sparse autoencoder model designed to interpret representations from the RiNALMo RNA language model. The study applies sparse autoencoders (SAEs) to RNA language model representations, following similar work on protein language models like ESM. SAE-RNA maps RiNALMo's internal representations to known human-level biological features, serving as a probe for understanding how RNA language models organize biological information. The authors frame this as a representation-level analysis rather than definitive biological concept discovery, acknowledging limitations in the approach.
Key facts
- SAE-RNA is a sparse autoencoder model for interpreting RNA language model representations.
- It analyzes representations from the RiNALMo RNA language model.
- The study follows recent work applying SAEs to protein language models like ESM.
- SAE-RNA maps representations to known human-level biological features.
- The approach is framed as a representation-level probe, not definitive concept discovery.
- The paper is available on arXiv under identifier 2510.02734.
Entities
—