Breaking Lock-In: Preserving Steerability under Low-Data VLA Post-Training
arXiv [Project]
A generalization-preserving VLA post-training method that mitigates policy lock-in under low-data post-training, maintaining pretrained visual grounding and enabling steerable instruction following across novel concepts, spatial targets, and task configurations, without requiring extra adaptation data!






