
Prism: A Plug-in Reproducible Infrastructure for Scalable Multimodal Continual Instruction Tuning
Prism addresses a critical friction point in multimodal LLM research: the lack of standardized infrastructure for continual instruction tuning. Current MCIT work requires researchers to fork and modify base model codebases, creating isolated implementations that resist comparison and slow iteration. By decoupling algorithmic innovation from engineering scaffolding, Prism enables plug-and-play method development and reproducible benchmarking. This matters because continual adaptation to new tasks is essential for real-world deployment, yet the field has been bottlenecked by implementation overhead rather than fundamental breakthroughs. A shared codebase accelerates the pace at which the community can validate and combine techniques.58























