DeepMMPlanner: Deep Multimodal Large Language Model Training Planner

DeepMMPlanner: Deep Multimodal Large Language Model Training Planner
Outstanding Young Researcher Grant, National Research Foundation of Korea (NRF)

November 1, 2025

By BDAI Lab

Project Duration: September 2025 ~ August 2026

Total Grant Amount: KRW ~100,000,000 (approx. USD 100K)

Project Overview

Optimization for Multimodal LLM Training: Proposes a unified system that dynamically selects CUDA kernels, allocates GPU resources per module (e.g., encoders, connectors, decoders), and schedules data batches based on sequence lengths, to eliminate performance bottlenecks in training large multimodal models
Kernel and Resource Planning: Builds a kernel planner that profiles and selects optimal CUDA kernels at runtime based on input features and hardware specs, and a model planner that performs fine-grained GPU resource partitioning and asynchronous execution tailored to each modality (text, image, speech, video)
Adaptive Scheduling: Designs a data planner that reduces padding and synchronization overhead by scheduling batches according to sequence length and predicted compute time, improving efficiency in distributed GPU training
Open, Extensible System Architecture: Implements the DeepMMPlanner system with compatibility for mainstream training frameworks (e.g., PyTorch, DeepSpeed), and releases it as open-source to foster community adoption and industrial deployment
Expected Impact: Reduces training cost and time significantly, enabling broader accessibility to multimodal LLMs in domains such as healthcare, autonomous driving, and smart interfaces. Promotes sustainable AI by reducing energy use and hardware demand

Join BDAI Lab

Masters/Ph.D Students

BDAI Lab is accepting Masters/Ph.D students. Please send your CV and transcript to Prof. Park.

Undergraduate Internship

BDAI Lab has multiple openings for undergraduate research internship. Please send your CV and transcript to Prof. Park.

Post-Doctoral Researcher

BDAI Lab is recruiting post-doctoral researchers. Please send your CV and transcript to Prof. Park.

Contact Infomation

Engineering Hall4 D802, 50 Yonsei-ro, Seodaemun-gu, Seoul, Republic of Korea (03722)

+82-2-2123-2718

kwanghyun.park@yonsei.ac.kr

Get Direction