1.
Real-Time 3D Scene Understanding with Vision-Language Models. IJRAI. 2025;8(3):12255 - 12257. doi:10.15662/IJRAI.2025.0803001