A research team led by Professor Jiwoong Yang of the Department of Energy Science and Engineering at DGIST has developed next ...
Abstract: Knowledge-based Visual Question Answering (VQA) is a challenging task that requires models to access external knowledge for reasoning. Large Language Models (LLMs) have recently been ...
Abstract: Visual odometry (VO) is a key part of autonomous navigation systems, particularly for robots and autonomous vehicles. Conventional feature-based or direct approaches for VO are powerful but ...