notesum.ai

Published at October 30

Robotic State Recognition with Image-to-Text Retrieval Task of Pre-Trained Vision-Language Model and Black-Box Optimization

cs.RO
cs.AI
cs.CV

Released Date: October 30, 2024

Authors: Kento Kawaharazuka1, Yoshiki Obinata1, Naoaki Kanazawa1, Kei Okada1, Masayuki Inaba1

Aff.: 1Institution 1

Arxiv: http://arxiv.org/abs/2410.22707v1