Chain-of-Caption: Training-free improvement of multimodal large language model on referring expression comprehension
Comments 4 pages, 5 figures, 2 tables
Comments 4 pages, 5 figures, 2 tables
Comments 10 pages, 4 figures
Comments 11 pages, 5 tables, 10 figures. Under peer review
Comments Accepted at ACM Web Conference 2026 (WWW2026)
Comments 11 pages (main text), 90 pages total. Project page: https://konstantinosmitsides.github.io/dreaming-in-code
Comments 8 pages, IEEE Robot. Automat. Lett. (RA-L) 2026
Comments PAKDD2026 Accepted
Comments 13 pages
Comments TMLR camera-ready version
Comments 22 Pages, long conference paper
Comments Accepted at the 39th International Conference on Advanced Information Networking and Applications (AINA 2025)
Journal ref Lecture Notes in Networks and Systems, vol 1210, pp. 222-233, 2025
Comments 8 pages, 5 figures
Comments 20 pages, 20 figures
Comments 8 pages, 3 figures
Comments Project website https://albertchen98.github.io/DwD-project/