VisualSiteDiary: A detector-free Vision-Language Transformer model for captioning photologs for daily construction reporting and image retrievals
Author(s): |
Yoonhwa Jung
Ikhyun Cho Shun-Hsiang Hsu Mani Golparvar-Fard |
---|---|
Medium: | journal article |
Language(s): | English |
Published in: | Automation in Construction, September 2024, v. 165 |
Page(s): | 105483 |
DOI: | 10.1016/j.autcon.2024.105483 |
- About this
data sheet - Reference-ID
10786021 - Published on:
20/06/2024 - Last updated on:
20/06/2024