Loading…
Loading…
Loading…
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Lee, Seung Hyun, Jiang, Jijun, Xu, Yiran, Li, Zhuofang, Ke, Junjie, Li, Yinxiao, He, Junfeng, Hickson, Steven, Datsenko, Katie, Kim, Sangpil, Yang, Ming-Hsuan, Essa, Irfan, Yang, Feng
Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (10.06.2025)
Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (10.06.2025)
Get full text
Conference Proceeding
Loading…
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Lee, Seung Hyun, Jiang, Jijun, Xu, Yiran, Li, Zhuofang, Ke, Junjie, Li, Yinxiao, He, Junfeng, Hickson, Steven, Datsenko, Katie, Kim, Sangpil, Yang, Ming-Hsuan, Essa, Irfan, Yang, Feng
Year of Publication 14.08.2024
Year of Publication 14.08.2024
Get full text
Journal Article
Loading…
Standardization of Neuromuscular Reflex Analysis -- Role of Fine-Tuned Vision-Language Model Consortium and OpenAI gpt-oss Reasoning LLM Enabled Decision Support System
Bandara, Eranga, Gore, Ross, Shetty, Sachin, Mukkamala, Ravi, Rhea, Christopher, Yarlagadda, Atmaram, Kaushik, Shaifali, De Silva, L. H. M. P, Maznychenko, Andriy, Sokolowska, Inna, Hass, Amin, De Zoysa, Kasun
Year of Publication 17.08.2025
Year of Publication 17.08.2025
Get full text
Journal Article
Loading…
THRONE: An Object-Based Hallucination Benchmark for the Free-Form Generations of Large Vision-Language Models
Kaul, Prannay, Li, Zhizhong, Yang, Hao, Dukler, Yonatan, Swaminathan, Ashwin, Taylor, C. J., Soatto, Stefano
Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (16.06.2024)
Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (16.06.2024)
Get full text
Conference Proceeding
Loading…
Loading…
VISUAL INSPECTION METHOD AND RATIONALE-GENERATIVE ESTIMATION METHOD USING LARGE VISION-LANGUAGE MODEL
KATO Kunihito, YOSHIDA Haruto, NAKATSUKA Shunsuke, YAMADA Yusei, HAYASHI Yoshikazu, AIZAWA Hiroaki, UENO Shiryu, TAKI Yukiya, OSHITA Takumi, TERADA Kazunori
Year of Publication 05.06.2025
Get full text
Year of Publication 05.06.2025
Patent
Loading…
Alifuse: Aligning and Fusing Multimodal Medical Data for Computer-Aided Diagnosis
Chen, Qiuhui, Hong, Yi
Published in Proceedings (IEEE International Conference on Bioinformatics and Biomedicine) (03.12.2024)
Published in Proceedings (IEEE International Conference on Bioinformatics and Biomedicine) (03.12.2024)
Get full text
Conference Proceeding
Loading…
A novel approach with vision-language models for custom e-commerce product listings
Huynh Ngoc Nhu, Y, Nguyen, Quoc-Dung, Kingkan, Cherdsak
Published in Multimedia tools and applications (30.04.2025)
Published in Multimedia tools and applications (30.04.2025)
Get full text
Journal Article
Loading…
SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples
Howard, Phillip, Madasu, Avinash, Le, Tiep, Moreno, Gustavo Lujan, Bhiwandiwalla, Anahita, Lal, Vasudev
Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (16.06.2024)
Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (16.06.2024)
Get full text
Conference Proceeding
Loading…
Vision Language Models are Biased
Vo, An, Nguyen, Khai-Nguyen, Taesiri, Mohammad Reza, Dang, Vy Tuong, Nguyen, Anh Totti, Kim, Daeyoung
Year of Publication 29.05.2025
Year of Publication 29.05.2025
Get full text
Journal Article
Loading…
Low-Rank Few-Shot Adaptation of Vision-Language Models
Zanella, Maxime, Ayed, Ismail Ben
Published in IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshops (17.06.2024)
Published in IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshops (17.06.2024)
Get full text
Conference Proceeding
Loading…
Loading…
Loading…