jingyi0000/VLM_survey

Vision-Language Models for Vision Tasks: A Survey

computer-visiondeep-learningsurveytransfer-learningclipknowledge-distillationvision-language-model
This is stars and forks stats for /jingyi0000/VLM_survey repository. As of 12 May, 2024 this repository has 1287 stars and 138 forks.

Vision Language Models for Vision Tasks: A Survey This is the repository of Vision Language Models for Vision Tasks: a Survey, a systematic survey of VLM studies in various visual recognition tasks including image classification, object detection, semantic segmentation, etc. For details, please refer to: Vision-Language Models for Vision Tasks: A Survey [Paper] Feel free to contact us or pull requests if you find any related papers that are not included here. News Last update on 2023/10/2 VLM Pre-training...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
rese1f/StableVideoPython1.1k0640
MrNeRF/gaussian-splatting-cudaCCudaC++4290280
roboflow/inferencePythonOther5990180
Nixtla/nixtlaJupyter NotebookPythonOther668+2452+2
QwenLM/Qwen-VLPythonShell1.5k0970
hybridgroup/gocvGoC++C5.8k+13851+1
luban-agi/Awesome-AIGC-Tutorials1.9k01030
mathworks/MATLAB-Simulink-Challenge-Project-HubMATLAB839+10210+2
xuebinqin/U-2-NetPython7.5k01.3k0
camel-ai/camelPythonOther3.3k04000