UniversalImageRestoration | 多任务图像修复

Controlling Vision-Language Models for Universal Image Restoration

daclip

daclip

为了在混合降解数据集上训练 DA-CLIP，我们使用引导式视觉语言框架 BLIP 为所有 HQ 图像生成合成字幕。由于输入是干净的，因此假定生成的字幕是准确和高质量的。然后，我们可以直接将这些干净的标题、LQ 图像和相应的降解类型结合起来，构建图像-文本-降解对。

img

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

BLIP是引导语言图像预训练，实现统一的视觉语言理解和生成

image-20231012120149853

https://github.com/salesforce/LAVIS BLIP集成到了LAVIS里面
- Image Description Generation 功能可以构造数据

img

image-20231012120629567

代码语言：javascript复制

    # !pip3 install transformers==4.15.0 timm==0.4.12 fairscale==0.4.4
    !pip3 install transformers timm==0.4.12 fairscale==0.4.4

image-20231012122457536

image-20231012135330101

daclip

image-20231012115849651

0 人点赞