整理:AI算法与图像处理
CVPR2022论文和代码整理:https://github.com/DWCTOD/CVPR2022-Papers-with-Code-Demo
ECCV2022论文和代码整理:https://github.com/DWCTOD/ECCV2022-Papers-with-Code-Demo
最新成果demo展示:
标题:
eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
论文:https://arxiv.org/pdf/2211.01324.pdf
主页:https://deepimagination.cc/eDiffi/
官方表述:Compared to the open-source text to image methods (Stable diffusion) and (DALL-E2), our model conistently leads to improved synthesis quality.
我们提出了eDiffi,一种用于合成给定文本的图像的扩散模型。基于扩散模型在不同采样阶段的行为不同的经验观察,我们建议训练一组专家去噪网络,每个网络专门针对特定的噪声区间。我们的模型以T5文本嵌入、CLIP图像嵌入和CLIP文本嵌入为条件。我们的方法可以生成与任何输入文本提示相对应的照片级真实感图像。除了文本到图像合成,我们还提供了两个额外的功能-(1)样式转换,它使我们能够使用参考样式图像控制生成的样本的样式;(2)“用文字绘制”-一个用户可以通过在画布上绘制分割图来生成图像的应用程序,这对于在脑海中绘制所需图像非常方便。
最新论文整理
ECCV2022
Updated on : 8 Nov 2022
total number : 1
Contrastive Learning for Diverse Disentangled Foreground Generation
- 论文/Paper: http://arxiv.org/pdf/2211.02707
- 代码/Code: None
CVPR2022
NeurIPS
Updated on : 8 Nov 2022
total number : 6
TAP-Vid: A Benchmark for Tracking Any Point in a Video
- 论文/Paper: http://arxiv.org/pdf/2211.03726
- 代码/Code: None
CASA: Category-agnostic Skeletal Animal Reconstruction
- 论文/Paper: http://arxiv.org/pdf/2211.03568
- 代码/Code: None
UATTA-ENS: Uncertainty Aware Test Time Augmented Ensemble for PIRC Diabetic Retinopathy Detection
- 论文/Paper: http://arxiv.org/pdf/2211.03148
- 代码/Code: https://github.com/ptnv-s/uatta-ens
Distilling Representations from GAN Generator via Squeeze and Span
- 论文/Paper: http://arxiv.org/pdf/2211.03000
- 代码/Code: https://github.com/yangyu12/squeeze-and-span.
Momentum-based Weight Interpolation of Strong Zero-Shot Models for Continual Learning
- 论文/Paper: http://arxiv.org/pdf/2211.03186
- 代码/Code: None
Deep Learning for Rapid Landslide Detection using Synthetic Aperture Radar (SAR) Datacubes
- 论文/Paper: http://arxiv.org/pdf/2211.02869
- 代码/Code: https://github.com/iprapas/landslide-sar-unet