📃 Computer Vision Papers of the week: A Brand New Program to Dig Into the Field of Computer Vision
| Week | Papers | Paper | Code |
|---|---|---|---|
| Week 1 : 24 Oct 2022 to 28 Oct 2022 - Linkedin Post | |||
| 1 | MetaFormer Baselines for Vision | ||
| 2 | Monocular Dynamic View Synthesis: A Reality Check | ||
| 3 | Gallery Filter Network for Person Search | ||
| 4 | Weakly-Supervised Temporal Article Grounding(DUAL-MIL) | ||
| 5 | A Task-aware Dual Similarity Network for Fine-grained Few-shot Learning | ||
| 6 | Rethinking Learning Approaches for Long Term Action Anticipation | ||
| 7 | Human Behavior Animation: Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural Embeddings | ||
| Week 2 : 24 Oct 2022 to 28 Oct 2022 - Linkedin Post | |||
| 8 | DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models | ||
| 9 | High Fidelity Neural Audio Compression | ||
| 10 | DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation | ||
| 11 | SearchTrack: Multiple Object Tracking with Object-Customized Search and Motion-Aware Feature | ||
| 12 | NeRFPlayer : A Streamable Dynamic Scene Representation with Decomposed Neural Radiance Fields | ||
| 13 | Imagic: Text-Based Real Image Editing with Diffusion Models | ||
| Week3: 1 Nov 2022 to 12 Nov 2022 - Linkedin Post | |||
| 14 | OneFormer: One Transformer to Rule Universal Image Segmentation | ||
| 15 | Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training | ||
| 16 | Unifying Flow, Stereo and Depth Estimation | ||
| 17 | InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions | ||
| 18 | Dungeons and Data: A Large-Scale NetHack Dataset | ||
| 19 | Probabilistic Deep Metric Learning for Hyperspectral Image Classification | ||