Code Monkey home page Code Monkey logo

adversesceneunderstanding_paper_reading's Introduction

场景理解 域适应

This repo will keep updating ... 🤗

UDA

训练方法

  • (ISBI 2024) "When Visual Prompt Tuning Meets Source-Free Domain Adaptive Semantic Segmentation" [PDF] [code]

  • (ISBI 2024)"ConvLoRA and AdaBN Based Domain Adaptation via Self-Training" [PDF] [code]

  • (AAAI 2024)"Parsing All Adverse Scenes Severity-Aware Semantic Segmentation with Mask-Enhanced Cross-Domain Consistency" [PDF] [code]

  • (ICCV 2023)"CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation" [PDF] [code]

    • P1: Conventional cameras fail to capture structural details and boundary information in low-light conditions. -> Event cameras -> unsupervised Cross-Modality Domain Adaptation (CMDA) framework to leverage multi-modality (Images and Events) information for nighttime semantic segmentation (数据不够,引入了额外的event信息,有点像先验)
  • (ICCV 2023)"Contrastive Model Adaptation for Cross-Condition Robustness in Semantic Segmentation" [PDF] [code]

    • 利用参考图像的信息缩短域差异,进行信息交换和对比学习
  • (ICCV 2023 Oral)"Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation"[PDF] [code]

    • existing works rely heavily on domain knowledge derived from the task specific nighttime dataset. (训练数据依赖特定任务的数据集)-> zero-shot
    • image-level methods simply consider synthetic nighttime as pseudo-labeled data and overlook model-level feature extraction;model-level methods focus on adjusting model architecture but neglect image-level nighttime characteristics.(现有模型没有统筹考虑图像层面和模型层面的特点)-> 图像方面迁移后和原图相似性最小化,模型层面域适应最大化图像相似性
  • (CVPR 2023)"MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation"[pdf] [code]

    P1:Most previous UDA methods struggle with classes that have a similar visual appearance on the target domain as no ground truth is available to learn the slight appearance differences.(大多数以前的UDA方法都很难处理在目标域上具有相似视觉外观的类,因为没有基本事实可以用来学习轻微的外观差异。比如road/sidewalk,pedestrian/rider)->To address this problem, we propose to enhance UDA with spatial context relations as additional clues for robust visual recognition

  • (CVPR 2023)"DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic Segmentation" [PDF]

  • (CVPR 2023)"FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding" [PDF] [code]

  • (CVPR 2023)"DA-DETR: Domain Adaptive Detection Transformer with Information Fusion"[PDF]

  • (AAAI 2023 oral)"VBLC: Visibility Boosting and Logit-Constraint Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions" [PDF] [code]

    • P1: However, previous methods often reckon on additional reference images of the same scenes taken from normal conditions, which are quite tough to collect in reality. (训练数据的获取方式依赖参考)

    • P2:Furthermore, most of them mainly focus on individual adverse condition such as nighttime or foggy, weakening the model versatility when encountering other adverse weathers. (训练数据的主要关注夜晚和雾天,忽视其他恶劣场景)

    • P3: Second, for the self-training schemes prevailing in UDA, we observe the insufficient exploitation of predictions on unlabeled target samples for fear of overconfidence. (自训练方式过于自信,导致目标域样本利用不足)

  • (NIPS 2022)"Unsupervised Domain Adaptation for Semantic Segmentation using Depth Distribution"[PDF] [code]

恶劣天气风格特征

  • (ICCV 2023) "PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization"[PDF]

  • (arxiv 2023.5) "Condition-Invariant Semantic Segmentation" [PDF] [code]

  • (ICCV 2023) "PODA: Prompt-driven Zero-shot Domain Adaptation"[PDF] [code]

    • New task:‘Prompt-driven Zero-shot Domain Adaptation’, where we adapt a model trained on a source domain using only a general description in natural language of the target domain
  • (CVPR 2022) "FIFO: Learning Fog-invariant Features for Foggy Scene Segmentation" [PDF] [code]

伪标签

  • (CVPR 2023) "Learning Pseudo-Relations for Cross-domain Semantic Segmentation" [PDF] [code]

  • (ICCV Workshop 2023) "SegDA: Maximum Separable Segment Mask with Pseudo Labels for Domain Adaptive Semantic Segmentation". [PDF] [code]

  • (CVPR 2023) "Continuous Pseudo-Label Rectified Domain Adaptive Semantic Segmentation with Implicit Neural Representations" [PDF] [code]

  • (CVPR 2023) "UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration" [PDF] [code]

  • (CVPR 2023) "Guiding Pseudo-labels with Uncertainty Estimation for Source-free Unsupervised Domain Adaptation"[PDF] [code]

  • (WACV 2023) "Refign: Align and Refine for Adaptation of Semantic Segmentation to Adverse Conditions" [PDF] [code]
    Ranks #1 on both the ACDC leaderboard—72.05 mIoU—and the Dark Zurich leaderboard—63.91 mIoU.

    P1:A critical issue in this procedure is the error propagation of noisy labels, leading to a drift in pseudo-labels if unmitigated.

chain-of-domain

  • (Arxiv 202403) "CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning"[PDF] [code]

数据生成

  • (Arxiv 202403) "ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer" [PDF]

  • (Arxiv 202402) "ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation"[PDF]

One-shot UDA

  • (CVPR 2023) "Informative Data Mining for One-shot Cross-Domain Semantic Segmentation" [PDF] [code]

Online UDA

  • (CVPR 2024) "Memory-based Adapters for Online 3D Scene Perception" [PDF] [code]

  • (ICCV 2023) "To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation" [PDF] [code]

  • (ECCV 2022) "Online Domain Adaptation for Semantic Segmentation in Ever-Changing Conditions"[code]

  • (CVPR 2022) "On the Road to Online Adaptation for Semantic Image Segmentation" [PDF]

Test Time DA

  • (AAAI 2024) "Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction" [PDF] [code]

  • (CVPR 2024)"Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation" [PDF]

  • (arxiv 2404)"Test-Time Model Adaptation with Only Forward Passes" [PDF]

  • (arxiv 2402)"BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation" [PDF] [code]

  • (CVPR 2023)"Robust Test-Time Adaptation in Dynamic Scenarios" [PDF] [code]

  • (CVPR 2023)"Dynamically Instance-Guided Adaptation: A Backward-free Approach for Test-Time Domain Adaptive Semantic Segmentation" [PDF] [code]

Domain Generalization

  • (CVPR 2024) "A Simple Recipe for Language-guided Domain Generalized Segmentation"[PDF] [code]

  • (CVPR 2024) "Collaborating Foundation models for Domain Generalized Semantic Segmentation"[PDF] [code]

  • (CVPR 2024) "Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation"[PDF] [code]

  • (Arxiv 202312) "DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control"[PDF] [code]

  • (CVPR 2024)"Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning" [PDF] [code]

  • (CVPR 2023) "CLIP the Gap: A Single Domain Generalization Approach for Object Detection" [PDF] [code]

  • (NIPS 2022)"Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation" [PDF]

Anomaly Detection

  • (ICCV 2023 oral)"Unmasking Anomalies in Road-Scene Segmentation" [PDF] [code]

  • (ICCV 2023)"RbA: Segmenting Unknown Regions Rejected by All" [PDF] [code]

  • (NIPS 2023)"ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation" [PDF] [code]

其他相关

  • (CVPR 2023)"StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning" [PDF] [code]

  • (ICCV 2023 Oral) "Iterative Prompt Learning for Unsupervised Backlit Image Enhancement" [PDF] [code]

  • (Arxiv 2023) "MoWE: Mixture of Weather Experts for Multiple Adverse Weather Removal". [PDF]

  • (CVPR 2023) "Learning Weather-General and Weather-Specific Features for Image Restoration Under Multiple Adverse Weather Conditions" [PDF]

Dataset

  • FoggyCityscape
  • FoggyDriving
  • FoggyZurich
  • ACDC
  • BDD100K
  • DarkZurich
  • Nighttime Driving
  • ACG (Adverse-Condition Generalization):o 121 fog, 225 rain, 276 snow, and 300 night images [link]
  • DSEC Night_Semantic: 5 nighttime sequences of Zurich City 09a-e, and includes 1,692 training samples and 150 testing samples. [link]
  • MUAD Dataset a synthetic dataset for autonomous driving with multiple uncertainty types and tasks. It contains 10413 in total: 3420 images in the train set, 492 in the validation set and 6501 in the test set. The test set is divided as follows: 551 in the normal set, 102 in the normal set no shadow, 1668 in the OOD set, 605 in the low adversity set and 602 images in the high adversity set 1552 in the low adversity with OOD set and 1421 images in the high adversity with OOD set. All of these sets cover day and night conditions, with 2/3 being day images and 1/3 night images. Test datasets address diverse weather conditions (rain, snow, and fog with two different intensity levels) and multiple OOD objects. [link1] [link2]

video dataset

KITTI

SHIFT

Rainy-foggy cityscape

adversesceneunderstanding_paper_reading's People

Contributors

cameltr avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.