About 365,000 results
Open links in new tab
  1. Scene Graph Generation - Papers With Code

    We propose a novel scene graph generation model called Graph R-CNN, that is both effective and efficient at detecting objects and their relations in images. In this paper, we present a method that improves scene graph generation by explicitly modeling inter-dependency among the entire object instances.

  2. ChocoWu/Awesome-Scene-Graph-Generation - GitHub

    Spatio-Temporal (Video) Scene Graph Generation, a.k.a, dynamic scene graph generation, aims to provide a detailed and structured interpretation of the whole scene by parsing an event into a sequence of interactions between different visual entities.

  3. Scene Graph Generation with Role-Playing Large Language Models

    Oct 20, 2024 · Current approaches for open-vocabulary scene graph generation (OVSGG) use vision-language models such as CLIP and follow a standard zero-shot pipeline -- computing similarity between the query image and the text embeddings …

  4. From Pixels to Graphs: Open-Vocabulary Scene Graph Generation

    Apr 1, 2024 · Our framework leverages vision-language pre-trained models (VLM) by incorporating an image-to-graph generation paradigm. Specifically, we generate scene graph sequences via image-to-text generation with VLM and …

  5. Generate Any Scene

    We introduce Generate Any Scene, a groundbreaking framework that systematically enumerates scene graphs representing a vast array of visual scenes, spanning realistic to imaginative compositions.

  6. [1811.09543] An Interpretable Model for Scene Graph Generation

    Nov 21, 2018 · We propose an efficient and interpretable scene graph generator. We consider three types of features: visual, spatial and semantic, and we use a late fusion strategy such that each feature's contribution can be explicitly investigated.

  7. Mutual introspective distillation for unbiased scene graph generation ...

    4 days ago · Existing scene graph generation methods primarily focus on addressing the long-tail problem in the labeling. However, most debiasing approaches struggle with a trade-off between head and tail class performance compared to biased-trained models. In this paper, we propose a balanced fusion strategy to leverage the strengths of both model types. From the perspective of causality, we proposed a ...

  8. BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation

    6 days ago · Specifically, we introduce an end-to-end scene graph generation model, the Bidirectional Conditioning Transformer (BCTR), to implement this factorization. BCTR consists of two key modules. First, the Bidirectional Conditioning Generator (BCG) performs multi-stage interactive feature augmentation between entities and predicates, enabling mutual ...

  9. DDS: Decoupled Dynamic Scene-Graph Generation Network

    Scene-graph generation involves creating a structural representation of the relationships between objects in a scene by predicting subject-object-relation triplets from input data. Existing methods show poor performance in detecting triplets outside of a predefined set, primarily due to their reliance on dependent feature learning. To address this …

  10. Scene Graph Generation: A comprehensive survey

    Jan 21, 2024 · Scene Graph Generation (SGG) refers to the task of automatically mapping an image or a video into a semantic structural scene graph, which requires the correct labeling of detected objects and their relationships. In this paper, a comprehensive survey of recent achievements is provided.

Refresh