Media Summary: This semester research project was the final outcome of the CS8803 Computer Vision and Language class at Georgia Tech, ... Learn all the ways Microsoft is a part of CVPR 2020: Authors: Sai Raam Venkataraman; Rishi Sridhar Rao; S. Balasubramanian; R. Raghunatha Sarma; Chandra Sekhar Vorugunti ...

Modeling Compositionality In Vqa - Detailed Analysis & Overview

This semester research project was the final outcome of the CS8803 Computer Vision and Language class at Georgia Tech, ... Learn all the ways Microsoft is a part of CVPR 2020: Authors: Sai Raam Venkataraman; Rishi Sridhar Rao; S. Balasubramanian; R. Raghunatha Sarma; Chandra Sekhar Vorugunti ... Mikyas Desta, Larry Chen, Tomasz Kornuta Visual Question Answering is a novel problem domain where multi-modal inputs must ... In this episode we discuss the highlight paper: Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual ... Authors: Peixi Xiong, Ying Wu Description: There are two main challenges in Visual Question Answering (

Photo Gallery

Modeling Compositionality in VQA
Towards Compositionality in VQA - CS8803 CVL research project
Dr. Richard Socher: Recursive Deep Learning for Modeling Semantic Compositionality
Towards Compositionality in Visual Question Answering Systems
SQuINTing at VQA Models: Introspecting VQA Models with Sub-Questions
Can You Even Tell Left From Right? Presenting a New Challenge for VQA
WACV18: Object-based reasoning in VQA
A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021
Exploring Weaknesses of VQA Models through Attribution Driven Insights
MERCON2022 VQA
MICCAI2022 Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformer
CVPR 2023 - Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning
View Detailed Profile
Modeling Compositionality in VQA

Modeling Compositionality in VQA

Modeling Compositionality in VQA

Towards Compositionality in VQA - CS8803 CVL research project

Towards Compositionality in VQA - CS8803 CVL research project

This semester research project was the final outcome of the CS8803 Computer Vision and Language class at Georgia Tech, ...

Dr. Richard Socher: Recursive Deep Learning for Modeling Semantic Compositionality

Dr. Richard Socher: Recursive Deep Learning for Modeling Semantic Compositionality

Recursive Deep Learning for

Towards Compositionality in Visual Question Answering Systems

Towards Compositionality in Visual Question Answering Systems

The project is aimed at making the

SQuINTing at VQA Models: Introspecting VQA Models with Sub-Questions

SQuINTing at VQA Models: Introspecting VQA Models with Sub-Questions

Learn all the ways Microsoft is a part of CVPR 2020: https://www.microsoft.com/en-us/research/event/cvpr-2020/

Can You Even Tell Left From Right? Presenting a New Challenge for VQA

Can You Even Tell Left From Right? Presenting a New Challenge for VQA

Authors: Sai Raam Venkataraman; Rishi Sridhar Rao; S. Balasubramanian; R. Raghunatha Sarma; Chandra Sekhar Vorugunti ...

WACV18: Object-based reasoning in VQA

WACV18: Object-based reasoning in VQA

Mikyas Desta, Larry Chen, Tomasz Kornuta Visual Question Answering is a novel problem domain where multi-modal inputs must ...

A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021

A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021

"A Transformer-based Cross-modal Fusion

Exploring Weaknesses of VQA Models through Attribution Driven Insights

Exploring Weaknesses of VQA Models through Attribution Driven Insights

"Exploring Weaknesses of

MERCON2022 VQA

MERCON2022 VQA

Visual Question Answering (

MICCAI2022 Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformer

MICCAI2022 Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformer

arxiv: https://arxiv.org/abs/2206.11053 GitHub: https://github.com/lalithjets/Surgical_VQA.

CVPR 2023 - Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning

CVPR 2023 - Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning

In this episode we discuss the highlight paper: Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual ...

TA-Student VQA: Multi-Agents Training by Self-Questioning

TA-Student VQA: Multi-Agents Training by Self-Questioning

Authors: Peixi Xiong, Ying Wu Description: There are two main challenges in Visual Question Answering (