GitHub - slewyh/vqa: This is a CS6240 project on visual question answering.

Generative Question Answering for Image and Video QA.

GQA, a generative method proposed by Lewis et al., is shown to perform well for image and text data. Our paper aims to extend GQA in 2 ways:

1, Incorporate a generative answer model in for Image-based GQA in order to expand the choices of candidate answers for a question by introducing a new seq2seq model for the answer generator that takes the image and some ‘weak representation’ of the question as inputs.

To view the sub-module and its contents, run:

cd  ImageQA
ls

See README.md for instructions on how to run the model scripts.

2, Apply GQA to video QA on questions that requires spatial and temporal grounding to obtain a relevant answer. The model is evaluated on TVQAplus dataset.

cd  TVQAplus
ls

See README.md for instructions on how to run the model scripts.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
ImageQA		ImageQA
TVQAplus		TVQAplus
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generative Question Answering for Image and Video QA.

About

Releases

Packages

Contributors 2

Languages

slewyh/vqa

Folders and files

Latest commit

History

Repository files navigation

Generative Question Answering for Image and Video QA.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages