VTprompt

This repository contains the code for the paper: "Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models".

Installation

Environment Setup

Please follow the instructions in Grounded Segment Anything to set up the environment.

Usage

Building Vprompt
Using Tprompt to prompt Multimodal Large Language Models for generating answers.

Evaluation Code and Usage Tutorial

We are currently in the process of organizing detailed evaluation code and usage tutorials. Please stay tuned for updates!

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VTprompt

Installation

Environment Setup

Usage

Evaluation Code and Usage Tutorial

About

Releases

Packages

jiangsongtao/VTprompt

Folders and files

Latest commit

History

Repository files navigation

VTprompt

Installation

Environment Setup

Usage

Evaluation Code and Usage Tutorial

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages