ALEAHallu

This repository is the official implementation of ALEAHallu, the method proposed in paper "Look Closer! An Adversarial Parametric Editing Framework for Hallucination Mitigation in VLMs".

Accepted by AAAI'25

Requirements

conda create -n aleahallu python=3.7
conda activate aleahallu
conda install pytorch==1.8.0 cudatoolkit=11.1 -c pytorch -c conda-forge
pip install -r requirements.txt

Model details

LLaVA is an open-source chatbot trained by fine-tuning LLM on multimodal instruction-following data. It is an auto-regressive language model, based on the transformer architecture. Base LLM: liuhaotian/llava-v1.5-7b

Training dataset

COCO val2017
POPE benchmark

Train ALEAHallu

python chair_eval.py --model llava-1.5 --data_path /images --gpu-id 3 --beam 2 --scale_factor 50 --threshold 15 --num_attn_candidates 5 --penalty_weights 1

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
dataset/train		dataset/train
eval_configs		eval_configs
minigpt4		minigpt4
pope_coco		pope_coco
transformers-4.29.2		transformers-4.29.2
.gitignore		.gitignore
README.md		README.md
chair_eval.py		chair_eval.py
pope_eval.py		pope_eval.py
requirments.txt		requirments.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ALEAHallu

Requirements

Model details

Training dataset

Train ALEAHallu

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ALEAHallu

Requirements

Model details

Training dataset

Train ALEAHallu

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages