[IJCAI 2025] Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning (Trident)

🧑‍💻 Authors: Xudong Yan, Songhe Feng, Yang Zhang, Jian Yang, Yueguan Lin, Haojun Fei
🏛️ Affiliations: Beijing Jiaotong University, Qifu Technology
🔍 More details: [arXiv version] | [IJCAI version] | [code]

Note: The supplementary material is provided in the paper's arXiv version.

📝 Overview

TL;DR: We employ both LLM and MLLM to guide attribute-object disentanglement by generating auxiliary attributes and representing primitive words for CZSL, respectively.

⚙️ Setup

Our work is implemented in PyTorch framework. Create a conda environment trident using:

conda create --name trident python=3.8.0
conda activate trident
pip install -r requirements.txt

⬇️ Download

Datasets: In our work, we conduct experiments on three datasets: MIT-States, C-GQA, and VAW-CZSL. For VAW-CZSL, you can download this dataset from this website. For MIT-States and C-GQA, please using:

bash utils/download_data.sh

Pre-trained models: ViT-Large-Patch14-336px (the backbone) can be downloaded here. LLaVA-v1.5-7b can be found here.

🏋️ Training

Before training Trident, please obtain the auxiliary attributes by GPT-3.5 through OpenAI official API, and get the last hidden states of LLaVA v1.5 offline， which can be found in utils folder.
Train Trident model with a specified configure file using:
```
python train.py --cfg config/{DATASET_NAME}.yml
```

📊 Evaluation

Evaluate Trident model using:

python test.py --cfg config/{DATASET_NAME}.yml --load TRIDENT_MODEL.pth

📚 Citation

If you find our work helpful, please cite our paper:

@inproceedings{Yan_2025_IJCAI,
  title     = {Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning},
  author    = {Yan, Xudong and Feng, Songhe and Zhang, Yang and Yang, Jian and Lin, Yueguan and Fei, Haojun},
  booktitle = {Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, {IJCAI-25}},
  pages     = {2179--2187},
  year      = {2025},
}

or

@inproceedings{Yan_2025_IJCAI,
   title={Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning},
   author={Yan, Xudong and Feng, Songhe and Zhang, Yang and Yang, Jian and Lin, Yueguan and Fei, Haojun},
   journal={arXiv preprint arXiv:2411.12584},
   year={2024}
}

🙏 Acknowledgement

Thanks for the publicly available code of OADis and LLaVA.

📬 Contact

If you have any questions or are interested in collaboration, please feel free to contact me at [email protected] / [email protected] .

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
config		config
images		images
models		models
utils		utils
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
evaluator_ge.py		evaluator_ge.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[IJCAI 2025] Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning (Trident)

📝 Overview

⚙️ Setup

⬇️ Download

🏋️ Training

📊 Evaluation

📚 Citation

🙏 Acknowledgement

📬 Contact

About

Uh oh!

Releases

Packages

Languages

License

xud-yan/Trident

Folders and files

Latest commit

History

Repository files navigation

[IJCAI 2025] Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning (Trident)

📝 Overview

⚙️ Setup

⬇️ Download

🏋️ Training

📊 Evaluation

📚 Citation

🙏 Acknowledgement

📬 Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages