The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment

This repository hosts ImageCritic, a reference-guided post-editing approach designed to correct inconsistencies in generated images. It aims to solve the inconsistency problem in generated images by applying attention alignment and a detail encoder, providing significant improvements over existing methods in various customized generation scenarios.

The model was presented in the paper The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment.

πŸ–ΌοΈ Visual Results

ImageCritic can effectively resolve detail-related issues in various customized generation scenarios, providing significant improvements over existing methods.

Online HuggingFace Demo

You can try ImageCritic demo on HuggingFace.

Citation

If you find this project useful for your research, please consider citing our paper:

@article{ouyang2025consistency,
  title={The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment},
  author={Ouyang, Ziheng and Song, Yiren and Liu, Yaoli and Zhu, Shihao and Hou, Qibin and Cheng, Ming-Ming and Shou, Mike Zheng},
  journal={arXiv preprint arXiv:2511.20614},
  year={2025}
}

πŸ“§ Contact

If you have any comments or questions, please open a new issue or contact Ziheng Ouyang

License

Licensed under a Creative Commons Attribution-NonCommercial 4.0 International for Non-commercial use only. Any commercial use should get formal permission first.

Downloads last month
485
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Space using ziheng1234/ImageCritic 1