Detailed info

PromptMix: Textto-image diffusion models enhance the performance of lightweight networks

Authors	Arian Bakhtiarnia, Qi Zhang, and Alexandros Iosifidis
Title	PromptMix: Textto-image diffusion models enhance the performance of lightweight networks
Abstract	Many deep learning tasks require annotations that are too time consuming for human operators, resulting in small dataset sizes. This is especially true for dense regression problems such as crowd counting which requires the location of every person in the image to be annotated. Techniques such as data augmentation and synthetic data generation based on simulations can help in such cases. In this paper, we introduce PromptMix, a method for artificially boosting the size of existing datasets, that can be used to improve the performance of lightweight networks. First, synthetic images are generated in an end-to-end data-driven manner, where text prompts are extracted from existing datasets via an image captioning deep network, and subsequently introduced to text-to-image diffusion models. The generated images are then annotated using one or more high-performing deep networks, and mixed with the real dataset for training the lightweight network. By extensive experiments on five datasets and two tasks, we show that PromptMix can significantly increase the performance of lightweight networks by up to 26%.
ISBN	TBA
Conference	International Joint Conference on Neural Networks (IJCNN 2023)
Date	June 18-23
Location	Queensland, Australia
Year of Publication	2023
Url	https://zenodo.org/record/8086752
DOI

Key Facts

Project Coordinator: Dr. Sotiris Ioannidis
Institution: Foundation for Research and Technology Hellas (FORTH)
E-mail: marvel-info@marvel-project.eu
Start: 01.01.2021
Duration: 36 months
Participating Organisations: 17
Number of countries: 12

Get Connected

Funding

This project has received funding from the European Union’s Horizon 2020 Research and Innovation program under grant agreement No 957337. The website reflects only the view of the author(s) and the Commission is not responsible for any use that may be made of the information it contains.

Detailed info

PromptMix: Textto-image diffusion models enhance the performance of lightweight networks

Key Facts

Get Connected

Menu

Funding