Try out Step-Audio-EditX
Chat with a powerful language model
Generate detailed images from text prompts