Dear maintainers,
I would like to suggest adding our work "DIFFA: Large Language Diffusion Models Can Listen and Understand" to the Awesome-Audio-LLM repository.
Our paper introduces DIFFA, which explores the capabilities of large language diffusion models in listening and understanding audio. The relevant information is as follows:
We believe this work contributes to the field of audio-language models and would be a valuable addition to the collection.
Thank you for your consideration!