Qualitative scene descriptions from images for integrated speech and image understandingGudrun SocherEsaurito4,3Avvisami