From descriptions to images: what reasoning in between?