Beyond task success: A closer look at jointly learning to see, ask, and GuessWhat