Learning to detect violent videos using convolutional long short-term memory