Multilingual part-of-speech tagging with bidirectional long short-term memory models and auxiliary loss