1. We found this model to work poorly when we trained it on standard human-annotated parsing datasets (1M tokens), so we constructed an artifi cial dataset by labelling a large corpus with the BerkeleyParser. Equal contribution 1 arXiv:1412.7449v3 cs.CL 9 Jun 2015 . Go LSTM1 in LS 10...