Baum, Leonard, “An Inequality and Associated Maximization Technique in Statistical Estimation for Probabilistic Functions of Markov Processes”, 1972, Inequalities 3:1-8. Bikel et al., “An Algorithm that Learns What's in a Name,” 1999, Machine Learning Journal Special Issue on Natural Language...