When the process is terminated fortfa.text.crf.crf_log_likelihood, it looks like the gradient calculation throughtf.scanincrf_forwardis stuck, if we addback_prop=False, it manage to do the tracing, but it is a part of the loss function, so we need to backdrop the gradient. ...