AI needs a lot of human feedback. For example, LLMs train using a process calledreinforcement learning from human feedbackwhere people fine tune models by repeatedly ranking outputs from best to worst. A May 2023paperalso describes the phenomenon ofmodel collapse, which states that LLMs malfunct...