Stop words, which are commonly used words that do not carry significant meaning, were then removed from the tokenized text columns. This step helped in reducing noise and improving the quality of the data. Lemmatization was performed using the Stanza library [28], which reduces terms to their ...