2004; García & Fishman,2011; Yu,2013; Eberhard et al.,2022).Footnote1It is commonly used in colloquial scenarios (e.g., daily conversation and social media) but also in formal and written contexts, such as in the Legislative Council of the Hong Kong Special Administrative...
stemming, lemmatization etc. Stop words such as ‘in’, ‘the’, ‘and’ etc. are removed as they don’t contribute to any meaningful interpretation and their frequency is also high which may affect the computation time