You may want to avoid using features that are not available at prediction time.") + new_string = f"There are {len(leakage_cols)} columns with data leakage. Double check whether you should use this variable." + dq_df1.loc[bad_col,new_col] += dq_df1.loc[bad_col,'first_comma'] ...