Hi, congratulations for the article! I’m thinking… if someone really wants to use get_dummies() instead of OHE to avoid the problem with different number of features on test set it could simply transform the data before splitting into train and test set, or stratifying the split maybe should work… but if the model will be put into production then it could get into trouble when receiving new data i think.