Neto Figueira
Jan 25, 2021

--

Hi, congratulations for the article! I’m thinking… if someone really wants to use get_dummies() instead of OHE to avoid the problem with different number of features on test set it could simply transform the data before splitting into train and test set, or stratifying the split maybe should work… but if the model will be put into production then it could get into trouble when receiving new data i think.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

Neto Figueira
Neto Figueira

Responses (1)

Write a response