One-hot encoding is computationally intensive and can lead to a large increase in the dimensionality of the data, especially when dealing with categorical variables that have many unique values. This can make storage and computation more resource-intensive and inefficient. Therefore, it is often more practical to perform one-hot encoding on smaller samples of training sets rather than within the feature repository for broader applications.
RMSE (Root Mean Squared Error) is a widely accepted and valid evaluation metric for regression problems. Therefore, the explanation stating that the RMSE is an invalid evaluation metric for regression problems is invalid.
