dear All,
I've the following problem:
I have a data.frame in which some of the columns are numerical values and some are categorical. Just for data exploration I would like to do sort of a clustering or a correlation matrix on the columns to see which of them are actually correlated. But how can I correlate between numeric and continuous values?
Thanks in advance for any hints and suggestions on that!
cheers, jo
I didn't want to transform the data and was hoping there is some distance metric out there that would work with both categorical and numerical data.
But I'll definitely look into the idea with the random forest, thanks Sean!
just a secondary thought: Gower's distance wouldn't help here right?