I'm experimenting with some pattern matching/clustering/sorting stuff, and for some of it I've been using Euclidean distance. I'm wondering what the best approach is when dealing with different dimensionalities. That is, when 2 arrays are different lengths, is it better to pad the shorter one (presumably with 0.0s), or should I find some way of truncating the longer one? Truncating seems like it would be somewhat arbitrary in removing information from the longer array, but padding the shorter one is also kind of arbitrary, in a way... If I think of it just in terms of 2D and 3D spaces, then padding seems reasonable, as it would be like imagining the 2D point to be at 0.0 on the z-axis of a 3D space, which is certainly arbitrary, but not particularly disagreeable.
Any thoughts? Or is there some better overall method I should be considering, in cases where the arrays to be compared are of different lengths?
Thanks in advance.