-
The core clustering code has been completely re-written and is now far more efficient.
- Clustering is typically about ten times faster now.
- Due to changes in how chi-squared calculations are now made and combined clusters are tracked along the way, potential row combinations that are "tied" (or have chi-squared reductions that are so extremely close as to be essentially tied) may be clustered in a different order compared to the version 1.0 function.
- This generally only affects earlier clustering steps, well above where most cut points will be in practice.
- If results from a previous version 1.0 clustering need to be reproduced exactly for every step, you can force the use of the old function by calling
greenclust:::.greenclust.v1()
. This feature is not guaranteed to remain in future versions.
-
Verbose output now includes the names of the two rows that were combined at each step.
-
Fix for passing data frames to greenclust()
(#8)