Automatic PC threshold
We have a function that automatically creates a threshold on how many PCs should be used. It does this by finding the knee in the explained variance distribution. It uses the grey bars to find a threshold.
In cases like this, it would select only about 5 dimensions, which is too strict. So it may be a better idea to apply the knee finding to the cumulative variance instead. Or we could also say to collect PCs until > x% variance is explained.
Anyway, some discussion on this is needed.