Partek Flow Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Numbered figure captions
SubtitleTextGraph-based clustering results visualized by t-SNE. Colors correspond to clusters. Resolution Clustering resolution of 0.3 (left), 1 (center), and 2 (right). Note the number of clusters per resolution
AnchorNameGraph-based clustering Resolution

...

Removes links between pairs of points if their similarity is below the threshold. Larger values lead to a shorter run time, but can result in many singleton clusters. Default is 0.0. 

Number of nearest

...

neighbors 

Clustering preserves the local structure of the data by focusing on the distances between each point and its k nearest neighbors. The optimal perplexity depends on the size and density of the data. Generally, a larger and/or more dense data set will benefit from a larger number of nearest neighbors. Increasing the number of nearest neighbors will increase the size of clusters and vice versa (Figure 3). Default is 30. The range of possible values is 3 to 100.

...

Numbered figure captions
SubtitleTextGraph-based clustering results visualized by t-SNE. Colors correspond to clusters. Number of nearest neighbors set to 5 (left), 30 (center), 100 (right).
AnchorNameGraph-based clustering - Number of nearest neighbors

Scale

...

This parameter can be used to speed up clustering at the expense of accuracy. Larger scale implies greater accuracy and helps avoid singletons, but takes more time to run. To maximize accuracy, the total count of observations being clustered should be below the product of nearest neighbors and scale. Default is 100,000. The range of possible values is 1 to 100,000. 

...