Partek Flow Documentation

Page tree
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »

Latent semantic indexing (LSI)  was first introduced for the analysis of scATAC-seq data by Cusanovich et al. 2018[1]. LSI combines steps of frequency-inverse document frequency (TF-IDF) normalization followed by singular value decomposition (SVD). Partek Flow wraps Signac's TF-IDF normalization for single cell ATAC-seq dataset. It is a two-step normalization procedure that both normalizes across cells to correct for differences in cellular sequencing depth, and across peaks to give higher values to more rare peaks[2].

TF-IDF normalization in Flow can be invoked in Normalization and scaling section by clicking any single cell counts data node (Figure 1).

Figure 1. TF-IDF normalization task in Normalization and scaling section in Flow.

To run TF-IDF normalization

  • Click a single cell counts data node
  • Click the Normalization and scaling section in the toolbox
  • Click TF-IDF normalization

The output of TF-IDF normalization is a new data node that has been normalized by log(TF x IDF). We can then use this new normalized matrix for downstream analysis and visualization (Figure 2).

Figure 3. Example workflows to demonstrate downstream analysis and visualization of Scran deconvolution output.


Other parameters in this task that you can adjust include:

Pool size: A numeric vector of pool sizes, i.e., number of cells per pool.

Max cluster size: An integer scalar specifying the maximum number of cells in each cluster.

Enforce positive estimates: A logical scalar indicating whether linear inverse models should be used to enforce positive estimates.

Scaling factor: A numeric scalar containing scaling factors to adjust the counts prior to computing size factors.




References

  1. Lun, A. T., K. Bach, and J. C. Marioni. Pooling across cells to normalize single-cell RNA sequencing data with many zero counts. Genome Biol. 2016.

  https://genomebiology.biomedcentral.com/articles/10.1186/s13059-016-0947-7



Additional Assistance

If you need additional assistance, please visit our support page to submit a help ticket or find phone numbers for regional support.

  • No labels