singlet.dataset.correlations¶
-
class
singlet.dataset.correlations.
Correlation
(dataset)[source]¶ Bases:
singlet.dataset.plugins.Plugin
Correlate gene expression and phenotype in single cells
-
correlate_features_features
(features='all', features2=None, method='spearman')[source]¶ Correlate feature expression with one or more phenotypes.
Parameters: - features (list or string) – list of features to correlate. Use a string for a single feature. The special string ‘all’ (default) uses all features.
- features2 (list or string) – list of features to correlate with. Use a string for a single feature. The special string ‘all’ uses all features. None (default) takes the same list as features, returning a square matrix.
- method (string) – type of correlation. Must be one of ‘pearson’ or ‘spearman’.
Returns: - pandas.DataFrame with the correlation coefficients. If either
features or features2 is a single string, the function returns a pandas.Series. If both are a string, it returns a single correlation coefficient.
-
correlate_features_phenotypes
(phenotypes, features='all', method='spearman', fillna=None)[source]¶ Correlate feature expression with one or more phenotypes.
Parameters: - phenotypes (list of string) – list of phenotypes, i.e. columns of the samplesheet. Use a string for a single phenotype.
- features (list or string) – list of features to correlate. Use a string for a single feature. The special string ‘all’ (default) uses all features.
- method (string) – type of correlation. Must be one of ‘pearson’ or ‘spearman’.
- fillna (dict, int, or None) – a dictionary with phenotypes as keys and numbers to fill for NaNs as values. None will do nothing.
Returns: - pandas.DataFrame with the correlation coefficients. If either
phenotypes or features is a single string, the function returns a pandas.Series. If both are a string, it returns a single correlation coefficient.
-
correlate_phenotypes_phenotypes
(phenotypes, phenotypes2=None, method='spearman', fillna=None, fillna2=None)[source]¶ Correlate feature expression with one or more phenotypes.
Parameters: - phenotypes (list of string) – list of phenotypes, i.e. columns of the samplesheet. Use a string for a single phenotype.
- phenotypes2 (list of string) – list of phenotypes, i.e. columns of the samplesheet. Use a string for a single phenotype. None (default) uses the same as phenotypes.
- method (string) – type of correlation. Must be one of ‘pearson’ or ‘spearman’.
- fillna (dict, int, or None) – a dictionary with phenotypes as keys and numbers to fill for NaNs as values. None will do nothing, potentially yielding NaN as correlation coefficients.
- fillna2 (dict, int, or None) – as fillna, but for phenotypes2.
Returns: - pandas.DataFrame with the correlation coefficients. If either
phenotypes or features is a single string, the function returns a pandas.Series. If both are a string, it returns a single correlation coefficient.
-
correlate_samples
(samples='all', samples2=None, phenotypes=None, method='spearman')[source]¶ Correlate feature expression with one or more phenotypes.
Parameters: - samples (list or string) – list of samples to correlate. Use a string for a single sample. The special string ‘all’ (default) uses all samples.
- samples2 (list or string) – list of samples to correlate with. Use a string for a single sample. The special string ‘all’ uses all samples. None (default) takes the same list as samples, returning a square matrix.
- method (string) – type of correlation. Must be one of ‘pearson’ or ‘spearman’.
- phenotypes (list) – phenotypes to include as additional features in the correlation calculation. None (default) means only feature counts are used.
Returns: - pandas.DataFrame with the correlation coefficients. If either
samples or samples2 is a single string, the function returns a pandas.Series. If both are a string, it returns a single correlation coefficient.
-