pycallingcards.tools.rank_peak_groups_mu¶
- pycallingcards.tools.rank_peak_groups_mu(mdata, groupby, adata_cc='CC', groups='all', reference=None, n_peaks=None, key_added=None, copy=False, rankby='pvalues', method='fisher_exact', alternative='None')[source]¶
Rank peaks for characterizing groups. Designed for mudata object.
- Parameters:
mdata (
MuData) – mdata for both RNA and CC data.groupby (
str) – The key of the groups.adata_cc (
str(default:'CC')) – Name for Anndata of CC. Anndata is mdata[adata_cc].groups (
Union[Literal['all'],Iterable[str]] (default:'all')) – Subset of groups (list), e.g. [‘g1’, ‘g2’, ‘g3’], to which comparison shall be restricted, or all (default), for all groups.reference (
Optional[str] (default:None)) – If rest, compare each group to the union of the rest of the group. If a group identifier, compare with respect to this group.n_peaks (
Optional[int] (default:None)) – The number of peaks that appear in the returned tables. Default includes all peaks.key_added (
Optional[str] (default:None)) – The key in adata.uns information is saved to.rankby (
Optional[Literal['pvalues','logfoldchanges','pvalues_adj']] (default:'pvalues')) – [‘pvalues’, ‘logfoldchanges’]. The list we rank by.copy (
bool(default:False)) – If copy, it will return a copy of the AnnData object and leave the passed adata unchanged.method (
Optional[Literal['binomtest','binomtest2','fisher_exact']] (default:'fisher_exact')) – [“binomtest”, “binomtest2”,”fisher_exact”]. binomtest uses binomial test, binomtest2 uses binomial test but stands on a different hypothesis of binomtest, fisher_exact uses fisher exact test.alternative (
Optional[Literal['two-sided','greater','None']] (default:'None')) – [‘two-sided’, ‘greater’,’None’]. If it has two samples/cluster, ‘two-sided’ is recommended. Otherwise, please use ‘greater’. For default (‘None’), if groupby == “Index”, it will be ‘two-sided’. Otherwise, please use ‘greater’.
- Returns:
- names - structured np.ndarray (.uns[‘rank_peaks_groups’]). Structured array is to be indexed by the group ID storing the peak names. Ordered according to scores.return pvalues - structured np.ndarray (.uns[‘rank_peaks_groups’])return logfoldchanges - structured np.ndarray (.uns[‘rank_peaks_groups’])number - pandas.DataFrame (.uns[‘rank_peaks_groups’]). The number of peaks or the number of cells that contain peaks (depending on the method).number_rest - pandas.DataFrame (.uns[‘rank_peaks_groups’]). The number of peaks or the number of cells that contain peaks (depending on the method).
- Return type:
Optional[AnnData] | total - pandas.DataFrame (.uns[‘rank_peaks_groups’]). The total number of cells that contain peaks. | total_rest - pandas.DataFrame (.uns[‘rank_peaks_groups’]). The total number of cells that contain peaks.- Example:
>>> import pycallingcards as cc >>> mdata = cc.datasets.mousecortex_data(data="Mudata") >>> cc.tl.rank_peak_groups_mu(mdata,"RNA:cluster",method = 'binomtest',key_added = 'binomtest')