Signac is an extension of Seurat for the analysis of single-cell chromatin data (DNA-based single-cell assays). We have extended the Seurat object to include information about the genome sequence and genomic coordinates of sequenced fragments per cell, and include functions needed for the analysis of single-cell chromatin data.
Signac uses the Seurat object structure, and so all the Seurat commands can be used when analysing data with Signac. See the Data Structures and Object Interaction vignette for an explanation of the classes defined in Signac and how to use them. See the Seurat documentation for more information about the Seurat object: https://satijalab.org/seurat/
See the merge and integration vignettes for information on combining multiple single-cell chromatin datasets.
The fragment file is provided in the output of cellranger-atac, so if you use the 10x Genomics scATAC-seq kit and process the data using Cellranger you will have a fragment file generated.
If you are using another method that does not provide a fragment file as output, you can use the sinto package to generate a fragment file from the BAM file. See here for more information on using Sinto to generate a fragment file: https://timoast.github.io/sinto/basic_usage.html#create-scatac-seq-fragments-file
Choosing the dimensionality is a general problem in single-cell analysis for which there is no simple solution. There has been discussion about this for scRNA-seq, and you can read our recommendations for scRNA-seq in the Seurat vignettes: https://satijalab.org/seurat/v3.1/pbmc3k_tutorial.html (see “Determine the ‘dimensionality’ of the dataset”).
Here are some general tips/suggestions that might help guide you in the choice for number of dimensions:
If you are studying an organism that does not have a BSgenome
genome package or EnsDB
annotation package available on BioConductor, you can still use your own GTF file or FASTA files with Signac.
To create your own BSgenome
data package, see this vignette.
To use a GTF file, you can import it using rtracklayer
, for example:
gtf <- rtracklayer::import('genes.gtf')
gene.coords <- gtf[gtf$type == 'gene']
seqlevelsStyle(gene.coords) <- 'UCSC'
gene.coords <- keepStandardChromosomes(gene.coords, pruning.mode = 'coarse')
If you use Signac, please cite Stuart et al., 2020:
@UNPUBLISHED{signac,
"Multimodal single-cell chromatin analysis with Signac",
title = "Stuart, Tim and Srivastava, Avi and Lareau, Caleb and Satija,
author = Rahul",
"bioRxiv",
journal = "2020.11.09.373613",
pages = nov,
month = 2020,
year = "https://www.biorxiv.org/content/10.1101/2020.11.09.373613v1",
url = "en"
language = }
Signac is an extension of Seurat, and uses the Seurat object structure, so you should consider citing the Seurat paper if you have used Signac.