atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cells

2,789
Estimated number of cells (hg19)
2,692
Estimated number of cells (mm10)
26,958
Median fragments per cell (hg19)
18,256
Median fragments per cell (mm10)
83.4%
Fraction of fragments overlapping any targeted region (hg19)
71.4%
Fraction of fragments overlapping any targeted region (mm10)
52.2%
Fraction of transposition events in peaks in cell barcodes

Sample

Sample IDatac_v1_hgmm_5k
Sample description1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cells
FASTQ path
‎/mnt/scratch2/cellranger-atac/fastqs/atac_v1_hgmm_5k_fastqs‎
Pipeline version1.1.0
Reference path
‎/mnt/scratch2/cellranger-atac/references/refdata-cellranger-atac-hg19-and-mm10-1.1.0‎
OrganismHomo_sapiens_and_Mus_musculus
Assemblyhg19_and_mm10
Annotationbarnyard

Sequencing

Total number of read pairs
Total number of read pairs that were assigned to this library in demultiplexing.
Fraction of read pairs with a valid barcode
Fraction of read pairs with barcodes that match the whitelist after barcode correction.
Q30 bases in Read 1
Fraction of read 1 bases with Q-score >= 30.
Q30 bases in Read 2
Fraction of read 2 bases with Q-score >= 30.
Q30 bases in Barcode
Fraction of cell barcode bases with Q-score >= 30.
Q30 bases in Sample Index
Fraction of sample index bases with Q-score >= 30.
Total number of read pairs223,622,583
Fraction of read pairs with a valid barcode97.5%
Q30 bases in Read 195.2%
Q30 bases in Read 295.0%
Q30 bases in Barcode73.8%
Q30 bases in Sample Index87.4%

Barnyard

Observed multiplet rate
The observed fraction of cell barcodes that appear to have cells from both species present.
Inferred multiplet rate
The estimated fraction of cell barcodes containing more than one cell.
Median barcode purity
The median, across all cell barcodes, of the fraction of fragments in the barcode that align uniquely to the species assigned to the barcode.
Plots
(left) Barnyard scatter plot, where each dot represents a barcode and its coordinates indicate number of fragments, from each species, assigned to the barcode. Groups are estimated computationally.
(right) Histograms of per barcode purities estimated for barcodes in each species.
Observed multiplet rate2.3%
Inferred multiplet rate4.7%
Median barcode purity (hg19)99.9%
Median barcode purity (mm10)99.8%
110010k512510251002510002510k25100k
Non-cellsMultipletshg19 Cellsmm10 CellsBarnyardSample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellshg19 Fragments Per Cellmm10 Fragments Per Cell
0.60.70.80.905001000150020002500
hg19 Cellsmm10 CellsPurity (Cell Barcodes)Sample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellsFraction of Fragments in Primary Species# of Cell Barcodes

Cells

Estimated number of cells
The total number of barcodes identified as cells.
Lower threshold on the number of fragments overlapping peaks per barcode to annotate barcode as cell
If the number of fragments (that passed all filters and overlap peaks) associated with a barcode is greater than this threshold (as determined by the cell calling algorithm), the barcode is annotated as cell.
Median fragments per cell
Among barcodes identified as cells, the median number of fragments per barcode.
Median fragments per non-cell barcode
Among barcodes not identified as cells, the median number of fragments per barcode.
Plots
(left) Knee plot of number of fragments overlapping peaks for all the barcodes in the library. This number is used to call cells.
(right) Histograms of number of fragments per cell barcode for non-cells and cells.
Estimated number of cells (hg19)2,789
Estimated number of cells (mm10)2,692
Lower threshold on the number of fragments overlapping peaks per barcode to annotate barcode as cell (hg19)277.00
Lower threshold on the number of fragments overlapping peaks per barcode to annotate barcode as cell (mm10)150.00
Median fragments per cell (hg19)26,958
Median fragments per cell (mm10)18,256
Median fragments per non-cell barcode2
110010k12510251002510002510k25100k
Non-cellshg19 Cellshg19 CellsSample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellsBarcodeshg19 Fragments Overlapping Peaks
110010k12510251002510002510k25100k2
Non-cellshg19 Cellshg19 Fragment DistributionSample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellshg19 Fragments Per BarcodeBarcodes
110010k12510251002510002510k25
Non-cellsmm10 Cellsmm10 CellsSample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellsBarcodesmm10 Fragments Overlapping Peaks
110100100010k512510251002510002510k25100k25
Non-cellsmm10 Cellsmm10 Fragment DistributionSample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellsmm10 Fragments Per BarcodeBarcodes

Cell Clustering

Plots
(left) Scatter plot of barcodes annotated as cells, colored by automatically computed clusters via graph clustering.
(right) Scatter plot of barcodes annotated as cells, colored by number of fragments in the barcode.
(bottom) Scatter plot of barcodes annotated as cells, colored by species.
−500−40−2002040
Cluster 1 (949)Cluster 2 (675)Cluster 3 (570)Cluster 4 (537)Cluster 5 (487)Cluster 6 (482)Cluster 7 (364)Cluster 8 (359)Cluster 9 (336)Cluster 10 (287)Cluster 11 (210)Cluster 12 (97)Cell Clustering (By Cluster)Sample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellstSNE axis 1tSNE axis 2
−500−40−2002040
2.533.544.55log10 FragmentsCell Clustering (By Depth)Sample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellstSNE axis 1tSNE axis 2
−500−40−2002040
hg19 (2661)mm10 (2564)Doublet (128)Cell Clustering (By Species)Sample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellstSNE axis 1tSNE axis 2

Insert Sizes

Fragments in nucleosome-free regions
Fraction of fragments (that passed all filters) of size smaller than 147 basepairs.
Fragments flanking a single nucleosome
Fraction of fragments (that passed all filters) of size between 147 and 294 basepairs.
Plots
Insert size distribution in linear scale.
Fragments in nucleosome-free regions32.4%
Fragments flanking a single nucleosome43.9%
2004006000100k200k300k400k500k600k700k
Insert Size DistributionSample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellsInsert SizeFragment Count (linear scale)

Targeting

Enrichment score of transcription start sites
The TSS profile is the summed accessibility signal (defined as number of cut sites per base) in a window of 2,000 bases around all the annotated TSSs, normalized by the minimum signal in the window. This metric reports the maximum value in the profile.
Fraction of fragments overlapping TSS
The fraction of fragments (that passed all filters) overlapping transcription start sites, as defined by the GENCODE basic annotation.
Fraction of fragments overlapping called peaks
The fraction of fragments (that passed all filters) overlapping the set of peaks called for the library.
Fraction of fragments overlapping any targeted region
The fraction of fragments (that passed all filters) overlapping targeted regions (transcription start sites, DNase hypersensitive regions, enhancer or promoter regions).
Fraction of total read pairs mapped confidently to genome (>30 mapq)
Fraction of all the sequenced read pairs that mapped to the genome with high mapping quality. Includes unique and duplicate read pairs from any barcode.
Fraction of total read pairs that are unmapped and in cell barcodes
Fraction of all the sequenced read pairs that come from cell barcodes and could not be mapped to the genome with confidence.
Fraction of total read pairs in mitochondria and in cell barcodes
Fraction of all the sequenced read pairs that come from cell barcodes and map to the mitochondrial genome.
Plots
(left) TSS profile, as described above.
(right) Targeting scatter plot. Each dot represents a barcode. Horizontal axis is the barcode's number of fragments, vertical axis is the percentage of those fragments that overlap peaks. Non-cell and cell groups are represented with different colors.
Enrichment score of transcription start sites7.21
Fraction of fragments overlapping TSS (hg19)33.9%
Fraction of fragments overlapping TSS (mm10)35.9%
Fraction of fragments overlapping called peaks (hg19)54.7%
Fraction of fragments overlapping called peaks (mm10)55.4%
Fraction of fragments overlapping any targeted region (hg19)83.4%
Fraction of fragments overlapping any targeted region (mm10)71.4%
Fraction of total read pairs mapped confidently to genome (>30 mapq)86.9%
Fraction of total read pairs that are unmapped and in cell barcodes0.7%
Fraction of total read pairs in mitochondria and in cell barcodes0.2%
−1000−500050010001234567
Enrichment around TSS (normalized)Sample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellsRelative Position (bp from TSS)Relative Enrichment
110010k1M00.20.40.60.81
Non-cellshg19mm10Singlecell Targeting (Peaks)Sample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellsFragments per BarcodeFraction Fragments Overlapping Peaks

Library Complexity

Percent duplicates
Fraction of all the sequenced read pairs that come from cell barcodes and are deemed to be PCR duplicates due to alignment to the same genomic position as another read pair in the library.
Sequencing saturation
Estimated sequencing saturation of high-quality fragment pool. Computed as the ratio of observed unique read pairs to estimated library complexity.
Estimated bulk library complexity
Estimated complexity of the library given the observed unique read pairs when sequenced to current depth.
Plots
Observed per cell complexity as a function of downsampling rate in mean reads per cell.
Percent duplicates29.8%
Sequencing saturation42.4%
Estimated bulk library complexity297,709,083
020k40k60k80k05k10k15k20k
Per-Cell Library Complexity At Read DepthSample atac_v1_hgmm_5k - 1:1 mixture of fresh frozen human (GM12878) and mouse (A20) cellsMean Reads Per CellMedian Unique Fragments Per Cell