This page includes General Transfer Format (.gtf) or related files which allow one to reproduce the alignment from raw (fastq) sequencing data to a specific version of the genome/transcriptome in RNA-seq analysis. Most individuals wanting to use processed versions of Allen Institute sequencing data (e.g., cell x gene matrices) can safely ignore this page. The top table shows a data type-centric view of these files, while the bottom table shows at project-centric view of the same files. In both tables links are colored in blue for convenience.
In the tables below, 'SSv4' indicates any single cell or single nucleus RNA-sequencing or Patch-Seq data that was processed using the SMARTerV4 method, while '10x' indicates any single cell or single nucleus RNA-sequencing data processed using 10x Genomics single-omics or multiomics methods. All mouse and human single cell or single nucleus RNA-sequencing data was aligned to some version of the GRCm38 and GRCh38 genomics, respectively, although file formats and transcriptomics versions differ between rows. NA in the table means no data has every been generated in the relevant slot, while "in process" means that data has been generated but currently is not included in any Allen Brain Map tools. Finally, data from several projects with data generated prior to 2022 (including Patch-seq data) were originally processed using one transcriptome version and have since been converted into the current version. In this case, refer to the relevant Allen Brain Map or scientific public to determine which genome/transcriptome version was used. Abbreviations: GBM = Ivy Glioblastoma Atlas Project; TBI = Aging, Dementia and Traumatic Brain Injury Study; WHB = Whole human brain atlas included in ABC Atlas (May 2024); CTKE = Cell Type Knowledge Explorer; LGN = Lateral Geniculate Nucleus (part of Comparative LGN project)
Reference files by category and species
Data sets used | Human | Mouse | Marmoset | Macaque | Other mammals |
---|---|---|---|---|---|
10x processed using CellRanger V6 (~2022 - current) | GRCh38/gencode.v32 | mm10/genecode.vM23 | in process | in process | in process |
SSv4 (~2022 - current; conversion date same as 10x) | GRCh38/gencode.v32 | mm10/genecode.vM23 | in process | in process | in process |
10x processed using CellRanger V3 (start - ~2021) | GRCh38.p2/pre-mRNA | mm10/10x pre-mRNA v3.0.0 | NA | NA | NA |
SSv4 (start - ~2021; conversion date same as 10x) | GRCh38.p2 | GRCm38.p3 | NA | ref_Mmul_10_top_level (LGN) | NA |
Projects with data processed by collaborators | GRCh38.p13/gencode.v35 (WHB) | Gencode v10 (BrainSpan) | NA | mCalJa1.2.pat.X (CTKE) | NA | NA |
Historical bulk RNA-Seq data sets | GRCh38.p2 (TBI) | GRCh37.p5 (GBM) | NA | NA | NA | NA |
Reference files by data set
Update your browser to view this website correctly.