Archive Page
This page is maintained as a historical record and is no longer being updated.
About
The olfactory (OR) and vomeronasal receptor (VR) repertoires are collectively encoded by 1700 genes and pseudogenes in the mouse genome. The are among the largest gene families in mammals. Yet most only have coding sequences annotated in existing databases, and many lack experimental support due to the similarity between them and their unusual monogenic, sensory cell-specific expression in olfactory tissues.
The aim of the olfactory transcriptomes project was to catalogue the genes expressed in the two major olfactory tissues of C57BL/6J mice, the olfactory mucosa (OM) and vomeronasal organs (VNO). Then to use these data to generate new, extended gene annotations for the OR and VR gene repertoires.
The raw sequence data are available from the European Nucleotide Archive (ENA) under accessions PRJEB2572 and PRJEB1365, and can be reused without restriction.
Downloads
The raw and normalized expression values for all the genes in the OM of mice (xlsx).
The raw and normalized expression values for all the genes in the VNO of mice (xlsx).
Gene models for OR genes based on the OM RNAseq dataset, provided in GTF format. The ENSEMBL gene models are also included. Each model has been given a unique gene-transcript_id pair. The Ensembl models are annotated as ENSEMBL_transcript on column 2 and the transcript_id on column 9 corresponds to that of Ensembl. The gene models that differ to the Ensembl ones, are annotated as reconstructed_transcript on column 2. For those exons that overlap with the Ensembl model, the id has been included as a reference.
Gene models for VR genes based on the VNO RNAseq dataset, provided in GTF format. The ENSEMBL gene models are also included. Each model has been given a unique gene-transcript_id pair. The Ensembl models are annotated as ENSEMBL_transcript on column 2 and the transcript_id on column 9 corresponds to that of Ensembl. The gene models that differ to the Ensembl ones, are annotated as reconstructed_transcript on column 2. For those exons that overlap with the Ensembl model, the id has been included as a reference.
The extended gene model sequences for OR genes in FASTA format.
The extended gene model sequences for VR genes in FASTA format.
Data use
This sequencing centre plans on publishing the completed and annotated sequences in a peer-reviewed journal as soon as possible. Permission of the principal investigator should be obtained before publishing analyses of the sequence/open reading frames/genes on a chromosome or genome scale. See our data sharing policy.