bodymapRat 1.4.0
The bodymapRat package contains gene expression
data on 652 RNA-Seq samples from a comprehensive rat
transcriptomic BodyMap study. These samples include
the sequence identifier information provided in the
header of the FASTQ files which can be used as a
surrogate for batch. These samples have not been
normalized or pre-processed.
The data are provided in a SummarizedExperiment. The
phenotypic information can be extracted using the
colData() function and a description of the phenotypic
data is listed in the table below:
| Title | Description | 
|---|---|
| sraExperiment | SRA Experiment ID | 
| title | Title of sample provided by the authors | 
| geoAccession | GEO Accession ID | 
| BioSample | BioSample ID | 
| avgLength | Average read length | 
| instrument | Machine identifier (from FASTQ header) | 
| runID | Run ID (from FASTQ header) | 
| fcID | Flow cell ID (from FASTQ header) | 
| fcLane | Flow cell lane (from FASTQ header) | 
| tile | Tile (from FASTQ header) | 
| xtile | xtile (from FASTQ header) | 
| ytile | ytile (from FASTQ header) | 
| organ | Body organ | 
| sex | Gender | 
| stage | Stage | 
| techRep | Technical replicate number | 
| colOrgan | Column of colors to help with plotting | 
| rnaRIN | RIN number | 
| barcode | barcode number | 
The data can be accessed as follows:
library(SummarizedExperiment) 
library(bodymapRat)
We use the bodymapRat() function to download the
relevant files from Bioconductor’s
ExperimentHub web
resource. Running this function will download a
SummarizedExperiment object, which contains read counts, as
well as the metadata on the rows (genes) and columns (cells).
bm_rat <- bodymapRat()
# Get the expression data
counts = assay(bm_rat)
dim(counts)
## [1] 32637   652
counts[1:5, 1:5]
##                    SRR1169893 SRR1169894 SRR1169895 SRR1169896 SRR1169897
## ENSRNOG00000000001          1          0          0          1          4
## ENSRNOG00000000007          1          1          0          3          0
## ENSRNOG00000000008          7          4          2          3          7
## ENSRNOG00000000009          0          0          0          0          1
## ENSRNOG00000000010          0          1          0          0          0
# Get the meta data along columns
head(colData(bm_rat))
## DataFrame with 6 rows and 22 columns
##            sraExperiment      sraRun       title geoAccession sraSample
##              <character> <character> <character>     <factor>  <factor>
## SRR1169893     SRX471368  SRR1169893 Adr_F_002_1   GSM1328469 SRS558114
## SRR1169894     SRX471368  SRR1169894 Adr_F_002_1   GSM1328469 SRS558114
## SRR1169895     SRX471369  SRR1169895 Adr_F_002_2   GSM1328470 SRS558115
## SRR1169896     SRX471369  SRR1169896 Adr_F_002_2   GSM1328470 SRS558115
## SRR1169897     SRX471370  SRR1169897 Adr_F_002_3   GSM1328471 SRS558116
## SRR1169898     SRX471370  SRR1169898 Adr_F_002_3   GSM1328471 SRS558116
##               BioSample avgLength       organ         sex     stage   techRep
##                <factor> <integer> <character> <character> <numeric> <integer>
## SRR1169893 SAMN02642886        50     Adrenal           F         2         1
## SRR1169894 SAMN02642886        50     Adrenal           F         2         2
## SRR1169895 SAMN02642867        50     Adrenal           F         2         1
## SRR1169896 SAMN02642867        50     Adrenal           F         2         2
## SRR1169897 SAMN02642894        50     Adrenal           F         2         1
## SRR1169898 SAMN02642894        50     Adrenal           F         2         2
##               colOrgan         mix      rnaRIN     barcode  instrument
##            <character> <character> <character> <character> <character>
## SRR1169893       brown          M1         9.3          11   HWI-ST845
## SRR1169894       brown          M1         9.3          11   HWI-ST845
## SRR1169895       brown          M1         9.1           5   HWI-ST845
## SRR1169896       brown          M1         9.1           5   HWI-ST845
## SRR1169897       brown          M1         9.5           3  HWI-ST1131
## SRR1169898       brown          M1         9.5           3  HWI-ST1195
##                  runID        fcID      fcLane        tile       xtile
##            <character> <character> <character> <character> <character>
## SRR1169893      120326   D0VTJACXX           2        1101        1506
## SRR1169894      120525   D10G7ACXX           2        1101        1394
## SRR1169895      120326   D0VTJACXX           5        1101        1170
## SRR1169896      120525   D10G7ACXX           5        1101        1650
## SRR1169897      120424   C0P4UACXX           4        1101        1675
## SRR1169898      120525   C0TDUACXX           4        1101        1138
##                  ytile
##            <character>
## SRR1169893        2000
## SRR1169894        2133
## SRR1169895        2029
## SRR1169896        2126
## SRR1169897        2216
## SRR1169898        2067
The data in this package are used as an example
data set in the
qsmooth Bioconductor package.
sessionInfo()
## R version 4.0.0 (2020-04-24)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 18.04.4 LTS
## 
## Matrix products: default
## BLAS:   /home/biocbuild/bbs-3.11-bioc/R/lib/libRblas.so
## LAPACK: /home/biocbuild/bbs-3.11-bioc/R/lib/libRlapack.so
## 
## locale:
##  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
##  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=C              
##  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
##  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
##  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
## 
## attached base packages:
## [1] parallel  stats4    stats     graphics  grDevices utils     datasets 
## [8] methods   base     
## 
## other attached packages:
##  [1] bodymapRat_1.4.0            ExperimentHub_1.14.0       
##  [3] AnnotationHub_2.20.0        BiocFileCache_1.12.0       
##  [5] dbplyr_1.4.3                SummarizedExperiment_1.18.1
##  [7] DelayedArray_0.14.0         matrixStats_0.56.0         
##  [9] Biobase_2.48.0              GenomicRanges_1.40.0       
## [11] GenomeInfoDb_1.24.0         IRanges_2.22.1             
## [13] S4Vectors_0.26.0            BiocGenerics_0.34.0        
## [15] knitr_1.28                  BiocStyle_2.16.0           
## 
## loaded via a namespace (and not attached):
##  [1] Rcpp_1.0.4.6                  lattice_0.20-41              
##  [3] assertthat_0.2.1              digest_0.6.25                
##  [5] mime_0.9                      R6_2.4.1                     
##  [7] RSQLite_2.2.0                 evaluate_0.14                
##  [9] httr_1.4.1                    pillar_1.4.4                 
## [11] zlibbioc_1.34.0               rlang_0.4.6                  
## [13] curl_4.3                      blob_1.2.1                   
## [15] Matrix_1.2-18                 rmarkdown_2.1                
## [17] stringr_1.4.0                 RCurl_1.98-1.2               
## [19] bit_1.1-15.2                  shiny_1.4.0.2                
## [21] compiler_4.0.0                httpuv_1.5.2                 
## [23] xfun_0.13                     pkgconfig_2.0.3              
## [25] htmltools_0.4.0               tidyselect_1.0.0             
## [27] tibble_3.0.1                  GenomeInfoDbData_1.2.3       
## [29] interactiveDisplayBase_1.26.0 bookdown_0.18                
## [31] crayon_1.3.4                  dplyr_0.8.5                  
## [33] later_1.0.0                   bitops_1.0-6                 
## [35] rappdirs_0.3.1                grid_4.0.0                   
## [37] xtable_1.8-4                  lifecycle_0.2.0              
## [39] DBI_1.1.0                     magrittr_1.5                 
## [41] stringi_1.4.6                 XVector_0.28.0               
## [43] promises_1.1.0                ellipsis_0.3.0               
## [45] vctrs_0.2.4                   tools_4.0.0                  
## [47] bit64_0.9-7                   glue_1.4.0                   
## [49] purrr_0.3.4                   BiocVersion_3.11.1           
## [51] fastmap_1.0.1                 yaml_2.2.1                   
## [53] AnnotationDbi_1.50.0          BiocManager_1.30.10          
## [55] memoise_1.1.0