Download the table of metadata for each transcriptome from the 1KP website
(http://www.onekp.com/public_data.html). The metadata are wrapped into
a OneKp
S4 object. This object contains two data.frames: 1)
@table
, the main metadata table and 2) @links
a map from
resource to URL (mostly for internal use).
Arguments
- add_taxids
If TRUE, add NCBI taxon ids for each species. This requires downloading the NCBI taxonomy database, which will require a few extra minutes the first time you run the function. This step is necessary only if you wish to filter by NCBI taxon ids.
- filter
If TRUE, filter out entries that are associated with a single species (for example crosses or datasets pooled across a genus). If set to TRUE, then add_taxids will also be set to TRUE.
Details
This dataset is also saved as package data, you can access this with
data(onekp)
.
The metadata table contains the following columns:
species - species scientific name
code - 4-letter 1KP transcriptome unique identifier
family - the taxonomic family
tissue - the tissue(s) that where sequenced
peptides - the filename for the transcript proteins
nucleotides - the filename for the transcript DNA
tax_id (optional) - the species NCBI taxonomy ID
Examples
if (FALSE) {
# scrape data from the OneKP website
kp <- retrieve_onekp()
# print to see data summary
kp
# access the metadata table
head(kp@table)
}