
Overview
Najko Jahn
2021-08-24
Source:vignettes/introducing-europepmc.Rmd
introducing-europepmc.Rmd
What is searched?
Europe PMC is a repository of life science literature. Europe PMC ingests all PubMed content and extends its index with other literature and patent sources.
For more background on Europe PMC, see:
Levchenko, M., Gou, Y., Graef, F., Hamelers, A., Huang, Z., Ide-Smith, M., … McEntyre, J. (2017). Europe PMC in 2017. Nucleic Acids Research, 46(D1), D1254–D1260. https://doi.org/10.1093/nar/gkx1005
How to search Europe PMC with R?
This client supports the Europe PMC search syntax. If you are unfamiliar with searching Europe PMC, check out the Europe PMC query builder, a very nice tool that helps you to build queries. To make use of Europe PMC queries in R, copy & paste the search string to the search functions of this package.
In the following, some examples demonstrate how to search Europe PMC with R.
Free search
empc_search()
is the main function to query Europe PMC. It searches both metadata and fulltexts.
library(europepmc)
europepmc::epmc_search('malaria')
#> # A tibble: 100 × 29
#> id source pmid doi title authorString journalTitle issue journalVolume
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 34100426 MED 34100426 10.4… New … Lima MN, Ba… Neural Rege… 1 17
#> 2 33341138 MED 33341138 10.1… Trip… Wang J, Xu … Lancet 10267 396
#> 3 33341139 MED 33341139 10.1… Trip… van der Plu… Lancet 10267 396
#> 4 33535760 MED 33535760 10.3… THE … Damiani E, … Acta Med Hi… 2 18
#> 5 33530764 MED 33530764 10.1… Disc… Hoarau M, V… J Enzyme In… 1 36
#> 6 33372863 MED 33372863 10.1… ATP2… Lamy A, Mac… Emerg Micro… 1 10
#> 7 33594960 MED 33594960 10.1… Mana… Kambale-Kom… Hematology 1 26
#> 8 34283002 MED 34283002 10.1… <i>P… Alhassan AM… Pharm Biol 1 59
#> 9 34184352 MED 34184352 10.1… Stru… Chhibber-Go… Protein Sci 9 30
#> 10 34419123 MED 34419123 10.1… Burd… Dao F, Djon… Parasit Vec… 1 14
#> # … with 90 more rows, and 20 more variables: pubYear <chr>, journalIssn <chr>,
#> # pageInfo <chr>, pubType <chr>, isOpenAccess <chr>, inEPMC <chr>,
#> # inPMC <chr>, hasPDF <chr>, hasBook <chr>, hasSuppl <chr>,
#> # citedByCount <int>, hasReferences <chr>, hasTextMinedTerms <chr>,
#> # hasDbCrossReferences <chr>, hasLabsLinks <chr>,
#> # hasTMAccessionNumbers <chr>, firstIndexDate <chr>,
#> # firstPublicationDate <chr>, pmcid <chr>, versionNumber <int>
It is worth noting that Europe PMC expands queries with MeSH synonyms by default, a behavior which can be turned off with the synonym
parameter.
europepmc::epmc_search('malaria', synonym = FALSE)
#> # A tibble: 100 × 29
#> id source pmid doi title authorString journalTitle issue journalVolume
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 33341139 MED 33341139 10.1… Trip… van der Plu… Lancet 10267 396
#> 2 33341138 MED 33341138 10.1… Trip… Wang J, Xu … Lancet 10267 396
#> 3 34100426 MED 34100426 10.4… New … Lima MN, Ba… Neural Rege… 1 17
#> 4 34184352 MED 34184352 10.1… Stru… Chhibber-Go… Protein Sci 9 30
#> 5 34380494 MED 34380494 10.1… Publ… Heuschen AK… Malar J 1 20
#> 6 33530764 MED 33530764 10.1… Disc… Hoarau M, V… J Enzyme In… 1 36
#> 7 34399767 MED 34399767 10.1… Inve… Njau J, Sil… Malar J 1 20
#> 8 PPR385006 PPR <NA> 10.2… Temp… Ingholt MM,… <NA> <NA> <NA>
#> 9 34419123 MED 34419123 10.1… Burd… Dao F, Djon… Parasit Vec… 1 14
#> 10 34376219 MED 34376219 10.1… An a… Wanzira H, … BMC Health … 1 21
#> # … with 90 more rows, and 20 more variables: pubYear <chr>, journalIssn <chr>,
#> # pageInfo <chr>, pubType <chr>, isOpenAccess <chr>, inEPMC <chr>,
#> # inPMC <chr>, hasPDF <chr>, hasBook <chr>, hasSuppl <chr>,
#> # citedByCount <int>, hasReferences <chr>, hasTextMinedTerms <chr>,
#> # hasDbCrossReferences <chr>, hasLabsLinks <chr>,
#> # hasTMAccessionNumbers <chr>, firstIndexDate <chr>,
#> # firstPublicationDate <chr>, pmcid <chr>, versionNumber <int>
To get an exact match, use quotes as in the following example:
europepmc::epmc_search('"Human malaria parasites"')
#> # A tibble: 100 × 29
#> id source pmid doi title authorString journalTitle pubYear journalIssn
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 34415329 MED 34415329 10.1… Func… Kimata-Arig… J Biochem 2021 "0021-924x…
#> 2 34087264 MED 34087264 10.1… Dive… Goh XT, Lim… Mol Biochem… 2021 "0166-6851…
#> 3 34400833 MED 34400833 10.1… A he… Tintó-Font … Nat Microbi… 2021 "2058-5276"
#> 4 33789941 MED 33789941 10.1… Addi… Kwon H, Sim… mSphere 2021 "2379-5042"
#> 5 34211355 MED 34211355 <NA> An E… Clark NF, T… Yale J Biol… 2021 "0044-0086…
#> 6 34362867 MED 34362867 10.4… High… Lai MY, Raf… Trop Biomed 2021 "0127-5720…
#> 7 33693917 MED 33693917 10.1… Non-… Antinori S,… J Travel Med 2021 "1195-1982…
#> 8 32470136 MED 32470136 10.1… C-te… Kimata-Arig… J Biochem 2020 "0021-924x…
#> 9 PPR353209 PPR <NA> 10.1… 5-me… Liu M, Guo … <NA> 2021 <NA>
#> 10 33797521 MED 33797521 10.4… Comp… Mat Salleh … Trop Biomed 2021 "0127-5720…
#> # … with 90 more rows, and 20 more variables: pubType <chr>,
#> # isOpenAccess <chr>, inEPMC <chr>, inPMC <chr>, hasPDF <chr>, hasBook <chr>,
#> # hasSuppl <chr>, citedByCount <int>, hasReferences <chr>,
#> # hasTextMinedTerms <chr>, hasDbCrossReferences <chr>, hasLabsLinks <chr>,
#> # hasTMAccessionNumbers <chr>, firstIndexDate <chr>,
#> # firstPublicationDate <chr>, journalVolume <chr>, pageInfo <chr>,
#> # issue <chr>, pmcid <chr>, versionNumber <int>
Managing search results
By default, 100 records are returned, but the number of results can be expanded or limited with the limit
parameter.
europepmc::epmc_search('"Human malaria parasites"', limit = 10)
#> # A tibble: 10 × 28
#> id source pmid doi title authorString journalTitle pubYear journalIssn
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 34415329 MED 34415329 10.1… Func… Kimata-Arig… J Biochem 2021 "0021-924x…
#> 2 34087264 MED 34087264 10.1… Dive… Goh XT, Lim… Mol Biochem… 2021 "0166-6851…
#> 3 34400833 MED 34400833 10.1… A he… Tintó-Font … Nat Microbi… 2021 "2058-5276"
#> 4 33789941 MED 33789941 10.1… Addi… Kwon H, Sim… mSphere 2021 "2379-5042"
#> 5 34211355 MED 34211355 <NA> An E… Clark NF, T… Yale J Biol… 2021 "0044-0086…
#> 6 34362867 MED 34362867 10.4… High… Lai MY, Raf… Trop Biomed 2021 "0127-5720…
#> 7 33693917 MED 33693917 10.1… Non-… Antinori S,… J Travel Med 2021 "1195-1982…
#> 8 32470136 MED 32470136 10.1… C-te… Kimata-Arig… J Biochem 2020 "0021-924x…
#> 9 PPR353209 PPR <NA> 10.1… 5-me… Liu M, Guo … <NA> 2021 <NA>
#> 10 33797521 MED 33797521 10.4… Comp… Mat Salleh … Trop Biomed 2021 "0127-5720…
#> # … with 19 more variables: pubType <chr>, isOpenAccess <chr>, inEPMC <chr>,
#> # inPMC <chr>, hasPDF <chr>, hasBook <chr>, hasSuppl <chr>,
#> # citedByCount <int>, hasReferences <chr>, hasTextMinedTerms <chr>,
#> # hasDbCrossReferences <chr>, hasLabsLinks <chr>,
#> # hasTMAccessionNumbers <chr>, firstIndexDate <chr>,
#> # firstPublicationDate <chr>, journalVolume <chr>, pageInfo <chr>,
#> # issue <chr>, pmcid <chr>
Results are sorted by relevance. Other options via the sort
parameter are
-
sort = 'cited'
by the number of citation, descending from the most cited publication -
sort = 'date'
by date published starting with the most recent publication
Search by DOIs
Sometimes, you would like to check, if articles are indexed in Europe PMC using DOI names, a widely used identifier for scholarly articles. Use epmc_search_by_doi()
for this purpose.
my_dois <- c(
"10.1159/000479962",
"10.1002/sctm.17-0081",
"10.1161/strokeaha.117.018077",
"10.1007/s12017-017-8447-9"
)
europepmc::epmc_search_by_doi(doi = my_dois)
#> # A tibble: 4 × 28
#> id source pmid doi title authorString journalTitle issue journalVolume
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 28957815 MED 28957815 10.1… Clin… Schnieder M… Eur Neurol 5-6 78
#> 2 28941317 MED 28941317 10.1… Conc… Doeppner TR… Stem Cells … 11 6
#> 3 29018132 MED 29018132 10.1… One-… Psychogios … Stroke 11 48
#> 4 28623611 MED 28623611 10.1… Defe… Carboni E, … Neuromolecu… 2-3 19
#> # … with 19 more variables: pubYear <chr>, journalIssn <chr>, pageInfo <chr>,
#> # pubType <chr>, isOpenAccess <chr>, inEPMC <chr>, inPMC <chr>, hasPDF <chr>,
#> # hasBook <chr>, hasSuppl <chr>, citedByCount <int>, hasReferences <chr>,
#> # hasTextMinedTerms <chr>, hasDbCrossReferences <chr>, hasLabsLinks <chr>,
#> # hasTMAccessionNumbers <chr>, firstIndexDate <chr>,
#> # firstPublicationDate <chr>, pmcid <chr>
Output options
By default, a non-nested data frame printed as tibble is returned. Other formats are output = "id_list"
returning a list of IDs and sources, and output = “‘raw’”” for getting full metadata as list. Please be aware that these lists can become very large.
More advanced options to search Europe PMC
Author search
Use the Europe PMC query syntax to search by author names:
europepmc::epmc_search('AUTH:"Salmon Maelle"')
#> # A tibble: 10 × 28
#> id source pmid doi title authorString journalTitle issue journalVolume
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 30378432 MED 30378432 10.1… When… Milà C, Sal… Environ Sci… 22 52
#> 2 29778830 MED 29778830 10.1… Wear… Salmon M, M… Environ Int <NA> 117
#> 3 29751338 MED 29751338 10.1… Use … Kumar MK, S… Environ Pol… <NA> 239
#> 4 29330030 MED 29330030 10.1… Heal… Mueller N, … Prev Med <NA> 109
#> 5 29626773 MED 29626773 10.1… Deve… Sanchez M, … Sci Total E… <NA> 634
#> 6 29088243 MED 29088243 10.1… Time… Schumacher … PLoS One 10 12
#> 7 28606699 MED 28606699 10.1… Inte… Tonne C, Sa… Int J Hyg E… 6 220
#> 8 28708095 MED 28708095 10.3… Pred… Sanchez M, … Int J Envir… 7 14
#> 9 27063588 MED 27063588 10.2… A sy… Salmon M, S… Euro Survei… 13 21
#> 10 26250543 MED 26250543 10.1… Baye… Salmon M, S… Biom J 6 57
#> # … with 19 more variables: pubYear <chr>, journalIssn <chr>, pageInfo <chr>,
#> # pubType <chr>, isOpenAccess <chr>, inEPMC <chr>, inPMC <chr>, hasPDF <chr>,
#> # hasBook <chr>, hasSuppl <chr>, citedByCount <int>, hasReferences <chr>,
#> # hasTextMinedTerms <chr>, hasDbCrossReferences <chr>, hasLabsLinks <chr>,
#> # hasTMAccessionNumbers <chr>, firstIndexDate <chr>,
#> # firstPublicationDate <chr>, pmcid <chr>
Europe PMC Advanced Search has a auto-suggest field for author names if you feel unsure how the name you are searching for is indexed in Europe PMC. Using the Boolean OR
operator allows searching for more than one spelling variant:
q <- 'AUTH:"PÜHLER Alfred" OR AUTH:"Pühler Alfred Prof. Dr." OR AUTH:"Puhler A"'
europepmc::epmc_search(q, limit = 1000)
#> # A tibble: 590 × 29
#> id source pmid pmcid doi title authorString journalTitle journalVolume
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 34367203 MED 34367203 PMC8… 10.3… ExoS… Geiger O, S… Front Plant… 12
#> 2 34361893 MED 34361893 PMC8… 10.3… Indi… Hassa J, Kl… Microorgani… 9
#> 3 34040261 MED 34040261 PMC8… 10.1… Swar… Warnat-Herr… Nature 594
#> 4 33589928 MED 33589928 <NA> 10.1… Impl… Mayer G, Mü… Brief Bioin… <NA>
#> 5 33643369 MED 33643369 PMC7… 10.3… Exop… Castellani … Front Plant… 12
#> 6 33441124 MED 33441124 PMC7… 10.1… Dise… Aschenbrenn… Genome Med 13
#> 7 PPR264825 PPR <NA> <NA> 10.2… The … Droste J, O… <NA> <NA>
#> 8 33220679 MED 33220679 <NA> 10.1… Glob… Nilsson JF,… FEMS Microb… 97
#> 9 33348776 MED 33348776 PMC7… 10.3… The … Maus I, Tub… Microorgani… 8
#> 10 33296687 MED 33296687 PMC7… 10.1… Long… Bernardes J… Immunity 53
#> # … with 580 more rows, and 20 more variables: pubYear <chr>,
#> # journalIssn <chr>, pageInfo <chr>, pubType <chr>, isOpenAccess <chr>,
#> # inEPMC <chr>, inPMC <chr>, hasPDF <chr>, hasBook <chr>, hasSuppl <chr>,
#> # citedByCount <int>, hasReferences <chr>, hasTextMinedTerms <chr>,
#> # hasDbCrossReferences <chr>, hasLabsLinks <chr>,
#> # hasTMAccessionNumbers <chr>, firstIndexDate <chr>,
#> # firstPublicationDate <chr>, issue <chr>, versionNumber <int>
There is a considerable overlap between common names. The integration of ORCID, a persistent author identifier, allows unambiguous search for personal publications in Europe PMC. For example, here’s how to search for publications written by Bernd Weisshaar (ORCID: https://orcid.org/0000-0002-7635-3473) sorted by the number of times cited in descending order:
europepmc::epmc_search('AUTHORID:"0000-0002-7635-3473"', limit = 200, sort = "cited")
#> # A tibble: 150 × 28
#> id source pmid doi title authorString journalTitle issue journalVolume
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 21873998 MED 21873998 10.1… The … Wang X, Wan… Nat Genet 10 43
#> 2 20674465 MED 20674465 10.1… MYB … Dubos C, St… Trends Plan… 10 15
#> 3 11597504 MED 11597504 10.1… The … Stracke R, … Curr Opin P… 5 4
#> 4 11906833 MED 11906833 10.1… bZIP… Jakoby M, W… Trends Plan… 3 7
#> 5 14756321 MED 14756321 10.1… An A… Rosso MG, L… Plant Mol B… 1-2 53
#> 6 12679534 MED 12679534 10.1… The … Heim MA, Ja… Mol Biol Ev… 5 20
#> 7 11080161 MED 11080161 10.1… Tran… Jin H, Comi… EMBO J 22 19
#> 8 15361138 MED 15361138 10.1… Comp… Zimmermann … Plant J 1 40
#> 9 15255866 MED 15255866 10.1… TT2,… Baudry A, H… Plant J 3 39
#> 10 17419845 MED 17419845 10.1… Diff… Stracke R, … Plant J 4 50
#> # … with 140 more rows, and 19 more variables: pubYear <chr>,
#> # journalIssn <chr>, pageInfo <chr>, pubType <chr>, isOpenAccess <chr>,
#> # inEPMC <chr>, inPMC <chr>, hasPDF <chr>, hasBook <chr>, hasSuppl <chr>,
#> # citedByCount <int>, hasReferences <chr>, hasTextMinedTerms <chr>,
#> # hasDbCrossReferences <chr>, hasLabsLinks <chr>,
#> # hasTMAccessionNumbers <chr>, firstIndexDate <chr>,
#> # firstPublicationDate <chr>, pmcid <chr>
Annotations
Europe PMC provides text-mined annotations contained in abstracts and open access full-text articles.
These automatically identified concepts and term can be retrieved at the article-level:
europepmc::epmc_annotations_by_id(c("MED:28585529", "PMC:PMC1664601"))
#> # A tibble: 774 × 13
#> source ext_id pmcid prefix exact postfix name uri id type section
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 MED 28585529 PMC5467160 "tive… Beta… " allo… Beta… http… http… Clin… Title …
#> 2 MED 28585529 PMC5467160 "nomi… genes ".\nRa… gene http… http… Sequ… Title …
#> 3 MED 28585529 PMC5467160 "nomi… genes " is o… gene http… http… Sequ… Abstra…
#> 4 MED 28585529 PMC5467160 " One… genes " are … gene http… http… Sequ… Abstra…
#> 5 MED 28585529 PMC5467160 " ide… beet " (Bet… Beta… http… http… Clin… Abstra…
#> 6 MED 28585529 PMC5467160 "ify … Beta… " ssp.… Beta… http… http… Clin… Abstra…
#> 7 MED 28585529 PMC5467160 "ulga… gene " Rz2 … gene http… http… Sequ… Abstra…
#> 8 MED 28585529 PMC5467160 "e ge… geno… " sequ… geno… http… http… Sequ… Abstra…
#> 9 MED 28585529 PMC5467160 "eque… beet ". Our… Beta… http… http… Clin… Abstra…
#> 10 MED 28585529 PMC5467160 "disc… genes " rele… gene http… http… Sequ… Abstra…
#> # … with 764 more rows, and 2 more variables: provider <chr>, subType <chr>
To obtain a list of articles where Europe PMC has text-minded annotations, either subset the resulting data.frame
tt <- epmc_search("malaria")
tt[tt$hasTextMinedTerms == "Y" | tt$hasTMAccessionNumbers == "Y",]
#> # A tibble: 94 × 29
#> id source pmid doi title authorString journalTitle issue journalVolume
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 34100426 MED 34100426 10.4… New … Lima MN, Ba… Neural Rege… 1 17
#> 2 33535760 MED 33535760 10.3… THE … Damiani E, … Acta Med Hi… 2 18
#> 3 33530764 MED 33530764 10.1… Disc… Hoarau M, V… J Enzyme In… 1 36
#> 4 33372863 MED 33372863 10.1… ATP2… Lamy A, Mac… Emerg Micro… 1 10
#> 5 33594960 MED 33594960 10.1… Mana… Kambale-Kom… Hematology 1 26
#> 6 34283002 MED 34283002 10.1… <i>P… Alhassan AM… Pharm Biol 1 59
#> 7 34184352 MED 34184352 10.1… Stru… Chhibber-Go… Protein Sci 9 30
#> 8 34362867 MED 34362867 10.4… High… Lai MY, Raf… Trop Biomed 3 38
#> 9 34399767 MED 34399767 10.1… Inve… Njau J, Sil… Malar J 1 20
#> 10 PPR385006 PPR <NA> 10.2… Temp… Ingholt MM,… <NA> <NA> <NA>
#> # … with 84 more rows, and 20 more variables: pubYear <chr>, journalIssn <chr>,
#> # pageInfo <chr>, pubType <chr>, isOpenAccess <chr>, inEPMC <chr>,
#> # inPMC <chr>, hasPDF <chr>, hasBook <chr>, hasSuppl <chr>,
#> # citedByCount <int>, hasReferences <chr>, hasTextMinedTerms <chr>,
#> # hasDbCrossReferences <chr>, hasLabsLinks <chr>,
#> # hasTMAccessionNumbers <chr>, firstIndexDate <chr>,
#> # firstPublicationDate <chr>, pmcid <chr>, versionNumber <int>
or expand the query choosing an annotation type or provider from the Europe PMC Advanced Search query builder.
epmc_search('malaria AND (ANNOTATION_TYPE:"Cell") AND (ANNOTATION_PROVIDER:"Europe PMC")')
#> # A tibble: 100 × 28
#> id source pmid pmcid doi title authorString journalTitle issue
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 31782768 MED 31782768 PMC79… 10.1… Incre… Jongo SA, Ch… Clin Infect… 11
#> 2 31808816 MED 31808816 PMC76… 10.1… Retin… Villaverde C… J Pediatric… 5
#> 3 30989220 MED 30989220 PMC73… 10.1… Clini… Enane LA, Su… J Pediatric… 3
#> 4 31300826 MED 31300826 PMC72… 10.1… Black… Opoka RO, Wa… Clin Infect… 11
#> 5 31807752 MED 31807752 <NA> 10.1… Malar… Marcombe S, … J Med Entom… 3
#> 6 31505001 MED 31505001 <NA> 10.1… Acute… Oshomah-Bell… J Trop Pedi… 2
#> 7 31687768 MED 31687768 <NA> 10.1… Evalu… Ferdinand DY… Trans R Soc… 3
#> 8 31693130 MED 31693130 PMC71… 10.1… Reduc… Kingston HWF… J Infect Dis 9
#> 9 31679146 MED 31679146 <NA> 10.1… A Sys… Thiengsusuk … Eur J Drug … 2
#> 10 30852586 MED 30852586 <NA> 10.1… An Ex… Woodford J, … J Infect Dis 6
#> # … with 90 more rows, and 19 more variables: journalVolume <chr>,
#> # pubYear <chr>, journalIssn <chr>, pageInfo <chr>, pubType <chr>,
#> # isOpenAccess <chr>, inEPMC <chr>, inPMC <chr>, hasPDF <chr>, hasBook <chr>,
#> # hasSuppl <chr>, citedByCount <int>, hasReferences <chr>,
#> # hasTextMinedTerms <chr>, hasDbCrossReferences <chr>, hasLabsLinks <chr>,
#> # hasTMAccessionNumbers <chr>, firstIndexDate <chr>,
#> # firstPublicationDate <chr>
Data integrations
Another nice feature of Europe PMC is to search for cross-references between Europe PMC to other databases. For instance, to get publications cited by entries in the Protein Data bank in Europe published 2016:
europepmc::epmc_search('(HAS_PDB:y) AND FIRST_PDATE:2016')
#> # A tibble: 100 × 28
#> id source pmid pmcid doi title authorString journalTitle issue
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 27989121 MED 27989121 PMC58… 10.1… Short… Lin J, Pozha… Biochemistry 2
#> 2 27815281 MED 27815281 PMC52… 10.1… Struc… Wakamatsu T,… Appl Enviro… 2
#> 3 28035004 MED 28035004 PMC53… 10.1… Struc… Waz S, Nakam… J Biol Chem 7
#> 4 28030602 MED 28030602 PMC51… 10.1… Struc… Christensen … PLoS One 12
#> 5 28066558 MED 28066558 PMC51… 10.1… Struc… Gai Z, Wang … Cell Discov <NA>
#> 6 28024149 MED 28024149 PMC53… 10.1… Cryst… Kuk AC, Mash… Nat Struct … 2
#> 7 28031486 MED 28031486 PMC52… 10.1… Struc… Sevrioukova … Proc Natl A… 3
#> 8 28011634 MED 28011634 PMC53… 10.1… Struc… Levdikov VM,… J Biol Chem 7
#> 9 28009010 MED 28009010 PMC51… 10.1… Struc… Zhao H, Wei … Sci Rep <NA>
#> 10 28197319 MED 28197319 PMC53… 10.1… Struc… Johannes JW,… ACS Med Che… 2
#> # … with 90 more rows, and 19 more variables: journalVolume <chr>,
#> # pubYear <chr>, journalIssn <chr>, pageInfo <chr>, pubType <chr>,
#> # isOpenAccess <chr>, inEPMC <chr>, inPMC <chr>, hasPDF <chr>, hasBook <chr>,
#> # hasSuppl <chr>, citedByCount <int>, hasReferences <chr>,
#> # hasTextMinedTerms <chr>, hasDbCrossReferences <chr>, hasLabsLinks <chr>,
#> # hasTMAccessionNumbers <chr>, firstIndexDate <chr>,
#> # firstPublicationDate <chr>
The following sources are supported
- CHEBI a database and ontology of chemical entities of biological interest https://www.ebi.ac.uk/chebi/
- CHEMBL a database of bioactive drug-like small molecules https://www.ebi.ac.uk/chembldb/
- EMBL now ENA, provides a comprehensive record of the world’s nucleotide sequencing information https://www.ebi.ac.uk/ena/browser/
- INTACT provides a freely available, open source database system and analysis tools for molecular interaction data https://www.ebi.ac.uk/intact/
- INTERPRO provides functional analysis of proteins by classifying them into families and predicting domains and important sites https://www.ebi.ac.uk/interpro/
- OMIM a comprehensive and authoritative compendium of human genes and genetic phenotypes https://www.omim.org/about
- PDB European resource for the collection, organisation and dissemination of data on biological macromolecular structures https://www.ebi.ac.uk/pdbe/
- UNIPROT comprehensive and freely accessible resource of protein sequence and functional information https://www.uniprot.org/
- PRIDE PRIDE Archive - proteomics data repository https://www.ebi.ac.uk/pride/archive/
To retrieve metadata about these external database links, use europepmc_epmc_db()
.
Citations and reference sections
Europe PMC let us also obtain citation metadata and reference sections. For retrieving citation metadata per article, use
europepmc::epmc_citations("9338777", limit = 500)
#> # A tibble: 233 × 11
#> id source citationType title authorString journalAbbrevia… pubYear volume
#> <chr> <chr> <chr> <chr> <chr> <chr> <int> <chr>
#> 1 33353… MED review-arti… Xeno… Galow AM, G… Int J Mol Sci 2020 21
#> 2 31565… MED research-ar… Regu… Chung HC, N… J Vet Sci 2019 20
#> 3 30230… MED research su… Bioe… Legallais C… Adv Healthc Mat… 2018 7
#> 4 30264… MED research su… Porc… Fiebig U, F… Xenotransplanta… 2018 25
#> 5 29756… MED historical … Infe… Weiss RA. Xenotransplanta… 2018 25
#> 6 29642… MED research su… Trac… Kawasaki J,… Viruses 2018 10
#> 7 28768… MED research su… Pres… Kawasaki J,… J Virol 2017 91
#> 8 28437… MED research su… Thre… Colon-Moran… Virology 2017 507
#> 9 28054… MED research su… Anti… Inoue Y, Yo… Ann Biomed Eng 2017 45
#> 10 27832… MED research-ar… Tran… Kim N, Choi… PLoS One 2016 11
#> # … with 223 more rows, and 3 more variables: issue <chr>, citedByCount <int>,
#> # pageInfo <chr>
For reference section from an article:
europepmc::epmc_refs("28632490", limit = 200)
#> # A tibble: 169 × 19
#> id source citationType title authorString journalAbbrevia… issue pubYear
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <int>
#> 1 12002480 MED JOURNAL ART… Tric… Adolfsson-E… Chemosphere 9-10 2002
#> 2 18795164 MED JOURNAL ART… In v… Ahn KC, Zha… Environ Health … 9 2008
#> 3 18556606 MED JOURNAL ART… Effe… Aiello AE, … Am J Public Hea… 8 2008
#> 4 17683018 MED JOURNAL ART… Cons… Aiello AE, … Clin Infect Dis <NA> 2007
#> 5 15273108 MED JOURNAL ART… Rela… Aiello AE, … Antimicrob Agen… 8 2004
#> 6 18207219 MED JOURNAL ART… The … Allmyr M, H… Sci Total Envir… 1 2008
#> 7 17007908 MED JOURNAL ART… Tric… Allmyr M, A… Sci Total Envir… 1 2006
#> 8 26948762 MED JOURNAL ART… Pres… Alvarez-Riv… J Chromatogr A <NA> 2016
#> 9 23192912 MED JOURNAL ART… Expo… Anderson SE… Toxicol Sci 1 2012
#> 10 25837385 MED JOURNAL ART… Obse… Vladar EK, … Methods Cell Bi… <NA> 2015
#> # … with 159 more rows, and 11 more variables: volume <chr>, pageInfo <chr>,
#> # citedOrder <int>, match <chr>, essn <chr>, issn <chr>,
#> # publicationTitle <chr>, publisherLoc <chr>, publisherName <chr>,
#> # externalLink <chr>, doi <chr>
Fulltext access
Europe PMC gives not only access to metadata, but also to full-texts. Adding AND (OPEN_ACCESS:y)
to your search query, returns only those articles where Europe PMC has also the fulltext.
Fulltext as xml document can accessed via the PMID or the PubMed Central ID (PMCID):
europepmc::epmc_ftxt("PMC3257301")
#> {xml_document}
#> <article article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML">
#> [1] <front>\n <journal-meta>\n <journal-id journal-id-type="nlm-ta">PLoS ...
#> [2] <body>\n <sec id="s1">\n <title>Introduction</title>\n <p>Atmosphe ...
#> [3] <back>\n <ack>\n <p>We would like to thank Dr. C. Gourlay and Dr. T. ...