Hi I am using the XML package in R to scrape html pages. The page of interest is http://www.ncbi.nlm.nih.gov/protein/225903367?report=fasta and on that page there is a sequence
The problem is that the page is created dynamically using javascript, and the sequence is not visible in the rendering returned to R.
The CRAN package "rentrez" provides an interface to eutils, which is the programmatic way to query Entrez
library(rentrez)
entrez_fetch(db="protein", id="225903367", rettype="fasta")