Scraping dynamic content in a website

后端未结

关注

 4  1577

梦如初夏 2020-11-28 13:18

I need to scrape news announcements from this website, Link. The announcements seem to be generated dynamically. They dont appear in the source. I usually use mechanize but

4条回答

失恋的感觉 (楼主)

2020-11-28 13:51

In python you can use urllib and urllib2 to connect to a website and collect data. For example:

from urllib2 import urlopen
myUrl = "http://www.marketvectorsindices.com/#!News/List"
inStream = urlopen(myUrl)
instream.read(1024) # etc, in a while loop
# all your fun page parsing code (perhaps: import from xml.dom.minidom import parse)

0 讨论(0)

查看其它4个回答