Python 3: using requests does not get the full content of a web page

后端未结

关注

 2  1399

甜味超标 2020-12-14 12:27

I am testing using the requests module to get the content of a webpage. But when I look at the content I see that it does not get the full content of the page.<

2条回答

青春惊慌失措 (楼主)

2020-12-14 12:54

The page is rendered with JavaScript making more requests to fetch additional data. You can fetch the complete page with selenium.

from bs4 import BeautifulSoup
from selenium import webdriver
driver = webdriver.Chrome()
url = "https://shop.nordstrom.com/c/womens-dresses-shop?origin=topnav&cm_sp=Top%20Navigation-_-Women-_-Dresses&offset=11&page=3&top=72"
driver.get(url)
soup = BeautifulSoup(driver.page_source, 'html.parser')
driver.quit()
print(soup.prettify())

For other solutions see my answer to Scraping Google Finance (BeautifulSoup)

0 讨论(0)

查看其它2个回答