urllib2 | 易学教程

Logging into website with multiple pages using Python (urllib2 and cookielib)

阅读更多关于 Logging into website with multiple pages using Python (urllib2 and cookielib)

问题 I am writing a script to retrieve transaction information from my bank's home banking website for use in a personal mobile application. The website is laid out like so: https:/ /homebanking.purduefed.com/OnlineBanking/Login.aspx -> Enter username -> Submit form -> https:/ /homebanking.purduefed.com/OnlineBanking/AOP/Password.aspx -> Enter password -> Submit form -> https:/ /homebanking.purduefed.com/OnlineBanking/AccountSummary.aspx The problem I am having is since there are 2 separate pages

urllib2 timeout

阅读更多关于 urllib2 timeout

问题 i'm using urllib2 library for my code, i'm using a lot of (urlopen) EDIT: loadurl i have a problem on my network, when i'm browsing sites, sometimes my browser gets stuck on "Connecting" to a certain website and sometimes my browser returns a timeout My question is if i use urllib2 on my code it can timeout when trying to connect for too long to a certain website or the code will get stuck on that line. i know that urllib2 can handle timeouts without specifying it on code but it can apply for

urllib2 SSL3_CHECK_CERT_AND_ALGORITHM: dh key too small

阅读更多关于 urllib2 SSL3_CHECK_CERT_AND_ALGORITHM: dh key too small

问题 Attempting to send a SOAP request using suds , I'm using Python 2.7.6 . I'm not very versed with security I am led to believe that either the security - key, on either my machine or the server's machine is too small, I'm not sure how to resolve. Do I generate some new key and create a custom opener ? Any assistance /guidance would be helpful. Stacktrace: Traceback (most recent call last): File "read_xml.py", line 71, in <module> client.service.PO(purchase_orders) File "/usr/local/lib/python2

Python urllib2 returning an empty string

阅读更多关于 Python urllib2 returning an empty string

问题 I'm trying to retrieve the following URL: http://www.winkworth.co.uk/sale/property/flat-for-sale-in-masefield-court-london-n5/HIH140004. import urllib2 response = urllib2.urlopen('http://www.winkworth.co.uk/rent/property/terraced-house-to-rent-in-mill-road--/WOT140129') response.read() However I'm getting an empty string. When I try it through the browser or with cURL it works fine. Any ideas what's going on? 回答1: I got a response when using the requests library but not when using urllib2 ,

urllib2: reading https url failure

阅读更多关于 urllib2: reading https url failure

问题 This code fails on my Ubuntu, but works well on other hosts. >>> from urllib2 import urlopen >>> urlopen("https://courtapps.utcourts.gov/XchangeWEB/login") Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/usr/lib/python2.7/urllib2.py", line 126, in urlopen return _opener.open(url, data, timeout) File "/usr/lib/python2.7/urllib2.py", line 400, in open response = self._open(req, data) File "/usr/lib/python2.7/urllib2.py", line 418, in _open '_open', req) File "/usr

Does urllib2.urlopen() actually fetch the page?

阅读更多关于 Does urllib2.urlopen() actually fetch the page?

问题 I was condering when I use urllib2.urlopen() does it just to header reads or does it actually bring back the entire webpage? IE does the HTML page actually get fetch on the urlopen call or the read() call? handle = urllib2.urlopen(url) html = handle.read() The reason I ask is for this workflow... I have a list of urls (some of them with short url services) I only want to read the webpage if I haven't seen that url before I need to call urlopen() and use geturl() to get the final page that

Form Submission in Python Without Name Attribute

阅读更多关于 Form Submission in Python Without Name Attribute

问题 Background: Using urllib and urllib2 in Python, you can do a form submission. You first create a dictionary. formdictionary = { 'search' : 'stackoverflow' } Then you use urlencode method of urllib to transform this dictionary. params = urllib.urlencode(formdictionary) You can now make a url request with urllib2 and pass the variable params as a secondary parameter with the first parameter being the url. open = urllib2.urlopen('www.searchpage.com', params) From my understanding, urlencode

urllib2 doesn't use proxy (Fiddler2), set using ProxyHandler

阅读更多关于 urllib2 doesn't use proxy (Fiddler2), set using ProxyHandler

问题 I have Fiddler2 listening on 0.0.0.0:8888. try: data = '' proxy = urllib2.ProxyHandler({'http': '127.0.0.1:8888'}) //also tried {'http': 'http://127.0.0.1:8888/'} opener = urllib2.build_opener(proxy) urllib2.install_opener(opener) req = urllib2.Request('http://www.google.com') response = urllib2.urlopen(req) the_page = response.read() print the_page except Exception, detail: print "Err ", detail I don't see the GET or any request to google in Fiddler (but I can see other requests) is there a

HTTP Error 504: Gateway Time-out when trying to read a reddit comments post

阅读更多关于 HTTP Error 504: Gateway Time-out when trying to read a reddit comments post

问题 I am encountering an error when trying to get a comments' http from reddit. This has happened to various URLs (not all of them with special characters) and this is one of them. In one hour time frame, there may be 1000 or more requests to the reddit.com domain. hdr = {"User-Agent": "My Agent"} try: req = urllib2.Request("http://www.reddit.com/r/gaming/" "comments/1bjuee/when_pokΓ©mon_was_good", headers=hdr) htmlSource = urllib2.urlopen(req).read() except Exception as inst: print inst Output>

Python: clicking a button [duplicate]

阅读更多关于 Python: clicking a button [duplicate]

问题 This question already has answers here : Python: Clicking a button with urllib or urllib2 (3 answers) Closed 6 years ago . I have problems in clicking this button that looks in HTML code like this: <form method="post"> <br> <input type="hidden" value="6" name="deletetree"> <input type="submit" value="Delete Tree" name="pushed"> </form> and the url that needs to be generated looks like this: http://mysite.com/management.php?Category=2&id_user=19&deteletree=6&pushed=Delete+Tree Update: I tried