Python open html file, take screenshot, crop and save as image

随声附和 提交于 2019-12-14 00:32:20

问题


I am using the Bokeh package to generate maps to show the results of a simulation. The output is individual maps in html format with interactivity. The interactivity is needed for individual maps.

See this link for an example:

http://docs.bokeh.org/en/0.10.0/docs/gallery/texas.html

The simulation can automatically be set to run a number of times and will produce a map for each run. This could be 100's of maps. I would like to be able to stitch together the maps to create a movie - the interactivity is not required for this. Bokeh has functionality to create PNG files via the browser so it is possible to manually save each map as a file and use ffmpeg to create a movie. However this is not really an option if you need to do it for 100's of files. Currently there is no way to automatically generate PNG files via Bokeh but I believe it will be added at some point.

So I need a workaround. My thought is to open each html file from the location they are stored on the local drive, take a screen shot, crop the image to keep the required section and save. But I have not yet found a solution that works.

Cropping an image is easy:

from PIL import Image

img = Image.open(file_name)
box = (1, 1, 1000, 1000)
area = img.crop(box)
area.save('saved_image', 'jpeg')

My problem is opening the html file and taking the screen shot in the first place to feed to the above code.

For this I have tried the following but both require a URL rather than an html file. Also both use Firefox which doesn't work for me but I have installed chrome and altered the code appropriately.

How to take partial screenshot with Selenium WebDriver in python?

http://www.idiotinside.com/2015/10/20/take-screenshot-selenium-python/

My code for this is:

from selenium import webdriver

driver = webdriver.Chrome()
driver.get('file_name')
driver.save_screenshot('image.png')
driver.quit()

Which returns:

{"code":-32603,"message":"Cannot navigate to invalid URL"}

Clearly a filename is not a url so that is clear. It works fine if you pass it a website. Any help in getting a html loaded and a picture taken would be greatly appreciated! It does not have to involve Selenium.


回答1:


As of Bokeh 0.12.6, it is now easier to take those screenshots directly from Python without needing to open a browser.

Exporting PNGs looks like this

export_png(plot, filename="plot.png")

And exporting SVGs looks like this

plot.output_backend = "svg"
export_svgs(plot, filename="plot.svg")

There are some optional dependencies that need to be installed. You can find more information in the Exporting Plots section of the User Guide.




回答2:


hgazibara comment proved to be the simplest fix. Some simplified code is below to provide the answer. It would be nice if the web page didn't actually have to show itself for the screen shot to be taken. I will see if I can add this later.

import glob
from PIL import Image
from selenium import webdriver 

# get a list of all the files to open
glob_folder = os.path.join(file_location, '*.html')

html_file_list = glob.glob(glob_folder)
index = 1

for html_file in html_file_list:

    # get the name into the right format
    temp_name = "file://" + html_file

    # open in webpage
    driver = webdriver.Chrome()
    driver.get(temp_name)
    save_name = '00' + str(index) + '.png'       
    driver.save_screenshot(save_path, save_name))
    driver.quit()
    index += 1

    # crop as required
    img = Image.open(save_path, save_name))
    box = (1, 1, 1000, 1000)
    area = img.crop(box)
    area.save('cropped_image' + str(index), 'png')



回答3:


You can use SimpleHTTPServer to create a basic webserver and expose your local file.

You should run this in the path where the html is python -m SimpleHTTPServer 8000

Then just change driver.get('file_name') to driver.get('localhost:8000/file_name.html')

I recommend you to use "wait until" to be sure everything is loaded before take the screenshot:

driver.wait.until(EC.visibility_of_element_located((By.XPATH,'//*[@id="someID"]')))




回答4:


If you're running Linux, you may have to do the following steps.

First open the html file via command line in the default app. You'll probably have to do something like

import os

os.system('command to open html file')

Use the solution listed to take your screenshot.

Then do the processing in PIL as suggested.

If you're doing this in windows,

You may want to setup the AutoIT drivers so that you can use your python script to manipulate GUI windows etc.



来源:https://stackoverflow.com/questions/38568804/python-open-html-file-take-screenshot-crop-and-save-as-image

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!