How to download all files (but not HTML) from a website using wget?

前端 未结 8 997
生来不讨喜
生来不讨喜 2020-11-29 14:19

How to use wget and get all the files from website?

I need all files except the webpage files like HTML, PHP, ASP etc.

8条回答
  •  无人及你
    2020-11-29 14:53

    I was trying to download zip files linked from Omeka's themes page - pretty similar task. This worked for me:

    wget -A zip -r -l 1 -nd http://omeka.org/add-ons/themes/
    
    • -A: only accept zip files
    • -r: recurse
    • -l 1: one level deep (ie, only files directly linked from this page)
    • -nd: don't create a directory structure, just download all the files into this directory.

    All the answers with -k, -K, -E etc options probably haven't really understood the question, as those as for rewriting HTML pages to make a local structure, renaming .php files and so on. Not relevant.

    To literally get all files except .html etc:

    wget -R html,htm,php,asp,jsp,js,py,css -r -l 1 -nd http://yoursite.com
    

提交回复
热议问题