So, I\'m working on a PHP script, and part of it needs to be able to query a website, then get text from it.
First off, I need to be able to query a certain website
Can this be done by getting all of the content from the webpage utilizing methods already listed above, and then using regex to remove all characters between open and closed brackets?
A page that looks like this:
stuff in here
Would then become this after regex:
h1 { font-style:... }stuff in here
And because we want to remove all of the code in between various tags such as the [style] tag, we could then first use regex to remove all characters between [style and /style] so that we are just left with:
stuff in here
Would this work then? Please reply if you think it would or if you foresee errors as I would like to create a tool with this parsing.