What are some good ways of keeping content from being copied to other sites

喜欢而已 提交于 2019-12-11 04:27:29

问题


I understand that no matter what I do, someone will be able to copy it. However I can still make them work hard for it. What are some good ways of making data not easily copied using php compatible coding.

--- Added ----

The data is a listing of results for certain local sports events. We send people out to collect the information, post the information, make corrections and such. However a competing website takes our results (I know they are directly copying them) and never updates them which causes people to call our office and complain.

---- Answer for my Use ----

I picked one of them, however I am going to use multiple of your answers. I am going to add my link in a using the copy pasta trick. I am going to put fake hidden text into it. I am also going to do the fake hidden text trick with different versions of the div tag that are fake (making it even harder to scrape or to do something like copy to textpad and replace it real easily), and I am going to talk to a lawyer as well about legal recourse and what I can do to make it illegal for them to copy the data (such as creative bios or something cool like that). Thanks for your help.


回答1:


Place some <div style="display: inline; position: absolute; overflow: hidden; width: 0px">useless words</div> in the text. It won't display for reading, but if someone copy and paste... "WOW where it came from WTF!! *CRY*"




回答2:


Joe, you can't really make them work really hard to get your data. It's essentially just a single request to any of your pages. Your best option is to explicitly state that you own the rights to all of your content, and that any infringement on that ownership will lead to legal ramifications*.

* Not a lawyer




回答3:


Your data will be copied to every computer that requests the page and it will stay there until the person clears their cache. To answer your question, you can't.

What you can do is create a CSS style such as:

.copy-pasta { display: none; }

And then throughout your content, add something like this:

<p class="copy-pasta">Content provided via <a href="[your url]">[your website here]</a></p>

This will increase your page rank when copy-pasters blatantly steal your content, meaning you will show up first in search results.




回答4:


How about putting links to your site in with the displayed data? No big fanfare, but just suggest that the for the most up to date figures, they can go to the real website that publishes them.

Most of what you try will only work for a time. Until you exceed their laziness factor. (What they're doing suggests a high laziness factor.)

Laws don't protect publicly available data, but you may be able to protect the packaging and presentation.




回答5:


Programs used to copy out data look for the data using pattern-matching. You could 'decorate' your data with randomly-chosen tags (like one row would have a span tag surrounding it, the next row a div, etc...). Just a thought.

Clarification: With screen-scraper at least, the user of the program specifies what HTML comes before the data they want, and what HTML comes after it. You can make it more difficult for them to automatically retrieve the data.




回答6:


Why are people calling your office to complain if the data is on a competing website? If they have a domain name that is similar enough to yours that people are confusing the two of you or if they've put something on their site that makes it look like you've endorsed them, then you've got them for trademark infringement.




回答7:


Disable the context menu is a start.

$(document).bind('contextmenu', function(e)
{
    return false;
});

Or

<body oncontextmenu="return false;">



回答8:


Forbidding people to get data is almost impossible. You can mess up your tags and make the code really dirty and hard to parse... but it's not really enough. You could also generate a big image with the data in it, this would be painful to parse! ... but you don't want to do that.

Because you said...

However a competing website takes our results (I know they are directly copying them) and never updates them which causes people to call our office and complain.

... my call would be to take this the other way and create an API allowing people to get your content in a way that YOU designed.

Also if they are just shamelessly stealing your data and they don't have the right to do it, consider a legal option.




回答9:


Another option is to use PHP code to generate images from the site's HTML. You would use the images to display the content, instead of HTML which can be easily copied out. Example code is here, and I bet you could find more code to do this by Googling:

http://www.acasystems.com/en/web-thumb-activex/faq-php-convert-html-to-image.htm




回答10:


Try Copyscape it wont prevent your content from being copied, but it will make finding the copies very easy.




回答11:


You may encrypt the data on the page, and have javascript obfuscated decoding routine that will decode it for you viewers. You may switch keys and encryption algorithms from time to time. Same javascript should disable ability to select text and/or copy it to prevent manual copy-pasting.

They won't be able to copy manually and their scraper would have to be able to run javascript to get the data.

Caveat is that the data won't be visible for Google, but if data is rather numeric it might not be such a big harm.

If they scrape automatically and very often you may also try to pinpoint their IP by observing most active IP-s on your site and serve them fake data.

Please don't use lawyers, that's hitting below the belt.




回答12:


use swf to display your data, just like other online books



来源:https://stackoverflow.com/questions/1380315/what-are-some-good-ways-of-keeping-content-from-being-copied-to-other-sites

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!