Python: computing pariwise distances causes memory error

▼魔方 西西 提交于 2019-12-06 09:17:34

Doing matrix-based algorithms on large data sets is prohibitive.

The memory requirements are straightforward to estimate. Even with exploiting symmetry, many implementations will max out at about 65000 instances. But even 64 bit implementations and big machines will eventually give up. A 1000000x1000000 matrix with double precision and exploiting symmetry needs 4 TB of RAM.

Use better algorithms that don't need O(n^2) memory and runtime.

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!