How to calculate or approximate the median of a list without storing the list

后端未结

关注

 10  1190

你的背包 2020-11-28 22:24

I\'m trying to calculate the median of a set of values, but I don\'t want to store all the values as that could blow memory requirements. Is there a way of calculating or ap

10条回答

悲&欢浪女 (楼主)

2020-11-28 22:41
David's suggestion seems like the most sensible approach for approximating the median.

A running mean for the same problem is a much easier to calculate:

M_n = M_n-1 + ((V_n - M_n-1) / n)

Where M_n is the mean of n values, M_n-1 is the previous mean, and V_n is the new value.

In other words, the new mean is the existing mean plus the difference between the new value and the mean, divided by the number of values.

In code this would look something like:
```
new_mean = prev_mean + ((value - prev_mean) / count)
```
though obviously you may want to consider language-specific stuff like floating-point rounding errors etc.
0 讨论(0)

查看其它10个回答
发布评论:

提交评论
- 加载中...