how does searchsort in python work?

匿名 (未验证) 提交于 2019-12-03 02:22:01

问题:

To make my question clear say if I have an array a as Out[123]: [1, 3, 4, 6, 9, 10, 54] When I try to search the numbers in the list, searchsort returns correct value but when I try something not in the list, it returns an absurd value

here is some of the results

In [131]: a Out[131]: [1, 3, 4, 6, 9, 10, 54]  In [132]: searchsorted(a,1) Out[132]: 0  In [133]: searchsorted(a,6) Out[133]: 3  In [134]: searchsorted(a,[9,54,1]) Out[134]: array([4, 6, 0])  In [135]: searchsorted(a,[9,54,1,0]) Out[135]: array([4, 6, 0, 0]) ***> # here 0 is not in the list, but turns up @ position 0***  In [136]: searchsorted(a,740) Out[136]: 7 ***> # here 0 is not in the list, but turns up @ position 7*** 

why is this happening?

回答1:

searchsorted tells you where the element belongs to guarantee ordering:

Find the indices into a sorted array a such that, if the corresponding elements in v were inserted before the indices, the order of a would be preserved.

inserting 740 at position 7 would preserve ordering, as would inserting 0 at position 0.



回答2:

searchsorted doesn't tell you where things are, it tells you where things should go to keep the list sorted.

So 0 would have to be inserted at position 0, before the 1. Similarly, 740 needs to be inserted at position 7, beyond the current end of the list.

You can see this by reading the docs here:

numpy.searchsorted(a, v, side='left', sorter=None)

Find indices where elements should be inserted to maintain order.

Find the indices into a sorted array a such that, if the corresponding elements in v were inserted before the indices, the order of a would be preserved.



回答3:

from the docs it states that it uses binary search to spot insertion point of an item in a sorted list.

the word 'insertion point' means, if item I is inserted to the insertion point index N in sorted array A, the array A will remain sorted with new item I.

your examples like [9, 54, 1] is meaningless since the array is not sorted.

you can use bisect module in python to do the same thing, without numpy.



标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!