Algorithms Every Data Scientist Should Know: Reservoir Sampling

Great Stuff for Data Scientist

Kenkyuu

Say you have a stream of items of large and unknown length that we can only iterate over once. Create an algorithm that randomly chooses an item from this stream such that each item is equally likely to be selected.

Algorithms Every Data Scientist Should Know: Reservoir Sampling

View original post

Advertisements