Generalized k-matches

Suppose that an urn contains m distinguishable balls, and that these balls are sampled (with replacement), thus generating a sequence of colors. Many questions can be asked about this sequence; the distribution of the time until a color is sampled twice within a memory window of size k (i.e., the wa...

Full description

Saved in:
Bibliographic Details
Published inStatistics & probability letters Vol. 38; no. 2; pp. 167 - 175
Main Authors Herzog, Jonathan, McLaren, Christopher, Godbole, Anant P.
Format Journal Article
LanguageEnglish
Published Amsterdam Elsevier B.V 01.06.1998
Elsevier
SeriesStatistics & Probability Letters
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Suppose that an urn contains m distinguishable balls, and that these balls are sampled (with replacement), thus generating a sequence of colors. Many questions can be asked about this sequence; the distribution of the time until a color is sampled twice within a memory window of size k (i.e., the waiting time till the first k-match) was derived by Arnold (1972). Next, Burghardt et al. (1994) proved that the limiting distribution of the number of k-matches in the first n draws is Poisson if k = o( m). An even more general question is discussed here: if, for every draw from the urn, a random k-sample is taken of the previous draws, what is the distribution of the number of generalized k-matches? Our solution resolves a question of Glen Meeden (see Arnold, 1972). Extensions to the case where the k-sample is drawn from the (union of the) past and the future are provided, and the case of non-uniform selection probabilities is treated.
ISSN:0167-7152
1879-2103
DOI:10.1016/S0167-7152(97)00169-7