gotchaMinor
Why does linear search have $\frac{n}{2}$ comparisons on average?
Viewed 0 times
whylinearfracsearchaveragedoeshavecomparisons
Problem
I'm reading the Wikipedia page on Linear Search and it is mentioned that there are on average $\frac{n}{2}$ comparisons.
I tried working this out on my own.
First I considered the number of cases. There are $n$ cases corresponding to when the target value is the $1$st, $2$nd, ..., $n$th entry of the list. There is also the case where the entry is not in the list at all, so we have $n + 1$ cases.
Next I consider the individual number of comparisons needed per case. For the cases where the target value is in the list, we need $1, 2, \dots, n$ comparisons respectively. For the case where the target value is not in the list, we need $n$ comparisons (after which the search terminates and concludes with a return value of NIL).
Finally, I sum up the individual number of comparisons to get the total number of comparisons, and divide by the number of cases to get average number of comparisons. I get
$$\frac{1 + 2 + \dots + n + n}{n + 1} = \frac{n^2 + 3n}{2n + 2}$$
which is different from $\frac{n}{2}$.
May I where is the fallacy in my reasoning? Thank you!
I tried working this out on my own.
First I considered the number of cases. There are $n$ cases corresponding to when the target value is the $1$st, $2$nd, ..., $n$th entry of the list. There is also the case where the entry is not in the list at all, so we have $n + 1$ cases.
Next I consider the individual number of comparisons needed per case. For the cases where the target value is in the list, we need $1, 2, \dots, n$ comparisons respectively. For the case where the target value is not in the list, we need $n$ comparisons (after which the search terminates and concludes with a return value of NIL).
Finally, I sum up the individual number of comparisons to get the total number of comparisons, and divide by the number of cases to get average number of comparisons. I get
$$\frac{1 + 2 + \dots + n + n}{n + 1} = \frac{n^2 + 3n}{2n + 2}$$
which is different from $\frac{n}{2}$.
May I where is the fallacy in my reasoning? Thank you!
Solution
Let us assume that required element is equally likely to be in any position between $1$ and $n$. Let $x _i $ be a random variable. Random variable $x_i = 1$ means required element is in ith position and vice-versa. Now probability that required element is in position between $1$ and $n$ is $$ Pr[x_i = 1] = \frac{1}{n}$$ Now Expected value is given by $$\mathbb{E}(x) = 1 \times\frac{1}{n} + 2 \times \frac{1}{n} + 3 \times \frac{1}{n} +\cdots + n \times\frac{1}{n} $$ which can be written as
$$\mathbb{E}(x) = \frac{n(n+1)}{2n} = \frac{(n+1)}{2}$$
which is a required answer
Reference : https://www.macs.hw.ac.uk/~pjbk/pathways/cpp2/node56.html
$$\mathbb{E}(x) = \frac{n(n+1)}{2n} = \frac{(n+1)}{2}$$
which is a required answer
Reference : https://www.macs.hw.ac.uk/~pjbk/pathways/cpp2/node56.html
Context
StackExchange Computer Science Q#71547, answer score: 6
Revisions (0)
No revisions yet.