Statistics
Table of Contents
Combination, Permutation and Ordering
Multinomial Coefficient
In context of combinations, the number of ways to choose objects from a total of is given by . This is because the act of choosing is equvilant to categorizing or labeling the objects into 2 distinct categories, the chosen ones or the ones that is not chosen. There are ways to order objects. However, the order of both the chosen group and the non chosen group does not matter, and there are ways to order the chosen group, ways of ordering the non chosen group; hence, the number of ways to categorize the objects is .
Extending this concept, to partitioning the objects into categories, then if we need indistinguishable objects in the th category, then the number of ways to categorize is given by the multinomial coefficient .
Note: must be true.
Example
The number of distinct words that can be produced from the letters STATISTICS
is , because the count of each letter is given by {S
: 3, T
: 3, A
: 1, I
: 2, C
: 1}.
Permutation and Ordering
When the ordering is important, it is equivalent to categorizing objects into distinct categories, the category for the non-chosen objects and the categories with labels first, second, third, etc... Each category except for the non-chosen category contains exactly 1 object. Therefore, the number of ways to choose objects from is .
In other words, the act of ordering is fundamentally the same as giving unique labels.
Partition into Multiple Categories
The number of ways to partition indistinguishable objects into categories is given by
. To partition the objects into categories, we need sticks to seperate them. Hence, it requires spaces in total to place all objects and sticks. Out of those spots, of them would be for sticks and of them for objects.
To ensure that each category has at least object, the sticks have to be seperated by at least object (e.g. for , ). Then, we have to categorized (i.e. insert) the remaining objects, and by the result above, there are ways of doing so.
Note: Each category is distinguishable, so is different from .
The difference with the multinomial coefficient scenario is that this partition does not restrict the number of objects in each category, but both requires that every object is in one of the categories.
Examples
- Number of ways to give indistinguishable coconuts to distinguishable friends:
- Number of possible tallies of 20 indistinguishable votes to 6 distinguishable canidates:
Probability Distributions
Binomial and Multinomial Distribution
For Successes in bernoulli trials, there are ways of arranging the Successes and Failures. Therefore, the probability of successes is given by where is the probability of success in each bernoulli trial.
For experiments that have more than 2 possible outcomes in each trial, the multinomial coefficient would be useful.
Note: each trial is independent and the union of all outcomes requires to be the entire sample space.
Example
3 dices are rolled independently, find the probability of getting 2 six's and 1 five. The outcome of each trial is either 5, 6 or other, so the probability is: .
Alternatively, the only outcomes of the experiment that satisfy the conditions are , and we get the same result: .
Poisson Distribution
For any interval of space time, it can be uniformly partitioned into non overlapping sub-intervals (). Assume that each event is independent from each other. If an event occured in , then it is equally likely to occur in any of the sub-intervals. For each of the event that happened, the probability of it being in any of the sub-interval follows the discrete uniform distribution.
This is a different interpretation of the binomial distribution, this interpretation can be extended further to obtain the poisson distribution. As approaches , each sub-interval becomes a single moment in space time. The distribution of the probability of an occurance being at any of the moments is still uniform due to the property of independence; In fact, a continuous uniform distribution. The probability of occurances at a given moment, however, approaches 0.
If occurances are expected in the interval , then the probability of occurances actually takes place is .
Due to their similarity, the poisson distribution can be approximated with the binomial distribution when is large.
Product of Two Normal PDFs
Notation
- represents the PDF with , and
- represent the CDF of the standard normal distribution
Derivation
Consider and independently, and try to find .
Consider the exponent part and by completing the squares,
So,
, which yields the result of a scaled normal pdf with .
Product of Three Normal PDFs
For the case of , by using the above results, we get the result of a scaled normal pdf with a mean of , a variance of , and a scale factor of .
Filters
Consider altitude sensors on a rocket, their measurements often comes with some error in repsect to the real altitude. Assume that the error is normally distributed centered at 0, and the actual altitude of the rocket is uniformlly distributed for some values and .
Let be independent random variables for the predicted and measured values of the current altitude. For simplicity, only consider the bivariate case with and .
The purpose of sample filters are to filter out the error in measured and predicted value to give an best estimation of the actual value, in other words, find .
Bayes Filter
By Bayes Theorem, , and .
The denominator is an integral, . However, since is independent from , then
Since and are independent, then . By the results from the Product of Two Normal PDFs, . Therefore, , where and .
For the special case and , .