11.4 Poisson processes

A Poisson process is the simplest model of random arrivals: events happen at a constant average rate, independently of each other and of past history. Photon arrivals from a faint source, radioactive decays in a sample, customer arrivals at a bank, action potentials in an auditory-nerve fibre — all are well-modelled as Poisson processes over short enough timescales. The Poisson process has a single parameter (the rate λ\lambda) and three statistical signatures: exponentially-distributed waiting times, Poisson-distributed counts, and complete temporal independence.

This lesson develops all three.

Setup: events at a constant rate

A Poisson process is defined by these properties:

  1. Independence. Events in disjoint time intervals are independent.
  2. Constant rate. Over a very short interval dtdt, the probability of one event is λdt\lambda\, dt and the probability of two or more is o(dt)o(dt) (vanishingly small).
  3. Initial condition. The process starts at t=0t = 0 with zero events recorded.

From these axioms, every statistical property of the process follows.

The Poisson distribution: count statistics

Let N(T)N(T) be the number of events occurring in the interval [0,T][0, T]. By the axioms above, N(T)N(T) is a discrete random variable. Its distribution turns out to be Poisson with mean λT\lambda T:

  Pr(N(T)=k)  =  (λT)keλTk!,k=0,1,2,  \boxed{\;\mathrm{Pr}(N(T) = k) \;=\; \frac{(\lambda T)^k\, e^{-\lambda T}}{k!}, \qquad k = 0, 1, 2, \ldots\;}
Poisson as the limit of binomial

Divide the interval [0,T][0, T] into nn subintervals of length Δt=T/n\Delta t = T/n. In each subinterval, the probability of an event is approximately p=λΔt=λT/np = \lambda \Delta t = \lambda T / n (treating Δt\Delta t as small enough that at most one event occurs). The events across subintervals are independent.

The total count N(T)N(T) is the number of “successes” in nn Bernoulli trials each with probability pp — a binomial distribution:

Pr(N(T)=k)  =  (nk)pk(1p)nk.\mathrm{Pr}(N(T) = k) \;=\; \binom{n}{k}\, p^k\, (1 - p)^{n - k}.

Now take the limit nn \to \infty with λT=np\lambda T = np held fixed (i.e. p=λT/n0p = \lambda T / n \to 0):

(nk)pk(1p)nk  =  n!k!(nk)!nk/k!(λTn)k=(λT)k/nk(1λTn)neλT(1λTn)k1.\binom{n}{k}\, p^k\, (1 - p)^{n - k} \;=\; \underbrace{\frac{n!}{k!\, (n - k)!}}_{\approx\, n^k / k!} \cdot \underbrace{\left( \frac{\lambda T}{n} \right)^k}_{= (\lambda T)^k / n^k} \cdot \underbrace{\left( 1 - \frac{\lambda T}{n} \right)^{n}}_{\to\, e^{-\lambda T}} \cdot \underbrace{\left( 1 - \frac{\lambda T}{n} \right)^{-k}}_{\to\, 1}.

Putting the limits together:

Pr(N(T)=k)    (λT)keλTk!.\mathrm{Pr}(N(T) = k) \;\to\; \frac{(\lambda T)^k\, e^{-\lambda T}}{k!}.

This is the Poisson distribution. It is the natural limit of a thinned binomial as the trials become many and individually unlikely. The result is independent of how you take the limit — the rate λT\lambda T is the only thing that matters.

Mean and variance of Poisson(λT)\mathrm{Poisson}(\lambda T):

E[N(T)]  =  λT,Var[N(T)]  =  λT.\mathbb{E}[N(T)] \;=\; \lambda T, \qquad \mathrm{Var}[N(T)] \;=\; \lambda T.

The mean equals the variance. This is the Poisson’s defining signature: the standard deviation of the count grows as λT\sqrt{\lambda T}, so the fractional spread σ/μ=1/λT\sigma / \mu = 1/\sqrt{\lambda T} shrinks as the count grows. Doubling the observation time halves the fractional uncertainty (but only by 2\sqrt{2}).

Exponential inter-arrival times

A Poisson process generates events at random times. The waiting time between consecutive events — the inter-arrival time — has a remarkable property: it is exponentially distributed with mean 1/λ1/\lambda.

Inter-arrival times are exponential

Let T1T_1 be the time until the first event. The probability that no event has occurred by time tt is the same as Pr(N(t)=0)\mathrm{Pr}(N(t) = 0), which by the Poisson formula above is

Pr(T1>t)  =  Pr(N(t)=0)  =  (λt)0eλt0!  =  eλt.\mathrm{Pr}(T_1 > t) \;=\; \mathrm{Pr}(N(t) = 0) \;=\; \frac{(\lambda t)^0\, e^{-\lambda t}}{0!} \;=\; e^{-\lambda t}.

The CDF of T1T_1 is

FT1(t)  =  Pr(T1t)  =  1eλt.F_{T_1}(t) \;=\; \mathrm{Pr}(T_1 \leq t) \;=\; 1 - e^{-\lambda t}.

Differentiating gives the PDF:

fT1(t)  =  dFT1dt  =  λeλt,t0.f_{T_1}(t) \;=\; \frac{d F_{T_1}}{dt} \;=\; \lambda\, e^{-\lambda t}, \qquad t \geq 0.

This is the exponential distribution with rate λ\lambda. By the independence axiom, the same argument applies to every subsequent inter-arrival time: T2,T3,T4,T_2, T_3, T_4, \ldots are all i.i.d. exponential with rate λ\lambda.

The exponential distribution has the memoryless property: Pr(T>s+tT>s)=Pr(T>t)\mathrm{Pr}(T > s + t \mid T > s) = \mathrm{Pr}(T > t). Given that you’ve already waited ss seconds for the next event, the additional waiting time is distributed the same as if you’d just started — the process has no “memory” of how long you’ve already waited. This is the only continuous distribution with this property.

For an auditory-nerve fibre firing at λ=100\lambda = 100 spikes per second, the mean inter-spike interval is 1/λ=101/\lambda = 10 ms. But individual intervals can be much longer or shorter — the distribution is exponential, so a fraction e137%e^{-1} \approx 37\% of intervals exceed the mean, and a fraction e50.7%e^{-5} \approx 0.7\% exceed five times the mean.

A Poisson process, made visible

spikes05 sλ = 5 events/s · 20 events showninter-spike intervals — expected exponential, mean 1/λ = 0.20 s00.60 sspikes in 1 s — expected Poisson(λ = 5)020

A Poisson process at rate λ produces events at independent, uniformly-distributed times — no event "remembers" when the previous one happened (the memoryless property). Three statistical consequences, all visible above. The inter-spike intervals are exponentially distributed with mean 1/λ (the red curve over the left histogram). The number of events in any time window T is Poisson-distributed with mean λT (the red dots over the right histogram). The events themselves cluster and gap unpredictably; the apparent rhythm of a Poisson raster is an artefact of the human visual system, not a property of the process. Used to model radioactive decay, photon arrivals, customer-queue arrivals, and (most relevant for this bookshelf) the spike trains of auditory-nerve fibres in [Hearing Ch 5](/hearing/auditory-nerve).

The top panel is a single 5-second realisation, drawn as a spike raster — each vertical line is an event. The bottom-left histogram is the distribution of inter-spike intervals across many trials, overlaid with the theoretical exponential. The bottom-right histogram is the distribution of spike counts in a 1-second window, overlaid with the theoretical Poisson PMF.

Three things to take from playing with λ\lambda:

Adding Poisson processes

A useful property: the superposition of two independent Poisson processes with rates λ1\lambda_1 and λ2\lambda_2 is a Poisson process with rate λ1+λ2\lambda_1 + \lambda_2. Conversely, thinning a Poisson process by independently retaining each event with probability pp gives a Poisson process with rate pλp\lambda.

These properties are why Poisson processes are so easy to combine and decompose. If nn identical auditory-nerve fibres each fire as a Poisson process at rate λ\lambda, the total spike count in the population is Poisson at rate nλn\lambda. If a population of fibres collectively fires at rate Λ\Lambda and we subsample some fraction pp of them, the subsample’s spike count is Poisson at rate pΛp\Lambda. Both consequences are central to modelling neural populations and to the population-coding analyses of Hearing Ch 5.

When the Poisson model breaks

The Poisson process is the simplest possible model of stochastic events. Real neural firing is not perfectly Poisson — most real spike trains show:

The Fano factor F=Var[N]/E[N]F = \mathrm{Var}[N] / \mathbb{E}[N] measures departure from Poisson: F=1F = 1 exactly for Poisson, F<1F < 1 for refractory-period-dominated regularity, F>1F > 1 for over-dispersion. Auditory-nerve fibres typically have F1F \approx 1 at moderate firing rates, dropping below 1 at high rates where the refractory period dominates.

What we use this for

Poisson processes (and the Poisson distribution as their count statistic) are everywhere in noise and event statistics:

The next lesson, 11.5, develops Bayesian inference and signal detection theory — the inferential machinery that combines a likelihood (often based on Gaussian or Poisson noise from this and previous lessons) with a prior to produce a posterior belief.