Reed-Solomon Codes

MDS codes

An MDS (Maximum Distance Separable) code is a linear code of dimension $k$ and length $n$ over a finite field $F$ (a vector subspace of dimension $k$ of $F^{n}$ ), which reaches the Singleton bound (the minimal Hamming distance between any two codewords is $d = n - k + 1$ ), and so provides the maximum erasure-correcting capability possible (as a linear code cannot verify $d > n - k + 1$ , again by the Singleton bound). So we can correct up to $d - 1 = n - k$ erasures by finding the closest codeword under the Hamming distance. A generating matrix $G \in F^{n \times k}$ of a linear code $C$ is defined by $C = Column-span (G) = {G x^{T} : x \in F^{k}}$ . The property that any $k$ coefficients $\tilde{c}$ of a codeword $c$ determine it, comes from the fact that any set of $k$ rows of its generating matrix forms a full rank $k \times k$ matrix $A$ (the equation $A c^{T} = \tilde{c}^{T}$ where $c$ is the unknown has a unique solution, as $A$ is invertible).

Reed-Solomon codes

Let $F$ be a prime field of order $r$ . Let $n = α k$ such that $n ∣ r - 1$ . Let $ω \in F$ be a primitive $n$ -th root of unity. Aside: how do we find such primitive roots of unity in practice?

We consider the Reed-Solomon code of parameters $[n, k, d = n - k + 1]$ and evaluation points, $ω = (1, ω, ω^{2}, \dots, ω^{n - 1})$ the subgroup of the $n$ -th roots of unity. We denote it $RS (n, k, ω) : = {(f (ω^{i}))_{i \in [[0, n - 1]]} ∣ f \in F [x] \land de g f < k}$ .

RS has rate $R = k / n = 1/ α$ . To a vector $f$ we associate the polynomial $f (x) = \sum_{i} f_{i} x^{i}$ , and reciprocally. RS is a linear code, whose generating matrix is the following Vandermonde matrix, for which every square submatrix is invertible, so RS is indeed MDS:

11 ⋮ 1 ω_{1} ω_{2} ω_{n} ω_{1}^{2} ω_{2}^{2} ω_{n}^{2} \dots \dots ⋱ \dots ω_{1}^{k - 1} ω_{2}^{k - 1} ⋮ ω_{n}^{k - 1} .

Encoding

Encoding a message $m = (m_{0}, \dots, m_{k - 1}) \in F^{k}$ amounts to evaluate its associated polynomial $m (x) = \sum_{i = 0}^{k - 1} m_{i} x^{i}$ at the evaluation points $ω$ . This can be done with an $n$ -points discrete Fourier transform supported by $F$ in time $O (n lo g n)$ :

Input: $m = (m_{0}, \dots, m_{k - 1}) \in F^{k}$

Output: $c = (c_{0}, \dots, c_{n - 1}) \in RS (n, k, ω)$

Return $FFT_{n} (IFFT_{k} (m) ∥ 0_{F^{n - k}})$

Decoding from erasures

As we saw earlier, we can decode a codeword $c \in RS (n, k, ω)$ with at most $d - 1 = n - k$ erasures, i.e. from at least $k$ components of $c$ .

Without loss of generality, let $\tilde{c} = (c_{0}, \dots, c_{k - 1})$ be the received codeword with erasures, the first $k$ components of a codeword. We can retrieve the original message $m$ with any $k$ components of $c$ thanks to the Lagrange interpolation polynomial, where the $x_{i}$ are the evaluation points of $\tilde{c}$

$m (x) = \sum_{i = 0}^{k - 1} c_{i} \prod_{j = 0 j \neq = i}^{k - 1} \frac{x - x _{j}}{x _{i} - x _{j}} .$

As detailed in https://arxiv.org/pdf/0907.1788v1.pdf, the idea is to rewrite $m (x)$ as a product of two polynomials $A (x)$ and $B (x)$ so that the convolution theorem allows us to recover $m (x)$ using FFTs in $O (n lo g n)$ . (The authors consider sums to $n - 1$ while we can consider sums to $k - 1$ since $m (x)$ has degree $k$ .)

To do so, let $A (x) : = \prod_{i = 0}^{k - 1} (x - x_{i}), A_{i} (x) : = \prod_{j = 0 j \neq = i}^{k - 1} (x - x_{j}) .$

Let $n_{i} : = \frac{c _{i}}{A _{i} ( x _{i} )}$ .

The interpolation polynomial becomes: $m (x) = A (x) \sum_{i = 0}^{k - 1} \frac{c _{i}}{( x - x _{i} ) A _{i} ( x _{i} )} = A (x) \sum_{i = 0}^{k - 1} \frac{n _{i}}{x - x _{i}} .$

Note that $A_{i} (x_{i}) \neq = 0$ by definition, so it is invertible in $F$ .

In order to replace the costly product $A_{i} (x)$ in this expression, we use the fact that the formal derivative $A^{'} (x)$ of $A (x)$ satisfies for all $i \in [[0, k - 1]]$ : $A^{'} (x_{i}) = A_{i} (x_{i})$ . So we can compute $(A_{i} (x_{i}))_{i}$ by evaluating $A^{'} (x)$ at the points $ω$ with an FFT.

Indeed: $A^{'} (x) = (\prod_{i = 0}^{k - 1} (x - x_{i}))^{'} = \sum_{i = 0}^{k - 1} (x - x_{i})^{'} \prod_{j = 0 j \neq = i}^{k - 1} (x - x_{j}) = \sum_{j = 0}^{k - 1} A_{j} (x) .$ So $A^{'} (x_{i}) = \sum_{j = 0}^{k - 1} A_{j} (x_{i}) = A_{i} (x_{i})$ as the other polynomials $A_{j} (x)$ have $x_{i}$ as root.

The resulting expression corresponds to the barycentric formula.

Writing the fraction $\frac{1}{x _{i} - x} = \sum_{j = 0}^{\infty} \frac{x ^{j}}{x _{i}^{j + 1}}$ as a formal power series, we obtain

m (x) / A (x) = i = 0 \sum k - 1 \frac{n _{i}}{x - x _{i}} mod x^{k} = - i = 0 \sum k - 1 (j = 0 \sum k - 1 \frac{n _{i}}{x _{i}^{j + 1}} x^{j}) .

But if we let $N (x) : = \sum_{i = 0}^{k - 1} \frac{n _{i}}{x _{i}} x^{i}$ then $\sum_{i = 0}^{k - 1} \frac{n _{i}}{x - x _{i}} mod x^{k} = - \sum_{j = 0}^{k - 1} N (x_{i}^{- j}) x^{j} = : - B (x) .$

$B (x)$ is thus given by the first $k$ components of $n \times IFFT_{n} (N)$ .

So the product is given by the convolution theorem, and by linearity of the DFT and pairwise product:

m = A * (- B) = - IFFT_{2 k} (FFT_{2 k} (A) ⊙ FFT_{2 k} (B)) .

The total cost is $O (k lo g^{2} k + n lo g n)$ : the first term accounts for the computation of the product for $A (x)$ with a divide and conquer approach for the multiplication of its factors with FFT multiplication, and the second one for the other steps.

Sharding

In some applications, such as database sharding, it can be advantageous to split the codewords into chunks (aka shards). For this purpose, let $s$ be the number of shards, $l = n / s$ the length of a shard, and $ω$ a primitive $n$ -th root of unity.

The domain of evaluation is then split into cosets: $⟨ ω ⟩ = ⨆_{i \in [[0, s - 1]]} Ω_{i}$ , for $Ω_{0} = {ω^{s j}}_{j \in [[0, l - 1]]}$ and $Ω_{i} = ω^{i} Ω_{0}$ .

For a set of $k / s$ shard indices $Z \subseteq {0, s - 1}$ , we reorganize the product $A (x) = \prod_{i = 0}^{k - 1} (x - x_{i})$ into $A (x) = \prod_{i \in Z ∣ Z ∣ = \frac{k}{s}} Z_{i} ω^{'} \in Ω_{i} \prod (x - ω^{'}) .$ We notice that $Z_{0} (x) = x^{∣ Ω_{0} ∣} - 1$ (as its roots are the elements of a group of order dividing $∣ Ω_{0} ∣$ ) entails $Z_{i} (x) = x^{∣ Ω_{0} ∣} - ω^{i ∣ Ω_{0} ∣}$ (multiplying all terms by a constant $ω^{i}$ in an integral domain), which is as sparse as it can be. More formally: every element of $Ω_{i}$ is of the form $ω^{i} ω^{s j}$ for $j \in [[0, l - 1]]$ . Thus $Z_{i} (ω^{i} ω^{s j}) = (ω^{i} ω^{s j})^{∣ Ω_{0} ∣} - ω^{i ∣ Ω_{0} ∣} = (ω^{i})^{∣ Ω_{0} ∣} (ω^{s j})^{∣ Ω_{0} ∣} - ω^{i ∣ Ω_{0} ∣} = ω^{i ∣ Ω_{0} ∣} 1 - ω^{i ∣ Ω_{0} ∣} = 0.$ So every element of $Ω_{i}$ is a root of $Z_{i} (x)$ . Moreover, $Z_{i} (x)$ is a degree $∣ Ω_{0} ∣ = l$ polynomial so has at most $l$ roots: $Z_{i} (x)$ ’s only roots are $Ω_{i}$ .

With this little observation, we’ve reduced the number of leaves of the recursion tree for the divide-and-conquer multiplication of the factors of $A (x)$ from $k$ to $k / s$ .

Notes

Explorer

Reed-Solomon Codes

MDS codes

Reed-Solomon codes

Encoding

Decoding from erasures

Sharding

Graph View

Table of Contents

Backlinks