Abridged Optics

▸

1Introduction to Polarization

1.1The Transverse Nature of Light

Light is a transverse electromagnetic wave: the electric field vector $\vec{E}$ and the magnetic field vector $\vec{B}$ oscillate perpendicular to the direction of propagation. For a monochromatic plane wave traveling in the $z$ -direction, the electric field can be written as the superposition of two orthogonal components:

Electric Field of a Plane Wave

\vec{E}(z,t) = E_{0x}\cos(kz - \omega t + \delta_x)\,\hat{x} + E_{0y}\cos(kz - \omega t + \delta_y)\,\hat{y}

Here $E_{0x}$ and $E_{0y}$ are the amplitudes of the x- and y-components, $k = 2\pi/\lambda$ is the wavenumber, $\omega$ is the angular frequency, and $\delta_x$ and $\delta_y$ are the phase offsets of each component. The polarization state of the wave is determined entirely by the amplitude ratio $E_{0y}/E_{0x}$ and the phase difference $\delta = \delta_y - \delta_x$ . The magnetic field carries the same information and is fully determined by Maxwell's equations, so polarization is conventionally described using only the electric field.

1.2Historical Context

The concept of polarization emerged from observations that could not be explained by a simple scalar wave theory. In 1669, Erasmus Bartholinus discovered double refraction (birefringence) in calcite crystals, observing that a single incident ray produced two refracted rays. Christiaan Huygens attempted to explain this with his wave theory in 1690, but the full explanation required the concept of transverse oscillations, which was not yet established.

Etienne-Louis Malus discovered polarization by reflection in 1808 when he observed that light reflected from a glass window, viewed through a calcite crystal, showed intensity variations as the crystal was rotated. He coined the term “polarization” by analogy with the poles of a magnet. Sir David Brewster subsequently determined the angle of incidence at which reflected light is fully polarized (now called Brewster's angle) in 1815 [1, 4].

Augustin-Jean Fresnel provided the definitive theoretical framework in the 1820s by establishing that light waves are transverse, deriving the amplitude reflection and transmission coefficients (the Fresnel equations), and explaining birefringence as a consequence of directional variation in the refractive index. The Jones calculus (1941) and Stokes-Mueller calculus (Stokes 1852, Mueller 1948) later provided the matrix formalisms that are essential tools in modern optical engineering [4, 5].

1.3Why Polarization Matters

Polarization is a fundamental property of light that affects nearly every aspect of optical system design. Fresnel reflection and transmission at any interface depend on polarization: s-polarized and p-polarized light reflect differently, which means that any optical surface acts as a partial polarizer. Thin-film coating performance (anti-reflection coatings, mirrors, filters, beam splitters) is inherently polarization-dependent because the Fresnel coefficients at each layer boundary depend on the polarization state [1, 3].

In laser systems, polarization control is essential for beam combining, frequency doubling (which requires a specific linear polarization orientation relative to the crystal axis), electro-optic modulation, and isolation (optical isolators use the Faraday effect to create non-reciprocal polarization rotation). In imaging and metrology, polarization is used for stress analysis (photoelasticity), surface characterization (ellipsometry), glare reduction, and contrast enhancement. In telecommunications, polarization-division multiplexing doubles fiber capacity by encoding independent data streams on orthogonal polarization states [3, 5].

magneto-optic Faraday effect physics and Verdet constants

Understanding polarization is therefore not optional for the optical engineer — it is required for predicting system performance, diagnosing unexpected losses, and designing components that control the polarization state as needed.

▸

2Polarization States

2.1Linear Polarization

When the phase difference between the x- and y-components is $\delta = 0$ or $\delta = \pi$ , the electric field vector traces a straight line in the xy-plane as the wave propagates. The orientation of this line depends on the amplitude ratio $E_{0y}/E_{0x}$ . If $E_{0y} = 0$ , the wave is horizontally polarized (x-polarized); if $E_{0x} = 0$ , the wave is vertically polarized (y-polarized). Any intermediate ratio gives linear polarization at an angle $\theta = \arctan(E_{0y}/E_{0x})$ to the x-axis.

Linear polarization is the most common polarization state encountered in practice. Most laser sources emit linearly polarized light (due to Brewster windows in the cavity, or polarization-selective coatings). Linear polarizers transmit one orientation and block the orthogonal orientation, producing linearly polarized light from an unpolarized source [1, 2].

2.2Circular Polarization

When the x- and y-amplitudes are equal ( $E_{0x} = E_{0y} = E_0$ ) and the phase difference is $\delta = \pm\pi/2$ , the electric field vector traces a circle in the xy-plane as the wave propagates. The condition for circular polarization is:

Circular Polarization Condition

E_{0x} = E_{0y} = E_0, \quad \delta = \delta_y - \delta_x = \pm\frac{\pi}{2}

The sign of $\delta$ determines the handedness. In the optics convention (looking into the oncoming beam), $\delta = +\pi/2$ gives right-hand circular polarization (RCP) — the electric field vector rotates clockwise — and $\delta = -\pi/2$ gives left-hand circular polarization (LCP) — the electric field vector rotates counter-clockwise. Note that the physics convention (looking along the direction of propagation) reverses the handedness labels. The optics convention is used throughout this guide [1, 5].

Circular polarization is produced by passing linearly polarized light through a quarter-wave plate with the linear polarization oriented at 45° to the fast axis. It is used in optical isolators, 3D cinema systems, satellite communications, and anywhere rotation-invariant polarization is needed.

2.3Elliptical Polarization

The most general polarization state is elliptical: the tip of the electric field vector traces an ellipse in the xy-plane. Linear and circular polarization are special cases of elliptical polarization. For arbitrary $E_{0x}$ , $E_{0y}$ , and $\delta$ , the electric field components satisfy the polarization ellipse equation:

Polarization Ellipse

\frac{E_x^2}{E_{0x}^2} + \frac{E_y^2}{E_{0y}^2} - 2\frac{E_x E_y}{E_{0x} E_{0y}}\cos\delta = \sin^2\delta

The ellipse is characterized by two angles. The orientation angle $\psi$ gives the tilt of the major axis relative to the x-axis:

Orientation Angle

\tan 2\psi = \frac{2E_{0x}E_{0y}\cos\delta}{E_{0x}^2 - E_{0y}^2}

The ellipticity angle $\chi$ describes the shape of the ellipse, ranging from $\chi = 0$ (linear) to $\chi = \pm\pi/4$ (circular):

Ellipticity Angle

\sin 2\chi = \frac{2E_{0x}E_{0y}\sin\delta}{E_{0x}^2 + E_{0y}^2}

In practice, elliptical polarization arises whenever light passes through a birefringent element that introduces a phase retardation other than 0, $\pi/2$ , or $\pi$ , or when linearly polarized light reflects from a metallic surface (which introduces different phase shifts for s- and p-components) [1, 5].

Figure 2.1 — End-on view of the electric field vector tip for linear, circular, and elliptical polarization states. Toggle between states to see the characteristic trace geometry and defining angles.

2.4Unpolarized and Partially Polarized Light

Natural light sources (incandescent lamps, LEDs, the sun) emit unpolarized light: the polarization state changes randomly on timescales much shorter than any practical measurement interval. The time-averaged electric field has no preferred orientation. Unpolarized light cannot be described by a single Jones vector (which assumes complete coherence and full polarization) but is naturally represented by the Stokes formalism.

Partially polarized light is a mixture of fully polarized and unpolarized components. The degree of polarization (DOP) quantifies the fraction of the total intensity that is polarized:

Degree of Polarization

\text{DOP} = \frac{I_{\text{pol}}}{I_{\text{total}}} = \frac{\sqrt{S_1^2 + S_2^2 + S_3^2}}{S_0}

DOP ranges from 0 (completely unpolarized) to 1 (fully polarized). For example, skylight is partially polarized (DOP typically 0.3–0.7, depending on the scattering angle relative to the sun), while a well-collimated laser beam is typically fully polarized (DOP ≈ 1). Reflection from a dielectric surface at Brewster's angle produces fully polarized light (DOP = 1) in the reflected beam [1, 6].

▸

3Jones Calculus

3.1Jones Vectors

The Jones vector provides a compact, complex-valued representation of the polarization state of a fully polarized, monochromatic plane wave. It is a 2×1 column vector containing the complex amplitudes of the x- and y-components of the electric field:

Jones Vector

\vec{J} = \begin{pmatrix} E_{0x}\,e^{i\delta_x} \\ E_{0y}\,e^{i\delta_y} \end{pmatrix}

Two Jones vectors represent the same polarization state if they differ only by a common phase factor $e^{i\phi}$ or a real scaling factor. The normalized Jones vectors for the most common polarization states are listed below.

Polarization State	Jones Vector
Horizontal (x)	\begin{pmatrix} 1 \\ 0 \end{pmatrix}
Vertical (y)	\begin{pmatrix} 0 \\ 1 \end{pmatrix}
Linear at +45°	\frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ 1 \end{pmatrix}
Linear at −45°	\frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ -1 \end{pmatrix}
Right-hand circular (RCP)	\frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ -i \end{pmatrix}
Left-hand circular (LCP)	\frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ i \end{pmatrix}

3.2Jones Matrices

Each polarization-modifying optical element (polarizer, waveplate, rotator) is represented by a 2×2 complex matrix called the Jones matrix. The output Jones vector is obtained by multiplying the input Jones vector by the element's Jones matrix:

Jones Matrix Transformation

\vec{J}_{\text{out}} = \mathbf{M}\,\vec{J}_{\text{in}}

The Jones matrix for a horizontal polarizer (transmits x-component, blocks y-component):

Horizontal Polarizer

\mathbf{M}_{\text{H-pol}} = \begin{pmatrix} 1 & 0 \\ 0 & 0 \end{pmatrix}

A linear polarizer with its transmission axis at angle $\theta$ to the x-axis:

Polarizer at Angle \u03B8

\mathbf{M}_{\text{pol}}(\theta) = \begin{pmatrix} \cos^2\theta & \sin\theta\cos\theta \\ \sin\theta\cos\theta & \sin^2\theta \end{pmatrix}

A quarter-wave plate (QWP) with its fast axis along x:

Quarter-Wave Plate (fast axis horizontal)

\mathbf{M}_{\text{QWP}} = e^{-i\pi/4}\begin{pmatrix} 1 & 0 \\ 0 & -i \end{pmatrix}

A half-wave plate (HWP) with its fast axis along x:

Half-Wave Plate (fast axis horizontal)

\mathbf{M}_{\text{HWP}} = e^{-i\pi/2}\begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix}

A general waveplate with retardance $\Gamma$ and fast axis along x:

General Waveplate

\mathbf{M}_{\text{WP}}(\Gamma) = e^{-i\Gamma/2}\begin{pmatrix} 1 & 0 \\ 0 & e^{-i\Gamma} \end{pmatrix}

To model an element whose principal axis is rotated by angle $\theta$ from the x-axis, the Jones matrix is rotated using the rotation matrix $\mathbf{R}(\theta)$ :

Rotated Jones Matrix

\mathbf{M}'(\theta) = \mathbf{R}(-\theta)\,\mathbf{M}\,\mathbf{R}(\theta), \quad \mathbf{R}(\theta) = \begin{pmatrix} \cos\theta & \sin\theta \\ -\sin\theta & \cos\theta \end{pmatrix}

3.3Cascading Elements

When light passes through a sequence of optical elements, the overall Jones matrix is the product of the individual Jones matrices in reverse order (the first element encountered is on the right). For N elements:

Cascaded Jones Matrices

\vec{J}_{\text{out}} = \mathbf{M}_N \cdot \mathbf{M}_{N-1} \cdots \mathbf{M}_2 \cdot \mathbf{M}_1 \cdot \vec{J}_{\text{in}}

Worked Example: Light Through a Polarizer and Quarter-Wave Plate

Consider horizontally polarized light passing first through a linear polarizer at 45°, then through a quarter-wave plate with its fast axis horizontal. What is the output polarization state?

\vec{J}_{\text{in}} = \begin{pmatrix} 1 \\ 0 \end{pmatrix}

\vec{J}_1 = \mathbf{M}_{\text{pol}}(45^{\circ})\,\vec{J}_{\text{in}} = \begin{pmatrix} 1/2 & 1/2 \\ 1/2 & 1/2 \end{pmatrix}\begin{pmatrix} 1 \\ 0 \end{pmatrix} = \frac{1}{2}\begin{pmatrix} 1 \\ 1 \end{pmatrix}

\vec{J}_{\text{out}} = \mathbf{M}_{\text{QWP}}\,\vec{J}_1 = e^{-i\pi/4}\begin{pmatrix} 1 & 0 \\ 0 & -i \end{pmatrix}\frac{1}{2}\begin{pmatrix} 1 \\ 1 \end{pmatrix} = \frac{e^{-i\pi/4}}{2}\begin{pmatrix} 1 \\ -i \end{pmatrix}

Ignoring the global phase factor, the output is proportional to $\begin{pmatrix} 1 \\ -i \end{pmatrix}$ , which is right-hand circularly polarized light. The 45° polarizer projects the input onto the +45° linear state, and the QWP converts that linear state to circular polarization.

3.4Limitations

The Jones calculus applies only to fully polarized, coherent, monochromatic light. It cannot represent unpolarized or partially polarized light, because the Jones vector inherently describes a definite polarization state. For partially polarized light, depolarizing elements, or incoherent superposition of beams, the Stokes-Mueller formalism (Section 4) is required. Additionally, the Jones calculus does not account for absolute intensity — it tracks the relative complex amplitudes, not the absolute power. For systems involving both coherent and incoherent effects, a hybrid approach using coherency matrices or the Mueller calculus is needed [5, 6].

▸

4Stokes & Mueller Calculus

4.1Stokes Vector

The Stokes vector is a 4×1 real-valued vector that completely describes the polarization state of a light beam, including partially polarized and unpolarized light. Unlike the Jones vector, it is defined in terms of measurable intensities rather than complex amplitudes:

Stokes Vector

\vec{S} = \begin{pmatrix} S_0 \\ S_1 \\ S_2 \\ S_3 \end{pmatrix} = \begin{pmatrix} I_{\text{total}} \\ I_H - I_V \\ I_{+45} - I_{-45} \\ I_{\text{RCP}} - I_{\text{LCP}} \end{pmatrix}

Here $S_0$ is the total intensity, $S_1$ is the preference for horizontal over vertical polarization, $S_2$ is the preference for +45° over −45° polarization, and $S_3$ is the preference for right-circular over left-circular polarization. Each Stokes parameter can be determined by measuring the transmitted intensity through appropriate polarizers and waveplates.

For a fully polarized beam described by field amplitudes $E_{0x}$ , $E_{0y}$ , and phase difference $\delta$ , the Stokes parameters are:

Stokes Parameters from Field Amplitudes

\vec{S} = \begin{pmatrix} E_{0x}^2 + E_{0y}^2 \\ E_{0x}^2 - E_{0y}^2 \\ 2E_{0x}E_{0y}\cos\delta \\ 2E_{0x}E_{0y}\sin\delta \end{pmatrix}

4.2Degree of Polarization

The degree of polarization (DOP), degree of linear polarization (DOLP), and degree of circular polarization (DOCP) are derived from the Stokes parameters:

DOP, DOLP, DOCP

\text{DOP} = \frac{\sqrt{S_1^2 + S_2^2 + S_3^2}}{S_0}, \quad \text{DOLP} = \frac{\sqrt{S_1^2 + S_2^2}}{S_0}, \quad \text{DOCP} = \frac{|S_3|}{S_0}

For fully polarized light, $S_0^2 = S_1^2 + S_2^2 + S_3^2$ and DOP = 1. For partially polarized light, $S_0^2 > S_1^2 + S_2^2 + S_3^2$ and 0 < DOP < 1. For completely unpolarized light, $S_1 = S_2 = S_3 = 0$ and DOP = 0.

Worked Example: DOP from Measured Stokes Parameters

A polarimeter measures the Stokes vector of a beam as $\vec{S} = (1.0,\; 0.6,\; 0.0,\; 0.3)^T$ . Determine the DOP, DOLP, and DOCP.

\text{DOP} = \frac{\sqrt{0.6^2 + 0.0^2 + 0.3^2}}{1.0} = \frac{\sqrt{0.36 + 0 + 0.09}}{1.0} = \frac{\sqrt{0.45}}{1.0} = 0.671

\text{DOLP} = \frac{\sqrt{0.6^2 + 0.0^2}}{1.0} = \frac{0.6}{1.0} = 0.600

\text{DOCP} = \frac{|0.3|}{1.0} = 0.300

The beam is partially polarized (DOP = 0.671), with a stronger linear component (DOLP = 0.600) than circular component (DOCP = 0.300). The predominant polarization is horizontal (S₁ = 0.6 > 0), consistent with partial horizontal linear polarization with a small right-circular component (S₃ = 0.3 > 0).

4.3Mueller Matrices

Each optical element in the Mueller formalism is represented by a 4×4 real-valued matrix. The output Stokes vector is obtained by multiplying the input Stokes vector by the Mueller matrix:

Mueller Transformation

\vec{S}_{\text{out}} = \mathbf{M}\,\vec{S}_{\text{in}}

Mueller matrix for a horizontal linear polarizer:

Horizontal Polarizer (Mueller)

\mathbf{M}_{\text{H-pol}} = \frac{1}{2}\begin{pmatrix} 1 & 1 & 0 & 0 \\ 1 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{pmatrix}

Mueller matrix for a quarter-wave plate with fast axis horizontal:

QWP (Mueller, fast axis horizontal)

\mathbf{M}_{\text{QWP}} = \begin{pmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & -1 & 0 \end{pmatrix}

Mueller matrix for a half-wave plate with fast axis horizontal:

HWP (Mueller, fast axis horizontal)

\mathbf{M}_{\text{HWP}} = \begin{pmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & -1 \end{pmatrix}

Mueller matrix for a linear polarizer at angle $\theta$ :

Polarizer at Angle \u03B8 (Mueller)

\mathbf{M}_{\text{pol}}(\theta) = \frac{1}{2}\begin{pmatrix} 1 & \cos 2\theta & \sin 2\theta & 0 \\ \cos 2\theta & \cos^2 2\theta & \sin 2\theta\cos 2\theta & 0 \\ \sin 2\theta & \sin 2\theta\cos 2\theta & \sin^2 2\theta & 0 \\ 0 & 0 & 0 & 0 \end{pmatrix}

4.4Jones vs Mueller

The Jones and Mueller formalisms are complementary. The Jones calculus is simpler (2×2 matrices vs 4×4), preserves absolute phase information (important for interference calculations), and is computationally efficient. However, it is limited to fully polarized, coherent light.

The Mueller calculus handles any polarization state including partially polarized and unpolarized light, can represent depolarizing elements (scattering surfaces, integrating spheres, rough surfaces), and works directly with measurable intensities. Every Jones matrix can be converted to an equivalent Mueller matrix, but the reverse is not always possible — Mueller matrices that represent depolarization have no Jones equivalent [5, 6].

In practice, use the Jones calculus for coherent systems (laser optics, interferometers, fiber optics) where all elements are non-depolarizing and the light is fully polarized. Use the Mueller calculus when dealing with partially polarized sources, depolarizing elements, or when the system includes both polarized and unpolarized beams.

4.5Poincaré Sphere

The Poincaré sphere is a geometric representation of all possible polarization states on the surface of a unit sphere. The three Cartesian coordinates of a point on the sphere correspond to the normalized Stokes parameters $(S_1/S_0,\; S_2/S_0,\; S_3/S_0)$ . Fully polarized states lie on the surface of the sphere, while partially polarized states lie inside the sphere (with unpolarized light at the origin).

On the Poincaré sphere, the equator represents all linear polarization states: horizontal polarization is at (1, 0, 0), vertical at (−1, 0, 0), +45° at (0, 1, 0), and −45° at (0, −1, 0). The north pole (0, 0, 1) represents right-hand circular polarization and the south pole (0, 0, −1) represents left-hand circular polarization. All other points on the surface represent elliptical polarization states.

The Poincaré sphere is particularly useful for visualizing the action of waveplates: a waveplate with retardance $\Gamma$ and fast axis at angle $\theta$ rotates the polarization state on the sphere by an angle $\Gamma$ about an axis that passes through the equatorial point at longitude $2\theta$ . A quarter-wave plate rotates by 90°, a half-wave plate by 180°, and a full-wave plate by 360° (identity). This geometric interpretation makes it easy to predict the effect of any waveplate on any input polarization state [5, 6].

▸

5Polarization by Reflection

5.1Fresnel Equations

When light strikes the interface between two media with refractive indices $n_1$ and $n_2$ , the reflected and transmitted amplitudes depend on the polarization state. The two independent linear polarization states used in reflection analysis are s-polarization (electric field perpendicular to the plane of incidence, from the German “senkrecht”) and p-polarization (electric field parallel to the plane of incidence). The Fresnel amplitude reflection and transmission coefficients are:

Fresnel Reflection Coefficients

r_s = \frac{n_1\cos\theta_i - n_2\cos\theta_t}{n_1\cos\theta_i + n_2\cos\theta_t}, \quad r_p = \frac{n_2\cos\theta_i - n_1\cos\theta_t}{n_2\cos\theta_i + n_1\cos\theta_t}

Fresnel Transmission Coefficients

t_s = \frac{2n_1\cos\theta_i}{n_1\cos\theta_i + n_2\cos\theta_t}, \quad t_p = \frac{2n_1\cos\theta_i}{n_2\cos\theta_i + n_1\cos\theta_t}

Here $\theta_i$ is the angle of incidence and $\theta_t$ is the angle of refraction, related by Snell's law: $n_1\sin\theta_i = n_2\sin\theta_t$ . The power reflectance and transmittance are:

Power Reflectance and Transmittance

R_s = |r_s|^2, \quad R_p = |r_p|^2, \quad T_s = \frac{n_2\cos\theta_t}{n_1\cos\theta_i}|t_s|^2, \quad T_p = \frac{n_2\cos\theta_t}{n_1\cos\theta_i}|t_p|^2

Note that the factor $n_2\cos\theta_t / (n_1\cos\theta_i)$ in the transmittance accounts for the change in beam cross-section and wave impedance between the two media. Energy conservation requires $R + T = 1$ for each polarization (in a lossless medium) [1, 2, 4].

how optical coatings control s- and p-polarization reflectance

5.2Brewster's Angle

Brewster's angle $\theta_B$ is the angle of incidence at which the p-polarized reflection coefficient vanishes ( $r_p = 0$ ). At this angle, the reflected beam is entirely s-polarized. Brewster's angle satisfies:

Brewster's Angle

\theta_B = \arctan\!\left(\frac{n_2}{n_1}\right)

At Brewster's angle, the reflected and refracted rays are perpendicular to each other ( $\theta_i + \theta_t = 90^{\circ}$ ). This is exploited in Brewster windows (used in gas lasers to introduce a polarization-selective loss that forces the laser to oscillate in the p-polarization), and in pile-of-plates polarizers (multiple glass plates at Brewster's angle, each reflection removing more s-polarized light) [1, 2].

Worked Example: Brewster's Angle for BK7 Glass

Calculate Brewster's angle for light incident from air onto a BK7 glass surface at 550 nm, where the refractive index of BK7 is approximately 1.518.

\theta_B = \arctan\!\left(\frac{n_2}{n_1}\right) = \arctan\!\left(\frac{1.518}{1.000}\right) = 56.6^{\circ}

At 56.6° angle of incidence, the p-polarized reflectance is zero. The reflected beam is purely s-polarized, and the refracted angle is $\theta_t = 90^{\circ} - 56.6^{\circ} = 33.4^{\circ}$ .

Figure 5.1 — Brewster's angle geometry. At θ_B ≈ 56.6° for glass, the reflected beam is purely s-polarized and the reflected and refracted rays are perpendicular.

how beam splitters exploit polarization-dependent reflectance

5.3TIR and Phase Shifts

When light travels from a denser medium to a less dense medium ( $n_1 > n_2$ ) and the angle of incidence exceeds the critical angle $\theta_c = \arcsin(n_2/n_1)$ , total internal reflection (TIR) occurs. All incident power is reflected, but the s- and p-components acquire different phase shifts. The phase shifts upon TIR are:

Phase Shifts at TIR

\tan\!\left(\frac{\delta_s}{2}\right) = -\frac{\sqrt{\sin^2\theta_i - (n_2/n_1)^2}}{\cos\theta_i}, \quad \tan\!\left(\frac{\delta_p}{2}\right) = -\frac{\sqrt{\sin^2\theta_i - (n_2/n_1)^2}}{(n_2/n_1)^2\cos\theta_i}

The difference $\delta_p - \delta_s$ between the p- and s-phase shifts is a function of the angle of incidence and the refractive index ratio. By choosing the angle of incidence appropriately, a specific retardance can be introduced between the s- and p-components. This is the operating principle of the Fresnel rhomb: two TIR reflections at a carefully chosen angle produce a total phase difference of $\pi/2$ (quarter-wave retardation), converting linearly polarized input to circular polarization. The Fresnel rhomb has the advantage of being achromatic — the retardance varies only slowly with wavelength because it depends on geometry rather than material dispersion [1, 4].

5.4Metallic Surfaces

Metallic surfaces have a complex refractive index $\tilde{n} = n + i\kappa$ , where $\kappa$ is the extinction coefficient. The Fresnel equations remain valid with $n_2$ replaced by $\tilde{n}$ , but the reflection coefficients become complex even at normal incidence. Both s- and p-polarized light acquire different phase shifts upon reflection, so linearly polarized light reflected from a metal surface generally becomes elliptically polarized.

Metals do not exhibit a true Brewster's angle (since $r_p$ never reaches exactly zero), but they have a pseudo-Brewster angle where the p-reflectance reaches a minimum. The pseudo-Brewster angle and the associated reflectance minimum depend on the optical constants of the metal. For highly reflective metals like silver and aluminum, the p-reflectance minimum is shallow (a few percent dip), while for less reflective metals the minimum is more pronounced. Ellipsometry exploits these polarization-dependent reflection properties to determine the optical constants $n$ and $\kappa$ of thin films and surfaces [1, 4].

5.5Fresnel Reflectance at 45°

Worked Example: Fresnel Reflectance at 45\u00B0 for BK7

Calculate the s- and p-polarized power reflectance for light at 550 nm incident at 45° on an uncoated BK7 surface (n = 1.518).

\sin\theta_t = \frac{n_1}{n_2}\sin\theta_i = \frac{1.000}{1.518}\sin 45^{\circ} = \frac{0.7071}{1.518} = 0.4659

\theta_t = \arcsin(0.4659) = 27.76^{\circ}

r_s = \frac{1.000 \times \cos 45^{\circ} - 1.518 \times \cos 27.76^{\circ}}{1.000 \times \cos 45^{\circ} + 1.518 \times \cos 27.76^{\circ}} = \frac{0.7071 - 1.3427}{0.7071 + 1.3427} = \frac{-0.6356}{2.0498} = -0.3101

r_p = \frac{1.518 \times \cos 45^{\circ} - 1.000 \times \cos 27.76^{\circ}}{1.518 \times \cos 45^{\circ} + 1.000 \times \cos 27.76^{\circ}} = \frac{1.0733 - 0.8849}{1.0733 + 0.8849} = \frac{0.1884}{1.9582} = 0.0962

R_s = |r_s|^2 = 0.3101^2 = 0.0962 = 9.62\%

R_p = |r_p|^2 = 0.0962^2 = 0.0093 = 0.93\%

At 45°, s-polarized light reflects about 10 times more strongly than p-polarized light. This large polarization-dependent reflectance is why uncoated glass plates at 45° act as partial polarizers, and why anti-reflection coatings must be optimized for both polarizations at non-normal incidence.

▸

6Polarizers

6.1Dichroic Polarizers

Dichroic polarizers selectively absorb one polarization component while transmitting the orthogonal component. The most common example is Polaroid film, which uses aligned long-chain polymer molecules (polyvinyl alcohol doped with iodine) that strongly absorb light with its electric field parallel to the polymer chains. Modern dichroic sheet polarizers achieve extinction ratios of 10³ to 10⁴ across the visible spectrum and are available in large formats at low cost [1, 8].

Dichroic polarizers have several practical advantages: they are thin, lightweight, available in large apertures, and work over a wide acceptance angle. Their main limitations are absorption of the rejected polarization (which limits power handling and causes heating), moderate extinction ratio compared to crystalline polarizers, and limited UV and IR performance. They are widely used in displays, photography (polarizing filters), optical instruments, and educational laboratories.

6.2Birefringent Polarizers

Birefringent (or crystal) polarizers exploit the different refractive indices experienced by the ordinary and extraordinary rays in an anisotropic crystal to spatially separate the two polarization components. They achieve very high extinction ratios (10⁵ to 10⁶) and handle high optical powers because the rejected polarization is deflected rather than absorbed.

Glan-Thompson polarizer: Two calcite prisms cemented together with the optic axes parallel. The ordinary ray is totally internally reflected at the cement layer (which has a refractive index between the ordinary and extraordinary indices), while the extraordinary ray is transmitted. Provides the highest extinction ratio (> 10⁵) and largest acceptance angle (up to ±15°) among crystal polarizers, but the cement layer limits the damage threshold and UV transmission.

Glan-Taylor polarizer: Similar geometry to the Glan-Thompson but with an air gap between the prisms instead of cement. The air gap provides much higher damage threshold and UV transmission (down to about 214 nm with calcite), but the acceptance angle is smaller (±5° to ±8°) and the extinction ratio is somewhat lower (10⁴ to 10⁵).

Wollaston prism: Two calcite wedges with perpendicular optic axes cemented together. Both the ordinary and extraordinary rays exit the prism, symmetrically deviated about the input beam axis. This is useful when both polarization components are needed (e.g., in polarimeters or differential measurements). The deviation angle depends on the wedge angle and is typically 15° to 45°.

Rochon prism: Similar to the Wollaston but with a different optic axis arrangement. The ordinary ray passes through undeviated while the extraordinary ray is deviated. This provides a convenient undeviated reference beam [1, 5, 6].

6.3Wire-Grid Polarizers

Wire-grid polarizers consist of a parallel array of thin metallic wires (or lithographically patterned metal strips) on a transparent substrate. The wire spacing must be much smaller than the wavelength. Light polarized parallel to the wires drives conduction currents along the wires, causing reflection (and some absorption). Light polarized perpendicular to the wires cannot drive currents efficiently and is transmitted.

Historically, wire-grid polarizers were limited to the infrared (where wire spacings of a few micrometers suffice), but advances in nanolithography have enabled wire-grid polarizers for the visible and near-UV, with wire pitches below 150 nm. Modern visible wire-grid polarizers achieve extinction ratios of 10³ to 10⁴ with high transmission (80–90%) and work over wide angles of incidence. They are used in projection displays, imaging polarimeters, and applications requiring polarization components on flat substrates [5, 8].

6.4Thin-Film Polarizers

Thin-film polarizers use multilayer dielectric coatings designed to reflect one polarization and transmit the other at a specific angle of incidence (typically 45° or near Brewster's angle for the coating materials). They are commonly implemented as polarizing beam splitter (PBS) cubes and thin-film plate polarizers.

PBS cubes: A dielectric multilayer coating is deposited on the hypotenuse of one right-angle prism, and a second prism is cemented to it. P-polarized light is transmitted through the cube, while s-polarized light is reflected at 90°. PBS cubes provide both output beams in convenient, well-defined directions and are widely used in laser systems, interferometers, and optical instruments. Extinction ratios of 10³ for the reflected beam and 10² to 10³ for the transmitted beam are typical, though high-performance designs can achieve 10⁴ or better in transmission [5, 8].

Plate polarizers: A dielectric multilayer on a flat glass substrate at Brewster's angle (or a design angle near 56°). These are thinner, lighter, and handle higher power than PBS cubes (no cement layer), but produce only one clean output beam (the reflected s-polarized beam exits at an angle that depends on the plate orientation).

Figure 6.1 — Three common polarizer mechanisms. Toggle to compare dichroic absorption, birefringent splitting (Wollaston prism), and thin-film PBS cube operation.

6.5Malus's Law

When linearly polarized light with intensity $I_0$ passes through a linear polarizer whose transmission axis makes an angle $\theta$ with the polarization direction, the transmitted intensity is given by Malus's law:

Malus's Law

I(\theta) = I_0\cos^2\theta

At $\theta = 0^{\circ}$ , all the light is transmitted; at $\theta = 90^{\circ}$ , no light is transmitted (for an ideal polarizer). Malus's law is a direct consequence of the projection of the input electric field onto the polarizer's transmission axis. For unpolarized light incident on an ideal polarizer, the transmitted intensity is $I_0/2$ (averaging $\cos^2\theta$ over all angles gives 1/2) [1, 2].

Figure 6.2 — Malus's law: the transmitted intensity through an analyzer depends on the projection of the input E-field onto the transmission axis. The right panel shows the vector decomposition.

Worked Example: Three-Polarizer Paradox

Two crossed polarizers (transmission axes at 0° and 90°) transmit no light. What happens if a third polarizer is inserted between them at 45°?

I_1 = \frac{I_0}{2}

The output is linearly polarized at 0°.

I_2 = I_1\cos^2 45^{\circ} = \frac{I_0}{2} \times \frac{1}{2} = \frac{I_0}{4}

The output is now linearly polarized at 45°.

I_3 = I_2\cos^2 45^{\circ} = \frac{I_0}{4} \times \frac{1}{2} = \frac{I_0}{8}

Inserting the intermediate polarizer increases the transmitted intensity from zero to $I_0/8$ . The 45° polarizer projects the horizontal polarization onto its axis, creating a component that can then pass through the final vertical polarizer. This demonstrates that a polarizer is not merely a filter — it redefines the polarization state of the transmitted light.

6.6Extinction Ratio

The extinction ratio (ER) of a polarizer quantifies the contrast between the transmitted intensities for the two orthogonal polarization states. It is defined as the ratio of maximum to minimum transmission:

Extinction Ratio

\text{ER} = \frac{T_{\text{max}}}{T_{\text{min}}}

Extinction ratio is often expressed in decibels: $\text{ER (dB)} = 10\log_{10}(T_{\text{max}}/T_{\text{min}})$ . A higher extinction ratio means better polarization purity. The required extinction ratio depends on the application: visual displays may tolerate 10³ (30 dB), while precision polarimetry may require 10⁵ (50 dB) or better [5, 6].

Polarizer Type	Extinction Ratio	Transmission (%)	Acceptance Angle	Damage Threshold	Spectral Range
Dichroic sheet	10³–10⁴	40–85	Wide (±60°)	Low (< 1 W/cm²)	Visible
Glan-Thompson	> 10⁵	90–95	±15°	Moderate (cement limited)	350–2300 nm
Glan-Taylor	10⁴–10⁵	85–90	±5–±8°	High (air-spaced)	214–2300 nm
Wollaston prism	10⁵	90–95	±10°	Moderate	350–2300 nm
Wire-grid (visible)	10³–10⁴	80–90	Wide (±50°)	Moderate	400–2000 nm
Thin-film PBS cube	10²–10⁴	90–95	±2–±5°	Moderate (cement limited)	Narrowband
Thin-film plate	10³–10⁴	85–95	±2°	High	Narrowband

▸

7Waveplates and Retarders

7.1Birefringence

A waveplate (also called a retarder or retardation plate) is a flat optical element made from a birefringent material. Birefringent materials have two principal refractive indices: the ordinary index $n_o$ and the extraordinary index $n_e$ . The axis with the lower refractive index is called the fast axis (light polarized along it travels faster), and the orthogonal axis is the slow axis.

When linearly polarized light enters a waveplate, the component along the fast axis and the component along the slow axis travel at different speeds. After traversing a thickness $d$ , the two components accumulate a phase difference (retardance) $\Gamma$ :

Retardance

\Gamma = \frac{2\pi}{\lambda}(n_e - n_o)\,d

The retardance is typically specified in fractions of a wavelength: $\lambda/4$ (quarter-wave), $\lambda/2$ (half-wave), or $\lambda$ (full-wave). The corresponding phase differences are $\pi/2$ , $\pi$ , and $2\pi$ , respectively [1, 2, 8].

Figure 7.1 — Quarter-wave plate operation. Linearly polarized input at 45° to the fast axis is converted to right-hand circular polarization. Insets show the E-field trace before and after the QWP.

7.2Quarter-Wave Plates

A quarter-wave plate (QWP) introduces a retardance of $\Gamma = \pi/2$ (one quarter of a wavelength). Its key functions are converting linearly polarized light to circularly polarized light (when the input polarization is at 45° to the fast axis) and converting circularly polarized light to linearly polarized light.

When linearly polarized light at an arbitrary angle $\theta$ to the fast axis enters a QWP, the output is generally elliptically polarized. Circular output occurs only when $\theta = \pm 45^{\circ}$ . For $\theta = +45^{\circ}$ , the output is right-hand circular; for $\theta = -45^{\circ}$ , the output is left-hand circular (using the optics convention). Quarter-wave plates are essential components in optical isolators, circular polarimetry, and ellipsometry [1, 8, 9].

7.3Half-Wave Plates

A half-wave plate (HWP) introduces a retardance of $\Gamma = \pi$ (one half of a wavelength). Its primary function is rotating the orientation of linearly polarized light. If linearly polarized light enters the HWP at angle $\theta$ to the fast axis, the output is linearly polarized at angle $-\theta$ to the fast axis — effectively a rotation by $2\theta$ :

HWP Polarization Rotation

\theta_{\text{out}} = 2\theta_{\text{fast}} - \theta_{\text{in}}

where $\theta_{\text{fast}}$ is the fast-axis angle and $\theta_{\text{in}}$ is the input polarization angle, both measured from the same reference. The HWP is the most common component for continuously adjustable polarization rotation: rotating the HWP by angle $\alpha$ rotates the output polarization by $2\alpha$ . It is also used to swap the handedness of circularly polarized light (RCP becomes LCP and vice versa) [1, 8, 9].

7.4Full-Wave Plates

A full-wave plate introduces a retardance of $\Gamma = 2\pi$ (one full wavelength) at the design wavelength. At this wavelength, the full-wave plate has no effect on the polarization state — it is the identity element. However, at other wavelengths, the retardance differs from $2\pi$ , and the plate acts as a waveplate with a wavelength-dependent retardance.

Full-wave plates are used as tunable color filters in polarization microscopy (the technique is called “polarization color imaging” or “first-order red plate”). When placed between crossed polarizers, a full-wave plate transmits white light minus the design wavelength, producing a characteristic magenta color. Birefringent specimens then shift the effective retardance, changing the transmitted color and enabling qualitative assessment of birefringence magnitude and sign [1].

7.5Multiple-Order vs Zero-Order vs True Zero-Order

The retardance formula $\Gamma = 2\pi(n_e - n_o)d/\lambda$ allows multiple solutions for the thickness $d$ that gives a quarter-wave or half-wave retardance. These solutions differ by integer multiples of $\lambda/(n_e - n_o)$ .

Multiple-order waveplates use a single thick plate where the retardance is $(m + 1/4)\lambda$ or $(m + 1/2)\lambda$ for integer $m \gg 0$ . They are inexpensive and easy to manufacture but highly sensitive to wavelength, temperature, and angle of incidence because the fractional change in retardance depends on the total retardance, which is large.

Compound zero-order waveplates consist of two multiple-order plates with their fast axes crossed. The retardance is the difference between the two plates' retardances, which can be made small (quarter-wave or half-wave) even though each individual plate has many waves of retardance. Temperature and wavelength sensitivities largely cancel because both plates are affected equally.

True zero-order waveplates use a single thin plate with thickness $d = \lambda/(4(n_e - n_o))$ or $d = \lambda/(2(n_e - n_o))$ . For quartz ( $n_e - n_o \approx 0.009$ at 550 nm), the true zero-order QWP thickness is approximately 15 μm, which is too thin to handle as a free-standing plate. True zero-order waveplates are therefore made by depositing birefringent polymer films on a glass substrate. They offer the best wavelength, temperature, and angular performance [8, 9].

Worked Example: Quartz QWP Thickness Calculation

Calculate the thickness of a true zero-order quartz quarter-wave plate for 633 nm. The birefringence of quartz is $n_e - n_o = 0.00920$ at 633 nm.

\Gamma = \frac{\pi}{2} = \frac{2\pi}{\lambda}(n_e - n_o)\,d

d = \frac{\lambda}{4(n_e - n_o)} = \frac{633\;\text{nm}}{4 \times 0.00920} = \frac{633}{0.0368}\;\text{nm} = 17{,}200\;\text{nm} = 17.2\;\mu\text{m}

A true zero-order quartz QWP at 633 nm is only 17.2 μm thick — too thin to polish as a free-standing element. In practice, quartz waveplates at this thickness are either cemented to a glass substrate, or a compound zero-order design is used instead (two thicker plates with their fast axes crossed, with a thickness difference of 17.2 μm).

7.6Achromatic and Super-Achromatic Waveplates

Standard waveplates provide the design retardance at only one wavelength. The retardance varies inversely with wavelength ( $\Gamma \propto 1/\lambda$ for a fixed thickness), plus the birefringence itself is wavelength-dependent. For broadband applications, achromatic and super-achromatic designs are available.

Achromatic waveplates use two or three plates of different birefringent materials (commonly quartz and magnesium fluoride) with their fast axes at specific angles. The material dispersion of one material partially compensates the other, flattening the retardance vs. wavelength curve. Achromatic QWPs typically maintain λ/4 ± 2% over a 200–300 nm bandwidth.

Super-achromatic waveplates (also called Pancharatnam designs) use three or more birefringent plates at precisely calculated orientations. They achieve nearly constant retardance over very broad wavelength ranges (e.g., 325–1100 nm for a quartz/MgF₂ design). The design principle exploits the geometric (Pancharatnam) phase on the Poincaré sphere [5, 8, 9].

Figure 7.2 — Retardation vs. wavelength for multiple-order, compound zero-order, and true zero-order waveplates. Toggle to isolate each curve. True zero-order provides the flattest response across wavelength.

Waveplate Type	Bandwidth	Temperature Sensitivity	Angular Sensitivity	Typical Cost
Multiple-order	± 1–2 nm	High	High	Low
Compound zero-order	± 10–20 nm	Low	Low	Moderate
True zero-order (polymer)	± 30–50 nm	Very low	Very low	Moderate
Achromatic (2-material)	200–300 nm	Low	Low	High
Super-achromatic	400–1000+ nm	Very low	Low	Very high
Fresnel rhomb	Inherently broadband	Very low	N/A (geometry)	Moderate–high

7.7Fresnel Rhomb Retarders

The Fresnel rhomb is a solid glass prism (typically a parallelogram or rhombus shape) that produces retardation by exploiting the differential phase shift between s- and p-polarized light at total internal reflection. Two TIR events at a carefully chosen internal angle (typically near 54.6° for BK7 glass) produce a total retardance of $\lambda/4$ .

The key advantage of the Fresnel rhomb is its achromatic performance: the retardance depends on the TIR phase shifts, which vary only slowly with wavelength. A well-designed Fresnel rhomb maintains quarter-wave retardance within a few percent over the entire transparent range of the glass substrate (e.g., 350–2000 nm for BK7). The main disadvantages are the beam displacement introduced by the rhomb geometry, the sensitivity to the angle of incidence (which affects the internal reflection angle), and the larger physical size compared to a thin waveplate. Double Fresnel rhombs (two rhombs in series) can produce half-wave retardation with achromatic performance [1, 4].

▸

8Optical Activity and Faraday Rotation

8.1Natural Optical Activity

Optical activity (also called optical rotation) is the property of certain materials to rotate the plane of polarization of linearly polarized light as it propagates through the material. Optically active materials include crystalline quartz (along the optic axis), sugar solutions, amino acids, and many organic molecules. The rotation angle $\phi$ is proportional to the path length $d$ :

Optical Rotation

\phi = \rho \cdot d

where $\rho$ is the rotary power (degrees per unit length) of the material. For crystalline quartz at 589.3 nm, $\rho \approx 21.7^{\circ}/\text{mm}$ . Optically active materials exist in two enantiomeric forms: dextrorotatory (d- or +, rotates polarization clockwise when viewed facing the source) and levorotatory (l- or −, rotates counter-clockwise) [1, 4].

Optical activity arises from circular birefringence: the material has different refractive indices for right-hand and left-hand circularly polarized light. Linearly polarized light, which can be decomposed into equal RCP and LCP components, accumulates a differential phase between these components as it propagates, resulting in a net rotation of the linear polarization direction.

8.2Specific Rotation

For optically active solutions, the rotation is proportional to both the path length and the concentration. Biot's law gives the rotation as:

Biot's Law

\phi = [\alpha]_\lambda^T \cdot c \cdot d

where $[\alpha]_\lambda^T$ is the specific rotation (degrees · mL / (g · dm)) at wavelength $\lambda$ and temperature $T$ , $c$ is the concentration (g/mL), and $d$ is the path length (dm). The specific rotation is a material property that depends on wavelength (approximately as $1/\lambda^2$ , known as optical rotatory dispersion) and temperature. Saccharimetry (measuring sugar concentration by optical rotation) is one of the oldest applications of polarimetry in chemistry and food science [1, 4].

Worked Example: Optical Rotation in Crystalline Quartz

Calculate the rotation angle for linearly polarized light at 589.3 nm (sodium D-line) passing through a 10 mm thick quartz plate oriented along the optic axis. The rotary power of quartz at this wavelength is 21.7°/mm.

\phi = \rho \cdot d = 21.7^{\circ}\text{/mm} \times 10\;\text{mm} = 217^{\circ}

The polarization is rotated by 217°, which is equivalent to 217° − 180° = 37° beyond a half-turn. This large rotation illustrates why quartz optical rotators are practical in compact packages — significant rotation angles are achieved with short path lengths.

8.3Faraday Effect

The Faraday effect is the rotation of the polarization plane of linearly polarized light when it propagates through a material in the presence of a magnetic field parallel to the propagation direction. The Faraday rotation angle is:

Faraday Rotation

\theta_F = V \cdot B \cdot d

where $V$ is the Verdet constant of the material (rad/(T·m) or min/(Oe·cm)), $B$ is the magnetic flux density (T), and $d$ is the path length through the material (m). The Verdet constant is material- and wavelength-dependent: it is large for heavy glasses (SF-57: $V \approx 23$ rad/(T·m) at 633 nm), terbium gallium garnet (TGG: $V \approx -134$ rad/(T·m) at 633 nm), and terbium-doped glasses [1, 3].

The crucial distinction between Faraday rotation and natural optical activity is that Faraday rotation is non-reciprocal: the rotation direction is determined by the magnetic field direction, not the light propagation direction. A beam that traverses the material forward and then backward accumulates double the rotation (rather than canceling). This non-reciprocal property is the basis of optical isolators, which use a Faraday rotator combined with polarizers to allow light to pass in one direction while blocking light traveling in the opposite direction. Optical isolators are essential for protecting lasers from back-reflections [3, 5].

8.4Connection to Magnetism

The Faraday effect provides a direct link between polarization optics and magnetism. The Verdet constant is proportional to the dispersion of the refractive index ( $V \propto \lambda\,dn/d\lambda$ ), which means materials with strong dispersion in the visible (heavy flint glasses, rare-earth doped crystals) have the largest Verdet constants and are the most efficient Faraday rotators.

Magneto-optic effects beyond simple Faraday rotation include the magneto-optic Kerr effect (MOKE, polarization change upon reflection from a magnetized surface), used extensively in magnetic domain imaging and magneto-optic data storage readout; and the Cotton-Mouton effect (magnetic linear birefringence), which is typically much weaker than the Faraday effect. For system-level design involving magneto-optic components, including Faraday rotator specifications, magnetic circuit design, and isolator performance prediction, see the Magneto-Optic Design Calculator →.

▸

9Practical Considerations

9.1Wavelength Dependence

Nearly all polarization components exhibit wavelength dependence. Waveplate retardance scales approximately as $1/\lambda$ (plus the material dispersion of birefringence), so a quarter-wave plate at 633 nm provides more retardance at shorter wavelengths and less at longer wavelengths. Dichroic polarizers have wavelength-dependent extinction ratios and transmission. Thin-film polarizers (PBS cubes, plate polarizers) are designed for specific wavelength bands and can perform poorly outside their design range.

When operating at a wavelength different from the design wavelength, the effective retardance of a waveplate changes. For a zero-order waveplate designed at $\lambda_0$ , the retardance at wavelength $\lambda$ is approximately $\Gamma(\lambda) \approx \Gamma_0 \times \lambda_0/\lambda$ (neglecting the dispersion of birefringence). For a multiple-order waveplate, the deviation is amplified by the order number $m$ , making the wavelength sensitivity much worse [8, 9].

9.2Temperature Sensitivity

Temperature affects waveplate retardance through two mechanisms: thermal expansion (changes in thickness $d$ ) and the temperature dependence of birefringence ( $d(n_e - n_o)/dT$ ). For quartz, the birefringence temperature coefficient is approximately $-1 \times 10^{-7}$ /°C, which is small. However, for a multiple-order waveplate with total retardance of, say, 30 waves, even this small coefficient produces a significant change in the fractional retardance.

True zero-order and compound zero-order waveplates minimize temperature sensitivity because the total retardance (and therefore the temperature-induced change) is small. Polymer true zero-order waveplates typically achieve retardance stability of $\pm 0.5\%$ over a 50°C temperature range. For cryogenic or high-temperature applications, specify waveplates tested and characterized at the operating temperature [8, 9].

9.3Angle of Incidence Effects

Waveplates are designed for normal incidence. At non-normal incidence, the effective path length through the birefringent material increases, and the projections of the ordinary and extraordinary refractive indices change. The result is an angle-dependent retardance shift. For a zero-order waveplate, the retardance change is approximately proportional to $\theta^2$ (for small angles $\theta$ ), and is typically a few percent for ±5° tilt.

Crystal polarizers (Glan-Thompson, Glan-Taylor) have specified acceptance half-angles beyond which the polarization purity degrades. Exceeding the acceptance angle causes the rejected ray to leak into the transmitted beam, reducing the extinction ratio. PBS cubes also have limited angular acceptance: the coating design is optimized for a specific angle of incidence (45° inside the cube), and deviations reduce the extinction ratio [5, 8].

9.4Damage Threshold

Polarization components can be damaged by high laser powers or fluences. The damage mechanism depends on the component type: dichroic polarizers fail by thermal damage to the absorbing dye (low LIDT, typically < 1 W/cm² CW); cemented crystal polarizers and PBS cubes are limited by the cement layer (typically 1–5 J/cm² at 10 ns, or 10–100 W/cm² CW); air-spaced crystal polarizers (Glan-Taylor) are limited by the crystal surface damage threshold (typically 5–20 J/cm² at 10 ns); and waveplates are limited by coating damage (AR coatings on the surfaces) or the substrate material.

For high-power applications, use air-spaced Glan-Taylor or Glan-laser polarizers, plate-type thin-film polarizers (no cement), and appropriately coated waveplates. Always verify the LIDT specification against the actual beam parameters (peak fluence, not average power) at the component location [5, 8].

9.5Common Specification Parameters

Material	Birefringence (nₑ − nₒ)	Transparency Range (nm)	Damage Threshold	Common Use
Crystalline quartz	0.0091 (at 589 nm)	200–2700	High	Waveplates, rotators
Calcite	0.172 (at 589 nm)	350–2300	Moderate	Polarizers (Glan types)
Magnesium fluoride	0.012 (at 589 nm)	120–7000	High	UV/IR achromatic waveplates
α-BBO	0.116 (at 589 nm)	189–3500	High	UV polarizers, high-power
YVO₄	0.204 (at 633 nm)	400–5000	High	Compact waveplates
Polymer (stretched)	0.001–0.005	400–800	Low	True zero-order waveplates
Mica	0.005 (at 589 nm)	350–6000	Moderate	Low-cost QWPs
LiNbO₃	0.086 (at 633 nm)	350–5200	Moderate	Electro-optic retarders

▸

10Selection Workflow

10.1Identifying the Need

The first step in selecting polarization components is identifying which of three fundamental tasks is required: generating a specific polarization state from unpolarized or arbitrarily polarized light, analyzing an unknown polarization state (measuring its Stokes parameters or degree of polarization), or manipulating a known polarization state (rotating, converting between linear and circular, compensating unwanted polarization effects).

Generating polarized light: Start with a polarizer to produce linear polarization, then add waveplates as needed to produce circular or elliptical states. The polarizer type is selected based on the required extinction ratio, wavelength, aperture, power level, and cost.

Analyzing polarization: A polarimeter requires at minimum a rotating polarizer (for linear Stokes parameters) and a quarter-wave plate (for the circular Stokes parameter). Division-of-amplitude or division-of-time architectures are used depending on speed and accuracy requirements.

Manipulating polarization: Half-wave plates rotate linear polarization; quarter-wave plates convert between linear and circular; Faraday rotators provide non-reciprocal rotation for isolation. The waveplate type (multiple-order, zero-order, achromatic) is selected based on the wavelength range, temperature stability, and angular sensitivity requirements [5, 6].

10.2Selecting Polarizer Type

Step 1 — Determine the extinction ratio requirement. For visual applications and displays, 10³ (30 dB) is usually sufficient; a dichroic sheet polarizer is the most cost-effective choice. For scientific measurements, 10⁴ to 10⁵ is needed; a Glan-Taylor, Glan-Thompson, or high-quality PBS cube is appropriate. For precision polarimetry, 10⁵ to 10⁶ requires a Glan-Thompson or calcite Wollaston prism.

Step 2 — Determine the operating wavelength. For UV below 350 nm, use α-BBO or air-spaced calcite polarizers. For the visible (400–700 nm), all polarizer types are available. For the near-IR (700–2000 nm), calcite, wire-grid, or thin-film polarizers are used. For the mid- and far-IR, wire-grid polarizers on IR-transparent substrates are standard.

Step 3 — Assess the power and damage requirements. For high CW power or high peak fluence, avoid cemented components and dichroic sheet polarizers. Use air-spaced Glan-Taylor polarizers, plate-type thin-film polarizers, or wire-grid polarizers on damage-resistant substrates.

Step 4 — Consider the form factor. Sheet polarizers are thin and available in large sizes. Crystal polarizers are compact but limited to small apertures (typically 5–25 mm). PBS cubes are convenient for beam steering applications (providing two orthogonal output beams). Plate polarizers require operation at near-Brewster angles and produce a displaced beam.

10.3Selecting Waveplate Type

Step 1 — Determine the required retardance. Quarter-wave for linear-to-circular conversion; half-wave for polarization rotation; variable retarders (liquid crystal, electro-optic, photoelastic modulators) for dynamic retardance control.

Step 2 — Determine the wavelength range. For single-wavelength operation (laser line), a zero-order or compound zero-order waveplate is most cost-effective. For broadband operation, achromatic or super-achromatic waveplates, or a Fresnel rhomb, are required. For tunability, a Soleil-Babinet compensator or liquid crystal variable retarder provides continuously adjustable retardance at any wavelength.

Step 3 — Assess environmental requirements. Temperature stability favors true zero-order or compound zero-order designs over multiple-order. For cryogenic or high-temperature operation, verify the waveplate performance at the operating temperature. For high power, verify the LIDT of the AR coatings on the waveplate surfaces.

Step 4 — Consider cost and availability. Multiple-order waveplates are the least expensive but worst performing over wavelength and temperature. True zero-order polymer waveplates offer good performance at moderate cost. Achromatic and super-achromatic waveplates are the most expensive but essential for broadband systems. Fresnel rhombs are moderately expensive, inherently broadband, but bulky [8, 9].

10.4System-Level Management

In a complete optical system, managing polarization requires attention to every optical surface and element. Each uncoated surface at non-normal incidence acts as a partial polarizer (different reflectivities for s and p). Each mirror reflection can introduce a phase shift between s and p, changing the polarization state. Stress birefringence in windows, lenses, and prisms introduces unwanted retardance.

Best practices for system-level polarization management include: designing the optical path to minimize the number of reflections at non-normal incidence; using polarization-maintaining coatings (anti-reflection and mirror coatings optimized for equal s and p performance); orienting birefringent elements with their axes aligned to the system's principal polarization directions; placing polarizers and waveplates as close to the source as practical (before the beam encounters polarization-altering surfaces); and performing a full polarization ray trace (using the Jones or Mueller calculus through the entire system) to predict the polarization state at each point in the system.

For imaging systems, field-dependent polarization effects (variation of the polarization state across the field of view) can limit contrast and resolution. These effects arise from the angle-dependent Fresnel coefficients at each surface and are particularly important in high-numerical-aperture systems such as lithographic projection lenses and microscope objectives [5, 6].

Polarization & Polarizers