Skip to main content

Polarization & Polarizers

A complete guide to polarization states, formalisms, polarizing components, waveplates, and system-level design for optical engineers.

Comprehensive Guide

1Introduction to Polarization

1.1The Transverse Nature of Light

Light is a transverse electromagnetic wave: the electric field vector E\vec{E} and the magnetic field vector B\vec{B} oscillate perpendicular to the direction of propagation. For a monochromatic plane wave traveling in the zz-direction, the electric field can be written as the superposition of two orthogonal components:

Electric Field of a Plane Wave
E(z,t)=E0xcos(kzωt+δx)x^+E0ycos(kzωt+δy)y^\vec{E}(z,t) = E_{0x}\cos(kz - \omega t + \delta_x)\,\hat{x} + E_{0y}\cos(kz - \omega t + \delta_y)\,\hat{y}

Here E0xE_{0x} and E0yE_{0y} are the amplitudes of the x- and y-components, k=2π/λk = 2\pi/\lambda is the wavenumber, ω\omega is the angular frequency, and δx\delta_x and δy\delta_y are the phase offsets of each component. The polarization state of the wave is determined entirely by the amplitude ratio E0y/E0xE_{0y}/E_{0x} and the phase difference δ=δyδx\delta = \delta_y - \delta_x. The magnetic field carries the same information and is fully determined by Maxwell's equations, so polarization is conventionally described using only the electric field.

1.2Historical Context

The concept of polarization emerged from observations that could not be explained by a simple scalar wave theory. In 1669, Erasmus Bartholinus discovered double refraction (birefringence) in calcite crystals, observing that a single incident ray produced two refracted rays. Christiaan Huygens attempted to explain this with his wave theory in 1690, but the full explanation required the concept of transverse oscillations, which was not yet established.

Etienne-Louis Malus discovered polarization by reflection in 1808 when he observed that light reflected from a glass window, viewed through a calcite crystal, showed intensity variations as the crystal was rotated. He coined the term polarization by analogy with the poles of a magnet. Sir David Brewster subsequently determined the angle of incidence at which reflected light is fully polarized (now called Brewster's angle) in 1815 [1, 4].

Augustin-Jean Fresnel provided the definitive theoretical framework in the 1820s by establishing that light waves are transverse, deriving the amplitude reflection and transmission coefficients (the Fresnel equations), and explaining birefringence as a consequence of directional variation in the refractive index. The Jones calculus (1941) and Stokes-Mueller calculus (Stokes 1852, Mueller 1948) later provided the matrix formalisms that are essential tools in modern optical engineering [4, 5].

1.3Why Polarization Matters

Polarization is a fundamental property of light that affects nearly every aspect of optical system design. Fresnel reflection and transmission at any interface depend on polarization: s-polarized and p-polarized light reflect differently, which means that any optical surface acts as a partial polarizer. Thin-film coating performance (anti-reflection coatings, mirrors, filters, beam splitters) is inherently polarization-dependent because the Fresnel coefficients at each layer boundary depend on the polarization state [1, 3].

In laser systems, polarization control is essential for beam combining, frequency doubling (which requires a specific linear polarization orientation relative to the crystal axis), electro-optic modulation, and isolation (optical isolators use the Faraday effect to create non-reciprocal polarization rotation). In imaging and metrology, polarization is used for stress analysis (photoelasticity), surface characterization (ellipsometry), glare reduction, and contrast enhancement. In telecommunications, polarization-division multiplexing doubles fiber capacity by encoding independent data streams on orthogonal polarization states [3, 5].

Understanding polarization is therefore not optional for the optical engineer it is required for predicting system performance, diagnosing unexpected losses, and designing components that control the polarization state as needed.

2Polarization States

2.1Linear Polarization

When the phase difference between the x- and y-components is δ=0\delta = 0 or δ=π\delta = \pi, the electric field vector traces a straight line in the xy-plane as the wave propagates. The orientation of this line depends on the amplitude ratio E0y/E0xE_{0y}/E_{0x}. If E0y=0E_{0y} = 0, the wave is horizontally polarized (x-polarized); if E0x=0E_{0x} = 0, the wave is vertically polarized (y-polarized). Any intermediate ratio gives linear polarization at an angle θ=arctan(E0y/E0x)\theta = \arctan(E_{0y}/E_{0x}) to the x-axis.

Linear polarization is the most common polarization state encountered in practice. Most laser sources emit linearly polarized light (due to Brewster windows in the cavity, or polarization-selective coatings). Linear polarizers transmit one orientation and block the orthogonal orientation, producing linearly polarized light from an unpolarized source [1, 2].

2.2Circular Polarization

When the x- and y-amplitudes are equal (E0x=E0y=E0E_{0x} = E_{0y} = E_0) and the phase difference is δ=±π/2\delta = \pm\pi/2, the electric field vector traces a circle in the xy-plane as the wave propagates. The condition for circular polarization is:

Circular Polarization Condition
E0x=E0y=E0,δ=δyδx=±π2E_{0x} = E_{0y} = E_0, \quad \delta = \delta_y - \delta_x = \pm\frac{\pi}{2}

The sign of δ\delta determines the handedness. In the optics convention (looking into the oncoming beam), δ=+π/2\delta = +\pi/2 gives right-hand circular polarization (RCP) the electric field vector rotates clockwise and δ=π/2\delta = -\pi/2 gives left-hand circular polarization (LCP) the electric field vector rotates counter-clockwise. Note that the physics convention (looking along the direction of propagation) reverses the handedness labels. The optics convention is used throughout this guide [1, 5].

Circular polarization is produced by passing linearly polarized light through a quarter-wave plate with the linear polarization oriented at 45° to the fast axis. It is used in optical isolators, 3D cinema systems, satellite communications, and anywhere rotation-invariant polarization is needed.

2.3Elliptical Polarization

The most general polarization state is elliptical: the tip of the electric field vector traces an ellipse in the xy-plane. Linear and circular polarization are special cases of elliptical polarization. For arbitrary E0xE_{0x}, E0yE_{0y}, and δ\delta, the electric field components satisfy the polarization ellipse equation:

Polarization Ellipse
Ex2E0x2+Ey2E0y22ExEyE0xE0ycosδ=sin2δ\frac{E_x^2}{E_{0x}^2} + \frac{E_y^2}{E_{0y}^2} - 2\frac{E_x E_y}{E_{0x} E_{0y}}\cos\delta = \sin^2\delta

The ellipse is characterized by two angles. The orientation angle ψ\psi gives the tilt of the major axis relative to the x-axis:

Orientation Angle
tan2ψ=2E0xE0ycosδE0x2E0y2\tan 2\psi = \frac{2E_{0x}E_{0y}\cos\delta}{E_{0x}^2 - E_{0y}^2}

The ellipticity angle χ\chi describes the shape of the ellipse, ranging from χ=0\chi = 0 (linear) to χ=±π/4\chi = \pm\pi/4 (circular):

Ellipticity Angle
sin2χ=2E0xE0ysinδE0x2+E0y2\sin 2\chi = \frac{2E_{0x}E_{0y}\sin\delta}{E_{0x}^2 + E_{0y}^2}

In practice, elliptical polarization arises whenever light passes through a birefringent element that introduces a phase retardation other than 0, π/2\pi/2, or π\pi, or when linearly polarized light reflects from a metallic surface (which introduces different phase shifts for s- and p-components) [1, 5].

Polarization StatesxyθLinear PolarizationView: looking into beam (toward source)
Figure 2.1 — End-on view of the electric field vector tip for linear, circular, and elliptical polarization states. Toggle between states to see the characteristic trace geometry and defining angles.

2.4Unpolarized and Partially Polarized Light

Natural light sources (incandescent lamps, LEDs, the sun) emit unpolarized light: the polarization state changes randomly on timescales much shorter than any practical measurement interval. The time-averaged electric field has no preferred orientation. Unpolarized light cannot be described by a single Jones vector (which assumes complete coherence and full polarization) but is naturally represented by the Stokes formalism.

Partially polarized light is a mixture of fully polarized and unpolarized components. The degree of polarization (DOP) quantifies the fraction of the total intensity that is polarized:

Degree of Polarization
DOP=IpolItotal=S12+S22+S32S0\text{DOP} = \frac{I_{\text{pol}}}{I_{\text{total}}} = \frac{\sqrt{S_1^2 + S_2^2 + S_3^2}}{S_0}

DOP ranges from 0 (completely unpolarized) to 1 (fully polarized). For example, skylight is partially polarized (DOP typically 0.30.7, depending on the scattering angle relative to the sun), while a well-collimated laser beam is typically fully polarized (DOP 1). Reflection from a dielectric surface at Brewster's angle produces fully polarized light (DOP = 1) in the reflected beam [1, 6].

3Jones Calculus

3.1Jones Vectors

The Jones vector provides a compact, complex-valued representation of the polarization state of a fully polarized, monochromatic plane wave. It is a 2×1 column vector containing the complex amplitudes of the x- and y-components of the electric field:

Jones Vector
J=(E0xeiδxE0yeiδy)\vec{J} = \begin{pmatrix} E_{0x}\,e^{i\delta_x} \\ E_{0y}\,e^{i\delta_y} \end{pmatrix}

Two Jones vectors represent the same polarization state if they differ only by a common phase factor eiϕe^{i\phi} or a real scaling factor. The normalized Jones vectors for the most common polarization states are listed below.

Polarization StateJones Vector
Horizontal (x)\begin{pmatrix} 1 \\ 0 \end{pmatrix}
Vertical (y)\begin{pmatrix} 0 \\ 1 \end{pmatrix}
Linear at +45°\frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ 1 \end{pmatrix}
Linear at −45°\frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ -1 \end{pmatrix}
Right-hand circular (RCP)\frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ -i \end{pmatrix}
Left-hand circular (LCP)\frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ i \end{pmatrix}

3.2Jones Matrices

Each polarization-modifying optical element (polarizer, waveplate, rotator) is represented by a 2×2 complex matrix called the Jones matrix. The output Jones vector is obtained by multiplying the input Jones vector by the element's Jones matrix:

Jones Matrix Transformation
Jout=MJin\vec{J}_{\text{out}} = \mathbf{M}\,\vec{J}_{\text{in}}

The Jones matrix for a horizontal polarizer (transmits x-component, blocks y-component):

Horizontal Polarizer
MH-pol=(1000)\mathbf{M}_{\text{H-pol}} = \begin{pmatrix} 1 & 0 \\ 0 & 0 \end{pmatrix}

A linear polarizer with its transmission axis at angle θ\theta to the x-axis:

Polarizer at Angle \u03B8
Mpol(θ)=(cos2θsinθcosθsinθcosθsin2θ)\mathbf{M}_{\text{pol}}(\theta) = \begin{pmatrix} \cos^2\theta & \sin\theta\cos\theta \\ \sin\theta\cos\theta & \sin^2\theta \end{pmatrix}

A quarter-wave plate (QWP) with its fast axis along x:

Quarter-Wave Plate (fast axis horizontal)
MQWP=eiπ/4(100i)\mathbf{M}_{\text{QWP}} = e^{-i\pi/4}\begin{pmatrix} 1 & 0 \\ 0 & -i \end{pmatrix}

A half-wave plate (HWP) with its fast axis along x:

Half-Wave Plate (fast axis horizontal)
MHWP=eiπ/2(1001)\mathbf{M}_{\text{HWP}} = e^{-i\pi/2}\begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix}

A general waveplate with retardance Γ\Gamma and fast axis along x:

General Waveplate
MWP(Γ)=eiΓ/2(100eiΓ)\mathbf{M}_{\text{WP}}(\Gamma) = e^{-i\Gamma/2}\begin{pmatrix} 1 & 0 \\ 0 & e^{-i\Gamma} \end{pmatrix}

To model an element whose principal axis is rotated by angle θ\theta from the x-axis, the Jones matrix is rotated using the rotation matrix R(θ)\mathbf{R}(\theta):

Rotated Jones Matrix
M(θ)=R(θ)MR(θ),R(θ)=(cosθsinθsinθcosθ)\mathbf{M}'(\theta) = \mathbf{R}(-\theta)\,\mathbf{M}\,\mathbf{R}(\theta), \quad \mathbf{R}(\theta) = \begin{pmatrix} \cos\theta & \sin\theta \\ -\sin\theta & \cos\theta \end{pmatrix}

3.3Cascading Elements

When light passes through a sequence of optical elements, the overall Jones matrix is the product of the individual Jones matrices in reverse order (the first element encountered is on the right). For N elements:

Cascaded Jones Matrices
Jout=MNMN1M2M1Jin\vec{J}_{\text{out}} = \mathbf{M}_N \cdot \mathbf{M}_{N-1} \cdots \mathbf{M}_2 \cdot \mathbf{M}_1 \cdot \vec{J}_{\text{in}}
Worked Example: Light Through a Polarizer and Quarter-Wave Plate

Consider horizontally polarized light passing first through a linear polarizer at 45°, then through a quarter-wave plate with its fast axis horizontal. What is the output polarization state?

Jin=(10)\vec{J}_{\text{in}} = \begin{pmatrix} 1 \\ 0 \end{pmatrix}
J1=Mpol(45)Jin=(1/21/21/21/2)(10)=12(11)\vec{J}_1 = \mathbf{M}_{\text{pol}}(45^{\circ})\,\vec{J}_{\text{in}} = \begin{pmatrix} 1/2 & 1/2 \\ 1/2 & 1/2 \end{pmatrix}\begin{pmatrix} 1 \\ 0 \end{pmatrix} = \frac{1}{2}\begin{pmatrix} 1 \\ 1 \end{pmatrix}
Jout=MQWPJ1=eiπ/4(100i)12(11)=eiπ/42(1i)\vec{J}_{\text{out}} = \mathbf{M}_{\text{QWP}}\,\vec{J}_1 = e^{-i\pi/4}\begin{pmatrix} 1 & 0 \\ 0 & -i \end{pmatrix}\frac{1}{2}\begin{pmatrix} 1 \\ 1 \end{pmatrix} = \frac{e^{-i\pi/4}}{2}\begin{pmatrix} 1 \\ -i \end{pmatrix}

Ignoring the global phase factor, the output is proportional to (1i)\begin{pmatrix} 1 \\ -i \end{pmatrix}, which is right-hand circularly polarized light. The 45° polarizer projects the input onto the +45° linear state, and the QWP converts that linear state to circular polarization.

3.4Limitations

The Jones calculus applies only to fully polarized, coherent, monochromatic light. It cannot represent unpolarized or partially polarized light, because the Jones vector inherently describes a definite polarization state. For partially polarized light, depolarizing elements, or incoherent superposition of beams, the Stokes-Mueller formalism (Section 4) is required. Additionally, the Jones calculus does not account for absolute intensity it tracks the relative complex amplitudes, not the absolute power. For systems involving both coherent and incoherent effects, a hybrid approach using coherency matrices or the Mueller calculus is needed [5, 6].

4Stokes & Mueller Calculus

4.1Stokes Vector

The Stokes vector is a 4×1 real-valued vector that completely describes the polarization state of a light beam, including partially polarized and unpolarized light. Unlike the Jones vector, it is defined in terms of measurable intensities rather than complex amplitudes:

Stokes Vector
S=(S0S1S2S3)=(ItotalIHIVI+45I45IRCPILCP)\vec{S} = \begin{pmatrix} S_0 \\ S_1 \\ S_2 \\ S_3 \end{pmatrix} = \begin{pmatrix} I_{\text{total}} \\ I_H - I_V \\ I_{+45} - I_{-45} \\ I_{\text{RCP}} - I_{\text{LCP}} \end{pmatrix}

Here S0S_0 is the total intensity, S1S_1 is the preference for horizontal over vertical polarization, S2S_2 is the preference for +45° over 45° polarization, and S3S_3 is the preference for right-circular over left-circular polarization. Each Stokes parameter can be determined by measuring the transmitted intensity through appropriate polarizers and waveplates.

For a fully polarized beam described by field amplitudes E0xE_{0x}, E0yE_{0y}, and phase difference δ\delta, the Stokes parameters are:

Stokes Parameters from Field Amplitudes
S=(E0x2+E0y2E0x2E0y22E0xE0ycosδ2E0xE0ysinδ)\vec{S} = \begin{pmatrix} E_{0x}^2 + E_{0y}^2 \\ E_{0x}^2 - E_{0y}^2 \\ 2E_{0x}E_{0y}\cos\delta \\ 2E_{0x}E_{0y}\sin\delta \end{pmatrix}

4.2Degree of Polarization

The degree of polarization (DOP), degree of linear polarization (DOLP), and degree of circular polarization (DOCP) are derived from the Stokes parameters:

DOP, DOLP, DOCP
DOP=S12+S22+S32S0,DOLP=S12+S22S0,DOCP=S3S0\text{DOP} = \frac{\sqrt{S_1^2 + S_2^2 + S_3^2}}{S_0}, \quad \text{DOLP} = \frac{\sqrt{S_1^2 + S_2^2}}{S_0}, \quad \text{DOCP} = \frac{|S_3|}{S_0}

For fully polarized light, S02=S12+S22+S32S_0^2 = S_1^2 + S_2^2 + S_3^2 and DOP = 1. For partially polarized light, S02>S12+S22+S32S_0^2 > S_1^2 + S_2^2 + S_3^2 and 0 < DOP < 1. For completely unpolarized light, S1=S2=S3=0S_1 = S_2 = S_3 = 0 and DOP = 0.

Worked Example: DOP from Measured Stokes Parameters

A polarimeter measures the Stokes vector of a beam as S=(1.0,  0.6,  0.0,  0.3)T\vec{S} = (1.0,\; 0.6,\; 0.0,\; 0.3)^T. Determine the DOP, DOLP, and DOCP.

DOP=0.62+0.02+0.321.0=0.36+0+0.091.0=0.451.0=0.671\text{DOP} = \frac{\sqrt{0.6^2 + 0.0^2 + 0.3^2}}{1.0} = \frac{\sqrt{0.36 + 0 + 0.09}}{1.0} = \frac{\sqrt{0.45}}{1.0} = 0.671
DOLP=0.62+0.021.0=0.61.0=0.600\text{DOLP} = \frac{\sqrt{0.6^2 + 0.0^2}}{1.0} = \frac{0.6}{1.0} = 0.600
DOCP=0.31.0=0.300\text{DOCP} = \frac{|0.3|}{1.0} = 0.300

The beam is partially polarized (DOP = 0.671), with a stronger linear component (DOLP = 0.600) than circular component (DOCP = 0.300). The predominant polarization is horizontal (S = 0.6 > 0), consistent with partial horizontal linear polarization with a small right-circular component (S = 0.3 > 0).

4.3Mueller Matrices

Each optical element in the Mueller formalism is represented by a 4×4 real-valued matrix. The output Stokes vector is obtained by multiplying the input Stokes vector by the Mueller matrix:

Mueller Transformation
Sout=MSin\vec{S}_{\text{out}} = \mathbf{M}\,\vec{S}_{\text{in}}

Mueller matrix for a horizontal linear polarizer:

Horizontal Polarizer (Mueller)
MH-pol=12(1100110000000000)\mathbf{M}_{\text{H-pol}} = \frac{1}{2}\begin{pmatrix} 1 & 1 & 0 & 0 \\ 1 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{pmatrix}

Mueller matrix for a quarter-wave plate with fast axis horizontal:

QWP (Mueller, fast axis horizontal)
MQWP=(1000010000010010)\mathbf{M}_{\text{QWP}} = \begin{pmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & -1 & 0 \end{pmatrix}

Mueller matrix for a half-wave plate with fast axis horizontal:

HWP (Mueller, fast axis horizontal)
MHWP=(1000010000100001)\mathbf{M}_{\text{HWP}} = \begin{pmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & -1 \end{pmatrix}

Mueller matrix for a linear polarizer at angle θ\theta:

Polarizer at Angle \u03B8 (Mueller)
Mpol(θ)=12(1cos2θsin2θ0cos2θcos22θsin2θcos2θ0sin2θsin2θcos2θsin22θ00000)\mathbf{M}_{\text{pol}}(\theta) = \frac{1}{2}\begin{pmatrix} 1 & \cos 2\theta & \sin 2\theta & 0 \\ \cos 2\theta & \cos^2 2\theta & \sin 2\theta\cos 2\theta & 0 \\ \sin 2\theta & \sin 2\theta\cos 2\theta & \sin^2 2\theta & 0 \\ 0 & 0 & 0 & 0 \end{pmatrix}

4.4Jones vs Mueller

The Jones and Mueller formalisms are complementary. The Jones calculus is simpler (2×2 matrices vs 4×4), preserves absolute phase information (important for interference calculations), and is computationally efficient. However, it is limited to fully polarized, coherent light.

The Mueller calculus handles any polarization state including partially polarized and unpolarized light, can represent depolarizing elements (scattering surfaces, integrating spheres, rough surfaces), and works directly with measurable intensities. Every Jones matrix can be converted to an equivalent Mueller matrix, but the reverse is not always possible Mueller matrices that represent depolarization have no Jones equivalent [5, 6].

In practice, use the Jones calculus for coherent systems (laser optics, interferometers, fiber optics) where all elements are non-depolarizing and the light is fully polarized. Use the Mueller calculus when dealing with partially polarized sources, depolarizing elements, or when the system includes both polarized and unpolarized beams.

4.5Poincaré Sphere

The Poincaré sphere is a geometric representation of all possible polarization states on the surface of a unit sphere. The three Cartesian coordinates of a point on the sphere correspond to the normalized Stokes parameters (S1/S0,  S2/S0,  S3/S0)(S_1/S_0,\; S_2/S_0,\; S_3/S_0). Fully polarized states lie on the surface of the sphere, while partially polarized states lie inside the sphere (with unpolarized light at the origin).

On the Poincaré sphere, the equator represents all linear polarization states: horizontal polarization is at (1, 0, 0), vertical at (1, 0, 0), +45° at (0, 1, 0), and 45° at (0, 1, 0). The north pole (0, 0, 1) represents right-hand circular polarization and the south pole (0, 0, 1) represents left-hand circular polarization. All other points on the surface represent elliptical polarization states.

The Poincaré sphere is particularly useful for visualizing the action of waveplates: a waveplate with retardance Γ\Gamma and fast axis at angle θ\theta rotates the polarization state on the sphere by an angle Γ\Gamma about an axis that passes through the equatorial point at longitude 2θ2\theta. A quarter-wave plate rotates by 90°, a half-wave plate by 180°, and a full-wave plate by 360° (identity). This geometric interpretation makes it easy to predict the effect of any waveplate on any input polarization state [5, 6].

5Polarization by Reflection

5.1Fresnel Equations

When light strikes the interface between two media with refractive indices n1n_1 and n2n_2, the reflected and transmitted amplitudes depend on the polarization state. The two independent linear polarization states used in reflection analysis are s-polarization (electric field perpendicular to the plane of incidence, from the German senkrecht) and p-polarization (electric field parallel to the plane of incidence). The Fresnel amplitude reflection and transmission coefficients are:

Fresnel Reflection Coefficients
rs=n1cosθin2cosθtn1cosθi+n2cosθt,rp=n2cosθin1cosθtn2cosθi+n1cosθtr_s = \frac{n_1\cos\theta_i - n_2\cos\theta_t}{n_1\cos\theta_i + n_2\cos\theta_t}, \quad r_p = \frac{n_2\cos\theta_i - n_1\cos\theta_t}{n_2\cos\theta_i + n_1\cos\theta_t}
Fresnel Transmission Coefficients
ts=2n1cosθin1cosθi+n2cosθt,tp=2n1cosθin2cosθi+n1cosθtt_s = \frac{2n_1\cos\theta_i}{n_1\cos\theta_i + n_2\cos\theta_t}, \quad t_p = \frac{2n_1\cos\theta_i}{n_2\cos\theta_i + n_1\cos\theta_t}

Here θi\theta_i is the angle of incidence and θt\theta_t is the angle of refraction, related by Snell's law: n1sinθi=n2sinθtn_1\sin\theta_i = n_2\sin\theta_t. The power reflectance and transmittance are:

Power Reflectance and Transmittance
Rs=rs2,Rp=rp2,Ts=n2cosθtn1cosθits2,Tp=n2cosθtn1cosθitp2R_s = |r_s|^2, \quad R_p = |r_p|^2, \quad T_s = \frac{n_2\cos\theta_t}{n_1\cos\theta_i}|t_s|^2, \quad T_p = \frac{n_2\cos\theta_t}{n_1\cos\theta_i}|t_p|^2

Note that the factor n2cosθt/(n1cosθi)n_2\cos\theta_t / (n_1\cos\theta_i) in the transmittance accounts for the change in beam cross-section and wave impedance between the two media. Energy conservation requires R+T=1R + T = 1 for each polarization (in a lossless medium) [1, 2, 4].

5.2Brewster's Angle

Brewster's angle θB\theta_B is the angle of incidence at which the p-polarized reflection coefficient vanishes (rp=0r_p = 0). At this angle, the reflected beam is entirely s-polarized. Brewster's angle satisfies:

Brewster's Angle
θB=arctan ⁣(n2n1)\theta_B = \arctan\!\left(\frac{n_2}{n_1}\right)

At Brewster's angle, the reflected and refracted rays are perpendicular to each other (θi+θt=90\theta_i + \theta_t = 90^{\circ}). This is exploited in Brewster windows (used in gas lasers to introduce a polarization-selective loss that forces the laser to oscillate in the p-polarization), and in pile-of-plates polarizers (multiple glass plates at Brewster's angle, each reflection removing more s-polarized light) [1, 2].

Worked Example: Brewster's Angle for BK7 Glass

Calculate Brewster's angle for light incident from air onto a BK7 glass surface at 550 nm, where the refractive index of BK7 is approximately 1.518.

θB=arctan ⁣(n2n1)=arctan ⁣(1.5181.000)=56.6\theta_B = \arctan\!\left(\frac{n_2}{n_1}\right) = \arctan\!\left(\frac{1.518}{1.000}\right) = 56.6^{\circ}

At 56.6° angle of incidence, the p-polarized reflectance is zero. The reflected beam is purely s-polarized, and the refracted angle is θt=9056.6=33.4\theta_t = 90^{\circ} - 56.6^{\circ} = 33.4^{\circ}.

Brewster's AngleNormalθBθBθtUnpolarizeds-polarized onlyPartially polarizedAir (n₁ = 1.00)Glass (n₂ = 1.52)● s-pol (out of plane)↔ p-pol (in plane)θB + θt = 90°θB = arctan(n₂/n₁) ≈ 56.6°
Figure 5.1 — Brewster's angle geometry. At θ_B ≈ 56.6° for glass, the reflected beam is purely s-polarized and the reflected and refracted rays are perpendicular.

5.3TIR and Phase Shifts

When light travels from a denser medium to a less dense medium (n1>n2n_1 > n_2) and the angle of incidence exceeds the critical angle θc=arcsin(n2/n1)\theta_c = \arcsin(n_2/n_1), total internal reflection (TIR) occurs. All incident power is reflected, but the s- and p-components acquire different phase shifts. The phase shifts upon TIR are:

Phase Shifts at TIR
tan ⁣(δs2)=sin2θi(n2/n1)2cosθi,tan ⁣(δp2)=sin2θi(n2/n1)2(n2/n1)2cosθi\tan\!\left(\frac{\delta_s}{2}\right) = -\frac{\sqrt{\sin^2\theta_i - (n_2/n_1)^2}}{\cos\theta_i}, \quad \tan\!\left(\frac{\delta_p}{2}\right) = -\frac{\sqrt{\sin^2\theta_i - (n_2/n_1)^2}}{(n_2/n_1)^2\cos\theta_i}

The difference δpδs\delta_p - \delta_s between the p- and s-phase shifts is a function of the angle of incidence and the refractive index ratio. By choosing the angle of incidence appropriately, a specific retardance can be introduced between the s- and p-components. This is the operating principle of the Fresnel rhomb: two TIR reflections at a carefully chosen angle produce a total phase difference of π/2\pi/2 (quarter-wave retardation), converting linearly polarized input to circular polarization. The Fresnel rhomb has the advantage of being achromatic the retardance varies only slowly with wavelength because it depends on geometry rather than material dispersion [1, 4].

5.4Metallic Surfaces

Metallic surfaces have a complex refractive index n~=n+iκ\tilde{n} = n + i\kappa, where κ\kappa is the extinction coefficient. The Fresnel equations remain valid with n2n_2 replaced by n~\tilde{n}, but the reflection coefficients become complex even at normal incidence. Both s- and p-polarized light acquire different phase shifts upon reflection, so linearly polarized light reflected from a metal surface generally becomes elliptically polarized.

Metals do not exhibit a true Brewster's angle (since rpr_p never reaches exactly zero), but they have a pseudo-Brewster angle where the p-reflectance reaches a minimum. The pseudo-Brewster angle and the associated reflectance minimum depend on the optical constants of the metal. For highly reflective metals like silver and aluminum, the p-reflectance minimum is shallow (a few percent dip), while for less reflective metals the minimum is more pronounced. Ellipsometry exploits these polarization-dependent reflection properties to determine the optical constants nn and κ\kappa of thin films and surfaces [1, 4].

5.5Fresnel Reflectance at 45°

Worked Example: Fresnel Reflectance at 45\u00B0 for BK7

Calculate the s- and p-polarized power reflectance for light at 550 nm incident at 45° on an uncoated BK7 surface (n = 1.518).

sinθt=n1n2sinθi=1.0001.518sin45=0.70711.518=0.4659\sin\theta_t = \frac{n_1}{n_2}\sin\theta_i = \frac{1.000}{1.518}\sin 45^{\circ} = \frac{0.7071}{1.518} = 0.4659
θt=arcsin(0.4659)=27.76\theta_t = \arcsin(0.4659) = 27.76^{\circ}
rs=1.000×cos451.518×cos27.761.000×cos45+1.518×cos27.76=0.70711.34270.7071+1.3427=0.63562.0498=0.3101r_s = \frac{1.000 \times \cos 45^{\circ} - 1.518 \times \cos 27.76^{\circ}}{1.000 \times \cos 45^{\circ} + 1.518 \times \cos 27.76^{\circ}} = \frac{0.7071 - 1.3427}{0.7071 + 1.3427} = \frac{-0.6356}{2.0498} = -0.3101
rp=1.518×cos451.000×cos27.761.518×cos45+1.000×cos27.76=1.07330.88491.0733+0.8849=0.18841.9582=0.0962r_p = \frac{1.518 \times \cos 45^{\circ} - 1.000 \times \cos 27.76^{\circ}}{1.518 \times \cos 45^{\circ} + 1.000 \times \cos 27.76^{\circ}} = \frac{1.0733 - 0.8849}{1.0733 + 0.8849} = \frac{0.1884}{1.9582} = 0.0962
Rs=rs2=0.31012=0.0962=9.62%R_s = |r_s|^2 = 0.3101^2 = 0.0962 = 9.62\%
Rp=rp2=0.09622=0.0093=0.93%R_p = |r_p|^2 = 0.0962^2 = 0.0093 = 0.93\%

At 45°, s-polarized light reflects about 10 times more strongly than p-polarized light. This large polarization-dependent reflectance is why uncoated glass plates at 45° act as partial polarizers, and why anti-reflection coatings must be optimized for both polarizations at non-normal incidence.

6Polarizers

6.1Dichroic Polarizers

Dichroic polarizers selectively absorb one polarization component while transmitting the orthogonal component. The most common example is Polaroid film, which uses aligned long-chain polymer molecules (polyvinyl alcohol doped with iodine) that strongly absorb light with its electric field parallel to the polymer chains. Modern dichroic sheet polarizers achieve extinction ratios of 10³ to 10 across the visible spectrum and are available in large formats at low cost [1, 8].

Dichroic polarizers have several practical advantages: they are thin, lightweight, available in large apertures, and work over a wide acceptance angle. Their main limitations are absorption of the rejected polarization (which limits power handling and causes heating), moderate extinction ratio compared to crystalline polarizers, and limited UV and IR performance. They are widely used in displays, photography (polarizing filters), optical instruments, and educational laboratories.

6.2Birefringent Polarizers

Birefringent (or crystal) polarizers exploit the different refractive indices experienced by the ordinary and extraordinary rays in an anisotropic crystal to spatially separate the two polarization components. They achieve very high extinction ratios (10 to 10) and handle high optical powers because the rejected polarization is deflected rather than absorbed.

Glan-Thompson polarizer: Two calcite prisms cemented together with the optic axes parallel. The ordinary ray is totally internally reflected at the cement layer (which has a refractive index between the ordinary and extraordinary indices), while the extraordinary ray is transmitted. Provides the highest extinction ratio (> 10) and largest acceptance angle (up to ±15°) among crystal polarizers, but the cement layer limits the damage threshold and UV transmission.

Glan-Taylor polarizer: Similar geometry to the Glan-Thompson but with an air gap between the prisms instead of cement. The air gap provides much higher damage threshold and UV transmission (down to about 214 nm with calcite), but the acceptance angle is smaller (±5° to ±8°) and the extinction ratio is somewhat lower (10 to 10).

Wollaston prism: Two calcite wedges with perpendicular optic axes cemented together. Both the ordinary and extraordinary rays exit the prism, symmetrically deviated about the input beam axis. This is useful when both polarization components are needed (e.g., in polarimeters or differential measurements). The deviation angle depends on the wedge angle and is typically 15° to 45°.

Rochon prism: Similar to the Wollaston but with a different optic axis arrangement. The ordinary ray passes through undeviated while the extraordinary ray is deviated. This provides a convenient undeviated reference beam [1, 5, 6].

6.3Wire-Grid Polarizers

Wire-grid polarizers consist of a parallel array of thin metallic wires (or lithographically patterned metal strips) on a transparent substrate. The wire spacing must be much smaller than the wavelength. Light polarized parallel to the wires drives conduction currents along the wires, causing reflection (and some absorption). Light polarized perpendicular to the wires cannot drive currents efficiently and is transmitted.

Historically, wire-grid polarizers were limited to the infrared (where wire spacings of a few micrometers suffice), but advances in nanolithography have enabled wire-grid polarizers for the visible and near-UV, with wire pitches below 150 nm. Modern visible wire-grid polarizers achieve extinction ratios of 10³ to 10 with high transmission (8090%) and work over wide angles of incidence. They are used in projection displays, imaging polarimeters, and applications requiring polarization components on flat substrates [5, 8].

6.4Thin-Film Polarizers

Thin-film polarizers use multilayer dielectric coatings designed to reflect one polarization and transmit the other at a specific angle of incidence (typically 45° or near Brewster's angle for the coating materials). They are commonly implemented as polarizing beam splitter (PBS) cubes and thin-film plate polarizers.

PBS cubes: A dielectric multilayer coating is deposited on the hypotenuse of one right-angle prism, and a second prism is cemented to it. P-polarized light is transmitted through the cube, while s-polarized light is reflected at 90°. PBS cubes provide both output beams in convenient, well-defined directions and are widely used in laser systems, interferometers, and optical instruments. Extinction ratios of 10³ for the reflected beam and 10² to 10³ for the transmitted beam are typical, though high-performance designs can achieve 10 or better in transmission [5, 8].

Plate polarizers: A dielectric multilayer on a flat glass substrate at Brewster's angle (or a design angle near 56°). These are thinner, lighter, and handle higher power than PBS cubes (no cement layer), but produce only one clean output beam (the reflected s-polarized beam exits at an angle that depends on the plate orientation).

Polarizer TypesDichroic (Absorption)AbsorbedUnpolarizedLinearly polarizedTransmission axis
Figure 6.1 — Three common polarizer mechanisms. Toggle to compare dichroic absorption, birefringent splitting (Wollaston prism), and thin-film PBS cube operation.

6.5Malus's Law

When linearly polarized light with intensity I0I_0 passes through a linear polarizer whose transmission axis makes an angle θ\theta with the polarization direction, the transmitted intensity is given by Malus's law:

Malus's Law
I(θ)=I0cos2θI(\theta) = I_0\cos^2\theta

At θ=0\theta = 0^{\circ}, all the light is transmitted; at θ=90\theta = 90^{\circ}, no light is transmitted (for an ideal polarizer). Malus's law is a direct consequence of the projection of the input electric field onto the polarizer's transmission axis. For unpolarized light incident on an ideal polarizer, the transmitted intensity is I0/2I_0/2 (averaging cos2θ\cos^2\theta over all angles gives 1/2) [1, 2].

Malus's LawE₀Polarized inputAnalyzerE₀θE₀cosθI = I₀cos²θ
Figure 6.2 — Malus's law: the transmitted intensity through an analyzer depends on the projection of the input E-field onto the transmission axis. The right panel shows the vector decomposition.
Worked Example: Three-Polarizer Paradox

Two crossed polarizers (transmission axes at 0° and 90°) transmit no light. What happens if a third polarizer is inserted between them at 45°?

I1=I02I_1 = \frac{I_0}{2}

The output is linearly polarized at 0°.

I2=I1cos245=I02×12=I04I_2 = I_1\cos^2 45^{\circ} = \frac{I_0}{2} \times \frac{1}{2} = \frac{I_0}{4}

The output is now linearly polarized at 45°.

I3=I2cos245=I04×12=I08I_3 = I_2\cos^2 45^{\circ} = \frac{I_0}{4} \times \frac{1}{2} = \frac{I_0}{8}

Inserting the intermediate polarizer increases the transmitted intensity from zero to I0/8I_0/8. The 45° polarizer projects the horizontal polarization onto its axis, creating a component that can then pass through the final vertical polarizer. This demonstrates that a polarizer is not merely a filter it redefines the polarization state of the transmitted light.

6.6Extinction Ratio

The extinction ratio (ER) of a polarizer quantifies the contrast between the transmitted intensities for the two orthogonal polarization states. It is defined as the ratio of maximum to minimum transmission:

Extinction Ratio
ER=TmaxTmin\text{ER} = \frac{T_{\text{max}}}{T_{\text{min}}}

Extinction ratio is often expressed in decibels: ER (dB)=10log10(Tmax/Tmin)\text{ER (dB)} = 10\log_{10}(T_{\text{max}}/T_{\text{min}}). A higher extinction ratio means better polarization purity. The required extinction ratio depends on the application: visual displays may tolerate 10³ (30 dB), while precision polarimetry may require 10 (50 dB) or better [5, 6].

Polarizer TypeExtinction RatioTransmission (%)Acceptance AngleDamage ThresholdSpectral Range
Dichroic sheet10³–10⁴40–85Wide (±60°)Low (< 1 W/cm²)Visible
Glan-Thompson> 10⁵90–95±15°Moderate (cement limited)350–2300 nm
Glan-Taylor10⁴–10⁵85–90±5–±8°High (air-spaced)214–2300 nm
Wollaston prism10⁵90–95±10°Moderate350–2300 nm
Wire-grid (visible)10³–10⁴80–90Wide (±50°)Moderate400–2000 nm
Thin-film PBS cube10²–10⁴90–95±2–±5°Moderate (cement limited)Narrowband
Thin-film plate10³–10⁴85–95±2°HighNarrowband

7Waveplates and Retarders

7.1Birefringence

A waveplate (also called a retarder or retardation plate) is a flat optical element made from a birefringent material. Birefringent materials have two principal refractive indices: the ordinary index non_o and the extraordinary index nen_e. The axis with the lower refractive index is called the fast axis (light polarized along it travels faster), and the orthogonal axis is the slow axis.

When linearly polarized light enters a waveplate, the component along the fast axis and the component along the slow axis travel at different speeds. After traversing a thickness dd, the two components accumulate a phase difference (retardance) Γ\Gamma:

Retardance
Γ=2πλ(neno)d\Gamma = \frac{2\pi}{\lambda}(n_e - n_o)\,d

The retardance is typically specified in fractions of a wavelength: λ/4\lambda/4 (quarter-wave), λ/2\lambda/2 (half-wave), or λ\lambda (full-wave). The corresponding phase differences are π/2\pi/2, π\pi, and 2π2\pi, respectively [1, 2, 8].

Waveplate Operation (QWP)Before QWP45°LinearAfter QWPRCPCircularFastSlowQWPλ/4Input at 45° to fast axis → λ/4 retardation → circular output
Figure 7.1 — Quarter-wave plate operation. Linearly polarized input at 45° to the fast axis is converted to right-hand circular polarization. Insets show the E-field trace before and after the QWP.

7.2Quarter-Wave Plates

A quarter-wave plate (QWP) introduces a retardance of Γ=π/2\Gamma = \pi/2 (one quarter of a wavelength). Its key functions are converting linearly polarized light to circularly polarized light (when the input polarization is at 45° to the fast axis) and converting circularly polarized light to linearly polarized light.

When linearly polarized light at an arbitrary angle θ\theta to the fast axis enters a QWP, the output is generally elliptically polarized. Circular output occurs only when θ=±45\theta = \pm 45^{\circ}. For θ=+45\theta = +45^{\circ}, the output is right-hand circular; for θ=45\theta = -45^{\circ}, the output is left-hand circular (using the optics convention). Quarter-wave plates are essential components in optical isolators, circular polarimetry, and ellipsometry [1, 8, 9].

7.3Half-Wave Plates

A half-wave plate (HWP) introduces a retardance of Γ=π\Gamma = \pi (one half of a wavelength). Its primary function is rotating the orientation of linearly polarized light. If linearly polarized light enters the HWP at angle θ\theta to the fast axis, the output is linearly polarized at angle θ-\theta to the fast axis effectively a rotation by 2θ2\theta:

HWP Polarization Rotation
θout=2θfastθin\theta_{\text{out}} = 2\theta_{\text{fast}} - \theta_{\text{in}}

where θfast\theta_{\text{fast}} is the fast-axis angle and θin\theta_{\text{in}} is the input polarization angle, both measured from the same reference. The HWP is the most common component for continuously adjustable polarization rotation: rotating the HWP by angle α\alpha rotates the output polarization by 2α2\alpha. It is also used to swap the handedness of circularly polarized light (RCP becomes LCP and vice versa) [1, 8, 9].

7.4Full-Wave Plates

A full-wave plate introduces a retardance of Γ=2π\Gamma = 2\pi (one full wavelength) at the design wavelength. At this wavelength, the full-wave plate has no effect on the polarization state it is the identity element. However, at other wavelengths, the retardance differs from 2π2\pi, and the plate acts as a waveplate with a wavelength-dependent retardance.

Full-wave plates are used as tunable color filters in polarization microscopy (the technique is called polarization color imaging or first-order red plate). When placed between crossed polarizers, a full-wave plate transmits white light minus the design wavelength, producing a characteristic magenta color. Birefringent specimens then shift the effective retardance, changing the transmitted color and enabling qualitative assessment of birefringence magnitude and sign [1].

7.5Multiple-Order vs Zero-Order vs True Zero-Order

The retardance formula Γ=2π(neno)d/λ\Gamma = 2\pi(n_e - n_o)d/\lambda allows multiple solutions for the thickness dd that gives a quarter-wave or half-wave retardance. These solutions differ by integer multiples of λ/(neno)\lambda/(n_e - n_o).

Multiple-order waveplates use a single thick plate where the retardance is (m+1/4)λ(m + 1/4)\lambda or (m+1/2)λ(m + 1/2)\lambda for integer m0m \gg 0. They are inexpensive and easy to manufacture but highly sensitive to wavelength, temperature, and angle of incidence because the fractional change in retardance depends on the total retardance, which is large.

Compound zero-order waveplates consist of two multiple-order plates with their fast axes crossed. The retardance is the difference between the two plates' retardances, which can be made small (quarter-wave or half-wave) even though each individual plate has many waves of retardance. Temperature and wavelength sensitivities largely cancel because both plates are affected equally.

True zero-order waveplates use a single thin plate with thickness d=λ/(4(neno))d = \lambda/(4(n_e - n_o)) or d=λ/(2(neno))d = \lambda/(2(n_e - n_o)). For quartz (neno0.009n_e - n_o \approx 0.009 at 550 nm), the true zero-order QWP thickness is approximately 15 μm, which is too thin to handle as a free-standing plate. True zero-order waveplates are therefore made by depositing birefringent polymer films on a glass substrate. They offer the best wavelength, temperature, and angular performance [8, 9].

Worked Example: Quartz QWP Thickness Calculation

Calculate the thickness of a true zero-order quartz quarter-wave plate for 633 nm. The birefringence of quartz is neno=0.00920n_e - n_o = 0.00920 at 633 nm.

Γ=π2=2πλ(neno)d\Gamma = \frac{\pi}{2} = \frac{2\pi}{\lambda}(n_e - n_o)\,d
d=λ4(neno)=633  nm4×0.00920=6330.0368  nm=17,200  nm=17.2  μmd = \frac{\lambda}{4(n_e - n_o)} = \frac{633\;\text{nm}}{4 \times 0.00920} = \frac{633}{0.0368}\;\text{nm} = 17{,}200\;\text{nm} = 17.2\;\mu\text{m}

A true zero-order quartz QWP at 633 nm is only 17.2 μm thick too thin to polish as a free-standing element. In practice, quartz waveplates at this thickness are either cemented to a glass substrate, or a compound zero-order design is used instead (two thicker plates with their fast axes crossed, with a thickness difference of 17.2 μm).

7.6Achromatic and Super-Achromatic Waveplates

Standard waveplates provide the design retardance at only one wavelength. The retardance varies inversely with wavelength (Γ1/λ\Gamma \propto 1/\lambda for a fixed thickness), plus the birefringence itself is wavelength-dependent. For broadband applications, achromatic and super-achromatic designs are available.

Achromatic waveplates use two or three plates of different birefringent materials (commonly quartz and magnesium fluoride) with their fast axes at specific angles. The material dispersion of one material partially compensates the other, flattening the retardance vs. wavelength curve. Achromatic QWPs typically maintain λ/4 ± 2% over a 200300 nm bandwidth.

Super-achromatic waveplates (also called Pancharatnam designs) use three or more birefringent plates at precisely calculated orientations. They achieve nearly constant retardance over very broad wavelength ranges (e.g., 3251100 nm for a quartz/MgF design). The design principle exploits the geometric (Pancharatnam) phase on the Poincaré sphere [5, 8, 9].

Waveplate Order ComparisonWavelength (nm)Retardation (waves)4005006007008000.150.200.250.300.35λ/4600 nmMultiple-orderFlatter curves maintain target retardation across broader wavelength range
Figure 7.2 — Retardation vs. wavelength for multiple-order, compound zero-order, and true zero-order waveplates. Toggle to isolate each curve. True zero-order provides the flattest response across wavelength.
Waveplate TypeBandwidthTemperature SensitivityAngular SensitivityTypical Cost
Multiple-order± 1–2 nmHighHighLow
Compound zero-order± 10–20 nmLowLowModerate
True zero-order (polymer)± 30–50 nmVery lowVery lowModerate
Achromatic (2-material)200–300 nmLowLowHigh
Super-achromatic400–1000+ nmVery lowLowVery high
Fresnel rhombInherently broadbandVery lowN/A (geometry)Moderate–high

7.7Fresnel Rhomb Retarders

The Fresnel rhomb is a solid glass prism (typically a parallelogram or rhombus shape) that produces retardation by exploiting the differential phase shift between s- and p-polarized light at total internal reflection. Two TIR events at a carefully chosen internal angle (typically near 54.6° for BK7 glass) produce a total retardance of λ/4\lambda/4.

The key advantage of the Fresnel rhomb is its achromatic performance: the retardance depends on the TIR phase shifts, which vary only slowly with wavelength. A well-designed Fresnel rhomb maintains quarter-wave retardance within a few percent over the entire transparent range of the glass substrate (e.g., 3502000 nm for BK7). The main disadvantages are the beam displacement introduced by the rhomb geometry, the sensitivity to the angle of incidence (which affects the internal reflection angle), and the larger physical size compared to a thin waveplate. Double Fresnel rhombs (two rhombs in series) can produce half-wave retardation with achromatic performance [1, 4].

8Optical Activity and Faraday Rotation

8.1Natural Optical Activity

Optical activity (also called optical rotation) is the property of certain materials to rotate the plane of polarization of linearly polarized light as it propagates through the material. Optically active materials include crystalline quartz (along the optic axis), sugar solutions, amino acids, and many organic molecules. The rotation angle ϕ\phi is proportional to the path length dd:

Optical Rotation
ϕ=ρd\phi = \rho \cdot d

where ρ\rho is the rotary power (degrees per unit length) of the material. For crystalline quartz at 589.3 nm, ρ21.7/mm\rho \approx 21.7^{\circ}/\text{mm}. Optically active materials exist in two enantiomeric forms: dextrorotatory (d- or +, rotates polarization clockwise when viewed facing the source) and levorotatory (l- or , rotates counter-clockwise) [1, 4].

Optical activity arises from circular birefringence: the material has different refractive indices for right-hand and left-hand circularly polarized light. Linearly polarized light, which can be decomposed into equal RCP and LCP components, accumulates a differential phase between these components as it propagates, resulting in a net rotation of the linear polarization direction.

8.2Specific Rotation

For optically active solutions, the rotation is proportional to both the path length and the concentration. Biot's law gives the rotation as:

Biot's Law
ϕ=[α]λTcd\phi = [\alpha]_\lambda^T \cdot c \cdot d

where [α]λT[\alpha]_\lambda^T is the specific rotation (degrees · mL / (g · dm)) at wavelength λ\lambda and temperature TT, cc is the concentration (g/mL), and dd is the path length (dm). The specific rotation is a material property that depends on wavelength (approximately as 1/λ21/\lambda^2, known as optical rotatory dispersion) and temperature. Saccharimetry (measuring sugar concentration by optical rotation) is one of the oldest applications of polarimetry in chemistry and food science [1, 4].

Worked Example: Optical Rotation in Crystalline Quartz

Calculate the rotation angle for linearly polarized light at 589.3 nm (sodium D-line) passing through a 10 mm thick quartz plate oriented along the optic axis. The rotary power of quartz at this wavelength is 21.7°/mm.

ϕ=ρd=21.7/mm×10  mm=217\phi = \rho \cdot d = 21.7^{\circ}\text{/mm} \times 10\;\text{mm} = 217^{\circ}

The polarization is rotated by 217°, which is equivalent to 217° 180° = 37° beyond a half-turn. This large rotation illustrates why quartz optical rotators are practical in compact packages significant rotation angles are achieved with short path lengths.

8.3Faraday Effect

The Faraday effect is the rotation of the polarization plane of linearly polarized light when it propagates through a material in the presence of a magnetic field parallel to the propagation direction. The Faraday rotation angle is:

Faraday Rotation
θF=VBd\theta_F = V \cdot B \cdot d

where VV is the Verdet constant of the material (rad/(T·m) or min/(Oe·cm)), BB is the magnetic flux density (T), and dd is the path length through the material (m). The Verdet constant is material- and wavelength-dependent: it is large for heavy glasses (SF-57: V23V \approx 23 rad/(T·m) at 633 nm), terbium gallium garnet (TGG: V134V \approx -134 rad/(T·m) at 633 nm), and terbium-doped glasses [1, 3].

The crucial distinction between Faraday rotation and natural optical activity is that Faraday rotation is non-reciprocal: the rotation direction is determined by the magnetic field direction, not the light propagation direction. A beam that traverses the material forward and then backward accumulates double the rotation (rather than canceling). This non-reciprocal property is the basis of optical isolators, which use a Faraday rotator combined with polarizers to allow light to pass in one direction while blocking light traveling in the opposite direction. Optical isolators are essential for protecting lasers from back-reflections [3, 5].

8.4Connection to Magnetism

The Faraday effect provides a direct link between polarization optics and magnetism. The Verdet constant is proportional to the dispersion of the refractive index (Vλdn/dλV \propto \lambda\,dn/d\lambda), which means materials with strong dispersion in the visible (heavy flint glasses, rare-earth doped crystals) have the largest Verdet constants and are the most efficient Faraday rotators.

Magneto-optic effects beyond simple Faraday rotation include the magneto-optic Kerr effect (MOKE, polarization change upon reflection from a magnetized surface), used extensively in magnetic domain imaging and magneto-optic data storage readout; and the Cotton-Mouton effect (magnetic linear birefringence), which is typically much weaker than the Faraday effect. For system-level design involving magneto-optic components, including Faraday rotator specifications, magnetic circuit design, and isolator performance prediction, see the 🔧 Magneto-Optic Design Calculator.

9Practical Considerations

9.1Wavelength Dependence

Nearly all polarization components exhibit wavelength dependence. Waveplate retardance scales approximately as 1/λ1/\lambda (plus the material dispersion of birefringence), so a quarter-wave plate at 633 nm provides more retardance at shorter wavelengths and less at longer wavelengths. Dichroic polarizers have wavelength-dependent extinction ratios and transmission. Thin-film polarizers (PBS cubes, plate polarizers) are designed for specific wavelength bands and can perform poorly outside their design range.

When operating at a wavelength different from the design wavelength, the effective retardance of a waveplate changes. For a zero-order waveplate designed at λ0\lambda_0, the retardance at wavelength λ\lambda is approximately Γ(λ)Γ0×λ0/λ\Gamma(\lambda) \approx \Gamma_0 \times \lambda_0/\lambda (neglecting the dispersion of birefringence). For a multiple-order waveplate, the deviation is amplified by the order number mm, making the wavelength sensitivity much worse [8, 9].

9.2Temperature Sensitivity

Temperature affects waveplate retardance through two mechanisms: thermal expansion (changes in thickness dd) and the temperature dependence of birefringence (d(neno)/dTd(n_e - n_o)/dT). For quartz, the birefringence temperature coefficient is approximately 1×107-1 \times 10^{-7} /°C, which is small. However, for a multiple-order waveplate with total retardance of, say, 30 waves, even this small coefficient produces a significant change in the fractional retardance.

True zero-order and compound zero-order waveplates minimize temperature sensitivity because the total retardance (and therefore the temperature-induced change) is small. Polymer true zero-order waveplates typically achieve retardance stability of ±0.5%\pm 0.5\% over a 50°C temperature range. For cryogenic or high-temperature applications, specify waveplates tested and characterized at the operating temperature [8, 9].

9.3Angle of Incidence Effects

Waveplates are designed for normal incidence. At non-normal incidence, the effective path length through the birefringent material increases, and the projections of the ordinary and extraordinary refractive indices change. The result is an angle-dependent retardance shift. For a zero-order waveplate, the retardance change is approximately proportional to θ2\theta^2 (for small angles θ\theta), and is typically a few percent for ±5° tilt.

Crystal polarizers (Glan-Thompson, Glan-Taylor) have specified acceptance half-angles beyond which the polarization purity degrades. Exceeding the acceptance angle causes the rejected ray to leak into the transmitted beam, reducing the extinction ratio. PBS cubes also have limited angular acceptance: the coating design is optimized for a specific angle of incidence (45° inside the cube), and deviations reduce the extinction ratio [5, 8].

9.4Damage Threshold

Polarization components can be damaged by high laser powers or fluences. The damage mechanism depends on the component type: dichroic polarizers fail by thermal damage to the absorbing dye (low LIDT, typically < 1 W/cm² CW); cemented crystal polarizers and PBS cubes are limited by the cement layer (typically 15 J/cm² at 10 ns, or 10100 W/cm² CW); air-spaced crystal polarizers (Glan-Taylor) are limited by the crystal surface damage threshold (typically 520 J/cm² at 10 ns); and waveplates are limited by coating damage (AR coatings on the surfaces) or the substrate material.

For high-power applications, use air-spaced Glan-Taylor or Glan-laser polarizers, plate-type thin-film polarizers (no cement), and appropriately coated waveplates. Always verify the LIDT specification against the actual beam parameters (peak fluence, not average power) at the component location [5, 8].

9.5Common Specification Parameters

MaterialBirefringence (nₑ − nₒ)Transparency Range (nm)Damage ThresholdCommon Use
Crystalline quartz0.0091 (at 589 nm)200–2700HighWaveplates, rotators
Calcite0.172 (at 589 nm)350–2300ModeratePolarizers (Glan types)
Magnesium fluoride0.012 (at 589 nm)120–7000HighUV/IR achromatic waveplates
α-BBO0.116 (at 589 nm)189–3500HighUV polarizers, high-power
YVO₄0.204 (at 633 nm)400–5000HighCompact waveplates
Polymer (stretched)0.001–0.005400–800LowTrue zero-order waveplates
Mica0.005 (at 589 nm)350–6000ModerateLow-cost QWPs
LiNbO₃0.086 (at 633 nm)350–5200ModerateElectro-optic retarders

10Selection Workflow

10.1Identifying the Need

The first step in selecting polarization components is identifying which of three fundamental tasks is required: generating a specific polarization state from unpolarized or arbitrarily polarized light, analyzing an unknown polarization state (measuring its Stokes parameters or degree of polarization), or manipulating a known polarization state (rotating, converting between linear and circular, compensating unwanted polarization effects).

Generating polarized light: Start with a polarizer to produce linear polarization, then add waveplates as needed to produce circular or elliptical states. The polarizer type is selected based on the required extinction ratio, wavelength, aperture, power level, and cost.

Analyzing polarization: A polarimeter requires at minimum a rotating polarizer (for linear Stokes parameters) and a quarter-wave plate (for the circular Stokes parameter). Division-of-amplitude or division-of-time architectures are used depending on speed and accuracy requirements.

Manipulating polarization: Half-wave plates rotate linear polarization; quarter-wave plates convert between linear and circular; Faraday rotators provide non-reciprocal rotation for isolation. The waveplate type (multiple-order, zero-order, achromatic) is selected based on the wavelength range, temperature stability, and angular sensitivity requirements [5, 6].

10.2Selecting Polarizer Type

Step 1 Determine the extinction ratio requirement. For visual applications and displays, 10³ (30 dB) is usually sufficient; a dichroic sheet polarizer is the most cost-effective choice. For scientific measurements, 10 to 10 is needed; a Glan-Taylor, Glan-Thompson, or high-quality PBS cube is appropriate. For precision polarimetry, 10 to 10 requires a Glan-Thompson or calcite Wollaston prism.

Step 2 Determine the operating wavelength. For UV below 350 nm, use α-BBO or air-spaced calcite polarizers. For the visible (400700 nm), all polarizer types are available. For the near-IR (7002000 nm), calcite, wire-grid, or thin-film polarizers are used. For the mid- and far-IR, wire-grid polarizers on IR-transparent substrates are standard.

Step 3 Assess the power and damage requirements. For high CW power or high peak fluence, avoid cemented components and dichroic sheet polarizers. Use air-spaced Glan-Taylor polarizers, plate-type thin-film polarizers, or wire-grid polarizers on damage-resistant substrates.

Step 4 Consider the form factor. Sheet polarizers are thin and available in large sizes. Crystal polarizers are compact but limited to small apertures (typically 525 mm). PBS cubes are convenient for beam steering applications (providing two orthogonal output beams). Plate polarizers require operation at near-Brewster angles and produce a displaced beam.

10.3Selecting Waveplate Type

Step 1 Determine the required retardance. Quarter-wave for linear-to-circular conversion; half-wave for polarization rotation; variable retarders (liquid crystal, electro-optic, photoelastic modulators) for dynamic retardance control.

Step 2 Determine the wavelength range. For single-wavelength operation (laser line), a zero-order or compound zero-order waveplate is most cost-effective. For broadband operation, achromatic or super-achromatic waveplates, or a Fresnel rhomb, are required. For tunability, a Soleil-Babinet compensator or liquid crystal variable retarder provides continuously adjustable retardance at any wavelength.

Step 3 Assess environmental requirements. Temperature stability favors true zero-order or compound zero-order designs over multiple-order. For cryogenic or high-temperature operation, verify the waveplate performance at the operating temperature. For high power, verify the LIDT of the AR coatings on the waveplate surfaces.

Step 4 Consider cost and availability. Multiple-order waveplates are the least expensive but worst performing over wavelength and temperature. True zero-order polymer waveplates offer good performance at moderate cost. Achromatic and super-achromatic waveplates are the most expensive but essential for broadband systems. Fresnel rhombs are moderately expensive, inherently broadband, but bulky [8, 9].

10.4System-Level Management

In a complete optical system, managing polarization requires attention to every optical surface and element. Each uncoated surface at non-normal incidence acts as a partial polarizer (different reflectivities for s and p). Each mirror reflection can introduce a phase shift between s and p, changing the polarization state. Stress birefringence in windows, lenses, and prisms introduces unwanted retardance.

Best practices for system-level polarization management include: designing the optical path to minimize the number of reflections at non-normal incidence; using polarization-maintaining coatings (anti-reflection and mirror coatings optimized for equal s and p performance); orienting birefringent elements with their axes aligned to the system's principal polarization directions; placing polarizers and waveplates as close to the source as practical (before the beam encounters polarization-altering surfaces); and performing a full polarization ray trace (using the Jones or Mueller calculus through the entire system) to predict the polarization state at each point in the system.

For imaging systems, field-dependent polarization effects (variation of the polarization state across the field of view) can limit contrast and resolution. These effects arise from the angle-dependent Fresnel coefficients at each surface and are particularly important in high-numerical-aperture systems such as lithographic projection lenses and microscope objectives [5, 6].

References

  1. [1]E. Hecht, Optics, 5th ed. Pearson, 2017.
  2. [2]F. L. Pedrotti, L. M. Pedrotti, and L. S. Pedrotti, Introduction to Optics, 3rd ed. Cambridge University Press, 2017.
  3. [3]B. E. A. Saleh and M. C. Teich, Fundamentals of Photonics, 3rd ed. Wiley, 2019.
  4. [4]M. Born and E. Wolf, Principles of Optics, 7th ed. Cambridge University Press, 2019.
  5. [5]R. A. Chipman, W. S. T. Lam, and G. Young, Polarized Light and Optical Systems. CRC Press, 2018.
  6. [6]E. Collett, Field Guide to Polarization, SPIE Press, 2005.
  7. [7]SCHOTT, Optical Glass Data Sheets, 2024.
  8. [8]Edmund Optics, “Understanding Waveplates,” Technical Application Note.
  9. [9]Newport/MKS, “Introduction to Waveplates,” Application Note.
  10. [10]Thorlabs, “Polarization Optics Tutorial,” Web Resource.

All information, equations, and calculations have been compiled and verified to the best of our ability. For mission-critical applications, we recommend independent verification of all values. If you find an error, please let us know.