Polarization & Polarizers
A complete guide to polarization states, formalisms, polarizing components, waveplates, and system-level design for optical engineers.
▸1Introduction to Polarization
1.1The Transverse Nature of Light
Light is a transverse electromagnetic wave: the electric field vector and the magnetic field vector oscillate perpendicular to the direction of propagation. For a monochromatic plane wave traveling in the -direction, the electric field can be written as the superposition of two orthogonal components:
Here and are the amplitudes of the x- and y-components, is the wavenumber, is the angular frequency, and and are the phase offsets of each component. The polarization state of the wave is determined entirely by the amplitude ratio and the phase difference . The magnetic field carries the same information and is fully determined by Maxwell's equations, so polarization is conventionally described using only the electric field.
1.2Historical Context
The concept of polarization emerged from observations that could not be explained by a simple scalar wave theory. In 1669, Erasmus Bartholinus discovered double refraction (birefringence) in calcite crystals, observing that a single incident ray produced two refracted rays. Christiaan Huygens attempted to explain this with his wave theory in 1690, but the full explanation required the concept of transverse oscillations, which was not yet established.
Etienne-Louis Malus discovered polarization by reflection in 1808 when he observed that light reflected from a glass window, viewed through a calcite crystal, showed intensity variations as the crystal was rotated. He coined the term “polarization” by analogy with the poles of a magnet. Sir David Brewster subsequently determined the angle of incidence at which reflected light is fully polarized (now called Brewster's angle) in 1815 [1, 4].
Augustin-Jean Fresnel provided the definitive theoretical framework in the 1820s by establishing that light waves are transverse, deriving the amplitude reflection and transmission coefficients (the Fresnel equations), and explaining birefringence as a consequence of directional variation in the refractive index. The Jones calculus (1941) and Stokes-Mueller calculus (Stokes 1852, Mueller 1948) later provided the matrix formalisms that are essential tools in modern optical engineering [4, 5].
1.3Why Polarization Matters
Polarization is a fundamental property of light that affects nearly every aspect of optical system design. Fresnel reflection and transmission at any interface depend on polarization: s-polarized and p-polarized light reflect differently, which means that any optical surface acts as a partial polarizer. Thin-film coating performance (anti-reflection coatings, mirrors, filters, beam splitters) is inherently polarization-dependent because the Fresnel coefficients at each layer boundary depend on the polarization state [1, 3].
In laser systems, polarization control is essential for beam combining, frequency doubling (which requires a specific linear polarization orientation relative to the crystal axis), electro-optic modulation, and isolation (optical isolators use the Faraday effect to create non-reciprocal polarization rotation). In imaging and metrology, polarization is used for stress analysis (photoelasticity), surface characterization (ellipsometry), glare reduction, and contrast enhancement. In telecommunications, polarization-division multiplexing doubles fiber capacity by encoding independent data streams on orthogonal polarization states [3, 5].
Understanding polarization is therefore not optional for the optical engineer — it is required for predicting system performance, diagnosing unexpected losses, and designing components that control the polarization state as needed.
▸2Polarization States
2.1Linear Polarization
When the phase difference between the x- and y-components is or , the electric field vector traces a straight line in the xy-plane as the wave propagates. The orientation of this line depends on the amplitude ratio . If , the wave is horizontally polarized (x-polarized); if , the wave is vertically polarized (y-polarized). Any intermediate ratio gives linear polarization at an angle to the x-axis.
Linear polarization is the most common polarization state encountered in practice. Most laser sources emit linearly polarized light (due to Brewster windows in the cavity, or polarization-selective coatings). Linear polarizers transmit one orientation and block the orthogonal orientation, producing linearly polarized light from an unpolarized source [1, 2].
2.2Circular Polarization
When the x- and y-amplitudes are equal () and the phase difference is , the electric field vector traces a circle in the xy-plane as the wave propagates. The condition for circular polarization is:
The sign of determines the handedness. In the optics convention (looking into the oncoming beam), gives right-hand circular polarization (RCP) — the electric field vector rotates clockwise — and gives left-hand circular polarization (LCP) — the electric field vector rotates counter-clockwise. Note that the physics convention (looking along the direction of propagation) reverses the handedness labels. The optics convention is used throughout this guide [1, 5].
Circular polarization is produced by passing linearly polarized light through a quarter-wave plate with the linear polarization oriented at 45° to the fast axis. It is used in optical isolators, 3D cinema systems, satellite communications, and anywhere rotation-invariant polarization is needed.
2.3Elliptical Polarization
The most general polarization state is elliptical: the tip of the electric field vector traces an ellipse in the xy-plane. Linear and circular polarization are special cases of elliptical polarization. For arbitrary , , and , the electric field components satisfy the polarization ellipse equation:
The ellipse is characterized by two angles. The orientation angle gives the tilt of the major axis relative to the x-axis:
The ellipticity angle describes the shape of the ellipse, ranging from (linear) to (circular):
In practice, elliptical polarization arises whenever light passes through a birefringent element that introduces a phase retardation other than 0, , or , or when linearly polarized light reflects from a metallic surface (which introduces different phase shifts for s- and p-components) [1, 5].
2.4Unpolarized and Partially Polarized Light
Natural light sources (incandescent lamps, LEDs, the sun) emit unpolarized light: the polarization state changes randomly on timescales much shorter than any practical measurement interval. The time-averaged electric field has no preferred orientation. Unpolarized light cannot be described by a single Jones vector (which assumes complete coherence and full polarization) but is naturally represented by the Stokes formalism.
Partially polarized light is a mixture of fully polarized and unpolarized components. The degree of polarization (DOP) quantifies the fraction of the total intensity that is polarized:
DOP ranges from 0 (completely unpolarized) to 1 (fully polarized). For example, skylight is partially polarized (DOP typically 0.3–0.7, depending on the scattering angle relative to the sun), while a well-collimated laser beam is typically fully polarized (DOP ≈ 1). Reflection from a dielectric surface at Brewster's angle produces fully polarized light (DOP = 1) in the reflected beam [1, 6].
▸3Jones Calculus
3.1Jones Vectors
The Jones vector provides a compact, complex-valued representation of the polarization state of a fully polarized, monochromatic plane wave. It is a 2×1 column vector containing the complex amplitudes of the x- and y-components of the electric field:
Two Jones vectors represent the same polarization state if they differ only by a common phase factor or a real scaling factor. The normalized Jones vectors for the most common polarization states are listed below.
| Polarization State | Jones Vector |
|---|---|
| Horizontal (x) | \begin{pmatrix} 1 \\ 0 \end{pmatrix} |
| Vertical (y) | \begin{pmatrix} 0 \\ 1 \end{pmatrix} |
| Linear at +45° | \frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ 1 \end{pmatrix} |
| Linear at −45° | \frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ -1 \end{pmatrix} |
| Right-hand circular (RCP) | \frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ -i \end{pmatrix} |
| Left-hand circular (LCP) | \frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ i \end{pmatrix} |
3.2Jones Matrices
Each polarization-modifying optical element (polarizer, waveplate, rotator) is represented by a 2×2 complex matrix called the Jones matrix. The output Jones vector is obtained by multiplying the input Jones vector by the element's Jones matrix:
The Jones matrix for a horizontal polarizer (transmits x-component, blocks y-component):
A linear polarizer with its transmission axis at angle to the x-axis:
A quarter-wave plate (QWP) with its fast axis along x:
A half-wave plate (HWP) with its fast axis along x:
A general waveplate with retardance and fast axis along x:
To model an element whose principal axis is rotated by angle from the x-axis, the Jones matrix is rotated using the rotation matrix :
3.3Cascading Elements
When light passes through a sequence of optical elements, the overall Jones matrix is the product of the individual Jones matrices in reverse order (the first element encountered is on the right). For N elements:
Consider horizontally polarized light passing first through a linear polarizer at 45°, then through a quarter-wave plate with its fast axis horizontal. What is the output polarization state?
Ignoring the global phase factor, the output is proportional to , which is right-hand circularly polarized light. The 45° polarizer projects the input onto the +45° linear state, and the QWP converts that linear state to circular polarization.
3.4Limitations
The Jones calculus applies only to fully polarized, coherent, monochromatic light. It cannot represent unpolarized or partially polarized light, because the Jones vector inherently describes a definite polarization state. For partially polarized light, depolarizing elements, or incoherent superposition of beams, the Stokes-Mueller formalism (Section 4) is required. Additionally, the Jones calculus does not account for absolute intensity — it tracks the relative complex amplitudes, not the absolute power. For systems involving both coherent and incoherent effects, a hybrid approach using coherency matrices or the Mueller calculus is needed [5, 6].
▸4Stokes & Mueller Calculus
4.1Stokes Vector
The Stokes vector is a 4×1 real-valued vector that completely describes the polarization state of a light beam, including partially polarized and unpolarized light. Unlike the Jones vector, it is defined in terms of measurable intensities rather than complex amplitudes:
Here is the total intensity, is the preference for horizontal over vertical polarization, is the preference for +45° over −45° polarization, and is the preference for right-circular over left-circular polarization. Each Stokes parameter can be determined by measuring the transmitted intensity through appropriate polarizers and waveplates.
For a fully polarized beam described by field amplitudes , , and phase difference , the Stokes parameters are:
4.2Degree of Polarization
The degree of polarization (DOP), degree of linear polarization (DOLP), and degree of circular polarization (DOCP) are derived from the Stokes parameters:
For fully polarized light, and DOP = 1. For partially polarized light, and 0 < DOP < 1. For completely unpolarized light, and DOP = 0.
A polarimeter measures the Stokes vector of a beam as . Determine the DOP, DOLP, and DOCP.
The beam is partially polarized (DOP = 0.671), with a stronger linear component (DOLP = 0.600) than circular component (DOCP = 0.300). The predominant polarization is horizontal (S₁ = 0.6 > 0), consistent with partial horizontal linear polarization with a small right-circular component (S₃ = 0.3 > 0).
4.3Mueller Matrices
Each optical element in the Mueller formalism is represented by a 4×4 real-valued matrix. The output Stokes vector is obtained by multiplying the input Stokes vector by the Mueller matrix:
Mueller matrix for a horizontal linear polarizer:
Mueller matrix for a quarter-wave plate with fast axis horizontal:
Mueller matrix for a half-wave plate with fast axis horizontal:
Mueller matrix for a linear polarizer at angle :
4.4Jones vs Mueller
The Jones and Mueller formalisms are complementary. The Jones calculus is simpler (2×2 matrices vs 4×4), preserves absolute phase information (important for interference calculations), and is computationally efficient. However, it is limited to fully polarized, coherent light.
The Mueller calculus handles any polarization state including partially polarized and unpolarized light, can represent depolarizing elements (scattering surfaces, integrating spheres, rough surfaces), and works directly with measurable intensities. Every Jones matrix can be converted to an equivalent Mueller matrix, but the reverse is not always possible — Mueller matrices that represent depolarization have no Jones equivalent [5, 6].
In practice, use the Jones calculus for coherent systems (laser optics, interferometers, fiber optics) where all elements are non-depolarizing and the light is fully polarized. Use the Mueller calculus when dealing with partially polarized sources, depolarizing elements, or when the system includes both polarized and unpolarized beams.
4.5Poincaré Sphere
The Poincaré sphere is a geometric representation of all possible polarization states on the surface of a unit sphere. The three Cartesian coordinates of a point on the sphere correspond to the normalized Stokes parameters . Fully polarized states lie on the surface of the sphere, while partially polarized states lie inside the sphere (with unpolarized light at the origin).
On the Poincaré sphere, the equator represents all linear polarization states: horizontal polarization is at (1, 0, 0), vertical at (−1, 0, 0), +45° at (0, 1, 0), and −45° at (0, −1, 0). The north pole (0, 0, 1) represents right-hand circular polarization and the south pole (0, 0, −1) represents left-hand circular polarization. All other points on the surface represent elliptical polarization states.
The Poincaré sphere is particularly useful for visualizing the action of waveplates: a waveplate with retardance and fast axis at angle rotates the polarization state on the sphere by an angle about an axis that passes through the equatorial point at longitude . A quarter-wave plate rotates by 90°, a half-wave plate by 180°, and a full-wave plate by 360° (identity). This geometric interpretation makes it easy to predict the effect of any waveplate on any input polarization state [5, 6].
▸5Polarization by Reflection
5.1Fresnel Equations
When light strikes the interface between two media with refractive indices and , the reflected and transmitted amplitudes depend on the polarization state. The two independent linear polarization states used in reflection analysis are s-polarization (electric field perpendicular to the plane of incidence, from the German “senkrecht”) and p-polarization (electric field parallel to the plane of incidence). The Fresnel amplitude reflection and transmission coefficients are:
Here is the angle of incidence and is the angle of refraction, related by Snell's law: . The power reflectance and transmittance are:
Note that the factor in the transmittance accounts for the change in beam cross-section and wave impedance between the two media. Energy conservation requires for each polarization (in a lossless medium) [1, 2, 4].
5.2Brewster's Angle
Brewster's angle is the angle of incidence at which the p-polarized reflection coefficient vanishes (). At this angle, the reflected beam is entirely s-polarized. Brewster's angle satisfies:
At Brewster's angle, the reflected and refracted rays are perpendicular to each other (). This is exploited in Brewster windows (used in gas lasers to introduce a polarization-selective loss that forces the laser to oscillate in the p-polarization), and in pile-of-plates polarizers (multiple glass plates at Brewster's angle, each reflection removing more s-polarized light) [1, 2].
Calculate Brewster's angle for light incident from air onto a BK7 glass surface at 550 nm, where the refractive index of BK7 is approximately 1.518.
At 56.6° angle of incidence, the p-polarized reflectance is zero. The reflected beam is purely s-polarized, and the refracted angle is .
5.3TIR and Phase Shifts
When light travels from a denser medium to a less dense medium () and the angle of incidence exceeds the critical angle , total internal reflection (TIR) occurs. All incident power is reflected, but the s- and p-components acquire different phase shifts. The phase shifts upon TIR are:
The difference between the p- and s-phase shifts is a function of the angle of incidence and the refractive index ratio. By choosing the angle of incidence appropriately, a specific retardance can be introduced between the s- and p-components. This is the operating principle of the Fresnel rhomb: two TIR reflections at a carefully chosen angle produce a total phase difference of (quarter-wave retardation), converting linearly polarized input to circular polarization. The Fresnel rhomb has the advantage of being achromatic — the retardance varies only slowly with wavelength because it depends on geometry rather than material dispersion [1, 4].
5.4Metallic Surfaces
Metallic surfaces have a complex refractive index , where is the extinction coefficient. The Fresnel equations remain valid with replaced by , but the reflection coefficients become complex even at normal incidence. Both s- and p-polarized light acquire different phase shifts upon reflection, so linearly polarized light reflected from a metal surface generally becomes elliptically polarized.
Metals do not exhibit a true Brewster's angle (since never reaches exactly zero), but they have a pseudo-Brewster angle where the p-reflectance reaches a minimum. The pseudo-Brewster angle and the associated reflectance minimum depend on the optical constants of the metal. For highly reflective metals like silver and aluminum, the p-reflectance minimum is shallow (a few percent dip), while for less reflective metals the minimum is more pronounced. Ellipsometry exploits these polarization-dependent reflection properties to determine the optical constants and of thin films and surfaces [1, 4].
5.5Fresnel Reflectance at 45°
Calculate the s- and p-polarized power reflectance for light at 550 nm incident at 45° on an uncoated BK7 surface (n = 1.518).
At 45°, s-polarized light reflects about 10 times more strongly than p-polarized light. This large polarization-dependent reflectance is why uncoated glass plates at 45° act as partial polarizers, and why anti-reflection coatings must be optimized for both polarizations at non-normal incidence.
▸6Polarizers
6.1Dichroic Polarizers
Dichroic polarizers selectively absorb one polarization component while transmitting the orthogonal component. The most common example is Polaroid film, which uses aligned long-chain polymer molecules (polyvinyl alcohol doped with iodine) that strongly absorb light with its electric field parallel to the polymer chains. Modern dichroic sheet polarizers achieve extinction ratios of 10³ to 10⁴ across the visible spectrum and are available in large formats at low cost [1, 8].
Dichroic polarizers have several practical advantages: they are thin, lightweight, available in large apertures, and work over a wide acceptance angle. Their main limitations are absorption of the rejected polarization (which limits power handling and causes heating), moderate extinction ratio compared to crystalline polarizers, and limited UV and IR performance. They are widely used in displays, photography (polarizing filters), optical instruments, and educational laboratories.
6.2Birefringent Polarizers
Birefringent (or crystal) polarizers exploit the different refractive indices experienced by the ordinary and extraordinary rays in an anisotropic crystal to spatially separate the two polarization components. They achieve very high extinction ratios (10⁵ to 10⁶) and handle high optical powers because the rejected polarization is deflected rather than absorbed.
Glan-Thompson polarizer: Two calcite prisms cemented together with the optic axes parallel. The ordinary ray is totally internally reflected at the cement layer (which has a refractive index between the ordinary and extraordinary indices), while the extraordinary ray is transmitted. Provides the highest extinction ratio (> 10⁵) and largest acceptance angle (up to ±15°) among crystal polarizers, but the cement layer limits the damage threshold and UV transmission.
Glan-Taylor polarizer: Similar geometry to the Glan-Thompson but with an air gap between the prisms instead of cement. The air gap provides much higher damage threshold and UV transmission (down to about 214 nm with calcite), but the acceptance angle is smaller (±5° to ±8°) and the extinction ratio is somewhat lower (10⁴ to 10⁵).
Wollaston prism: Two calcite wedges with perpendicular optic axes cemented together. Both the ordinary and extraordinary rays exit the prism, symmetrically deviated about the input beam axis. This is useful when both polarization components are needed (e.g., in polarimeters or differential measurements). The deviation angle depends on the wedge angle and is typically 15° to 45°.
Rochon prism: Similar to the Wollaston but with a different optic axis arrangement. The ordinary ray passes through undeviated while the extraordinary ray is deviated. This provides a convenient undeviated reference beam [1, 5, 6].
6.3Wire-Grid Polarizers
Wire-grid polarizers consist of a parallel array of thin metallic wires (or lithographically patterned metal strips) on a transparent substrate. The wire spacing must be much smaller than the wavelength. Light polarized parallel to the wires drives conduction currents along the wires, causing reflection (and some absorption). Light polarized perpendicular to the wires cannot drive currents efficiently and is transmitted.
Historically, wire-grid polarizers were limited to the infrared (where wire spacings of a few micrometers suffice), but advances in nanolithography have enabled wire-grid polarizers for the visible and near-UV, with wire pitches below 150 nm. Modern visible wire-grid polarizers achieve extinction ratios of 10³ to 10⁴ with high transmission (80–90%) and work over wide angles of incidence. They are used in projection displays, imaging polarimeters, and applications requiring polarization components on flat substrates [5, 8].
6.4Thin-Film Polarizers
Thin-film polarizers use multilayer dielectric coatings designed to reflect one polarization and transmit the other at a specific angle of incidence (typically 45° or near Brewster's angle for the coating materials). They are commonly implemented as polarizing beam splitter (PBS) cubes and thin-film plate polarizers.
PBS cubes: A dielectric multilayer coating is deposited on the hypotenuse of one right-angle prism, and a second prism is cemented to it. P-polarized light is transmitted through the cube, while s-polarized light is reflected at 90°. PBS cubes provide both output beams in convenient, well-defined directions and are widely used in laser systems, interferometers, and optical instruments. Extinction ratios of 10³ for the reflected beam and 10² to 10³ for the transmitted beam are typical, though high-performance designs can achieve 10⁴ or better in transmission [5, 8].
Plate polarizers: A dielectric multilayer on a flat glass substrate at Brewster's angle (or a design angle near 56°). These are thinner, lighter, and handle higher power than PBS cubes (no cement layer), but produce only one clean output beam (the reflected s-polarized beam exits at an angle that depends on the plate orientation).
6.5Malus's Law
When linearly polarized light with intensity passes through a linear polarizer whose transmission axis makes an angle with the polarization direction, the transmitted intensity is given by Malus's law:
At , all the light is transmitted; at , no light is transmitted (for an ideal polarizer). Malus's law is a direct consequence of the projection of the input electric field onto the polarizer's transmission axis. For unpolarized light incident on an ideal polarizer, the transmitted intensity is (averaging over all angles gives 1/2) [1, 2].
Two crossed polarizers (transmission axes at 0° and 90°) transmit no light. What happens if a third polarizer is inserted between them at 45°?
The output is linearly polarized at 0°.
The output is now linearly polarized at 45°.
Inserting the intermediate polarizer increases the transmitted intensity from zero to . The 45° polarizer projects the horizontal polarization onto its axis, creating a component that can then pass through the final vertical polarizer. This demonstrates that a polarizer is not merely a filter — it redefines the polarization state of the transmitted light.
6.6Extinction Ratio
The extinction ratio (ER) of a polarizer quantifies the contrast between the transmitted intensities for the two orthogonal polarization states. It is defined as the ratio of maximum to minimum transmission:
Extinction ratio is often expressed in decibels: . A higher extinction ratio means better polarization purity. The required extinction ratio depends on the application: visual displays may tolerate 10³ (30 dB), while precision polarimetry may require 10⁵ (50 dB) or better [5, 6].
| Polarizer Type | Extinction Ratio | Transmission (%) | Acceptance Angle | Damage Threshold | Spectral Range |
|---|---|---|---|---|---|
| Dichroic sheet | 10³–10⁴ | 40–85 | Wide (±60°) | Low (< 1 W/cm²) | Visible |
| Glan-Thompson | > 10⁵ | 90–95 | ±15° | Moderate (cement limited) | 350–2300 nm |
| Glan-Taylor | 10⁴–10⁵ | 85–90 | ±5–±8° | High (air-spaced) | 214–2300 nm |
| Wollaston prism | 10⁵ | 90–95 | ±10° | Moderate | 350–2300 nm |
| Wire-grid (visible) | 10³–10⁴ | 80–90 | Wide (±50°) | Moderate | 400–2000 nm |
| Thin-film PBS cube | 10²–10⁴ | 90–95 | ±2–±5° | Moderate (cement limited) | Narrowband |
| Thin-film plate | 10³–10⁴ | 85–95 | ±2° | High | Narrowband |
▸7Waveplates and Retarders
7.1Birefringence
A waveplate (also called a retarder or retardation plate) is a flat optical element made from a birefringent material. Birefringent materials have two principal refractive indices: the ordinary index and the extraordinary index . The axis with the lower refractive index is called the fast axis (light polarized along it travels faster), and the orthogonal axis is the slow axis.
When linearly polarized light enters a waveplate, the component along the fast axis and the component along the slow axis travel at different speeds. After traversing a thickness , the two components accumulate a phase difference (retardance) :
The retardance is typically specified in fractions of a wavelength: (quarter-wave), (half-wave), or (full-wave). The corresponding phase differences are , , and , respectively [1, 2, 8].
7.2Quarter-Wave Plates
A quarter-wave plate (QWP) introduces a retardance of (one quarter of a wavelength). Its key functions are converting linearly polarized light to circularly polarized light (when the input polarization is at 45° to the fast axis) and converting circularly polarized light to linearly polarized light.
When linearly polarized light at an arbitrary angle to the fast axis enters a QWP, the output is generally elliptically polarized. Circular output occurs only when . For , the output is right-hand circular; for , the output is left-hand circular (using the optics convention). Quarter-wave plates are essential components in optical isolators, circular polarimetry, and ellipsometry [1, 8, 9].
7.3Half-Wave Plates
A half-wave plate (HWP) introduces a retardance of (one half of a wavelength). Its primary function is rotating the orientation of linearly polarized light. If linearly polarized light enters the HWP at angle to the fast axis, the output is linearly polarized at angle to the fast axis — effectively a rotation by :
where is the fast-axis angle and is the input polarization angle, both measured from the same reference. The HWP is the most common component for continuously adjustable polarization rotation: rotating the HWP by angle rotates the output polarization by . It is also used to swap the handedness of circularly polarized light (RCP becomes LCP and vice versa) [1, 8, 9].
7.4Full-Wave Plates
A full-wave plate introduces a retardance of (one full wavelength) at the design wavelength. At this wavelength, the full-wave plate has no effect on the polarization state — it is the identity element. However, at other wavelengths, the retardance differs from , and the plate acts as a waveplate with a wavelength-dependent retardance.
Full-wave plates are used as tunable color filters in polarization microscopy (the technique is called “polarization color imaging” or “first-order red plate”). When placed between crossed polarizers, a full-wave plate transmits white light minus the design wavelength, producing a characteristic magenta color. Birefringent specimens then shift the effective retardance, changing the transmitted color and enabling qualitative assessment of birefringence magnitude and sign [1].
7.5Multiple-Order vs Zero-Order vs True Zero-Order
The retardance formula allows multiple solutions for the thickness that gives a quarter-wave or half-wave retardance. These solutions differ by integer multiples of .
Multiple-order waveplates use a single thick plate where the retardance is or for integer . They are inexpensive and easy to manufacture but highly sensitive to wavelength, temperature, and angle of incidence because the fractional change in retardance depends on the total retardance, which is large.
Compound zero-order waveplates consist of two multiple-order plates with their fast axes crossed. The retardance is the difference between the two plates' retardances, which can be made small (quarter-wave or half-wave) even though each individual plate has many waves of retardance. Temperature and wavelength sensitivities largely cancel because both plates are affected equally.
True zero-order waveplates use a single thin plate with thickness or . For quartz ( at 550 nm), the true zero-order QWP thickness is approximately 15 μm, which is too thin to handle as a free-standing plate. True zero-order waveplates are therefore made by depositing birefringent polymer films on a glass substrate. They offer the best wavelength, temperature, and angular performance [8, 9].
Calculate the thickness of a true zero-order quartz quarter-wave plate for 633 nm. The birefringence of quartz is at 633 nm.
A true zero-order quartz QWP at 633 nm is only 17.2 μm thick — too thin to polish as a free-standing element. In practice, quartz waveplates at this thickness are either cemented to a glass substrate, or a compound zero-order design is used instead (two thicker plates with their fast axes crossed, with a thickness difference of 17.2 μm).
7.6Achromatic and Super-Achromatic Waveplates
Standard waveplates provide the design retardance at only one wavelength. The retardance varies inversely with wavelength ( for a fixed thickness), plus the birefringence itself is wavelength-dependent. For broadband applications, achromatic and super-achromatic designs are available.
Achromatic waveplates use two or three plates of different birefringent materials (commonly quartz and magnesium fluoride) with their fast axes at specific angles. The material dispersion of one material partially compensates the other, flattening the retardance vs. wavelength curve. Achromatic QWPs typically maintain λ/4 ± 2% over a 200–300 nm bandwidth.
Super-achromatic waveplates (also called Pancharatnam designs) use three or more birefringent plates at precisely calculated orientations. They achieve nearly constant retardance over very broad wavelength ranges (e.g., 325–1100 nm for a quartz/MgF₂ design). The design principle exploits the geometric (Pancharatnam) phase on the Poincaré sphere [5, 8, 9].
| Waveplate Type | Bandwidth | Temperature Sensitivity | Angular Sensitivity | Typical Cost |
|---|---|---|---|---|
| Multiple-order | ± 1–2 nm | High | High | Low |
| Compound zero-order | ± 10–20 nm | Low | Low | Moderate |
| True zero-order (polymer) | ± 30–50 nm | Very low | Very low | Moderate |
| Achromatic (2-material) | 200–300 nm | Low | Low | High |
| Super-achromatic | 400–1000+ nm | Very low | Low | Very high |
| Fresnel rhomb | Inherently broadband | Very low | N/A (geometry) | Moderate–high |
7.7Fresnel Rhomb Retarders
The Fresnel rhomb is a solid glass prism (typically a parallelogram or rhombus shape) that produces retardation by exploiting the differential phase shift between s- and p-polarized light at total internal reflection. Two TIR events at a carefully chosen internal angle (typically near 54.6° for BK7 glass) produce a total retardance of .
The key advantage of the Fresnel rhomb is its achromatic performance: the retardance depends on the TIR phase shifts, which vary only slowly with wavelength. A well-designed Fresnel rhomb maintains quarter-wave retardance within a few percent over the entire transparent range of the glass substrate (e.g., 350–2000 nm for BK7). The main disadvantages are the beam displacement introduced by the rhomb geometry, the sensitivity to the angle of incidence (which affects the internal reflection angle), and the larger physical size compared to a thin waveplate. Double Fresnel rhombs (two rhombs in series) can produce half-wave retardation with achromatic performance [1, 4].
▸8Optical Activity and Faraday Rotation
8.1Natural Optical Activity
Optical activity (also called optical rotation) is the property of certain materials to rotate the plane of polarization of linearly polarized light as it propagates through the material. Optically active materials include crystalline quartz (along the optic axis), sugar solutions, amino acids, and many organic molecules. The rotation angle is proportional to the path length :
where is the rotary power (degrees per unit length) of the material. For crystalline quartz at 589.3 nm, . Optically active materials exist in two enantiomeric forms: dextrorotatory (d- or +, rotates polarization clockwise when viewed facing the source) and levorotatory (l- or −, rotates counter-clockwise) [1, 4].
Optical activity arises from circular birefringence: the material has different refractive indices for right-hand and left-hand circularly polarized light. Linearly polarized light, which can be decomposed into equal RCP and LCP components, accumulates a differential phase between these components as it propagates, resulting in a net rotation of the linear polarization direction.
8.2Specific Rotation
For optically active solutions, the rotation is proportional to both the path length and the concentration. Biot's law gives the rotation as:
where is the specific rotation (degrees · mL / (g · dm)) at wavelength and temperature , is the concentration (g/mL), and is the path length (dm). The specific rotation is a material property that depends on wavelength (approximately as , known as optical rotatory dispersion) and temperature. Saccharimetry (measuring sugar concentration by optical rotation) is one of the oldest applications of polarimetry in chemistry and food science [1, 4].
Calculate the rotation angle for linearly polarized light at 589.3 nm (sodium D-line) passing through a 10 mm thick quartz plate oriented along the optic axis. The rotary power of quartz at this wavelength is 21.7°/mm.
The polarization is rotated by 217°, which is equivalent to 217° − 180° = 37° beyond a half-turn. This large rotation illustrates why quartz optical rotators are practical in compact packages — significant rotation angles are achieved with short path lengths.
8.3Faraday Effect
The Faraday effect is the rotation of the polarization plane of linearly polarized light when it propagates through a material in the presence of a magnetic field parallel to the propagation direction. The Faraday rotation angle is:
where is the Verdet constant of the material (rad/(T·m) or min/(Oe·cm)), is the magnetic flux density (T), and is the path length through the material (m). The Verdet constant is material- and wavelength-dependent: it is large for heavy glasses (SF-57: rad/(T·m) at 633 nm), terbium gallium garnet (TGG: rad/(T·m) at 633 nm), and terbium-doped glasses [1, 3].
The crucial distinction between Faraday rotation and natural optical activity is that Faraday rotation is non-reciprocal: the rotation direction is determined by the magnetic field direction, not the light propagation direction. A beam that traverses the material forward and then backward accumulates double the rotation (rather than canceling). This non-reciprocal property is the basis of optical isolators, which use a Faraday rotator combined with polarizers to allow light to pass in one direction while blocking light traveling in the opposite direction. Optical isolators are essential for protecting lasers from back-reflections [3, 5].
8.4Connection to Magnetism
The Faraday effect provides a direct link between polarization optics and magnetism. The Verdet constant is proportional to the dispersion of the refractive index (), which means materials with strong dispersion in the visible (heavy flint glasses, rare-earth doped crystals) have the largest Verdet constants and are the most efficient Faraday rotators.
Magneto-optic effects beyond simple Faraday rotation include the magneto-optic Kerr effect (MOKE, polarization change upon reflection from a magnetized surface), used extensively in magnetic domain imaging and magneto-optic data storage readout; and the Cotton-Mouton effect (magnetic linear birefringence), which is typically much weaker than the Faraday effect. For system-level design involving magneto-optic components, including Faraday rotator specifications, magnetic circuit design, and isolator performance prediction, see the 🔧 Magneto-Optic Design Calculator →.
▸9Practical Considerations
9.1Wavelength Dependence
Nearly all polarization components exhibit wavelength dependence. Waveplate retardance scales approximately as (plus the material dispersion of birefringence), so a quarter-wave plate at 633 nm provides more retardance at shorter wavelengths and less at longer wavelengths. Dichroic polarizers have wavelength-dependent extinction ratios and transmission. Thin-film polarizers (PBS cubes, plate polarizers) are designed for specific wavelength bands and can perform poorly outside their design range.
When operating at a wavelength different from the design wavelength, the effective retardance of a waveplate changes. For a zero-order waveplate designed at , the retardance at wavelength is approximately (neglecting the dispersion of birefringence). For a multiple-order waveplate, the deviation is amplified by the order number , making the wavelength sensitivity much worse [8, 9].
9.2Temperature Sensitivity
Temperature affects waveplate retardance through two mechanisms: thermal expansion (changes in thickness ) and the temperature dependence of birefringence (). For quartz, the birefringence temperature coefficient is approximately /°C, which is small. However, for a multiple-order waveplate with total retardance of, say, 30 waves, even this small coefficient produces a significant change in the fractional retardance.
True zero-order and compound zero-order waveplates minimize temperature sensitivity because the total retardance (and therefore the temperature-induced change) is small. Polymer true zero-order waveplates typically achieve retardance stability of over a 50°C temperature range. For cryogenic or high-temperature applications, specify waveplates tested and characterized at the operating temperature [8, 9].
9.3Angle of Incidence Effects
Waveplates are designed for normal incidence. At non-normal incidence, the effective path length through the birefringent material increases, and the projections of the ordinary and extraordinary refractive indices change. The result is an angle-dependent retardance shift. For a zero-order waveplate, the retardance change is approximately proportional to (for small angles ), and is typically a few percent for ±5° tilt.
Crystal polarizers (Glan-Thompson, Glan-Taylor) have specified acceptance half-angles beyond which the polarization purity degrades. Exceeding the acceptance angle causes the rejected ray to leak into the transmitted beam, reducing the extinction ratio. PBS cubes also have limited angular acceptance: the coating design is optimized for a specific angle of incidence (45° inside the cube), and deviations reduce the extinction ratio [5, 8].
9.4Damage Threshold
Polarization components can be damaged by high laser powers or fluences. The damage mechanism depends on the component type: dichroic polarizers fail by thermal damage to the absorbing dye (low LIDT, typically < 1 W/cm² CW); cemented crystal polarizers and PBS cubes are limited by the cement layer (typically 1–5 J/cm² at 10 ns, or 10–100 W/cm² CW); air-spaced crystal polarizers (Glan-Taylor) are limited by the crystal surface damage threshold (typically 5–20 J/cm² at 10 ns); and waveplates are limited by coating damage (AR coatings on the surfaces) or the substrate material.
For high-power applications, use air-spaced Glan-Taylor or Glan-laser polarizers, plate-type thin-film polarizers (no cement), and appropriately coated waveplates. Always verify the LIDT specification against the actual beam parameters (peak fluence, not average power) at the component location [5, 8].
9.5Common Specification Parameters
| Material | Birefringence (nₑ − nₒ) | Transparency Range (nm) | Damage Threshold | Common Use |
|---|---|---|---|---|
| Crystalline quartz | 0.0091 (at 589 nm) | 200–2700 | High | Waveplates, rotators |
| Calcite | 0.172 (at 589 nm) | 350–2300 | Moderate | Polarizers (Glan types) |
| Magnesium fluoride | 0.012 (at 589 nm) | 120–7000 | High | UV/IR achromatic waveplates |
| α-BBO | 0.116 (at 589 nm) | 189–3500 | High | UV polarizers, high-power |
| YVO₄ | 0.204 (at 633 nm) | 400–5000 | High | Compact waveplates |
| Polymer (stretched) | 0.001–0.005 | 400–800 | Low | True zero-order waveplates |
| Mica | 0.005 (at 589 nm) | 350–6000 | Moderate | Low-cost QWPs |
| LiNbO₃ | 0.086 (at 633 nm) | 350–5200 | Moderate | Electro-optic retarders |
▸10Selection Workflow
10.1Identifying the Need
The first step in selecting polarization components is identifying which of three fundamental tasks is required: generating a specific polarization state from unpolarized or arbitrarily polarized light, analyzing an unknown polarization state (measuring its Stokes parameters or degree of polarization), or manipulating a known polarization state (rotating, converting between linear and circular, compensating unwanted polarization effects).
Generating polarized light: Start with a polarizer to produce linear polarization, then add waveplates as needed to produce circular or elliptical states. The polarizer type is selected based on the required extinction ratio, wavelength, aperture, power level, and cost.
Analyzing polarization: A polarimeter requires at minimum a rotating polarizer (for linear Stokes parameters) and a quarter-wave plate (for the circular Stokes parameter). Division-of-amplitude or division-of-time architectures are used depending on speed and accuracy requirements.
Manipulating polarization: Half-wave plates rotate linear polarization; quarter-wave plates convert between linear and circular; Faraday rotators provide non-reciprocal rotation for isolation. The waveplate type (multiple-order, zero-order, achromatic) is selected based on the wavelength range, temperature stability, and angular sensitivity requirements [5, 6].
10.2Selecting Polarizer Type
Step 1 — Determine the extinction ratio requirement. For visual applications and displays, 10³ (30 dB) is usually sufficient; a dichroic sheet polarizer is the most cost-effective choice. For scientific measurements, 10⁴ to 10⁵ is needed; a Glan-Taylor, Glan-Thompson, or high-quality PBS cube is appropriate. For precision polarimetry, 10⁵ to 10⁶ requires a Glan-Thompson or calcite Wollaston prism.
Step 2 — Determine the operating wavelength. For UV below 350 nm, use α-BBO or air-spaced calcite polarizers. For the visible (400–700 nm), all polarizer types are available. For the near-IR (700–2000 nm), calcite, wire-grid, or thin-film polarizers are used. For the mid- and far-IR, wire-grid polarizers on IR-transparent substrates are standard.
Step 3 — Assess the power and damage requirements. For high CW power or high peak fluence, avoid cemented components and dichroic sheet polarizers. Use air-spaced Glan-Taylor polarizers, plate-type thin-film polarizers, or wire-grid polarizers on damage-resistant substrates.
Step 4 — Consider the form factor. Sheet polarizers are thin and available in large sizes. Crystal polarizers are compact but limited to small apertures (typically 5–25 mm). PBS cubes are convenient for beam steering applications (providing two orthogonal output beams). Plate polarizers require operation at near-Brewster angles and produce a displaced beam.
10.3Selecting Waveplate Type
Step 1 — Determine the required retardance. Quarter-wave for linear-to-circular conversion; half-wave for polarization rotation; variable retarders (liquid crystal, electro-optic, photoelastic modulators) for dynamic retardance control.
Step 2 — Determine the wavelength range. For single-wavelength operation (laser line), a zero-order or compound zero-order waveplate is most cost-effective. For broadband operation, achromatic or super-achromatic waveplates, or a Fresnel rhomb, are required. For tunability, a Soleil-Babinet compensator or liquid crystal variable retarder provides continuously adjustable retardance at any wavelength.
Step 3 — Assess environmental requirements. Temperature stability favors true zero-order or compound zero-order designs over multiple-order. For cryogenic or high-temperature operation, verify the waveplate performance at the operating temperature. For high power, verify the LIDT of the AR coatings on the waveplate surfaces.
Step 4 — Consider cost and availability. Multiple-order waveplates are the least expensive but worst performing over wavelength and temperature. True zero-order polymer waveplates offer good performance at moderate cost. Achromatic and super-achromatic waveplates are the most expensive but essential for broadband systems. Fresnel rhombs are moderately expensive, inherently broadband, but bulky [8, 9].
10.4System-Level Management
In a complete optical system, managing polarization requires attention to every optical surface and element. Each uncoated surface at non-normal incidence acts as a partial polarizer (different reflectivities for s and p). Each mirror reflection can introduce a phase shift between s and p, changing the polarization state. Stress birefringence in windows, lenses, and prisms introduces unwanted retardance.
Best practices for system-level polarization management include: designing the optical path to minimize the number of reflections at non-normal incidence; using polarization-maintaining coatings (anti-reflection and mirror coatings optimized for equal s and p performance); orienting birefringent elements with their axes aligned to the system's principal polarization directions; placing polarizers and waveplates as close to the source as practical (before the beam encounters polarization-altering surfaces); and performing a full polarization ray trace (using the Jones or Mueller calculus through the entire system) to predict the polarization state at each point in the system.
For imaging systems, field-dependent polarization effects (variation of the polarization state across the field of view) can limit contrast and resolution. These effects arise from the angle-dependent Fresnel coefficients at each surface and are particularly important in high-numerical-aperture systems such as lithographic projection lenses and microscope objectives [5, 6].
References
- [1]E. Hecht, Optics, 5th ed. Pearson, 2017.
- [2]F. L. Pedrotti, L. M. Pedrotti, and L. S. Pedrotti, Introduction to Optics, 3rd ed. Cambridge University Press, 2017.
- [3]B. E. A. Saleh and M. C. Teich, Fundamentals of Photonics, 3rd ed. Wiley, 2019.
- [4]M. Born and E. Wolf, Principles of Optics, 7th ed. Cambridge University Press, 2019.
- [5]R. A. Chipman, W. S. T. Lam, and G. Young, Polarized Light and Optical Systems. CRC Press, 2018.
- [6]E. Collett, Field Guide to Polarization, SPIE Press, 2005.
- [7]SCHOTT, Optical Glass Data Sheets, 2024.
- [8]Edmund Optics, “Understanding Waveplates,” Technical Application Note.
- [9]Newport/MKS, “Introduction to Waveplates,” Application Note.
- [10]Thorlabs, “Polarization Optics Tutorial,” Web Resource.