14. From Kinetic to MHD¶

Learning Objectives¶

Understand the systematic reduction from 6D kinetic theory to 3D single-fluid MHD
Derive single-fluid MHD equations from two-fluid theory by combining species
Identify the validity conditions and limitations of MHD approximations
Explain the CGL (Chew-Goldberger-Low) double adiabatic model for collisionless plasmas
Understand drift-kinetic and gyrokinetic theories as intermediate reductions
Compare different plasma models and know when to apply each

1. The Hierarchy of Plasma Models¶

1.1 Overview: From Full Kinetic to MHD¶

Plasma physics has a rich hierarchy of models, each with different levels of approximation and computational cost:

Full Kinetic (Vlasov-Maxwell)
    ↓  [average over gyration]
Drift-Kinetic (5D)
    ↓  [average over bounce motion / perturbative expansion]
Gyrokinetic (5D, with FLR)
    ↓  [take moments]
Two-Fluid (3D × 2 species)
    ↓  [combine species]
Extended MHD (Hall, FLR, etc.)
    ↓  [drop small terms]
Single-Fluid MHD (3D)
    ↓  [equilibrium, linearize]
MHD Waves, Instabilities

Each step down the hierarchy: - Reduces dimensionality or number of variables - Simplifies the equations - Loses some physics - Increases computational efficiency

The art of plasma physics is choosing the right model for the problem at hand.

1.2 What Does Each Model Capture?¶

Model	Dimensions	Captures	Misses
Vlasov-Maxwell	6D (r,v,t)	Everything: wave-particle, anisotropy, kinetic instabilities	Computationally prohibitive
Drift-Kinetic	5D (R,v∥,μ,t)	Parallel dynamics, trapped particles, collisionless damping	Cyclotron resonance, gyrophase
Gyrokinetic	5D (R,v∥,μ,t)	FLR, turbulence, microinstabilities	Fast magnetosonic, compressibility
Two-Fluid	3D × 2 species	Hall effect, electron pressure, separate species	Kinetic effects (damping, instabilities)
Hall MHD	3D	Whistler, fast reconnection, dispersive waves	Kinetic damping, pressure anisotropy
Resistive MHD	3D	Reconnection, resistive instabilities	Fast processes at small scales
Ideal MHD	3D	Alfvén/magnetosonic waves, gross equilibria	Reconnection, kinetic physics, small scales

1.3 When to Use Which Model?¶

Use Ideal MHD when: - Large-scale equilibria and stability (tokamak, stellar atmosphere) - Low-frequency waves ($\omega \ll \omega_{ci}$) - Collisional plasmas with isotropic pressure - Magnetic Reynolds number $R_m \gg 1$

Use Resistive MHD when: - Magnetic reconnection (solar flares, substorms) - Resistive instabilities (tearing modes) - Current-driven dynamics

Use Hall MHD when: - Scales approaching $d_i$ (magnetopause, reconnection) - Fast reconnection with whistler outflow - Magnetic field generation (dynamo)

Use Two-Fluid when: - Separate electron and ion dynamics are important - Pressure anisotropy within each species - Kinetic effects are secondary

Use Gyrokinetic when: - Tokamak turbulence (ion-temperature-gradient modes, trapped-electron modes) - Microinstabilities with FLR effects - Collisionless plasmas with weak perturbations

Use Full Kinetic when: - Wave-particle resonances are crucial (Landau damping, cyclotron heating) - Strongly non-Maxwellian distributions (beam-plasma, runaway electrons) - Velocity-space instabilities (two-stream, bump-on-tail)

2. From Two-Fluid to Single-Fluid MHD¶

2.1 Defining Single-Fluid Variables¶

Recall the two-fluid equations for species $s$ (electrons $e$, ions $i$):

Continuity: $$\frac{\partial n_s}{\partial t} + \nabla \cdot (n_s \mathbf{u}_s) = 0$$

Momentum: $$m_s n_s \frac{d \mathbf{u}_s}{dt} = q_s n_s (\mathbf{E} + \mathbf{u}_s \times \mathbf{B}) - \nabla p_s + \mathbf{R}_s$$

Energy (adiabatic closure): $$\frac{d}{dt}\left( \frac{p_s}{n_s^\gamma} \right) = 0$$

To derive single-fluid MHD, we define center-of-mass (fluid) variables:

Mass density: $$\rho = m_i n_i + m_e n_e \approx m_i n$$

(using quasi-neutrality $n_i \approx n_e \equiv n$ and $m_i \gg m_e$)

Fluid velocity (center-of-mass velocity): $$\mathbf{v} = \frac{m_i n_i \mathbf{u}_i + m_e n_e \mathbf{u}_e}{\rho} \approx \mathbf{u}_i$$

Total pressure: $$p = p_i + p_e$$

Current density: $$\mathbf{J} = e(n_i \mathbf{u}_i - n_e \mathbf{u}_e) \approx en(\mathbf{u}_i - \mathbf{u}_e)$$

Charge density (quasi-neutrality): $$\rho_c = e(n_i - n_e) \approx 0$$

2.2 Combining Continuity Equations¶

Add the electron and ion continuity equations:

$$\frac{\partial n_e}{\partial t} + \nabla \cdot (n_e \mathbf{u}_e) = 0$$ $$\frac{\partial n_i}{\partial t} + \nabla \cdot (n_i \mathbf{u}_i) = 0$$

Multiply the electron equation by $m_e$ and the ion equation by $m_i$, then add:

$$\frac{\partial \rho}{\partial t} + \nabla \cdot (m_e n_e \mathbf{u}_e + m_i n_i \mathbf{u}_i) = 0$$

Using $\rho \mathbf{v} = m_i n_i \mathbf{u}_i + m_e n_e \mathbf{u}_e \approx m_i n \mathbf{u}_i$:

$$\boxed{\frac{\partial \rho}{\partial t} + \nabla \cdot (\rho \mathbf{v}) = 0}$$

This is the mass continuity equation for single-fluid MHD.

2.3 Combining Momentum Equations¶

Add the electron and ion momentum equations:

$$m_e n_e \frac{d \mathbf{u}_e}{dt} = -e n_e (\mathbf{E} + \mathbf{u}_e \times \mathbf{B}) - \nabla p_e + \mathbf{R}_e$$ $$m_i n_i \frac{d \mathbf{u}_i}{dt} = +e n_i (\mathbf{E} + \mathbf{u}_i \times \mathbf{B}) - \nabla p_i + \mathbf{R}_i$$

The collision terms cancel: $\mathbf{R}_e + \mathbf{R}_i = 0$ (momentum conservation).

The electric field terms cancel (using quasi-neutrality): $$-e n_e \mathbf{E} + e n_i \mathbf{E} = e(n_i - n_e) \mathbf{E} \approx 0$$

The Lorentz force terms give: $$-e n_e \mathbf{u}_e \times \mathbf{B} + e n_i \mathbf{u}_i \times \mathbf{B} = e n (\mathbf{u}_i - \mathbf{u}_e) \times \mathbf{B} = \mathbf{J} \times \mathbf{B}$$

The inertial terms: $$m_e n_e \frac{d \mathbf{u}_e}{dt} + m_i n_i \frac{d \mathbf{u}_i}{dt} \approx m_i n \frac{d \mathbf{u}_i}{dt} = \rho \frac{d \mathbf{v}}{dt}$$

(neglecting the electron inertia term $m_e n_e d\mathbf{u}_e/dt \ll m_i n_i d\mathbf{u}_i/dt$).

Putting it together:

$$\boxed{\rho \frac{d \mathbf{v}}{dt} = \mathbf{J} \times \mathbf{B} - \nabla p}$$

This is the momentum equation for single-fluid MHD.

2.4 Ideal Ohm's Law¶

The key step to ideal MHD is to derive Ohm's law. In Lesson 13, we derived the generalized Ohm's law:

$$\mathbf{E} + \mathbf{v} \times \mathbf{B} = \eta \mathbf{J} + \frac{1}{en} \mathbf{J} \times \mathbf{B} - \frac{1}{en} \nabla p_e + \frac{m_e}{e^2 n^2} \frac{d \mathbf{J}}{dt}$$

In ideal MHD, we make the following approximations:

High conductivity ($\eta \to 0$): neglect resistive term
Large scales ($L \gg d_i$): neglect Hall term
Slow dynamics: neglect electron inertia
Negligible electron pressure gradient (or isotropic electron pressure): neglect pressure term

This gives the ideal Ohm's law:

$$\boxed{\mathbf{E} + \mathbf{v} \times \mathbf{B} = 0}$$

This is the frozen-in condition: the magnetic field is frozen into the fluid and moves with it.

2.5 Faraday's Law and Induction Equation¶

From Maxwell's equations: $$\nabla \times \mathbf{E} = -\frac{\partial \mathbf{B}}{\partial t}$$

Substituting the ideal Ohm's law $\mathbf{E} = -\mathbf{v} \times \mathbf{B}$:

$$\nabla \times (-\mathbf{v} \times \mathbf{B}) = -\frac{\partial \mathbf{B}}{\partial t}$$

Using the vector identity $\nabla \times (\mathbf{A} \times \mathbf{B}) = \mathbf{A}(\nabla \cdot \mathbf{B}) - \mathbf{B}(\nabla \cdot \mathbf{A}) + (\mathbf{B} \cdot \nabla)\mathbf{A} - (\mathbf{A} \cdot \nabla)\mathbf{B}$:

$$\nabla \times (\mathbf{v} \times \mathbf{B}) = \mathbf{v}(\nabla \cdot \mathbf{B}) - \mathbf{B}(\nabla \cdot \mathbf{v}) + (\mathbf{B} \cdot \nabla)\mathbf{v} - (\mathbf{v} \cdot \nabla)\mathbf{B}$$

Since $\nabla \cdot \mathbf{B} = 0$:

$$\frac{\partial \mathbf{B}}{\partial t} = \nabla \times (\mathbf{v} \times \mathbf{B}) = (\mathbf{B} \cdot \nabla)\mathbf{v} - \mathbf{B}(\nabla \cdot \mathbf{v}) - (\mathbf{v} \cdot \nabla)\mathbf{B}$$

Rearranging:

$$\boxed{\frac{\partial \mathbf{B}}{\partial t} = \nabla \times (\mathbf{v} \times \mathbf{B})}$$

Or equivalently:

$$\boxed{\frac{d \mathbf{B}}{dt} = (\mathbf{B} \cdot \nabla)\mathbf{v} - \mathbf{B}(\nabla \cdot \mathbf{v})}$$

where $d/dt = \partial/\partial t + \mathbf{v} \cdot \nabla$ is the convective derivative.

This is the induction equation (or magnetic evolution equation). It describes how the magnetic field evolves as the plasma flows.

2.6 Summary: Ideal MHD Equations¶

The ideal MHD equations are:

Mass continuity: $$\frac{\partial \rho}{\partial t} + \nabla \cdot (\rho \mathbf{v}) = 0$$

Momentum: $$\rho \frac{d \mathbf{v}}{dt} = \mathbf{J} \times \mathbf{B} - \nabla p$$

Induction: $$\frac{\partial \mathbf{B}}{\partial t} = \nabla \times (\mathbf{v} \times \mathbf{B})$$

Energy (adiabatic): $$\frac{d}{dt}\left( \frac{p}{\rho^\gamma} \right) = 0$$

Ampère's law (neglecting displacement current): $$\nabla \times \mathbf{B} = \mu_0 \mathbf{J}$$

No magnetic monopoles: $$\nabla \cdot \mathbf{B} = 0$$

These are 8 equations for 8 unknowns: $\rho$, $\mathbf{v}$ (3 components), $p$, $\mathbf{B}$ (3 components), given the constraint $\nabla \cdot \mathbf{B} = 0$.

(The electric field $\mathbf{E}$ is determined by Ohm's law: $\mathbf{E} = -\mathbf{v} \times \mathbf{B}$.)

3. Validity Conditions for MHD¶

3.1 Low Frequency: $\omega \ll \omega_{ci}$¶

MHD is a low-frequency approximation. The time scale of phenomena must be much longer than the ion cyclotron period:

$$\omega \ll \omega_{ci} = \frac{eB}{m_i}$$

This ensures that ions have time to gyrate and respond to the fields in a fluid-like manner, rather than exhibiting individual particle behavior.

Example: For $B = 1$ T, $\omega_{ci} \approx 10^8$ rad/s ($f \approx 16$ MHz). MHD is valid for phenomena slower than ~10 MHz.

3.2 Large Scale: $L \gg \rho_i$¶

The spatial scale must be much larger than the ion gyroradius:

$$L \gg \rho_i = \frac{v_{th,i}}{\omega_{ci}}$$

At scales $\lesssim \rho_i$, finite Larmor radius (FLR) effects become important, and MHD breaks down.

Example: For $T_i = 10$ keV and $B = 1$ T, $\rho_i \approx 0.5$ cm. MHD is valid for scales $\gg 1$ cm.

3.3 Collisional: $\lambda_{mfp} \ll L$¶

For isotropic pressure (as assumed in ideal MHD), collisions must be frequent enough to isotropize the distribution function:

$$\lambda_{mfp} = v_{th} \tau \ll L$$

where $\tau$ is the collision time.

In collisionless plasmas, the pressure tensor is anisotropic ($p_\parallel \neq p_\perp$), requiring a more general closure (e.g., CGL, discussed below).

Example: In the solar wind, $\lambda_{mfp} \sim 1$ AU $\gg L$ for any reasonable structure. Standard MHD is not valid—CGL or kinetic models are needed.

3.4 Non-Relativistic: $v \ll c$¶

Plasma flows and thermal velocities must be non-relativistic:

$$v, v_{th} \ll c$$

This allows us to neglect displacement current in Ampère's law and use non-relativistic momentum equations.

Example: For $T = 10$ keV, $v_{th,e} \approx 0.04c$ (relativistic corrections ~few percent). For higher temperatures, relativistic MHD is needed.

3.5 Quasi-Neutrality: $n_e \approx n_i$¶

The plasma must be quasi-neutral on the scales of interest:

$$L \gg \lambda_D = \sqrt{\frac{\epsilon_0 k_B T}{n e^2}}$$

This allows us to neglect charge separation and drop the displacement current.

Example: For $n = 10^{19}$ m$^{-3}$ and $T = 10$ eV, $\lambda_D \approx 10$ μm. MHD is valid for $L \gg 10$ μm.

3.6 High Magnetic Reynolds Number: $R_m \gg 1$¶

For ideal MHD (frozen-in), the magnetic Reynolds number must be large:

$$R_m = \frac{\mu_0 V L}{\eta} \gg 1$$

where $V$ is a characteristic flow velocity, $L$ is a length scale, and $\eta$ is the resistivity.

When $R_m \sim 1$, resistivity becomes important → resistive MHD.

Example: In a tokamak, $V \sim 100$ m/s, $L \sim 1$ m, $\eta \sim 10^{-8}$ Ω·m → $R_m \sim 10^{10}$. Ideal MHD is excellent.

3.7 Validity Regime Summary¶

Ideal MHD is valid when ALL of the following hold:

1. ω << ω_ci           (low frequency)
2. L >> ρ_i            (large scale)
3. λ_mfp << L          (collisional, for isotropic p)
4. v << c              (non-relativistic)
5. L >> λ_D            (quasi-neutral)
6. R_m >> 1            (frozen-in)

Violations → need extended MHD or kinetic models.

4. CGL (Double Adiabatic) Model¶

4.1 Motivation: Collisionless Magnetized Plasmas¶

In many astrophysical plasmas (solar wind, magnetosphere, galaxy clusters), the collision mean free path is enormous:

$$\lambda_{mfp} \gg L$$

In such plasmas, particles can travel long distances without colliding. The pressure tensor becomes anisotropic:

$$\overleftrightarrow{P} = p_\perp \overleftrightarrow{I} + (p_\parallel - p_\perp) \hat{\mathbf{b}} \hat{\mathbf{b}}$$

where $\hat{\mathbf{b}} = \mathbf{B}/B$ and: - $p_\parallel$: pressure parallel to $\mathbf{B}$ - $p_\perp$: pressure perpendicular to $\mathbf{B}$

Standard MHD assumes $p_\parallel = p_\perp = p$ (isotropic), which is invalid in collisionless plasmas.

4.2 Chew-Goldberger-Low (1956) Model¶

Chew, Goldberger, and Low (CGL) derived a closure for collisionless, strongly magnetized plasmas by assuming conservation of adiabatic invariants:

First adiabatic invariant (magnetic moment): $$\mu = \frac{m v_\perp^2}{2B} = \text{const}$$

This implies: $$\frac{d}{dt}\left( \frac{p_\perp}{n B} \right) = 0$$

Second adiabatic invariant (longitudinal action): $$J = \oint v_\parallel ds = \text{const}$$

For a local fluid element (not bouncing between mirrors), this becomes: $$\frac{d}{dt}\left( \frac{p_\parallel B^2}{n^3} \right) = 0$$

These are the CGL equations (also called double adiabatic equations).

4.3 CGL Closure Relations¶

The CGL equations are:

$$\boxed{\frac{d}{dt}\left( \frac{p_\perp}{nB} \right) = 0}$$

$$\boxed{\frac{d}{dt}\left( \frac{p_\parallel B^2}{n^3} \right) = 0}$$

These can be rewritten as:

$$\frac{1}{p_\perp} \frac{dp_\perp}{dt} = \frac{1}{n} \frac{dn}{dt} + \frac{1}{B} \frac{dB}{dt}$$

$$\frac{1}{p_\parallel} \frac{dp_\parallel}{dt} = 3 \frac{1}{n} \frac{dn}{dt} - 2 \frac{1}{B} \frac{dB}{dt}$$

Physical interpretation:

When the field increases ($dB/dt > 0$), $p_\perp$ increases (betatron heating), but $p_\parallel$ decreases (magnetic mirror effect).
Compression ($dn/dt > 0$) increases both $p_\perp$ and $p_\parallel$.

4.4 CGL Pressure Tensor¶

The momentum equation with the CGL pressure tensor becomes:

$$\rho \frac{d \mathbf{v}}{dt} = \mathbf{J} \times \mathbf{B} - \nabla \cdot \overleftrightarrow{P}$$

where: $$\nabla \cdot \overleftrightarrow{P} = \nabla p_\perp + (p_\parallel - p_\perp) \left[ \frac{\nabla \cdot \mathbf{B}}{B} \hat{\mathbf{b}} + \frac{(\mathbf{B} \cdot \nabla) \mathbf{B}}{B^2} \right]$$

Using $\nabla \cdot \mathbf{B} = 0$ and $(\mathbf{B} \cdot \nabla)\mathbf{B} = B^2 \boldsymbol{\kappa}$ (where $\boldsymbol{\kappa}$ is the curvature):

$$\nabla \cdot \overleftrightarrow{P} = \nabla p_\perp + (p_\parallel - p_\perp) \boldsymbol{\kappa}$$

So the momentum equation is:

$$\rho \frac{d \mathbf{v}}{dt} = \mathbf{J} \times \mathbf{B} - \nabla p_\perp - (p_\parallel - p_\perp) \boldsymbol{\kappa}$$

The anisotropy creates an extra force $-(p_\parallel - p_\perp) \boldsymbol{\kappa}$ along the field curvature.

4.5 CGL Instabilities¶

The CGL model predicts pressure-anisotropy-driven instabilities when:

Mirror instability: If $p_\perp / p_\parallel$ is too large $$\frac{p_\perp}{p_\parallel} > 1 + \frac{1}{\beta_\perp}$$ where $\beta_\perp = 2\mu_0 p_\perp / B^2$.

The plasma creates local magnetic mirrors (enhanced $B$ regions) to reduce $p_\perp$.

Firehose instability: If $p_\parallel / p_\perp$ is too large $$\frac{p_\parallel}{p_\perp} > 1 + \frac{2}{\beta_\parallel}$$ where $\beta_\parallel = 2\mu_0 p_\parallel / B^2$.

The magnetic field line "kinks" like a firehose under pressure.

These instabilities are observed in the solar wind and Earth's magnetosheath.

4.6 Limitations of CGL¶

No heat flux: CGL assumes no parallel heat conduction. In reality, heat flux is important on long parallel scales.
No collisions: CGL is for collisionless plasmas. Adding even weak collisions modifies the evolution.
Local approximation: CGL assumes the second adiabatic invariant holds locally, which breaks down for trapped particles bouncing on long scales.
Slow dynamics: CGL assumes slow evolution compared to the gyro-period and bounce period.

Despite these limitations, CGL captures essential physics of anisotropic pressure in collisionless plasmas and is widely used in space physics.

5. Beyond MHD: Drift-Kinetic and Gyrokinetic Theory¶

5.1 Drift-Kinetic Theory¶

Drift-kinetic theory reduces the dimensionality from 6D to 5D by averaging over the gyrophase.

The idea: in a magnetized plasma, particles rapidly gyrate around field lines. If we only care about slow dynamics ($\omega \ll \omega_c$), we can average over the fast gyration.

Variables: - $\mathbf{R}$: guiding center position (3D) - $v_\parallel$: parallel velocity (1D) - $\mu$: magnetic moment (adiabatic invariant, parameter) - Time $t$

Distribution function: $F(\mathbf{R}, v_\parallel, \mu, t)$ (5D instead of 6D)

Drift-kinetic equation (simplified): $$\frac{\partial F}{\partial t} + \mathbf{v}_d \cdot \nabla_\mathbf{R} F + \frac{d v_\parallel}{dt} \frac{\partial F}{\partial v_\parallel} = C[F]$$

where $\mathbf{v}_d$ includes the parallel motion and perpendicular drifts: $$\mathbf{v}_d = v_\parallel \hat{\mathbf{b}} + \mathbf{v}_E + \mathbf{v}_{\nabla B} + \mathbf{v}_\kappa + \cdots$$

What it captures: - Parallel motion and bounce dynamics (trapped particles) - All guiding-center drifts - Collisionless (Landau) damping

What it misses: - Cyclotron resonance (averaged out with gyrophase) - Finite Larmor radius (FLR) effects

Applications: - Neoclassical transport (tokamak collisional diffusion) - Bounce-averaged kinetic theory (trapped-particle instabilities) - Radiation belt dynamics (drift-loss-cone)

5.2 Gyrokinetic Theory¶

Gyrokinetic theory is the most sophisticated reduced model, capturing finite Larmor radius (FLR) effects while still averaging over gyrophase.

Key innovation: Expand in small parameters: $$\delta = \frac{\rho_i}{L} \sim \frac{\omega}{\omega_{ci}} \sim \frac{\delta f}{f_0} \ll 1$$

This is the gyrokinetic ordering: slow, small-amplitude, long-wavelength fluctuations.

Variables (same as drift-kinetic): - $\mathbf{R}$: gyrocenter position - $v_\parallel$: parallel velocity - $\mu$: magnetic moment

Gyrokinetic distribution: $g(\mathbf{R}, v_\parallel, \mu, t)$ (perturbed part)

Gyrokinetic equation (schematic): $$\frac{\partial g}{\partial t} + \mathbf{v}_d \cdot \nabla g + \frac{dv_\parallel}{dt} \frac{\partial g}{\partial v_\parallel} = \text{(source terms with FLR)}$$

The key difference from drift-kinetic: FLR effects are retained through: - Gyroaveraged electric field: $\langle \phi \rangle_\alpha$ (averaged over gyro-orbit) - Gyroaveraged magnetic perturbation

What it captures: - FLR effects (ion Landau damping, wave-particle resonances with FLR) - Microinstabilities: ITG (ion temperature gradient), TEM (trapped electron mode), ETG (electron temperature gradient) - Turbulence cascades with FLR

What it misses: - Compressible Alfvén waves (fast magnetosonic) - Low-frequency approximation: $\omega \ll \omega_{ci}$

Applications: - Tokamak turbulence: gyrokinetic simulations (GENE, GS2, GYRO) predict turbulent transport, which limits confinement - Microinstability analysis: determine growth rates of ITG, TEM, ETG modes - Zonal flows: self-generated sheared flows that regulate turbulence

Gyrokinetic simulations are the state-of-the-art for tokamak physics and run on the world's largest supercomputers.

5.3 Comparison: Drift-Kinetic vs. Gyrokinetic¶

Feature	Drift-Kinetic	Gyrokinetic
Dimensions	5D	5D
FLR effects	No	Yes
Gyrophase-averaged	Yes	Yes
Ordering	None (exact gyroaverage)	$\delta \ll 1$ (perturbative)
What it solves	Bounce motion, drifts	Turbulence, microinstabilities
Typical application	Neoclassical, radiation belts	Tokamak turbulence, ITG/TEM
Computational cost	Moderate	Very high

6. Extended MHD Models¶

6.1 Hall MHD¶

Hall MHD includes the Hall term in Ohm's law:

$$\mathbf{E} + \mathbf{v} \times \mathbf{B} = \frac{1}{en} \mathbf{J} \times \mathbf{B}$$

This allows ions and electrons to decouple at scales $\sim d_i$ (ion skin depth).

Key features: - Whistler waves at high $k$ - Fast magnetic reconnection (Petschek rate) - Dispersive Alfvén waves

Applications: - Magnetic reconnection (magnetopause, magnetotail, solar corona) - Dynamo theory (magnetic field generation) - Space plasma turbulence

6.2 Two-Temperature MHD¶

Separate electron and ion temperatures:

$$\frac{d p_e}{dt} + \gamma p_e \nabla \cdot \mathbf{v} = Q_{ei} + Q_e$$ $$\frac{d p_i}{dt} + \gamma p_i \nabla \cdot \mathbf{v} = -Q_{ei} + Q_i$$

where $Q_{ei}$ is electron-ion energy exchange, and $Q_{e,i}$ are external heating.

Applications: - Heating and energy partition (e.g., shocks heat ions more than electrons initially) - Radiative cooling (electrons radiate more efficiently)

6.3 FLR-MHD¶

Include finite Larmor radius corrections to the pressure tensor:

$$\overleftrightarrow{P} = p \overleftrightarrow{I} + \overleftrightarrow{\Pi}^{\text{FLR}}$$

where $\overleftrightarrow{\Pi}^{\text{FLR}}$ includes gyroviscosity and other FLR effects.

Applications: - Kinetic Alfvén waves - FLR stabilization of MHD instabilities

6.4 Inertial MHD (Electron MHD)¶

At very small scales ($d_e$), electron inertia becomes important:

$$\mathbf{E} + \mathbf{v}_e \times \mathbf{B} = \frac{m_e}{e^2 n} \frac{d \mathbf{J}}{dt}$$

This is electron MHD (EMHD), where ions are stationary and only electrons move.

Dispersion relation (whistler in EMHD): $$\omega = k^2 V_A d_e$$

Applications: - Magnetic reconnection diffusion region - Electron-scale turbulence

7. Python Code Examples¶

7.1 Validity Regime Diagram¶

import numpy as np
import matplotlib.pyplot as plt

# Parameter space: length scale vs. frequency
L = np.logspace(-4, 6, 200)  # 0.1 mm to 1000 km
omega = np.logspace(2, 10, 200)  # 100 rad/s to 10 GHz

L_grid, omega_grid = np.meshgrid(L, omega)

# Plasma parameters (typical tokamak)
n = 1e20  # m^-3
B = 2.0   # T
T = 5e3   # eV (5 keV)

e = 1.6e-19
m_i = 1.67e-27
m_e = 9.11e-31
k_B = 1.38e-23

# Characteristic scales and frequencies
omega_ci = e * B / m_i
omega_ce = e * B / m_e
omega_pi = np.sqrt(n * e**2 / (m_i * 8.85e-12))
omega_pe = np.sqrt(n * e**2 / (m_e * 8.85e-12))

v_th_i = np.sqrt(2 * k_B * T * e / m_i)
v_th_e = np.sqrt(2 * k_B * T * e / m_e)

rho_i = v_th_i / omega_ci
rho_e = v_th_e / omega_ce
d_i = 3e8 / omega_pi
d_e = 3e8 / omega_pe
lambda_D = np.sqrt(8.85e-12 * k_B * T * e / (n * e**2))

print("Characteristic scales and frequencies:")
print(f"  Ion gyrofrequency ω_ci = {omega_ci:.2e} rad/s ({omega_ci/(2*np.pi):.2e} Hz)")
print(f"  Ion gyroradius ρ_i = {rho_i*100:.2f} cm")
print(f"  Ion skin depth d_i = {d_i:.2f} m")
print(f"  Electron skin depth d_e = {d_e*100:.2f} cm")
print(f"  Debye length λ_D = {lambda_D*1e6:.2f} μm")
print()

# Define validity regions
# 1. MHD: ω << ω_ci, L >> ρ_i
MHD = (omega_grid < 0.1 * omega_ci) & (L_grid > 10 * rho_i)

# 2. Hall MHD: ω << ω_ci, L ~ d_i
Hall_MHD = (omega_grid < 0.1 * omega_ci) & (L_grid > 10 * rho_i) & (L_grid < 100 * d_i)

# 3. Two-fluid: ω << ω_ce, L > d_e
Two_Fluid = (omega_grid < 0.1 * omega_ce) & (L_grid > 10 * d_e)

# 4. Gyrokinetic: ω ~ ω_ci, L ~ ρ_i
Gyrokinetic = (omega_grid > 0.01 * omega_ci) & (omega_grid < omega_ci) & \
              (L_grid > rho_i) & (L_grid < 100 * rho_i)

# 5. Full kinetic: always valid (but expensive)
Full_Kinetic = np.ones_like(L_grid, dtype=bool)

# Plot
fig, ax = plt.subplots(figsize=(11, 8))

# Color regions
ax.contourf(L_grid, omega_grid, MHD.astype(int), levels=[0.5, 1.5],
            colors=['lightblue'], alpha=0.6)
ax.contourf(L_grid, omega_grid, Hall_MHD.astype(int), levels=[0.5, 1.5],
            colors=['lightcoral'], alpha=0.6)
ax.contourf(L_grid, omega_grid, Gyrokinetic.astype(int), levels=[0.5, 1.5],
            colors=['lightgreen'], alpha=0.6)

# Boundary lines
ax.axhline(omega_ci, color='r', linestyle='--', linewidth=2, label=f'$\omega_{{ci}}$ = {omega_ci:.2e} rad/s')
ax.axhline(omega_ce, color='m', linestyle='--', linewidth=1.5, label=f'$\omega_{{ce}}$ = {omega_ce:.2e} rad/s')

ax.axvline(rho_i, color='b', linestyle='--', linewidth=2, label=f'$\\rho_i$ = {rho_i*100:.1f} cm')
ax.axvline(d_i, color='g', linestyle='--', linewidth=2, label=f'$d_i$ = {d_i:.1f} m')
ax.axvline(d_e, color='orange', linestyle='--', linewidth=1.5, label=f'$d_e$ = {d_e*100:.1f} cm')

# Labels for regions
ax.text(1e0, 1e3, 'MHD', fontsize=16, weight='bold', color='blue')
ax.text(1e-1, 1e4, 'Hall MHD', fontsize=14, weight='bold', color='red')
ax.text(1e-2, 1e7, 'Gyrokinetic', fontsize=14, weight='bold', color='green')
ax.text(1e-3, 1e9, 'Full Kinetic', fontsize=14, weight='bold', color='black')

ax.set_xscale('log')
ax.set_yscale('log')
ax.set_xlabel('Length scale L (m)', fontsize=13)
ax.set_ylabel('Frequency ω (rad/s)', fontsize=13)
ax.set_title('Plasma Model Validity Regimes (n=$10^{20}$ m$^{-3}$, B=2 T, T=5 keV)', fontsize=14)
ax.legend(fontsize=10, loc='upper left')
ax.grid(True, which='both', alpha=0.3)
ax.set_xlim(1e-4, 1e6)
ax.set_ylim(1e2, 1e10)

plt.tight_layout()
plt.savefig('validity_regimes.png', dpi=150)
plt.show()

7.2 CGL vs. Isotropic MHD: Mirror Instability¶

import numpy as np
import matplotlib.pyplot as plt

def mirror_instability_threshold(beta_perp):
    """
    Mirror instability threshold: p_perp/p_parallel > 1 + 1/beta_perp
    """
    return 1 + 1/beta_perp

# Beta range
beta_perp = np.logspace(-2, 2, 200)

# Threshold
threshold = mirror_instability_threshold(beta_perp)

# Plot
fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(13, 5))

# Threshold curve
ax1.plot(beta_perp, threshold, 'r-', linewidth=3, label='Mirror instability threshold')
ax1.fill_between(beta_perp, 1, threshold, alpha=0.3, color='red', label='Unstable')
ax1.fill_between(beta_perp, threshold, 10, alpha=0.3, color='green', label='Stable')

ax1.set_xscale('log')
ax1.set_xlabel(r'$\beta_\perp = 2\mu_0 p_\perp / B^2$', fontsize=12)
ax1.set_ylabel(r'$p_\perp / p_\parallel$', fontsize=12)
ax1.set_title('Mirror Instability Threshold', fontsize=13)
ax1.set_ylim(1, 10)
ax1.legend(fontsize=11)
ax1.grid(alpha=0.3)

# Growth rate (simplified)
# γ/Ω_i ~ sqrt(β_perp) * (p_perp/p_parallel - 1 - 1/β_perp) for unstable
beta_example = 1.0
anisotropy = np.linspace(1, 5, 100)
threshold_value = mirror_instability_threshold(beta_example)

gamma_normalized = np.where(anisotropy > threshold_value,
                             np.sqrt(beta_example) * (anisotropy - threshold_value),
                             0)

ax2.plot(anisotropy, gamma_normalized, 'b-', linewidth=3)
ax2.axvline(threshold_value, color='r', linestyle='--', linewidth=2,
            label=f'Threshold at $\\beta_\\perp$ = {beta_example}')
ax2.fill_between(anisotropy, 0, gamma_normalized, alpha=0.3, color='blue')

ax2.set_xlabel(r'$p_\perp / p_\parallel$', fontsize=12)
ax2.set_ylabel(r'Growth rate $\gamma / \Omega_i$', fontsize=12)
ax2.set_title(f'Mirror Instability Growth Rate ($\\beta_\\perp$ = {beta_example})', fontsize=13)
ax2.legend(fontsize=11)
ax2.grid(alpha=0.3)

plt.tight_layout()
plt.savefig('mirror_instability.png', dpi=150)
plt.show()

print(f"Mirror instability:")
print(f"  At β_perp = 0.1: threshold p_perp/p_parallel > {mirror_instability_threshold(0.1):.2f}")
print(f"  At β_perp = 1.0: threshold p_perp/p_parallel > {mirror_instability_threshold(1.0):.2f}")
print(f"  At β_perp = 10:  threshold p_perp/p_parallel > {mirror_instability_threshold(10):.2f}")
print()
print("Physical interpretation:")
print("  High β_perp (strong pressure): easier to go unstable (lower threshold)")
print("  Low β_perp (weak pressure): harder to go unstable (higher threshold)")

7.3 Dispersion Comparison: MHD vs. Hall MHD vs. Kinetic¶

import numpy as np
import matplotlib.pyplot as plt

# Plasma parameters
n = 1e19
B = 0.1
T_e = 10  # eV
T_i = 10

e = 1.6e-19
m_i = 1.67e-27
m_e = 9.11e-31
mu_0 = 4e-7 * np.pi
k_B = 1.38e-23

# Derived quantities
omega_ci = e * B / m_i
omega_ce = e * B / m_e
omega_pi = np.sqrt(n * e**2 / (m_i * 8.85e-12))

v_A = B / np.sqrt(mu_0 * n * m_i)
c_s = np.sqrt(k_B * (T_e + T_i) * e / m_i)
d_i = 3e8 / omega_pi

v_th_e = np.sqrt(2 * k_B * T_e * e / m_e)
v_th_i = np.sqrt(2 * k_B * T_i * e / m_i)

print("Plasma parameters:")
print(f"  V_A = {v_A:.2e} m/s")
print(f"  c_s = {c_s:.2e} m/s")
print(f"  d_i = {d_i:.2e} m")
print(f"  ω_ci = {omega_ci:.2e} rad/s")
print()

# Wavenumber range
k = np.logspace(-3, 3, 500) / d_i  # normalized to d_i

# 1. MHD Alfvén wave
omega_MHD = k * v_A / omega_ci * (k * d_i)  # normalized to omega_ci

# 2. Hall MHD (whistler)
omega_Hall = k * v_A / omega_ci * (k * d_i) * np.sqrt(1 + (k * d_i)**2)

# 3. Kinetic Alfvén wave (warm plasma, with electron Landau damping)
# Approximate dispersion (electrostatic limit)
k_perp = k / 2  # assume oblique
rho_s = c_s / omega_ci
omega_KAW = k * v_A / omega_ci * (k * d_i) * np.sqrt(1 + (k_perp * d_i * rho_s / d_i)**2)

# 4. Ion acoustic wave
omega_ion_acoustic = k * c_s / omega_ci * (k * d_i)

# Plot
fig, ax = plt.subplots(figsize=(11, 7))

ax.loglog(k * d_i, omega_MHD, 'b-', linewidth=3, label='MHD Alfvén: $\omega = k_\parallel V_A$')
ax.loglog(k * d_i, omega_Hall, 'r--', linewidth=3, label='Hall MHD (whistler): $\omega = k_\parallel V_A \sqrt{1+(kd_i)^2}$')
ax.loglog(k * d_i, omega_KAW, 'g-.', linewidth=3, label='Kinetic Alfvén (warm)')
ax.loglog(k * d_i, omega_ion_acoustic, 'm:', linewidth=3, label='Ion acoustic: $\omega = k c_s$')

# Reference lines
ax.axvline(1, color='k', linestyle=':', alpha=0.5, linewidth=2, label='$k d_i = 1$')
ax.axhline(1, color='gray', linestyle=':', alpha=0.5, linewidth=2, label='$\omega = \omega_{ci}$')

# Asymptotic slopes
k_ref = np.logspace(-2, 0, 50)
ax.loglog(k_ref * d_i, (k_ref * d_i)**1 * 0.01, 'k--', alpha=0.4, label='slope = 1')
ax.loglog(k_ref * d_i * 10, (k_ref * d_i * 10)**2 * 0.001, 'k-.', alpha=0.4, label='slope = 2')

ax.set_xlabel(r'$k d_i$ (normalized wavenumber)', fontsize=13)
ax.set_ylabel(r'$\omega / \omega_{ci}$ (normalized frequency)', fontsize=13)
ax.set_title('Dispersion Relations: MHD vs. Hall MHD vs. Kinetic', fontsize=14)
ax.legend(fontsize=10, loc='upper left')
ax.grid(True, which='both', alpha=0.3)
ax.set_xlim(1e-3, 1e3)
ax.set_ylim(1e-4, 1e2)

plt.tight_layout()
plt.savefig('dispersion_comparison.png', dpi=150)
plt.show()

print("Dispersion relations:")
print("  MHD: ω ∝ k (linear, non-dispersive)")
print("  Hall MHD: ω ∝ k² at k d_i >> 1 (whistler, dispersive)")
print("  Kinetic: includes Landau damping (not shown, requires complex ω)")

Summary¶

In this lesson, we traced the systematic reduction from kinetic theory to MHD:

Two-fluid to single-fluid: By combining electron and ion equations, we obtain the MHD momentum and continuity equations. The key step is deriving the ideal Ohm's law $\mathbf{E} + \mathbf{v} \times \mathbf{B} = 0$ from the generalized Ohm's law by dropping resistive, Hall, pressure, and inertia terms.
Validity conditions: MHD is valid for low-frequency ($\omega \ll \omega_{ci}$), large-scale ($L \gg \rho_i$), collisional ($\lambda_{mfp} \ll L$), non-relativistic ($v \ll c$), quasi-neutral ($L \gg \lambda_D$), high-$R_m$ plasmas. Violations require extended MHD or kinetic models.
CGL model: For collisionless plasmas, the pressure is anisotropic ($p_\parallel \neq p_\perp$). The CGL (double adiabatic) closure uses conservation of adiabatic invariants: $p_\perp/(nB) = \text{const}$ and $p_\parallel B^2 / n^3 = \text{const}$. This predicts mirror and firehose instabilities.
Drift-kinetic and gyrokinetic: These 5D models average over gyrophase while retaining kinetic effects. Drift-kinetic captures bounce dynamics; gyrokinetic includes FLR effects and is used for tokamak turbulence simulations.
Extended MHD: Hall MHD, two-temperature MHD, FLR-MHD, and electron MHD extend standard MHD to capture additional physics at the cost of increased complexity.
Model comparison: Each model has a regime of validity. The choice depends on the scales, frequencies, and physics of interest. MHD is simple and captures large-scale dynamics; kinetic theory is comprehensive but computationally expensive.

Understanding the hierarchy of plasma models is essential for choosing the appropriate level of description for a given problem.

Practice Problems¶

Problem 1: Ideal MHD from Generalized Ohm's Law¶

Starting from the generalized Ohm's law: $$\mathbf{E} + \mathbf{v} \times \mathbf{B} = \eta \mathbf{J} + \frac{1}{en} \mathbf{J} \times \mathbf{B} - \frac{1}{en} \nabla p_e + \frac{m_e}{e^2 n^2} \frac{d \mathbf{J}}{dt}$$ For a tokamak plasma with $n = 10^{20}$ m$^{-3}$, $T_e = 10$ keV, $B = 5$ T, $L = 1$ m, $V = 100$ m/s: (a) Calculate the characteristic time scale $\tau = L/V$. (b) Estimate the magnitude of each term on the RHS relative to the LHS. (c) Which terms can be neglected to obtain ideal MHD? Justify your answer.

Problem 2: CGL Pressure Evolution¶

A collisionless plasma is compressed adiabatically by increasing the magnetic field from $B_0$ to $2B_0$ while keeping the density constant ($n = n_0$). (a) Using the CGL equations, find the final values of $p_\perp$ and $p_\parallel$ in terms of the initial values. (b) If initially $p_{\perp 0} = p_{\parallel 0} = p_0$, what is the anisotropy ratio $p_\perp / p_\parallel$ after compression? (c) For $\beta_{\perp 0} = 0.5$, does the compressed plasma exceed the mirror instability threshold?

Problem 3: Frozen-In Flux¶

In ideal MHD, the magnetic flux through any closed loop moving with the fluid is conserved: $$\frac{d\Phi}{dt} = 0, \quad \text{where } \Phi = \int_S \mathbf{B} \cdot d\mathbf{A}$$ (a) Prove this frozen-in theorem using the ideal Ohm's law and the induction equation. (b) A circular flux tube of initial radius $r_0 = 10$ cm has magnetic field $B_0 = 0.1$ T. The plasma is compressed radially to $r = 5$ cm. What is the final magnetic field (assuming incompressible flow)? (c) What is the physical meaning of "frozen-in"? Can field lines reconnect in ideal MHD?

Problem 4: Gyrokinetic Ordering¶

In gyrokinetic theory, the ordering is: $$\frac{\rho_i}{L} \sim \frac{\omega}{\omega_{ci}} \sim \frac{\delta f}{f_0} \sim \delta \ll 1$$ (a) For a tokamak with $L = 1$ m, $\rho_i = 5$ mm, what is $\delta$? (b) If $\omega_{ci} = 10^8$ rad/s, what is the maximum frequency resolved by gyrokinetics? (c) The fast magnetosonic wave has $\omega \sim k V_A$ with no upper frequency limit. Why can't gyrokinetics capture this wave?

Problem 5: Hall MHD Reconnection¶

In resistive MHD, the Sweet-Parker reconnection rate is: $$V_{in} \sim \frac{\eta}{L} \sim \frac{V_A}{S^{1/2}}$$ where $S = L V_A / \eta$ is the Lundquist number.

In Hall MHD, the reconnection rate becomes (Petschek): $$V_{in} \sim 0.1 V_A$$ independent of resistivity!

(a) For a solar flare with $B = 0.01$ T, $n = 10^{16}$ m$^{-3}$, $L = 10^4$ km, $T_e = 10^6$ K, calculate the Alfvén speed and the ion skin depth. (b) Estimate the Sweet-Parker reconnection time $\tau_{SP} \sim L / V_{in}$ (use Spitzer resistivity). (c) Estimate the Hall MHD reconnection time $\tau_{Hall}$. (d) Solar flares release energy on time scales of minutes. Which model is consistent with observations?

Previous: Two-Fluid Model | Next: Plasma Diagnostics