{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "(chapter.GeomOptics)=\n", "# Geometrical Optics\n", "\n", "\n", "```{admonition} What you should know and be able to do after studying this chapter\n", "- Principle of Fermat.\n", "- Understand the approximation made in Gaussian geometrical optics.\n", "- Know how to work with the sign convention of the Lens Maker's Formula (not the derivation of the formula).\n", "- Understand how the Lens Maker's Formula of a single lens follows from the formula for a single interface.\n", "- Understand how the image of two and more lenses is derived from that of a single lens by construction and by computing the intermediate images. You do not need to know the imaging equation and the formulae for the focal distances of two thin lenses.\n", "- Understand the matrix method (you do not need to know the matrices by hart).\n", "- Understand the modification of the lens model to incorporate a thick lens.\n", "- Understand the limitations of geometrical optics, in particular when diffraction optics is needed.\n", "```\n", "**Nice software for practicing geometrical optics**:\n", "\n", "[https://www.geogebra.org/m/X8RuneVy](https://www.geogebra.org/m/X8RuneVy)\n" ] }, { "cell_type": "code", "execution_count": 5, "metadata": { "tags": [ "thebe-remove-input-init" ] }, "outputs": [], "source": [ "import micropip\n", "await micropip.install(\"plotly\")\n", "\n", "import numpy as np\n", "import plotly.graph_objects as go\n", "from ipywidgets import interact, FloatSlider\n", "from IPython.display import HTML" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "## Introduction\n", "Geometrical optics is an old subject but it is still essential to understand and design optical instruments such as camera's, microscopes, telescopes etc. Geometrical optics started long before light was described as a wave as is done in wave optics, and long before it was discovered that light is an electromagnetic wave and that optics is part of electromagnetism.\n", "\n", "In this chapter we go back in history and treat geometrical optics. That may seem strange now that we have a much more accurate and better theory at our disposal. However, the predictions of geometrical optics are under quite common circumstances very useful and also very accurate. In fact, for many optical systems and practical instruments there is no alternative for geometrical optics because more accurate theories are much too complicated to use.\n", "\n", "When a material is illuminated, its molecules start to radiate spherical waves (more precisely, they radiate like tiny electric dipoles) and the total wave scattered by the material is the sum of all these spherical waves. A time-harmonic wave has at every point in space and at every instant of time a well defined phase.\n", "A **wave front** is a set of space-time points where the phase has the same value. At any fixed time, the wave front is called a surface of constant phase. This surface moves with the phase velocity in the direction of its local normal.\n", "\n", "For plane waves we have shown in the previous chapter that the surfaces of constant phase are planes and that the normal to these surfaces is in the direction of the wave vector which coincides with the direction of the phase velocity as well as with the direction of the flow of energy (the direction of the Poynting vector). For general waves, the local direction of energy flow is given by the direction of the Poynting vector. Provided that the radius of curvature of the surfaces is much larger than the wavelength, the normal to the surfaces of constant phase may still be considered to be in the direction of the local flow of energy. Such waves behave locally as plane waves and their effect can be accurately described by the methods of geometrical optics.\n", "\n", "Geometrical optics is based on the intuitive idea that light consists of a bundle of rays. But what is a ray?\n", "\n", "\n", "```{note}\n", "A ray is an oriented curve which is everywhere perpendicular to the surfaces of constant phase and points in the direction of the flow of energy.\n", "```\n", "\n", "Consider a point source at some distance before an opaque screen with an aperture. According to the ray picture, the light distribution on a second screen further away from the source and parallel to the first screen is simply an enlarged copy of the aperture (see {numref}`Fig_2_01_GeomDiffr`). The copy is enlarged due to the fanning out of the rays. However, this description is only accurate when the wavelength of the light is very small compared to the diameter of the aperture. If the aperture is only ten times the wavelength, the pattern is much broader due to the bending of the rays around the edge of the aperture. This phenomenon is called **diffraction**. Diffraction can not be explained by geometrical optics and will be studied in [](chapter.diffraction).\n", "\n", "\n", "```{figure} Images/Chapter_2/2_01_Figgeom.png\n", ":name: Fig_2_01_GeomDiffr\n", "Light distribution on a screen due to a rectangular aperture. Left: for a large aperture, we get an enlarged copy of the aperture. Right: for an aperture that is of the order of the wavelength there is strong bending (diffraction) of the light. \n", "```\n" ] }, { "cell_type": "code", "execution_count": 6, "metadata": { "tags": [ "remove-output", "thebe-init", "hide-input" ] }, "outputs": [ { "data": { "application/vnd.jupyter.widget-view+json": { "model_id": "4010dfc22e984d1a913a2a61b6f75d0a", "version_major": 2, "version_minor": 0 }, "text/plain": [ "interactive(children=(FloatSlider(value=1.0, description='Aperture size', max=2.0, min=0.1), FloatSlider(value…" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/plain": [ "" ] }, "execution_count": 6, "metadata": {}, "output_type": "execute_result" } ], "source": [ "def interactive_diffraction(aperture_size, distance):\n", " x = np.linspace(-2, 2, 20)\n", " y = np.linspace(-2, 2, 20)\n", " X, Y = np.meshgrid(x, y)\n", " Z1 = np.zeros_like(X) # aperture plane\n", " \n", " # aperture region\n", " mask = (np.abs(X) < aperture_size / 2) & (np.abs(Y) < aperture_size / 2)\n", " light_through = np.where(mask, 1, 0) # light passes only through the aperture\n", " \n", " # simulate light spreading\n", " spread_factor = (distance / 2) + 1 # approximate spread\n", " X_spread, Y_spread = np.meshgrid(np.linspace(-spread_factor, spread_factor, 20),\n", " np.linspace(-spread_factor, spread_factor, 20))\n", " Z2 = np.full_like(X_spread, distance)\n", " \n", " intensity = np.exp(-((X_spread**2 + Y_spread**2) / (aperture_size**2)))\n", " \n", " fig = go.Figure()\n", " \n", " # Add aperture surface\n", " fig.add_trace(\n", " go.Surface(\n", " y=Y, x=Z1, z=X, # rotated orientation\n", " surfacecolor=np.ones_like(X) * light_through,\n", " colorscale='Blues',\n", " showscale=False,\n", " opacity=0.7\n", " )\n", " )\n", " \n", " # Add screen surface with diffraction pattern\n", " fig.add_trace(\n", " go.Surface(\n", " y=Y_spread, x=Z2, z=X_spread, # rotated orientation\n", " surfacecolor=intensity,\n", " colorscale='Inferno',\n", " showscale=True,\n", " opacity=0.8\n", " )\n", " )\n", " \n", " fig.update_layout(\n", " scene=dict(\n", " xaxis_title=\"Distance\",\n", " yaxis_title=\"\",\n", " zaxis_title=\"\",\n", " aspectratio=dict(x=1, y=1, z=1),\n", " camera=dict(\n", " eye=dict(x=1.5, y=1.5, z=1)\n", " )\n", " ),\n", " margin=dict(l=0, r=0, t=30, b=0)\n", " )\n", " \n", " \n", " html = fig.to_html(include_plotlyjs='cdn', full_html=False)\n", " return HTML(html)\n", "\n", "interact(interactive_diffraction, \n", " aperture_size=FloatSlider(min=0.1, max=2, step=0.1, value=1, description=\"Aperture size\"),\n", " distance=FloatSlider(min=1, max=10, step=0.5, value=5, description=\"Distance\"))" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "\n", "Geometrical optics is accurate when the sizes of the objects in the system are large compared to the wavelength. It is possible to derive geometrical optics from Maxwell's equations by formally expanding the electromagnetic field in a power series in the wavelength and retaining only the first term of this expansion [^1]. However, this derivation is not rigorous because the power series generally does not converge (it is a so-called asymptotic series).\n", "\n", "Although it is possible to incorporate polarisation into geometrical optics [^2], this is not standard theory and we will not consider polarisation effects in this chapter\n", "\n", "## Principle of Fermat\n", "\n", "The starting point of the treatment of geometrical optics is the \n", "\n", "```{note}\n", "**Principle of Fermat (1657)**. The path followed by a light ray between two points is the one that takes the least amount of time.\n", "```\n", "\n", "The speed of light in a material with refractive index $n$, is $c/n$, where $c=3\\times 10^8$ m/s is the speed of light in vacuum. At the time of Fermat, the conviction was that the speed of light must be finite, but nobody could suspect how incredibly large it actually is. In 1676 the Danish astronomer Ole Römer computed the speed from inspecting the eclipses of a moon of Jupiter and arrived at an estimate that was only 30\\% too low.\n", "\n", "Let $\\mathbf{r}(s)$, be a ray with $s$ the length parameter. The ray links two points $S$ and $P$.\n", "Suppose that the refractive index varies with position: $n(\\mathbf{r})$. Over the infinitesimal distance from $s$ to\n", "$s+\\mathrm{d}s$, the speed of the light is\n", "\n", "```{math}\n", ":label: eq.defcn\n", "\\begin{align*}\n", "\\frac{c}{n(\\mathbf{r}(s))}.\n", "\\end{align*}\n", "```\n", "Hence the time it takes for light to go from $\\mathbf{r}(s)$ to $\\mathbf{r}(s+\\mathrm{d}s)$ is:\n", "\n", "```{math}\n", ":label: eq.defdt\n", "\\begin{align*}\n", "\\mathrm{d}t = \\frac{n(\\mathbf{r}(s))}{c} \\mathrm{d}s,\n", "\\end{align*}\n", "```\n", "and the total total time to go from $S$ to $P$ is:\n", "\n", "```{math}\n", ":label: eq.ttot\n", "\\begin{align*}\n", "t_{S \\rightarrow P} = \\int_0^{s_P} \\frac{n(\\mathbf{r}(s))}{c} \\mathrm{d}s,\n", "\\end{align*}\n", "```\n", "where $s_P$ is the distance along the ray from S to P.\n", "The **optical path length** [m] of the ray between S and P is defined by:\n", "\n", "\n", "\n", "```{math}\n", ":label: eq.defOPL\n", "\\boxed{\\begin{align*}\n", "\\text{OPL} = \\int_0^{s_P} n(\\mathbf{r}(s)) \\mathrm{d}s,\n", "\\end{align*}}\n", "```\n", "\n", "So the OPL is the distance weighted by the refractive index. \n", "\n", "```{note}\n", "Fermat's principle is thus equivalent to the statement that a ray follows the path with shortest OPL.\n", "```\n", "\n", "\n", "\n", "```{figure} Images/Chapter_2/2_02_Theory_of_mirage.jpg\n", ":name: Fig_2_02_Theory_of_mirage\n", "Because the temperature close to the ground is higher, the refractive index is lower there. Therefore the rays bend upwards, creating a mirror image of the tree below the ground. (From Popular Science Monthly Volume 5, Public Domain, [link](https://commons.wikimedia.org/w/index.php?curid=10770493)).\n", "```\n", "\n", "**Remark.**\n", "Actually, Fermat's principle as formulated above is not complete. There are circumstances that a ray can take two paths between two points that have different travel times. Each of these paths then corresponds to a minimum travel time compared to nearby paths, so the travel time is in general a *local minimum*. An example is the reflection by a mirror discussed in the following section.\n", "\n", "## Some Consequences of Fermat's Principle\n", "- **Homogeneous matter**\n", "\n", "In homogenous matter, the refractive index is constant and therefore paths of shortest OPL are straight lines. Hence in homogeneous matter rays are straight lines.\n", "- **Inhomogeneous matter**\n", "\n", "When the refractive index is a function of position such as air with a temperature gradient, the rays bend towards regions of higher refractive index. In the case of {numref}`Fig_2_02_Theory_of_mirage` for example, the ray from the top of the tree to the eye of the observer passes on a warm day close to the ground because there the temperature is higher and hence the refractive index is smaller. Although the curved path is longer than the straight path, the total travel time of the light is less because near the ground the light speed is higher (since the refractive index is smaller). The observer gets the impression that the tree is upside down under the ground.\n", "\n", "- **Law of reflection**\n", "\n", "\n", "Consider the mirror shown in {numref}`Fig_2_03_Descartes_Reflection`. Since the medium above th mirror is homogeneous, a ray from point $P$ can end up in $Q$ in two ways: by going along a straight line directly form $P$ to $Q$ or alternatively by straight lines via the mirror. Both possibilities have different path lengths and hence different travel times and hence both are local minima mentioned at the end of the previous section. We consider here the path by means of reflection by the mirror.\n", "Let the $x$-axis be the intersection of the mirror and the plane through the points $P$ and $Q$ and perpendicular to the mirror. Let the $y$-axis be normal to the mirror. Let $(x_P, y_P)$ and $(x_Q,y_Q)$ be the coordinates of $P$ and $Q$, respectively. If $(x,0)$ is the point where a ray from $P$ to $Q$ hits the mirror, the travel time of that ray is\n", "\n", "```{math}\n", ":label: eq.mirror1\n", "\\begin{align*}\n", "\\frac{n}{c}d_1(x) + \\frac{n}{c}d_2(x) = \\frac{n}{c}\\sqrt{ (x-x_P)^2 + y_P^2} +\\frac{n}{c} \\sqrt{ (x_Q-x)^2 + y_Q^2},\n", "\\end{align*}\n", "```\n", "where $n$ is the refractive index of the medium in $y>0$. According to Fermat's Principle, the point $(x,0)$ should be such that the travel time is minimum, i.e.\n", "\n", "```{math}\n", ":label: eq.mirror2\n", "\\begin{align*}\n", "\\frac{d }{d x} [d_1(x) + d_2(x)] = \\frac{(x-x_P)}{d_1(x)} - \\frac{(x_Q-x)}{d_2(x)} =0.\n", "\\end{align*}\n", "```\n", "Hence\n", "\n", "```{math}\n", ":label: eq.mirror3\n", "\\begin{align*}\n", "\\sin \\theta_i = \\sin \\theta_r,\n", "\\end{align*}\n", "```\n", "or\n", "\n", "```{math}\n", ":label: eq.mirror4\n", "\\begin{align*}\n", "\\theta_r = \\theta_i.\n", "\\end{align*}\n", "```\n", "where $\\theta_i$ and $\\theta_r$ are the angles of incidence and reflection as shown in {numref}`Fig_2_03_Descartes_Reflection`.\n", "\n", "\n", "```{figure} Images/Chapter_2/2_03_Descartes_Reflection.png\n", ":name: Fig_2_03_Descartes_Reflection\n", "Ray from $P$ to $Q$ via the mirror.\n", "```\n", "\n", "\n", "- **Snell's law of refraction**\n", "\n", "\n", "Next we consider refraction at an interface. Let $y=0$ be the interface between a medium with refractive index $n_i$ in $y>0$ and a medium with refractive index $n_t$ in $y<0$. We use the same coordinate system as in the case of reflection above. Let $(x_P,y_P)$ and $(x_Q,y_Q)$ with $y_P>0$ and $y_Q<0$ be the coordinates of two points $P$ and $Q$ are shown in {numref}`Fig_2_04_Descartes_Refraction`. What path will a ray follow that goes from $P$ to $Q$? Since the refractive index is constant in both half spaces, the ray is a straight line in both media. Let $(x,0)$ be the coordinate of the intersection point of the ray with the interface. Then the travel time is\n", "\n", "```{math}\n", ":label: eq.refrac1\n", "\\begin{align*}\n", "\\frac{n_i}{c} d_1(x) + \\frac{n_t}{c} d_2(x) = \\frac{n_i}{c} \\sqrt{(x-x_P)^2 + y_P^2} +\n", "\\frac{n_t}{c} \\sqrt{(x_Q-x)^2 + y_Q^2}.\n", "\\end{align*}\n", "```\n", "The travel time must be minimum, hence there must hold\n", "\n", "```{math}\n", ":label: eq.refrac2\n", "\\begin{align*}\n", "\\frac{d}{d x} \\left[ n_i d_1(x) + n_t d_2(x)\\right] = n_i \\frac{(x-x_P)}{d_1(x)} - n_t \\frac{(x_Q-x)}{d_2(x)}=0.\n", "\\end{align*}\n", "```\n", "where the travel time has been multiplied by the speed of light in vacuum. Eq. {eq}`eq.refrac2` implies\n", "\n", "```{math}\n", ":label: eq.refrac3\n", "\\begin{align*}\n", "n_i \\sin \\theta_i = n_t \\sin \\theta_t,\n", "\\end{align*}\n", "```\n", "where $\\theta_i$ and $\\theta_t$ are the angles between the ray and the normal to the surface in the upper half space and the lower half space, respectively ({numref}`Fig_2_04_Descartes_Refraction`).\n", "\n", "```{figure} Images/Chapter_2/2_04_Descartes_Refraction.png\n", ":name: Fig_2_04_Descartes_Refraction\n", "Ray from $P$ to $Q$ refracted by an interface.\n", "```" ] }, { "cell_type": "code", "execution_count": 7, "metadata": { "tags": [ "remove-output", "thebe-remove-input-init", "hide-input" ] }, "outputs": [ { "data": { "text/html": [ "
\n", "
" ], "text/plain": [ "" ] }, "execution_count": 7, "metadata": {}, "output_type": "execute_result" } ], "source": [ "import numpy as np\n", "import plotly.graph_objects as go\n", "from IPython.display import HTML\n", "\n", "def create_snells_law_visualization():\n", " fig = go.Figure()\n", "\n", " n1 = 1.0\n", " n2_range = np.linspace(1, 2, 11)\n", " a = 0\n", " base_traces = 0 # number of traces that are always visible\n", " \n", " # Add traces, one for each slider step\n", " for n2 in n2_range:\n", " aone = np.arctan(a + 1)\n", " atwo = np.arctan(a - 1)\n", "\n", " # Draw polygons and lines\n", " fig.add_trace(go.Scatter(visible=False, x=[-1, 1, 1, -1, -1], y=[0, 0, 1, 1, 0],\n", " fill='toself', fillcolor='rgba(0,0,255,0.33)', line=dict(color='blue'), name='Polygon 1', showlegend=False))\n", " fig.add_trace(go.Scatter(visible=False, x=[-1, 1, 1, -1, -1], y=[0, 0, -1, -1, 0],\n", " fill='toself', fillcolor='rgba(0,0,255,0.33)', line=dict(color='blue'), name='Polygon 2', showlegend=False))\n", " fig.add_trace(go.Scatter(visible=False, x=[-1, 1], y=[0, 0], line=dict(color='blue'), name='Line 1'))\n", "\n", " x = np.linspace(0, 2, 2000)\n", " y = 1\n", " t = np.sqrt(x**2 + y**2) * n1 + np.sqrt((2 - x)**2 + y**2) * n2\n", " __x = x[np.argmin(t)] - 1\n", "\n", " fig.add_trace(go.Scatter(visible=False, x=[__x, __x], y=[-1, 1], line=dict(color='blue', dash='dash'),\n", " name='Dashed Line', showlegend=False))\n", " fig.add_trace(go.Scatter(visible=False, x=[-1, __x], y=[1, 0], line=dict(color='red', width=2), name='Line 2', showlegend=False))\n", " fig.add_trace(go.Scatter(visible=False, x=[__x, 1], y=[0, -1], line=dict(color='purple', width=2), name='Line 3', showlegend=False))\n", " \n", " # Add labels\n", " fig.add_trace(go.Scatter(visible=False, x=[__x - 0.25], y=[0.5], text=[str(np.round(np.degrees(np.arctan((__x + 1) / y)), 1)) + \"°\"],\n", " mode='text', showlegend=False, name='Label 1'))\n", " fig.add_trace(go.Scatter(visible=False, x=[__x + (1 - __x) / 4], y=[-0.5],\n", " text=[str(np.round(np.degrees(np.arctan((2 - __x - 1) / y)), 1)) + \"°\"],\n", " mode='text', showlegend=False, name='Label 2'))\n", " fig.add_trace(go.Scatter(visible=False, x=[-0.75], y=[0.1], text=[\"n1 = \" + str(np.round(n1, 1))],\n", " mode='text', showlegend=False, name='Label 3'))\n", " fig.add_trace(go.Scatter(visible=False, x=[-0.75], y=[-0.1], text=[\"n2 = \" + str(np.round(n2, 1))],\n", " mode='text', showlegend=False, name='Label 4'))\n", " \n", " fig.update_layout(\n", " title=\"Fermat's Principle and Snell's Law\",\n", " showlegend=False,\n", " title_x=0.5, # center the title horizontally\n", " title_y=0.95 # adjust the vertical position of the title\n", " )\n", " \n", " traces_per_step = 10 # number of traces per value of n2\n", " active_n2_index = 0\n", " for i in range(traces_per_step):\n", " curr_idx = int(base_traces + active_n2_index * traces_per_step + i)\n", " fig.data[curr_idx].visible = True\n", " \n", " steps = []\n", " for i in range(0, n2_range.shape[0]):\n", " visarray = [False] * len(fig.data)\n", " curr_idx = int(i * traces_per_step)\n", " next_idx = int((i + 1) * traces_per_step)\n", " visarray[curr_idx:next_idx] = [True] * traces_per_step\n", " step = dict(\n", " method=\"update\",\n", " args=[{\"visible\": visarray}],\n", " label=round(n2_range[i], 1)\n", " )\n", " steps.append(step)\n", " \n", " sliders = [dict(\n", " active=active_n2_index,\n", " currentvalue={\"prefix\": \"n2 = \"},\n", " steps=steps\n", " )]\n", " \n", " fig.update_layout(\n", " sliders=sliders,\n", " legend_title=\"Legend\",\n", " )\n", " \n", " html = fig.to_html(include_plotlyjs='cdn', full_html=False)\n", " return HTML(html)\n", "\n", "create_snells_law_visualization()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Hence we have derived the law of reflection and Snell's law from Fermat's principle. In [](chapter.basics) the reflection law and Snell's law have been derived by a different method, namely from the continuity of the tangential electromagnetic field components at the interface.\n", "\n", "## Perfect Imaging by Conic Sections\n", "In this section the conic sections ellipse, hyperbole and parabola are important. In {numref}`Fig_2_05_ConicSection` their definitions are shown as a quick reminder[^3].\n", "\n", "```{figure} Images/Chapter_2/2_05a_ConicSection.png\n", "```\n", "```{figure} Images/Chapter_2/2_05b_ConicSection.png\n", ":name: Fig_2_05_ConicSection\n", "Overview of conic sections. The lower figure shows a definition that unifies the three definitions in the figure above by introducing a parameter called the eccentricity $e$. The point $F$ is the focus and the line $e=\\infty$ is the directrix of the conic sections.\n", "```\n", "\n", "We start with explaining what in geometrical optics is meant by **perfect imaging**.\n", "Let $S$ be a point source. The rays perpendicular to the spherical wave fronts emitted by $S$ radially fan out from $S$. Due to objects such as lenses etc. the spherical wave fronts are deformed and the direction of the ray are made to deviate from the radial propagation direction.\n", "When there is a point $P$ and a cone of rays coming from point $S$ and all rays in that cone intersect in point $P$, then by Fermat's principle, all these rays have traversed paths of minimum travel time. In particular, their travel times are equal and therefore they **all add up in phase** when they arrive in $P$. Hence at $P$ there is a high light intensity. Hence, if there is a cone of rays from point $S$ which all intersect in a point $P$ as shown in {numref}`Fig_2_06_Perfect_Imaging`, point $P$ is called the **perfect image** of $S$.\n", "By reversing the direction of the rays, $S$ is similarly a perfect image of $P$. The optical system in which this happens is called **stigmatic for the two points $S$ and $P$**. \n", "\n", "\n", "```{figure} Images/Chapter_2/2_06_Perfect_Imaging.png\n", ":name: Fig_2_06_Perfect_Imaging\n", "Perfect imaging: a cone of rays which diverge from $S$ and all intersect in point $P$. The rays continue after $P$.\n", "```\n", "\n", "\n", "**Remark**. The concept of a perfect image point exists only in geometrical optics. In reality finite apertures of lenses and other imaging systems cause diffraction due to which image points are never perfect but blurred.\n", "\n", "We summarise the main examples of stigmatic systems.\n", "\n", "**1.**\n", "**Perfect focusing and imaging by refraction.** A parallel bundle of rays propagating in a medium with refractive index $n_2$ can be focused into a point $F$ in a medium $n_1$. If $n_2>n_1$, the interface between the media should be a hyperbole with focus $F$, whereas if $n_20$ in {numref}`Fig_2_10_Spherical_interface`.\n", "\n", "```{figure} Images/Chapter_2/2_10_Spherical_interface.png\n", ":name: Fig_2_10_Spherical_interface\n", "Imaging by a spherical interface between two media with refractive indices $n_2>n_1$.\n", "```\n", "\n", "\n", "\n", "\n", "\n", "*Proof*. \n", "\n", "(Note: the proof is **not** part of the exam).\n", "It suffices to show that $P$ is independent of the ray, i.e. of $A$. We will do this by expressing $s_i$ into $s_o$ and showing that the result is independent of $A$. Let $\\alpha_1$ and $\\alpha_2$ be the angles of the rays $SA$ and $AP$ with the $z$-axis as shown in {numref}`Fig_2_10_Spherical_interface`.\n", "Let $\\theta_i$ be the angle of incidence of ray $SA$ with the local normal $CA$ on the surface and $\\theta_t$ be the angle of refraction. By considering the angles in triangle $\\Delta \\text{SCA}$ we find\n", "\n", "\n", "```{math}\n", ":label: eq.alpha1\n", "\\begin{align*}\n", "\\theta_i = \\alpha_1 + \\varphi.\n", "\\end{align*}\n", "```\n", "Similarly, from $\\Delta \\,\\text{CPA}$ we find\n", "\n", "```{math}\n", ":label: eq.lpha2\n", "\\begin{align*}\n", "\\theta_t=-\\alpha_2 + \\varphi.\n", "\\end{align*}\n", "```\n", "By substitution into the paraxial version of Snell's Law {eq}`eq.Snell3`, we obtain\n", "\n", "```{math}\n", ":label: eq.n1al1n2al2\n", "\\begin{align*}\n", "n_1 \\alpha_1 + n_2 \\alpha_2 = (n_2-n_1)\n", "\\varphi.\n", "\\end{align*}\n", "```\n", "Let $y_A$ and $z_A$ be the coordinates of point $A$. Since $s_o<0$ and $s_i>0$ we have\n", "\n", "```{math}\n", ":label: eq.alpha\n", "\\begin{align*}\n", "\\alpha_1 \\approx \\tan(\\alpha_1) = \\frac{y_A}{z_A-s_o}, \\;\\; \\;\\; \\alpha_2\\approx \\tan(\\alpha_2)= \\frac{y_A}{s_i-z_A}.\n", "\\end{align*}\n", "```\n", "Furthermore,\n", "\n", "```{math}\n", ":label: eq.varphi\n", "\\begin{align*}\n", "\\varphi \\approx \\sin \\varphi \\approx \\frac{y_A}{R}.\n", "\\end{align*}\n", "```\n", "which is small for paraxial rays.\n", "Hence,\n", "\n", "```{math}\n", ":label: eq.xA\n", "\\begin{align*}\n", "z_A=R-R\\cos\\varphi = R - R\\left(1-\\frac{\\varphi^2}{2}\\right)= \\frac{R}{2}\\varphi^2 \\approx 0,\n", "\\end{align*}\n", "```\n", "because it is second order in $y_A$ and therefore is neglected in the paraxial approximation. Then, {eq}`eq.alpha`\n", "becomes\n", "\n", "```{math}\n", ":label: eq.alpha22\n", "\\begin{align*}\n", "\\alpha_1 = -\\frac{y_A}{s_o}, \\quad \\alpha_2 =\\frac{y_A}{s_i}.\n", "\\end{align*}\n", "```\n", "By substituting {eq}`eq.alpha22` and {eq}`eq.varphi` into {eq}`eq.n1al1n2al2` we find\n", "\n", "```{math}\n", "\\begin{align*}\n", "-\\frac{n_1}{s_o} y_A + \\frac{n_2}{z_i} y_A = \\frac{n_2-n_1}{R} y_A,\n", "\\end{align*}\n", "```\n", "or\n", "\n", "```{math}\n", "\\begin{align*}\n", "-\\frac{n_1}{s_o } + \\frac{n_2}{s_i} = \\frac{n_2-n_1}{R},\n", "\\end{align*}\n", "```\n", "which is {eq}`eq.one_surface`.\n", "It implies that $s_i$, and hence $P$, is independent of $y_A$, i.e. of the ray chosen.\n", "Therefore, $P$ is a perfect image within the approximation of Gaussian geometrical optics.\n", "\n", "\n", "\n", "When\n", "$s_o \\rightarrow -\\infty$, the incident rays are parallel to the $z$-axis in medium 1 and the corresponding image point $F_i$ is called the **second focal point** or **image focal point**.\n", "Its $z$-coordinate is given by:\n", "\n", "\n", "\n", "```{math}\n", ":label: eq.def_fi\n", "\\boxed{\\begin{align*}\n", "\tf_i = \\frac{n_2}{{\\cal P}}=\\frac{n_2 R}{n_2-n_1},\n", "\t\\end{align*}}\n", "```\n", "\n", "and its absolute value (it is negative when $n_20 and $n_1>n_2$, or\n", "2) $R$<0 and $n_10$ and $f_<0$, which means that the object and image focal points are to the right and left, respectively, of the surface.\n", "\n", "Note that also when the power is positive, a virtual image can occur, namely when the object $S$ is in between the object focal point $F_o$ and the surface. Then the bundle of rays from S is so strongly diverging that the surface can not convert it into a convergent bundle and hence again the rays in image space seem to come from a point $P$ to the left of the surface. This agrees with the fact that when ${\\cal P}>0$ and $f_o< s_o<0$,\n", "{eq}`eq.one_surface2` implies that $s_i<0$.\n", "\n", "\n", "```{figure} Images/Chapter_2/2_12_Concave_Surface_Real_object_1.png\n", ":name: Fig_2_12_Surf_Concave\n", "Imaging by a concave surface ($R<0$) with $n_2>n_1$. All image points are to the left of the surface, i.e. are virtual ($s_i<0$).\n", "```\n", "\n", "\n", "Finally we look at a case that there is a bundle of convergent rays incident from the left on the surface which when extended into the right medium without refraction at the surface, would intersect in a point $S$. Since this point is not actually present, it is called a **virtual object point**, in contrast to **real object points** which are to the left of the surface. The coordinate of a virtual object point is positive: $s_o>0$.\n", "One may wonder why we look at this case. The reason is that if we have several spherical surfaces behind each other, we can compute the image of an object point by first determining the intermediate image by the most left surface and then use this intermediate image as object for the next surface and so on. In such a case it can easily happen that an intermediate image is to the right of the next surface and hence is a virtual object for that surface. In the case of {numref}`Fig_2_13_Convex_Virtual_Object` at the left, the power is positive, hence the convergent bundle of incident rays is made even more convergent which leads to a real image point. Indeed when $s_o>0$ and ${\\cal P}>0$ then {eq}`eq.one_surface` implies that always $s_i>0$. At the right of {numref}`Fig_2_13_Convex_Virtual_Object` the power is negative but is not sufficiently strong to turn the convergent incident bundle into a divergent bundle. So the image is still real. However, the image will be virtual when the virtual object $S$ is to the right of $F_o$ (which in this case is to the right of the surface) since then the bundle of rays converges so weakly that the surface turns is into a divergent bundle.\n", "\n", "\n", "```{figure} Images/Chapter_2/2_13_Spherical_Interface_Concave.png\n", ":name: Fig_2_13_Convex_Virtual_Object\n", "Imaging of a virtual object $S$ by a spherical interface with $R>0$ between two media with refractive indices $n_1>n_2$ (left) and $n_2>n_1$ (right).\n", "```\n", "\n", "\n", "In conclusion: provided the sign convention listed in {numref}`table_signconv` is used, formula {eq}`eq.one_surface`\n", "can always be used to determine the image of a given object by a spherical surface.\n", "\n", "```{table} Sign convention for spherical surfaces and thin lenses. The convention for $s_o$, $f_o$, $s_i$, $f_i$ follows from the fact that these are $z$-coordinates with the origin at vertex $V$ of the spherical surface (or the centre of the thin lens) and the positive $z$-axis is pointing to the right. The convention for the $y$-coordinate follows from the fact that the $y$-axis is positive upwards.\n", ":name: table_signconv\n", "| **quantity** | **positive** | **negative** |\n", "| :--: | :--: | :--: |\n", "| $s_o$, $s_i$. $f_0$, $f_i$ | corresponding point is to the right of vertex | corresponding point is to left of vertex |\n", "| $y_o$, $y_i$ | object, image point above optical axis | object, image point below optical axis |\n", "| $R$ | centre of curvature right of vertex | centre of curvature left of vertex |\n", "| Refractive index $n$ ambient medium of a mirror | before reflection | after reflection |\n", "```\n", "\n", "\n", "\n", "### Ray Vectors and Ray Matrices\n", "Now that we know that within Gaussian geometrical optics a single spherical surface images every object point to a perfect, real or virtual, image point it is easy to see that any row of spherical surfaces separated by homogeneous materials will also image any point perfectly. We first determine the intermediate image of the object point under the most left spherical surface as if the other surfaces are not present and use this intermediate image point as object point for imaging by the next spherical surface and so on. Of course, the intermediate image and object points can be virtual.\n", "\n", "Although this procedure is in principle simple, it is nevertheless convenient in Gaussian geometrical optics to introduce the concept of ray vectors and ray matrices to deal with optical system consisting of several spherical surfaces.\n", "With ray matrices it is easy to derive how the distance of a given ray to the optical axis and its direction change during propagation through an optical system. This in turn can be used to determine the image plane in an optical system for a given object plane.\n", "\n", "In any plane perpendicular to the $z$-axis, a ray is determined by the $y$-coordinate of the point of intersection of the ray with the plane and the angle $\\alpha$ with the optical ($z$)-axis. This angle has a sign and is defined as follows. Let $(y_1,z_1)$ and $(y_2,z_2)$ be the coordinates of two points on the ray and let the light propagate from point 1 to point 2. Then we define\n", "\n", "$$\n", "\\alpha = \\frac{ y_2-y_1}{z_2-z_1}.\n", "$$ (eq.defalpha)\n", "\n", "Examples of positive and negative $\\alpha$ are given in {numref}`Fig.alpha`. The case $z_2-z_1<0$ occurs when a ray propagates in the negative $z$-direction after it has been reflected by a mirror.\n", "According to {numref}`table_signconv` the refractive index of the ambient medium should after the reflection be taken negative. After a second reflection due to which the ray propagates again in the positive $z$-direction the refractive index should be chosen positive again.\n", "\n", "```{figure} Images/Chapter_2/2_14_Angle_definition.png\n", ":name: Fig.alpha\n", "Sign convention for the ray angle. In the upper two figures $\\alpha>0$ while in the lower two figures $\\alpha<0$.\n", "```\n", "\n", "\n", "We define the ray vector\n", "\n", "```{math}\n", ":label: eq.rayvector\n", "\\begin{align*}\n", "\\left( \\begin{array}{c}n\\alpha \\\\y\n", "\\end{array} \\right),\n", "\\end{align*}\n", "```\n", "where $n$ is the local refractive index. The definition with the refractive index as factor in the first element of the ray vector turns out to be convenient.\n", "The ray vectors of a ray in any two planes $z=z_1$, $z=z_2$, with $z_2>z_1$, are related by a so-called ray matrix:\n", "\n", "```{math}\n", ":label: eq.matgen\n", "\\begin{align*}\n", "\\left( \\begin{array}{c}n_2\\alpha_2 \\\\y_2\n", "\\end{array}\\right) = {\\cal M}\n", "\\left( \\begin{array}{c}n_1 \\alpha_1 \\\\y_1\n", "\\end{array}\\right).\n", "\\end{align*}\n", "```\n", "where\n", "\n", "```{math}\n", ":label: eq.defM2\n", "\\begin{align*}\n", "{\\cal M} =\\left( \\begin{array}{cc}A & B \\\\C & D\n", "\\end{array}\\right).\n", "\\end{align*}\n", "```\n", "The elements of matrix ${\\cal M}$ depend on the optical components and materials between the planes $z=z_1$ and $z=z_2$.\n", "\n", "As an example consider the ray matrix that relates a ray vector in the plane immediately before the spherical surface in {numref}`Fig_2_10_Spherical_interface` to the corresponding ray vector in the plane immediately behind that surface.\n", "Using {eq}`eq.n1al1n2al2` and {eq}`eq.varphi` it follows\n", "\n", "```{math}\n", ":label: eq.alphas\n", "\\begin{align*}\n", "n_1 \\alpha_1 - n_2 \\alpha_2 = \\frac{(n_2-n_1)y_1}{R},\n", "\\end{align*}\n", "```\n", "where we have replaced $\\alpha_2$ by $-\\alpha_2$ in {eq}`eq.n1al1n2al2`, because according to the sign convention, the angle $\\alpha_2$ in {numref}`Fig_2_10_Spherical_interface` should be taken negative.\n", "Because furthermore $y_2=y_1$, we conclude\n", " \n", "```{math}\n", ":label: eq.matsph0\n", "\\begin{align*}\n", "\\left( \\begin{array}{c}n_2\\alpha_2 \\\\ y_2\n", "\\end{array}\\right) &= \\left( \\begin{array}{c}n_1 \\alpha_1 - \\frac{(n_2-n_1)y_1}{R} \\\\ y_1\n", "\\end{array}\\right) \\\\\n", " &= \\left( \\begin{array}{cc}1 & -P \\\\ 0 & 1\n", "\\end{array}\\right)\\left(\n", " \\begin{array}{c}n_1 \\alpha_1 \\\\ y_1\n", "\\end{array}\\right), \\quad \\textbf{spherical surface,}\n", "\\end{align*}\n", "```\n", "where\n", "\n", "```{math}\n", ":label: eq.defP\n", "\\begin{align*}\n", "{\\cal P}= \\frac{n_2-n_1}{R},\n", "\\end{align*}\n", "```\n", "is as before the **power** of the surface.\n", "\n", "Next we consider a spherical mirror with radius of curvature $R$.\n", "We will show that the ray matrix between the planes just before and after the mirror is given by:\n", "\n", "```{math}\n", ":label: eq.sphmirror\n", "\\begin{align*}\n", "\\left( \\begin{array}{c}n_2\\alpha_2 \\\\y_2\n", "\\end{array}\\right)\n", "&= \\left( \\begin{array}{cc}1 & -{\\cal P} \\\\0 & 1\n", "\\end{array}\\right)\\left(\n", "\\begin{array}{c}n_1 \\alpha_1 \\\\y_1\n", "\\end{array}\\right), \\quad \\textbf{spherical reflector,}\n", "\\end{align*}\n", "```\n", "where\n", "\n", "```{math}\n", ":label: eq.defPrefl\n", "\\begin{align*}\n", "{\\cal P}= \\frac{2n}{R},\n", "\\end{align*}\n", "```\n", "is the power of the mirror, $n_1=n$ but $n_2=-n$, because the convention is used that if a ray propagates from **right to left** (i.e. in the negative $z$-direction), the refractive index in the ray vectors and ray matrices is chosen **negative**. Note that when the mirror is flat: $R=\\infty$, the ray matrix of the reflector implies\n", "\n", "```{math}\n", "\\begin{align*}\n", "n_2\\alpha_2 = n_1 \\alpha_1,\n", "\\end{align*}\n", "```\n", "which agrees with the fact that $n_2=-n_1$ and according to {eq}`eq.defalpha` $\\alpha_2$ and $\\alpha_1$ have opposite sign for a mirror.\n", "\n", "```{figure} Images/Chapter_2/2_15_Concave_mirror.png\n", ":name: Fig_2_15_Mirror\n", "Reflection by a mirror.\n", "```\n", "\n", "\n", "With all angles positive for the moment, it follows from {numref}`Fig_2_15_Mirror`\n", "\n", "```{math}\n", ":label: eq.refl1\n", "\\begin{align*}\n", "\\alpha_1&= \\theta_i +\\varphi, \\end{align*}\n", "```\n", "```{math}\n", ":label: eq.refl2\n", "\\begin{align*}\n", "\\\\\n", "\\alpha_2 &= \\varphi-\\theta_r= \\varphi-\\theta_i.\\end{align*}\n", "```\n", "Hence,\n", "\n", "```{math}\n", ":label: eq.refl3\n", "\\begin{align*}\n", "\\alpha_2= -\\alpha_1 + 2\\varphi.\n", "\\end{align*}\n", "```\n", "Now\n", "\n", "$$\n", "\\varphi\\approx \\frac{y_1}{R}\n", "$$ (eq.varphi3)\n", "\n", "In the situation drawn in {numref}`Fig_2_15_Mirror`,\n", "{eq}`eq.defalpha` implies that both $\\alpha_2$ and $\\alpha _1$ are positive. By choosing the refractive index negative after reflection,\n", "we conclude from {eq}`eq.refl3` and {eq}`eq.varphi3`:\n", "\n", "```{math}\n", ":label: eq.n2alpha2\n", "\\begin{align*}\n", "n_2\\alpha_2 = -n \\alpha_2 = n \\alpha_1 - \\frac{2n}{R} y_1 = n_1\\alpha_1 - \\frac{2n}{R}.\n", "\\end{align*}\n", "```\n", "This proves Eq. {eq}`eq.sphmirror`.\n", "\n", "\n", "We now consider the ray matrix when a ray propagates from a plane $z_1$ to a plane $z_2$ through a medium with with refractive index $n$.\n", "In that case we have\n", "$\\alpha_2=\\alpha_1$ and $y_2=y_1 + \\alpha_1(z_2-z_1)$, hence\n", "\n", "```{math}\n", ":label: eq.mathom\n", "\\begin{align*}\n", "{\\cal M}=\\left( \\begin{array}{cc}1 & 0 \\\\\\frac{z_2-z_1}{n} & 1\n", "\\end{array}\\right), \\quad \\textbf{homogeneous space}.\n", "\\end{align*}\n", "```\n", "Note that if the light propagates from the left to the right: $z_2>z_1$ and hence $z_2-z_1$ in the first column and second row of the matrix is positive, i.e. it is the distance between the planes.\n", "\n", "For two planes between which there are a number of optical components, possibly separated by regions with homogeneous material (e.g. air), the ray matrix can be obtained by multiplying the matrices of the individual components and of the homogeneous regions. The order of the multiplication of the matrices is such that the **right-most matrix corresponds to the first component that is encountered while propagating**, and so on.\n", "\n", "In the ray matrix approach all rays stay in the same plane, namely the plane through the ray and the $z$-axis. These rays are called **meridional rays**. By considering only meridional rays, the imaging by optical systems is restricted to two dimensions. Non-meridional rays are called **skew rays**. Skew rays do not pass through the optical axis and are not considered in the paraxial theory.\n", "\n", "**Remarks**. \n", "\n", "1. In matrix {eq}`eq.mathom` $z_1$ and $z_2$ are **coordinates**, i.e. they have a sign. \n", "\n", "2. Instead of choosing the refractive index negative in ray vectors of rays that propagate from right to left, one can reverse the direction of the positive $z$-axis after every reflection. The convention to make the refractive index negative is however more convenient in ray tracing software. \n", "\n", "3. The determinant of the ray matrices {eq}`eq.matsph0`, {eq}`eq.sphmirror` and\n", "{eq}`eq.mathom` are all 1. Since all ray matrices considered below are products of these elementary matrices, the determinant of every ray matrix considered is unity.\n", "\n", "### The Lens Matrix\n", "We apply ray matrices to a lens.\n", "{numref}`Fig_2_16_Spherical_Lens_simplified` shows a lens with two spherical surfaces. The refractive index of the lens is $n_l$ and that of the media to the left and to the right of the lens is $n_1$ and $n_2$, respectively. Let the distance between the vertices be $d$.\n", "\n", "```{figure} Images/Chapter_2/2_16_Spherical_Lens_simplified.png\n", ":name: Fig_2_16_Spherical_Lens_simplified\n", "A lens with thickness $d$. The ray matrix is defined between the planes immediately before and after the lens.\n", "```\n" ] }, { "cell_type": "code", "execution_count": 8, "metadata": { "tags": [ "remove-input" ] }, "outputs": [ { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "
\n", "\n", "\n", "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "%%HTML\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "
\n", "\n", "\n", "\n", "" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We will first derive the matrix which maps the ray vector in the plane **immediately in front** of the lens to that in the plane **immediately behind** the lens. Let\n", "\n", "```{math}\n", ":label: eq.rayvectors\n", "\\begin{align*}\n", "\\left( \\begin{array}{c}n_1 \\alpha_1 \\\\y_1\n", "\\end{array}\\right)\n", "\\;\\;\\; \\text{ and }\n", "\\left( \\begin{array}{c}n_2 \\alpha_2 \\\\y_2\n", "\\end{array}\\right)\n", "\\end{align*}\n", "```\n", "be two vectors in the two planes which correspond to the same ray. The ray is first refracted by the spherical surface with radius $R_1$ and centre $C_1$. Using {eq}`eq.matsph0` and {eq}`eq.defP` it follows that the matrix between the ray vectors just before and just behind the spherical surface with radius $R_1$ and centre $C_1$ is given by\n", "\n", "```{math}\n", ":label: eq.matsph1\n", "\\begin{align*}\n", "{\\cal M}_1= \\left( \\begin{array}{cc}1 & - {\\cal P}_1 \\\\0 & 1\n", "\\end{array}\\right)\n", "\\end{align*}\n", "```\n", ",\n", "where\n", "\n", "$$\n", "{\\cal P}_1 = \\frac{n_l-n_1}{R_1}.\n", "$$ (eq.defP1)\n", "\n", "The ray propagates then over the distance $d$ through the material of which the lens is made. The matrix that maps ray vectors from the plane inside the lens immediately behind the left spherical surface to a ray vector in the plane immediately before the right spherical surface follows from {eq}`eq.mathom`:\n", "\n", "```{math}\n", ":label: eq.matglass\n", "\\begin{align*}\n", "{\\cal M}_2=\\left( \\begin{array}{cc}1 & 0 \\\\\\frac{d}{n_l} & 1\n", "\\end{array}\\right).\n", "\\end{align*}\n", "```\n", "Finally, the matrix that maps ray vectors from the plane in the lens immediately before the second spherical surface to vectors in the plane immediately behind it is\n", "\n", "```{math}\n", ":label: eq.matsph2\n", "\\begin{align*}\n", "{\\cal M}_3= \\left( \\begin{array}{cc}1 & -{\\cal P}_2 \\\\0 & 1\n", "\\end{array}\\right).\n", "\\end{align*}\n", "```\n", "with\n", "\n", "$$\n", "{\\cal P}_2 = \\frac{n_2-n_l}{R_2}.\n", "$$ (eq.defP2)\n", "\n", "Hence the matrix that maps ray vectors in the plane immediately before the lens to ray vectors in the plane immediately behind the lens is given by the matrix product:\n", "\n", "```{math}\n", ":label: eq.matlens\n", "\\begin{align*}\n", "{\\cal M}&= {\\cal M}_3 {\\cal M}_2 {\\cal M}_1 \\\\\n", "&= \\left( \\begin{array}{cc}1 - \\frac{d}{n_l}P_2 & -P_1 - P_2 + \\frac{d}{n_l} P_1P_2 \\\\\\frac{d}{n_l} & 1 -\\frac{d}{n_l}P_1\n", "\\end{array}\\right), \\quad \\textbf{lens}.\n", "\\end{align*}\n", "```\n", "The quantity\n", "\n", "```{math}\n", ":label: eq.powerlens\n", "\\begin{align*}\n", "{\\cal P}={\\cal P}_1+{\\cal P}_2 - \\frac{d}{n_l}{\\cal P}_1{\\cal P}_2\n", "\\end{align*}\n", "```\n", "is called the **power** of the lens. It has dimension 1/length and is given in diopter (${\\cal D}$), where $1 \\,\\, {\\cal D}=\\text{m}^{-1}$. The power can be positive and negative.\n", "The space to the left of the lens is called the **object space** and that to the right of the lens is called the **image space**.\n", "\n", "```{index} Focusing with a Thin Lens\n", ":name: subsection.focthin\n", "```\n", "### Focusing with a Thin Lens\n", "\n", "For a thin lens the vertices $V_1$ and $V_2$ coincide and $d=0$, hence {eq}`eq.matlens` becomes\n", "\n", "```{math}\n", ":label: eq.matthinlens\n", "\\begin{align*}\n", "{\\cal M} = \\left( \\begin{array}{cc}1 & -P\\\\0 & 1\n", "\\end{array}\\right), \\quad \\textbf{thin lens},\n", "\\end{align*}\n", "```\n", "where\n", "\n", "$$\n", "P=P_1+P_2 = \\left( \\frac{n_l-n_1}{R_1}-\\frac{n_2-n_l}{R_2}\\right),\n", "$$ (eq.Pthinlens)\n", "\n", "The origin of the coordinate system is chosen in the common vertex $V_1=V_2$.\n", "\n", "By considering a ray in medium 1 which is parallel to the optical axis ($\\alpha_1=0$) and at height $y_1$, we get $n_2 \\alpha_2= - Py_1$ and $y_2=y_1$. Hence, when $P>0$, the angle $\\alpha_2$ of the ray has sign opposite to $y_2$ and therefore the ray in image space is bent back to the optical axis, yielding a **second focal point** or **image focal point** $F_i$. Its\n", "$z$-coordinate $f_i$ s:\n", "\n", "```{math}\n", ":label: eq.thinlensfo\n", "\\begin{align*}\n", "f_i = \\frac{\\alpha_2}{y_2} = \\frac{n_2}{{\\cal P}}.\n", "\\end{align*}\n", "```\n", "For a ray emerging in image space at height $y_2$ and parallel to the optical axis: $\\alpha_2=0$, we have $y_1=y_2$ and \n", "```{math}\n", "\\begin{align*}\n", "n_1\\alpha_1 = P y_1.\n", "\\end{align*}\n", "```\n", "If the power is positive: ${\\cal P}>0$, the angle $\\alpha_1$ has the same sign as $y_1$, which implies that the ray in object space has intersected the optical axis in a point $F_o$ with $z$-coordinate: $z=f_o$\n", "\n", "```{math}\n", ":label: eq.thinlensfi\n", "\\begin{align*}\n", "f_o = -\\frac{y_1}{\\alpha_1} = -\\frac{n_1}{{\\cal P}}.\n", "\\end{align*}\n", "```\n", "The point $F_o$ is called the **first focal point** or **object focal point**.\n", "\n", "We conclude that when the power ${\\cal P}$ of the lens is positive, $f_i>0$ and $-f_o>0$, which means that the image and object focal points are in the image and object space, respectively, hence they are both real. A lens with positive power is called **convergent** or **positive**. It makes incident bundles of rays convergent or less divergent.\n", "\n", "A lens with negative power is called divergent and has $f_i<0$, $-f_o<0$. It makes incident rays more divergent or less convergent.\n", "Incident rays which are parallel to the optical axis are refracted away from the optical axis and seem to come from a point in front of the lens with $z$-coordinate $f_i<0$. Hence the image focal point does not correspond to a location where there is an actual concentration of light intensity, i.e. it is virtual. The object focal point is a virtual object point, because only a bundle of incident rays that are converging to a certain point behind the negative lens can be turned into a bundle of rays parallel to the optical axis.\n", "\n", "With the results obtained for the focal coordinates we can rewrite the lens matrix of a thin lens as\n", "\n", "```{math}\n", ":label: eq.matthinlens2\n", "\\begin{align*}\n", "{\\cal M} = \\left( \\begin{array}{cc}1 & -\\frac{n_2}{f_i} \\\\0 & 1\n", "\\end{array}\\right), \\quad \\textbf{thin lens}.\n", "\\end{align*}\n", "```\n", "\n", "\n", "```{index} Imaging with a Thin Lens\n", ":name: subsection.imagingthinlens\n", "```\n", "### Imaging with a Thin Lens\n", "\n", "We first consider a general ray matrix {eq}`eq.matgen`, {eq}`eq.defM2` between two planes $z=z_1$ and $z=z_2$ and ask the following question:\n", "what are the properties of the ray matrix such that the two planes are images of each other, or (as this is also called) are each other's conjugate?\n", "Clearly for these planes to be each other's image, we should have that for every point coordinate $y_1$ in the plane $z=z_1$ there is a point with some coordinate $y_2$ in the plane $z=z_2$ such that any ray through $(y_1,z_1)$ (within some cone of rays) will pass through point $(y_2,z_2)$.\n", "Hence for any angle $\\alpha_1$ (in some interval of angles) there is an angle $\\alpha_2$ such that {eq}`eq.matgen` is valid.\n", "This means that for any $y_1$ there is a $y_2$ such that for all angles $\\alpha_1$:\n", "\n", "```{math}\n", ":label: eq.eq\n", "\\begin{align*}\n", "y_2=C n_1\\alpha_1 + D y_1,\n", "\\end{align*}\n", "```\n", "This requires that\n", "\n", "\n", "```{math}\n", ":label: eq.condimage\n", "\\boxed{\\begin{align*}\n", "C=0, \\quad \\textbf{condition for imaging}.\n", "\\end{align*}}\n", "```\n", "\n", "The ratio of $y_2$ and $y_1$ IS the magnification $M$. Hence,\n", "\n", "```{math}\n", ":label: eq.magn\n", "\\begin{align*}\n", "M=\\frac{y_2}{y_1} = D,\n", "\\end{align*}\n", "```\n", "is the **magnification** of the image (this quantity has sign).\n", "\n", "To determine the image by a thin lens we first derive the ray matrix between two planes $z=z_1<0$ and $z=z_2>0$ on either side of the thin lens. The origin of the coordinate system is again at the vertex of the thin lens.\n", "This ray matrix is the product of the matrix for propagation from $z=z_1$ to the plane immediately in front of the lens, the matrix of the thin lens and the matrix for propagation from the plane immediately behind the lens to the plane $z=z_2$:\n", "\n", "```{math}\n", ":label: eq.matz1z2\n", "\\begin{align*}\n", "{\\cal M} &= \\left( \\begin{array}{cc}1 & 0 \\\\\\frac{z_2}{n_2} & 1\n", "\\end{array}\\right) \\left( \\begin{array}{cc}1 & - {\\cal P} \\\\0 & 1\n", "\\end{array}\\right) \\left( \\begin{array}{cc}1 & 0 \\\\\\frac{-z_1}{n_1} & 1\n", "\\end{array}\\right) \\\\\n", "&=\n", "\\left( \\begin{array}{cc}1+\\frac{z_1}{n_1}{\\cal P} & -{\\cal P} \\\\-\\frac{z_1}{n_1} + \\frac{z_2}{n_2} + \\frac{z_1z_2}{n_1 n_2}{\\cal P} & 1-\\frac{z_2}{n_2} {\\cal P}\n", "\\end{array}\\right)\n", "\\end{align*}\n", "```\n", "The imaging condition {eq}`eq.condimage` implies:\n", "\n", "\n", "```{math}\n", ":label: eq.lensmaker\n", "\\boxed{\\begin{align*}\n", "-\\frac{n_1}{s_o} + \\frac{n_2}{s_i}={\\cal P},\n", "\\quad \\bf{Lensmaker's \\;\\; Formula},\n", "\\end{align*}}\n", "```\n", " where we have written $s_o=z_1$ and $s_i=z_2$ for the $z$-coordinates of the object and the image.\n", "Because for the thin lens matrix {eq}`eq.matz1z2`: $D=1-z_2/f_i$, it follows by using {eq}`eq.lensmaker` that the magnification {eq}`eq.magn` is given by\n", "\n", "```{math}\n", ":label: eq.Mlens\n", "\\begin{align*}\n", "M = \\frac{y_i}{y_o}= 1-\\frac{s_i}{f_i}= \\frac{s_i}{s_o},\n", "\\end{align*}\n", "```\n", "where we have written now $y_o$ and $y_i$ instead of $y_1$ and $y_2$, respectively.\n", "\n", "**Remark.**\n", "The Lensmaker's formula for imaging by a thin lens can alternatively be derived by using the imaging formula {eq}`eq.one_surface` of the two spherical surfaces of the lens. We first image a given point $S$ by the left spherical surface using {eq}`eq.one_surface` as if the second surface were absent. The obtained intermediate image $P'$ is then imaged by the second spherical surface as if the first surface were absent. $P'$ can be a real or virtual object for the second surface. The derivation is carried out in Problem 2.5.\n", "\n", "\n", "Analogous to the case of a single spherical surface, an image is called a **real image** if it is to the right of the lens ($s_i>0$) and is called a **virtual image** when it seems to be to the left of the lens ($s_i<0$). An object is called a **real object** if it is to the left of the lens ($s_o<0$) and is a **virtual object** if it seems to be right of the lens ($s_o>0$).\n", "For a positive lens: ${\\cal P}>0$ and hence {eq}`eq.lensmaker` implies that $s_i>0$ provided $|s_o|>|f_o|$, which means that the image by a convergent lens is real if the object is further from the lens than the object focal point $F_o$.\n", "The case $s_o>0$ corresponds to a virtual object, i.e. to the case of a converging bundle of incident rays, which for an observer in object space seems to converge to a point at distance $s_o$ behind the lens.\n", "A convergent lens ($f_i>0$) will then make an image between the lens and the second focal point. In contrast, a diverging lens ($f_i<0$) can turn the incident converging bundle into a real image only if the virtual object point is between the lens and the focal point. If the virtual object point has larger distance to the lens, the convergence of the incident bundle is too weak and the diverging lens then refracts this bundle into a diverging bundle of rays which seem to come from a virtual image point in front of the lens ($s_i<0$).\n", "\n", "Instead of using ray matrices, one can construct the image with a ruler.\n", "Consider the imaging of a finite object $S_1S_2$ as shown in {numref}`Fig_2_17_Real_Image` for the case that the media to the left and right lens are the same. Let $y_o$ be the y-coordinate of $S_2$. We have $y_o>0$ when the object is above the optical axis.\n", "\n", "```{figure} Images/Chapter_2/2_17_Real_Image.png\n", ":name: Fig_2_17_Real_Image\n", "Object and image for a thin lens.\n", "```\n", "\n", "Draw the ray through the focal point $F_o$ in object space and the ray through the centre $V$ of the lens. The first ray becomes parallel in image space. The latter intersects both surfaces of the lens almost in their (almost coinciding) vertices and therefore the refraction is opposite at both surfaces and the ray exits the lens parallel to its direction of incidence. Furthermore, its lateral displacement can be neglected because the lens is thin. (Of course, this is not correct when the refractive indices to the left and right of the lens are different). Hence, **the ray through the centre of a thin lens is not refracted**. The intersection in image space of the two rays gives the location of the image point $P_2$ of $S_2$. The image is real if the intersection occurs in image space and is virtual otherwise.\n", "For the case of a convergent lens with a real object with $y_o>0$ as shown in {numref}`Fig_2_17_Real_Image`, it follows from the similar triangles\n", "$\\Delta\\,\\text{BV}\\text{F}_i$ and $\\Delta\\, \\text{P}_2\\text{P}_1\\text{F}_i$ that\n", "\n", "```{math}\n", ":label: eq.ratio1\n", "\\begin{align*}\n", "\\frac{y_o}{|y_i|} = \\frac{f_i}{s_i -f_i},\n", "\\end{align*}\n", "```\n", ".\n", "From the similar triangles $\\Delta\\, \\text{S}_2\\text{S}_1\\text{F}_o$ and $\\Delta\\, \\text{AVF}_o$:\n", "\n", "```{math}\n", ":label: eq.ratio2\n", "\\begin{align*}\n", "\\frac{|y_i|}{y_o}=\\frac{f_i}{f_o-s_o}.\n", "\\end{align*}\n", "```\n", "here we used $|f_o|=f_i$.\n", "(the absolute value of $y_i$ is taken because according to our sign convention $y_i$ in {numref}`Fig_2_17_Real_Image` is negative whereas {eq}`eq.ratio2` is a ratio of lengths).\n", "By multiplying these two equations we get the **Newtonian form** of the lens equation (valid when $n_2=n_1$):\n", "\n", "\n", "```{math}\n", ":label: eq.newton\n", "\\boxed{\\begin{align*}\n", "x_o x_i =- f_i^2=- f_o^2,\n", "\\end{align*}}\n", "```\n", "\n", "where $x_o$ and $x_i$ are the $z$-coordinates of the object and image relative to those of the first and second focal point, respectively:\n", "\n", "```{math}\n", ":label: eq.defxoxi\n", "\\begin{align*}\n", "x_o = s_o-f_o, \\;\\;\\; x_i = s_i-f_i.\n", "\\end{align*}\n", "```\n", "Hence $x_o$ is negative if the object is to the left of $F_o$ and $x_i$ is positive if the image is to the right of $F_i$.\n", "\n", "The **transverse magnification** is\n", "\n", "```{math}\n", ":label: eq.defM1\n", "\\begin{align*}\n", "M=\\frac{y_i}{y_o} = \\frac{s_i}{s_o} = -\\frac{x_i}{f_i},\n", "\\end{align*}\n", "```\n", "where the second identity follows from considering the similar triangles $\\Delta \\text{P}_2\\text{P}_1\\text{F}_i$ and $\\Delta \\text{BVF}_i$ in {numref}`Fig_2_17_Real_Image`.\n", "A positive $M$ means that the image is erect, a negative $M$ means that the image is inverted.\n", "\n", "All equations are also valid for a thin negative lens and for virtual objects and images.\n", "Examples of real and virtual object and image points for a positive and a negative lens are shown in {numref}`Fig_2_18_Positive_Lens` and {numref}`Fig_2_19_Negative_Lens`.\n", "\n", "\n", "```{figure} Images/Chapter_2/2_18_Positive_Lens.png\n", ":name: Fig_2_18_Positive_Lens\n", "Real and virtual objects and images for a convergent thin lens with the same refractive index left and right of the lens, i.e. $-f_o=f_i>0$. In (a) the object is real with $s_o0$). In (b) the object is between the front focal point and the lens: $f_o< s_o<0$. Then the rays from the object are too divergent for the lens to make them convergent in image space and hence the image is virtual: $s_i<0$. In \\(c\\) there is a cone of converging rays incident on the lens from the left which, in the absence of the lens, would converge to point $S$ behind the lens. Therefore $S$ is a virtual object ($s_0>0$). The image is real and can be constructed with the two rays shown.\n", "\t\tIn (d) $s_i$ is shown as function of $s_o$ for a convergent lens (see Eq. {eq}`eq.lensmaker`).\n", "```\n", "\n", "\n", "\n", "```{figure} Images/Chapter_2/2_19_Negative_Lens.png\n", ":name: Fig_2_19_Negative_Lens\n", "Real and virtual objects and images for a divergent thin lens with the same refractive index to the left and right of the lens, i.e. $-f_o=f_i<0$. In (a) the object is real, i.e. $s_o<0$. The diverging lens makes the cone of rays from the object more divergent so that the image is virtual: $s_i<0$. When the object is virtual, there is a cone of converging rays incident from the left which after extension to the right of the lens (as if the lens is not present) intersect in the virtual object S ($s_o>0$). It depends on how strong the convergence is whether the diverging lens turns this cone into converging rays or whether the rays keep diverging. In (b) $0-f_i$ and the image is virtual ($s_i<0$). In (d) $s_i$ is shown as function of $s_o$ for a divergent lens ($f_i<0$ (see Eq. {eq}`eq.lensmaker`).\n", "```\n" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "tags": [ "remove-input" ] }, "outputs": [], "source": [ "from IPython.display import HTML\n", "\n", "HTML('''\n", "
\n", "\n", "\n", "''')" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "tags": [ "remove-input" ] }, "outputs": [], "source": [ "HTML('''\n", "
\n", "\n", "\n", "''')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "\n", "```{index} Two Thin Lenses\n", ":name: subsection.twolenses\n", "```\n", "### Two Thin Lenses\n", "\n", "The ray matrix is a suitable method to study the imaging of a system consisting of several thin lenses. For two lenses however, the imaging can still easily be obtained by construction.\n", "We simply construct the image obtained by the first lens as if the second lens were not present and use this image as (possibly virtual) object for the second lens.\n", "In {numref}`Fig_2_20_Double_Lens` an example is shown where the distance between the lenses is larger than the sum of their focal lengths.\n", "First the image $P'$ of $S$ is constructed as obtained by $L_1$ as if $L_2$ were not present.\n", "We construct the intermediate image $P'$ due to lens $L_1$ using ray 2 and 3. $P'$ is a real image for lens $L_1$ and also a real object for lens $L_2$. Ray 3 is parallel to the optical axis between the two lenses and is thus refracted by lens $L_2$ through its back focal point $F_{2i}$. Ray 4 is the ray from $P'$ through the centre of lens $L_2$. The image point $P$ is the intersection of ray 3 and 4.\n", "\n", "\n", "```{figure} Images/Chapter_2/2_20_Two_Thin_Lenses_Separated.png\n", ":name: Fig_2_20_Double_Lens\n", "Two thin lenses separated by a distance that is larger than the sum of their focal lengths. \n", "```\n", "\n", "\n", "In the case of {numref}`Fig_2_21_Two_Thin_Lenses_close` the distance $d$ between the two positive lenses is smaller than their focal lengths.\n", "The intermediate image $P'$ is a real image for $L_1$ obtained as the intersection of rays 2 and 4 passing through the object and image focal points $F_{o1}$ and $F_{i1}$ of lens $L_1$. $P'$ is now a virtual object for lens $L_2$. To find its image by $L_2$, draw ray 3 from $P'$ through the centre of lens $L_2$ back to $S$ (this ray is refracted by lens $L_1$ but not by $L_2$) and draw ray 4 as refracted by lens $L_2$. Since ray 4 is parallel to the optical axis between the lenses, it passes through the back focal point $F_{2i}$ of lens $L_2$. The intersection point of ray 3 and 4 is the final image point $P$.\n", "\n", "```{figure} Images/Chapter_2/2_21_Two_Thin_Lenses_close.png\n", ":name: Fig_2_21_Two_Thin_Lenses_close\n", "Two thin lenses at a distance smaller than their focal lengths.\n", "```\n", "\n", "\n", "It is easy to express the $z$-coordinate $s_i$ with respect to the coordinate system with origin at the vertex of $L_2$ of the final image point, in the $z$-component $s_o$ with respect to the origin at the vertex of lens $L_1$ of the object point. We use the Lensmaker's Formula for each lens while taking care that the proper local coordinate systems are used.\n", "The intermediate image $P'$ due to lens $L_1$ has $z$-coordinate $s_{1i}$ with respect to the coordinate system with origin at the vertex $V_1$, which satisfies:\n", "\n", "```{math}\n", ":label: eq.L1\n", "\\begin{align*}\n", "-\\frac{1}{s_o} + \\frac{1}{s_{1i}}=\\frac{1}{f_{1i}}.\n", "\\end{align*}\n", "```\n", "As object for lens $L_2$, $P'$ has $z$-coordinate with respect to the coordinate system with origin at $V_2$ given by:\n", "$s_{2o}=s_{1i}-d$, where $d$ is the distance between the lenses. Hence, with $s_i=s_{2i}$ the Lensmaker's Formula for lens $L_2$ implies:\n", "\n", "```{math}\n", ":label: eq.L2\n", "\\begin{align*}\n", "-\\frac{1}{s_{1i}-d} + \\frac{1}{s_i} = \\frac{1}{f_{2i}}.\n", "\\end{align*}\n", "```\n", "By solving {eq}`eq.L1` for $s_{1i}$ and substituting the result into {eq}`eq.L2`, we find\n", "\n", "```{math}\n", ":label: eq.L1L2\n", "\\begin{align*}\n", "s_i = \\frac{ -d f_{1i}f_{2i} + f_{2i}(f_{i1}-d)s_o }{f_{1i}(f_{2i}-d) + (f_{1i}+f_{2i}-d) s_o}, \\;\\;\\; \\quad \\textbf{two thin lenses}.\n", "\\end{align*}\n", "```\n", "By taking the limit $s_o \\rightarrow -\\infty$, we obtain the $z$-coordinate $f_i$ of the image focal point of the two lenses, while $s_i\\rightarrow \\infty$ gives the $z$-coordinate $f_o$ of the object focal point:\n", "\n", "```{math}\n", ":label: eq.2fi\n", "\\begin{align*}\n", "f_i&= \\frac{ (f_{1i}-d) f_{2i}}{f_{1i}+f_{2i}-d},\n", "\\end{align*}\n", "```\n", "\n", "```{math}\n", ":label: eq.2fo\n", "\\begin{align*}\n", "f_o &= -\\frac{(f_{2i}-d)f_{1i}}{f_{1i}+f_{2i} - d},\n", "\\end{align*}\n", "```\n", "We found in [](subsection.focthin) that when the refractive indices of the media before and after the lens are the same, the object and image focal lengths of a thin lens are the identical. However, as follows from {eq}`eq.2fi` and {eq}`eq.2fo` the object and image focal lengths are in general different when there are several lenses.\n", "\n", "By construction using the intermediate image, it is clear that the magnification of the two-lens system is the product of the magnifications of the two lenses:\n", "\n", "```{math}\n", ":label: eq.M\n", "\\begin{align*}\n", "M = M_1 M_2.\n", "\\end{align*}\n", "```\n", "**Remarks**. \n", "\n", "1. When $f_{1i}+f_{2i}=d$ the focal points are at infinity. Such a system is called **telecentric**. \n", "\n", "2. In the limit where the lenses are very close together: $d\\rightarrow 0$, {eq}`eq.L1L2` becomes\n", "\n", "```{math}\n", ":label: eq.L1L2d0\n", "\\begin{align*}\n", "-\\frac{1}{s_o } + \\frac{1}{s_i} = \\frac{1}{f_{1i}} + \\frac{1}{f_{2i}}.\n", "\\end{align*}\n", "```\n", "The focal length $ f_i$ of the system of two lenses in contact thus satisfies:\n", "\n", "```{math}\n", ":label: eq.ftwo\n", "\\begin{align*}\n", "\\frac{1}{f_i} = \\frac{1}{f_{1i}} + \\frac{1}{f_{2i}}.\n", "\\end{align*}\n", "```\n", "In particular, by the using two identical lenses in contact, the focal length is halved. \n", "\n", "3. Although for two lenses the image coordinate can still be expressed relatively easily in the object distance, for systems with more lenses finding the overall ray matrix and then using the image condition {eq}`eq.condimage` is a much better strategy.\n", "\n", "" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### The Thick Lens\n", "\n", "At the left of {numref}`Fig_2_22_Thick_Lens_Principle_plane` a thick lens is shown. The object focal point is defined as the point whose rays are refracted such that the emerging rays are parallel to the optical axis. By extending the incident and emerging rays by straight segments, the points of intersection are found to be on a curved surface, which close to the optical axis, i.e. in the paraxial approximation, is in good approximation a plane perpendicular to the optical axis. This plane is called the **primary principal plane** and its intersection with the optical axis is called the primary principal point $H_1$.\n", "\n", "```{figure} Images/Chapter_2/2_22_Thick_Lens_Principle_plane.png\n", ":name: Fig_2_22_Thick_Lens_Principle_plane\n", "Principal planes of a thick lens, with front and back focal lengths: f.f.l and b.f.l.\n", "```\n", "\n", "By considering incident rays which are parallel to the optical axis and therefore focused in the image focal point, the **secondary principal plane** and secondary principal point $H_2$ are defined in a similar way (see the drawing at the right in {numref}`Fig_2_22_Thick_Lens_Principle_plane`).\n", "The principal planes can be outside the lens. For meniscus lenses, this is usually the case as shown in {numref}`Fig_2_23_Principle_planes`.\n", "It can be seen from {numref}`Fig_2_22_Thick_Lens_Principle_plane`\n", "that the principal planes are images of each other, with unit magnification. Hence, if an object is placed in the primary principal plane (hypothetically if this plane is inside the lens), its image is in the secondary principal plane. The image is erect and has unit magnification.\n", "\n", "\n", "```{figure} Images/Chapter_2/2_23_Principle_planes.png\n", ":name: Fig_2_23_Principle_planes\n", "Position of the principal planes for several lenses. \n", "```\n", "\n", "\n", "Now, if the object coordinates and object focal point are defined with respect to the origin at $H_1$ and the image coordinates and image focal point are defined with respect to the origin in $H_2$, the Lensmaker's formula {eq}`eq.lensmaker` can also be used for a thick lens.\n", "\n", "\n", "\n", "\n", "*Proof* \n", "\n", "We recall the result {eq}`eq.matlens` for the ray matrix between the planes through the front and back vertices $V_1$, $V_2$ of a thick lens with refractive index $n_l$ and thickness $d$:\n", "\n", "```{math}\n", ":label: eq.matlens_b\n", "\\begin{align*}\n", "{\\cal M}_{V_1V_2}\n", "&= \\left( \\begin{array}{cc}1 - \\frac{d}{n_l}P_2 & -P \\\\\\frac{d}{n_l} & 1 -\\frac{d}{n_l}P_1\n", "\\end{array}\\right), \\quad \\textbf{thick lens},\n", "\\end{align*}\n", "```\n", "where\n", "\n", "```{math}\n", ":label: eq.P1P2_b\n", "\\begin{align*}\n", "P_1= \\frac{n_l-n_1}{R_1}, \\quad P_2=\\frac{n_2-n_l}{R_2},\n", "\\end{align*}\n", "```\n", "and $n_1$, $n_2$ are the refractive indices to the left and the right of the lens, respectively, and where\n", "\n", "```{math}\n", ":label: eq.powerlens_b\n", "\\begin{align*}\n", "P=P_1+P_2 - \\frac{d}{n_l}P_1P_2.\n", "\\end{align*}\n", "```\n", "If $h_1$ is the $z$-coordinate of the first principal point $H_1$ with respect to the coordinate system with origin at vertex $V_1$, we have according to {eq}`eq.mathom` for the ray matrix between the primary principal plane and the plane through vertex $V_1$\n", "\n", "```{math}\n", ":label: eq.mathom_b\n", "\\begin{align*}\n", "{\\cal M}_1=\\left( \\begin{array}{cc}1 & 0 \\\\\\frac{h_1}{n_1} & 1\n", "\\end{array}\\right).\n", "\\end{align*}\n", "```\n", "Similarly, if $h_2$ is the coordinate of the secondary principal point $H_2$ with respect to the coordinate system with $V_2$ as origin, the ray matrix between the plane through vertex $V_2$ and the secondary principal plane is\n", "\n", "```{math}\n", ":label: eq.mathom_c\n", "\\begin{align*}\n", "{\\cal M}_2=\\left( \\begin{array}{cc}1 & 0 \\\\\\frac{h_2}{n_2} & 1\n", "\\end{array}\\right).\n", "\\end{align*}\n", "```\n", "The ray matrix between the two principle planes is then\n", "\n", "```{math}\n", ":label: eq.matH1H2\n", "\\begin{align*}\n", "{\\cal M}_{H_1H_2}= {\\cal M}_2 {\\cal M}_{V_1V_2}{\\cal M}_1.\n", "\\end{align*}\n", "```\n", "The coordinates $h_1$ and $h_2$ can be found by imposing to the resulting matrix the imaging condition\n", "{eq}`eq.condimage`: $C=0$ and the condition that the magnification should be unity: $D=1$, which follows from {eq}`eq.magn`.\n", "We omit the details and only give the resulting expressions here:\n", "\n", "```{math}\n", ":label: eq.V1H1\n", "\\begin{align*}\n", "h_1 &= \\frac{n_1}{n_l} \\frac{P_2}{P} d, \\end{align*}\n", "```\n", "```{math}\n", ":label: eq.V2H2\n", "\\begin{align*}\n", "\\\\\n", "h_2 &= -\\frac{n_2}{n_l} \\frac{P_1}{P} d.\\end{align*}\n", "```\n", "With these results, {eq}`eq.matH1H2` becomes\n", "\n", "```{math}\n", ":label: eq.matH1H2_b\n", "\\begin{align*}\n", "{\\cal M}_{H_1H_2}= \\left( \\begin{array}{cc}1 & -P \\\\0 & 1\n", "\\end{array}\\right).\n", "\\end{align*}\n", "```\n", "We see that **the ray matrix between the principal planes is identical to the ray matrix of a thin lens** {eq}`eq.matthinlens`.\n", "We therefore conclude that if the coordinates in object space are chosen with respect to the origin in the primary principal point $H_1$, and the coordinates in image space are chosen with respect to the origin in the secondary principal point $H_2$, the expressions for the first and second focal points and for the coordinates of the image point in terms of that of the object point are identical to that for a thin lens. An example of imaging by a thick lens is shown in {numref}`Fig_2_24_Thick_Lens_Imaging`.\n", "\n", "```{figure} Images/Chapter_2/2_24_Thick_Lens_Imaging.png\n", ":name: Fig_2_24_Thick_Lens_Imaging\n", "Thick-lens geometry. There holds\n", "\t $f_i=f_o$ if the ambient medium left of the lens is the same as to the right of the lens. All coordinates in object and image space are with respect to the origin in $H_1$ and $H_2$, respectively.\n", "```\n", "\n", "\n", "```{index} Stops\n", ":name: sec.stops\n", "```\n", "\n", "### Stops\n", "\n", "An element such as the rim of a lens or a diaphragm which determines the set of rays that can contribute to the image, is called the **aperture stop**. An ordinary camera has a variable diaphragm.\n", "\n", "The **entrance pupil** is the image of the aperture stop by all elements to the left of the aperture stop. In constructing the entrance pupil, rays are used which propagate from the right to the left. The image can be real or virtual. If there are no lenses between object and aperture stop, the aperture stop itself is the entrance pupil. Similarly, the **exit pupil** is the image of the aperture stop by all elements to the right of it. This image can be real or virtual. The entrance pupil determines for a given object the cone of rays in object space that contribute to the image, while the cone of rays leaving the exit pupil are those taking part in the image formation pupil (see {numref}`Fig_2_25_Aperture_Stop`).\n", "\n", "For any object point, the **chief ray** is the ray in the cone that passes through the centre of the entrance pupil, and hence also through the centres of the aperture stop and the exit pupil. A marginal ray is the ray that for an object point on the optical axis passes through the rim of the entrance pupil (and hence also through the rims of the aperture stop and the exit pupil).\n", "\n", "For a fixed diameter $D$ of the exit pupil and for given $x_o$, the magnification of the system is according to {eq}`eq.defM1` and {eq}`eq.newton` given by $M=-x_i/f_i=f_i/x_o$. It follows that when $f_i$ is increased, the magnification increases.\n", "A larger magnification means a lower energy density, hence a longer exposure time, i.e. **the speed of the lens is reduced**. Camera lenses are usually specified by two numbers: the focal length $f$, measured with respect to the exit pupil and the diameter $D$ of the exit pupil. The **$f$-number** is the ratio of the focal length to this diameter:\n", "\n", "\n", "```{math}\n", ":label: eq.fnumber\n", "\\boxed{\\begin{align*}\n", "\\text{f-number}=f/D.\n", "\\end{align*}}\n", "```\n", "\n", "For example, f-number$=2$ means $f = 2D$. Since the exposure time is proportional to the square of the f-number, a lens with f-number 1.4 is twice as fast as a lens with f-number 2.\n", "\n", "\n", "```{figure} Images/Chapter_2/2_25_Aperture_Stop.png\n", ":name: Fig_2_25_Aperture_Stop\n", "Aperture stop (A.S.) between the second and third lens, with entrance pupil and exit pupil (in this case these pupils are virtual images of the aperture stop). Also shown are the chief ray and the marginal ray. \n", "```\n", "\n", "\n", "## Beyond Gaussian Geometrical Optics\n", "\n", "### Aberrations\n", "For designing advanced optical systems Gaussian geometrical optics is not sufficient.\n", "Instead non-paraxial rays, and among them also non-meridional rays, must be traced using software based on Snell's Law with the sine of the angles of incidence and refraction. Often many thousands of rays are traced to evaluate the quality of an image.\n", "It is then found that in general the non-paraxial rays do not intersect at the ideal Gaussian image point. Instead of a single spot, a spot diagram is found which is more or less confined. The deviation from an ideal point image is quantified in terms of **aberrations**. One distinguishes between monochromatic and chromatic aberrations. The latter are caused by the fact that the refractive index depends on wavelength.\n", "Recall that in paraxial geometrical optics Snell's Law {eq}`eq.refrac3` is replaced by: $n_i \\theta_i = n_t \\theta_t$, i.e. $\\sin \\theta_i$ and $\\sin \\theta_t$ are replaced by the linear terms. If instead one retains the first two terms of the Taylor series of the sine, the errors in the image can be quantified by five monochromatic aberrations, the so-called **primary** or **Seidel aberrations**. The best known is **spherical aberration**, which is caused by the fact that for a convergent spherical lens, the rays that makes a large angle with the optical axis are focused closer to the lens than the paraxial rays (see {numref}`Fig_2_26_Aberration_Lens`).\n", "\n", "```{figure} Images/Chapter_2/2_26_Aberration_Lens.png\n", ":name: Fig_2_26_Aberration_Lens\n", "Spherical aberration of a planar-convex lens. \n", "```\n", "\n", "**Distortion** is one of the five primary aberrations. It causes deformation of images due to the fact that the magnification depends on the distance of the object point to the optical axis.\n", "\n", "For high-quality imaging the aberrations have to be reduced by adding more lenses and optimising the curvatures of the surfaces, the thicknesses of the lenses and the distances between them. For high quality systems, a lens with an aspherical surface is sometimes used. Systems with very small aberrations are extremely expensive, in particular if the field of view is large, as is the case in lithographic imaging systems used in the manufacturing of integrated circuits as shown in the lithographic system in {numref}`Fig_2_27_ASML_EUV`.\n", "\n", "A comprehensive treatment of aberration theory can be found in Braat et al.[^4].\n", "\n", "\n", "```{figure} Images/Chapter_2/2_27_ASML_EUV.png\n", ":name: Fig_2_27_ASML_EUV\n", "The EUV stepper TWINSCAN NXE:3400B.Lithographic lens system for DUV (192 nm), costing more than € 500.000. Ray paths are shown in purple. The optical system consists of mirrors because there are no suitable lenses for this wavelength (Courtesy of [ASML](https://www.asml.com/en/news/media-library)).\n", "```\n", "\n", "\n", "### Diffraction\n", "\n", "According to a generally accepted criterion formulated first by Rayleigh, aberrations start to deteriorate images considerably if the they cause path length differences of more than a quarter of the wavelength.\n", "When the aberrations are less than this, the system is called **diffraction limited**..\n", "\n", "```{figure} Images/Chapter_2/2_28_AiryDisk_210308.png\n", ":name: Fig_2_28_AirySpot\n", "Left: cross section of the field of the Airy pattern. Right: intensity pattern of the Airy pattern.\n", "```\n", "\n", "\n", "Even if the wave transmitted by the exit pupil would be perfectly spherical (no aberrations), the wave front consists of only a circular section of a sphere since the field is limited by the aperture. An aperture causes **diffraction**, i.e. bending and spreading of the light. When one images a point object on the optical axis, diffraction causes inevitable blurring given by the so-called Airy spot, as shown in {numref}`Fig_2_28_AirySpot`. The Airy spot has full-width at half maximum:\n", "\n", "```{math}\n", ":label: eq.Airy_res\n", "\\begin{align*}\n", "\\text{FWHM} = 0.6 \\frac{ \\lambda}{\\text{NA}},\n", "\\end{align*}\n", "```\n", "where NA$=\\arcsin(a/s_i)$ is the numerical aperture (i.e. 0