Geometrical optics

From Infogalactic: the planetary knowledge core
Jump to: navigation, search

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

Geometrical optics, or ray optics, describes light propagation in terms of rays. The ray in geometric optics is an abstraction, or instrument, useful in approximating the paths along which light propagates in certain classes of circumstances.

The simplifying assumptions of geometrical optics include that light rays:

  • propagate in rectilinear paths as they travel in a homogeneous medium
  • bend, and in particular circumstances may split in two, at the interface between two dissimilar media
  • follow curved paths in a medium in which the refractive index changes
  • may be absorbed or reflected.

Geometrical optics does not account for certain optical effects such as diffraction and interference. This simplification is useful in practice; it is an excellent approximation when the wavelength is small compared to the size of structures with which the light interacts. The techniques are particularly useful in describing geometrical aspects of imaging, including optical aberrations.

Explanation

As light travels through space, it oscillates in amplitude. In this image, each maximum amplitude crest is marked with a plane to illustrate the wavefront. The ray is the arrow perpendicular to these parallel surfaces.

A light ray is a line or curve that is perpendicular to the light's wavefronts (and is therefore collinear with the wave vector).

A slightly more rigorous definition of a light ray follows from Fermat's principle, which states that the path taken between two points by a ray of light is the path that can be traversed in the least time.[1]

Geometrical optics is often simplified by making the paraxial approximation, or "small angle approximation." The mathematical behavior then becomes linear, allowing optical components and systems to be described by simple matrices. This leads to the techniques of Gaussian optics and paraxial ray tracing, which are used to find basic properties of optical systems, such as approximate image and object positions and magnifications.[2]

Reflection

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

Glossy surfaces such as mirrors reflect light in a simple, predictable way. This allows for production of reflected images that can be associated with an actual (real) or extrapolated (virtual) location in space.

With such surfaces, the direction of the reflected ray is determined by the angle the incident ray makes with the surface normal, a line perpendicular to the surface at the point where the ray hits. The incident and reflected rays lie in a single plane, and the angle between the reflected ray and the surface normal is the same as that between the incident ray and the normal.[3] This is known as the Law of Reflection.

For flat mirrors, the law of reflection implies that images of objects are upright and the same distance behind the mirror as the objects are in front of the mirror. The image size is the same as the object size. (The magnification of a flat mirror is equal to one.) The law also implies that mirror images are parity inverted, which is perceived as a left-right inversion.

Mirrors with curved surfaces can be modeled by ray tracing and using the law of reflection at each point on the surface. For mirrors with parabolic surfaces, parallel rays incident on the mirror produce reflected rays that converge at a common focus. Other curved surfaces may also focus light, but with aberrations due to the diverging shape causing the focus to be smeared out in space. In particular, spherical mirrors exhibit spherical aberration. Curved mirrors can form images with magnification greater than or less than one, and the image can be upright or inverted. An upright image formed by reflection in a mirror is always virtual, while an inverted image is real and can be projected onto a screen.[3]

Refraction

Lua error in package.lua at line 80: module 'strict' not found.

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

Illustration of Snell's Law

Refraction occurs when light travels through an area of space that has a changing index of refraction. The simplest case of refraction occurs when there is an interface between a uniform medium with index of refraction n_1 and another medium with index of refraction n_2. In such situations, Snell's Law describes the resulting deflection of the light ray:

n_1\sin\theta_1 = n_2\sin\theta_2\

where \theta_1 and \theta_2 are the angles between the normal (to the interface) and the incident and refracted waves, respectively. This phenomenon is also associated with a changing speed of light as seen from the definition of index of refraction provided above which implies:

v_1\sin\theta_2\ = v_2\sin\theta_1

where v_1 and v_2 are the wave velocities through the respective media.[3]

Various consequences of Snell's Law include the fact that for light rays traveling from a material with a high index of refraction to a material with a low index of refraction, it is possible for the interaction with the interface to result in zero transmission. This phenomenon is called total internal reflection and allows for fiber optics technology. As light signals travel down a fiber optic cable, it undergoes total internal reflection allowing for essentially no light lost over the length of the cable. It is also possible to produce polarized light rays using a combination of reflection and refraction: When a refracted ray and the reflected ray form a right angle, the reflected ray has the property of "plane polarization". The angle of incidence required for such a scenario is known as Brewster's angle.[3]

Snell's Law can be used to predict the deflection of light rays as they pass through "linear media" as long as the indexes of refraction and the geometry of the media are known. For example, the propagation of light through a prism results in the light ray being deflected depending on the shape and orientation of the prism. Additionally, since different frequencies of light have slightly different indexes of refraction in most materials, refraction can be used to produce dispersion spectra that appear as rainbows. The discovery of this phenomenon when passing light through a prism is famously attributed to Isaac Newton.[3]

Some media have an index of refraction which varies gradually with position and, thus, light rays curve through the medium rather than travel in straight lines. This effect is what is responsible for mirages seen on hot days where the changing index of refraction of the air causes the light rays to bend creating the appearance of specular reflections in the distance (as if on the surface of a pool of water). Material that has a varying index of refraction is called a gradient-index (GRIN) material and has many useful properties used in modern optical scanning technologies including photocopiers and scanners. The phenomenon is studied in the field of gradient-index optics.[4]

A ray tracing diagram for a simple converging lens.

A device which produces converging or diverging light rays due to refraction is known as a lens. Thin lenses produce focal points on either side that can be modeled using the lensmaker's equation.[5] In general, two types of lenses exist: convex lenses, which cause parallel light rays to converge, and concave lenses, which cause parallel light rays to diverge. The detailed prediction of how images are produced by these lenses can be made using ray-tracing similar to curved mirrors. Similarly to curved mirrors, thin lenses follow a simple equation that determines the location of the images given a particular focal length (f) and object distance (S_1):

\frac{1}{S_1} + \frac{1}{S_2} = \frac{1}{f}

where S_2 is the distance associated with the image and is considered by convention to be negative if on the same side of the lens as the object and positive if on the opposite side of the lens.[5] The focal length f is considered negative for concave lenses.

Incoming parallel rays are focused by a convex lens into an inverted real image one focal length from the lens, on the far side of the lens.

Incoming parallel rays are focused by a convex lens into an inverted real image one focal length from the lens, on the far side of the lens

Rays from an object at finite distance are focused further from the lens than the focal distance; the closer the object is to the lens, the further the image is from the lens. With concave lenses, incoming parallel rays diverge after going through the lens, in such a way that they seem to have originated at an upright virtual image one focal length from the lens, on the same side of the lens that the parallel rays are approaching on.

With concave lenses, incoming parallel rays diverge after going through the lens, in such a way that they seem to have originated at an upright virtual image one focal length from the lens, on the same side of the lens that the parallel rays are approaching on.

Rays from an object at finite distance are associated with a virtual image that is closer to the lens than the focal length, and on the same side of the lens as the object. The closer the object is to the lens, the closer the virtual image is to the lens.

Rays from an object at finite distance are associated with a virtual image that is closer to the lens than the focal length, and on the same side of the lens as the object.

Likewise, the magnification of a lens is given by

 M = - \frac{S_2}{S_1} = \frac{f}{f - S_1}

where the negative sign is given, by convention, to indicate an upright object for positive values and an inverted object for negative values. Similar to mirrors, upright images produced by single lenses are virtual while inverted images are real.[3]

Lenses suffer from aberrations that distort images and focal points. These are due to both to geometrical imperfections and due to the changing index of refraction for different wavelengths of light (chromatic aberration).[3]

Underlying mathematics

Lua error in package.lua at line 80: module 'strict' not found. As a mathematical study, geometrical optics emerges as a short-wavelength limit for solutions to hyperbolic partial differential equations. In this short-wavelength limit, it is possible to approximate the solution locally by

u(t,x) \approx a(t,x)e^{i(k\cdot x - \omega t)}

where k, \omega satisfy a dispersion relation, and the amplitude a(t,x) varies slowly. More precisely, the leading order solution takes the form

a_0(t,x) e^{i\varphi(t,x)/\varepsilon}.

The phase \varphi(t,x)/\varepsilon can be linearized to recover large wavenumber k:= \nabla_x \varphi, and frequency \omega := -\partial_t \varphi. The amplitude a_0 satisfies a transport equation. The small parameter \varepsilon\, enters the scene due to highly oscillatory initial conditions. Thus, when initial conditions oscillate much faster than the coefficients of the differential equation, solutions will be highly oscillatory, and transported along rays. Assuming coefficients in the differential equation are smooth, the rays will be too. In other words, refraction does not take place. The motivation for this technique comes from studying the typical scenario of light propagation where short wavelength light travels along rays that minimize (more or less) its travel time. Its full application requires tools from microlocal analysis.

A simple example

Starting with the wave equation for (t,x) \in \mathbb{R}\times\mathbb{R}^n

L(\partial_t, \nabla_x) u := \left( \frac{\partial^2}{\partial t^2} - c(x)^2 \Delta \right)u(t,x) = 0, \;\; u(0,x) = u_0(x),\;\; u_t(0,x) = 0

assume an asymptotic series solution of the form

u(t,x) \sim a_\varepsilon(t,x)e^{i\varphi(t,x)/\varepsilon} = \sum_{j=0}^\infty i^j \varepsilon^j a_j(t,x) e^{i\varphi(t,x)/\varepsilon}.

Check that

L(\partial_t,\nabla_x)(e^{i\varphi(t,x)/\varepsilon}) a_\varepsilon(t,x) = e^{i\varphi(t,x)/\varepsilon}
\left( \left(\frac{i}{\varepsilon} \right)^2  L(\varphi_t, \nabla_x\varphi)a_\varepsilon + \frac{2i}{\varepsilon} V(\partial_t,\nabla_x)a_\varepsilon + \frac{i}{\varepsilon} (a_\varepsilon L(\partial_t,\nabla_x)\varphi) + L(\partial_t,\nabla_x)a_\varepsilon   \right)

with

V(\partial_t,\nabla_x) := \frac{\partial \varphi}{\partial t} \frac{\partial}{\partial t} - c^2(x)\sum_j \frac{\partial \varphi}{\partial x_j} \frac{\partial}{\partial x_j}

Plugging the series into this equation, and equating powers of \varepsilon, the most singular term O(\varepsilon^{-2}) satisfies the eikonal equation (in this case called a dispersion relation),

0 = L(\varphi_t,\nabla_x\varphi) = (\varphi_t)^2 - c(x)^2(\nabla_x \varphi)^2.

To order \varepsilon^{-1}, the leading-order amplitude must satisfy a transport equation

2V a_0 + (L\varphi)a_0 = 0

With the definition k : = \nabla_x \varphi, \omega := -\varphi_t, the eikonal equation is precisely the dispersion relation that results by plugging the plane wave solution e^{i(k\cdot x - \omega t)} into the wave equation. The value of this more complicated expansion is that plane waves cannot be solutions when the wavespeed c is non-constant. However, it can be shown that the amplitude a_0 and phase \varphi are smooth, so that on a local scale there are plane waves.

To justify this technique, the remaining terms must be shown to be small in some sense. This can be done using energy estimates, and an assumption of rapidly oscillating initial conditions. It also must be shown that the series converges in some sense.

See also

References

  1. Arthur Schuster, An Introduction to the Theory of Optics, London: Edward Arnold, 1904 online.
  2. Lua error in package.lua at line 80: module 'strict' not found.
  3. 3.0 3.1 3.2 3.3 3.4 3.5 3.6 Lua error in package.lua at line 80: module 'strict' not found.Chapter 35
  4. E. W. Marchand, Gradient Index Optics, New York, NY, Academic Press, 1978.
  5. 5.0 5.1 Lua error in package.lua at line 80: module 'strict' not found. Chapters 5 & 6.

Further reading

English translations of some early books and papers:

External links