Basis (linear algebra)
A set of vectors in a vector space V is called a basis, or a set of basis vectors, if the vectors are linearly independent and every vector in the vector space is a linear combination of this set.^{[1]} In more general terms, a basis is a linearly independent spanning set.
Given a basis of a vector space V, every element of V can be expressed uniquely as a linear combination of basis vectors, whose coefficients are referred to as vector coordinates or components. A vector space can have several distinct sets of basis vectors; however each such set has the same number of elements, with this number being the dimension of the vector space.
Contents
Definition
File:Basis graph (no label).svg
A basis B of a vector space V over a field F is a linearly independent subset of V that spans V.
In more detail, suppose that B = { v_{1}, …, v_{n} } is a finite subset of a vector space V over a field F (such as the real or complex numbers R or C). Then B is a basis if it satisfies the following conditions:
 the linear independence property,

 for all a_{1}, …, a_{n} ∈ F, if a_{1}v_{1} + … + a_{n}v_{n} = 0, then necessarily a_{1} = … = a_{n} = 0; and
 the spanning property,

 for every x in V it is possible to choose a_{1}, …, a_{n} ∈ F such that x = a_{1}v_{1} + … + a_{n}v_{n}.
The numbers a_{i} are called the coordinates of the vector x with respect to the basis B, and by the first property they are uniquely determined.
A vector space that has a finite basis is called finitedimensional. To deal with infinitedimensional spaces, we must generalize the above definition to include infinite basis sets. We therefore say that a set (finite or infinite) B ⊂ V is a basis, if
 every finite subset B_{0} ⊆ B obeys the independence property shown above; and
 for every x in V it is possible to choose a_{1}, …, a_{n} ∈ F and v_{1}, …, v_{n} ∈ B such that x = a_{1}v_{1} + … + a_{n}v_{n}.
The sums in the above definition are all finite because without additional structure the axioms of a vector space do not permit us to meaningfully speak about an infinite sum of vectors. Settings that permit infinite linear combinations allow alternative definitions of the basis concept: see Related notions below.
It is often convenient to list the basis vectors in a specific order, for example, when considering the transformation matrix of a linear map with respect to a basis. We then speak of an ordered basis, which we define to be a sequence (rather than a set) of linearly independent vectors that span V: see Ordered bases and coordinates below.
Properties
Again, B denotes a subset of a vector space V. Then, B is a basis if and only if any of the following equivalent conditions are met:
 B is a minimal generating set of V, i.e., it is a generating set and no proper subset of B is also a generating set.
 B is a maximal set of linearly independent vectors, i.e., it is a linearly independent set but no other linearly independent set contains it as a proper subset.
 Every vector in V can be expressed as a linear combination of vectors in B in a unique way. If the basis is ordered (see Ordered bases and coordinates below) then the coefficients in this linear combination provide coordinates of the vector relative to the basis.
Every vector space has a basis. The proof of this requires the axiom of choice. All bases of a vector space have the same cardinality (number of elements), called the dimension of the vector space. This result is known as the dimension theorem, and requires the ultrafilter lemma, a strictly weaker form of the axiom of choice.
Also many vector sets can be attributed a standard basis which comprises both spanning and linearly independent vectors.
Standard bases for example:
In R^{n}, {e_{1}, ..., e_{n}}, where e_{i} is the ith column of the identity matrix.
In P_{2}, where P_{2} is the set of all polynomials of degree at most 2, {1, x, x^{2}} is the standard basis.
In M_{22}, {M_{1,1}, M_{1,2}, M_{2,1}, M_{2,2}}, where M_{22} is the set of all 2×2 matrices. and M_{m,n} is the 2×2 matrix with a 1 in the m,n position and zeros everywhere else.
Change of basis
Given a vector space V over a field F and suppose that {v_{1}, ..., v_{n}} and {α_{1}, ..., α_{n}} are two bases for V. By definition, if ξ is a vector in V then ξ = x_{1}α_{1} + ... + x_{n}α_{n} for a unique choice of scalars x_{1}, ..., x_{n} in F called the coordinates of ξ relative to the ordered basis {α_{1}, ..., α_{n}}. The vector x = (x_{1}, ..., x_{n}) in F^{n} is called the coordinate tuple of ξ (relative to this basis). The unique linear map φ : F^{n} → V with φ(v_{j}) = α_{j} for j = 1, ..., n is called the coordinate isomorphism for V and the basis {α_{1}, ..., α_{n}}. Thus φ(x) = ξ if and only if ξ = x_{1}α_{1} + ... + x_{n}α_{n}.
A set of vectors can be represented by a matrix of which each column consists of the components of the corresponding vector of the set. As a basis is a set of vectors, a basis can be given by a matrix of this kind. The change of basis of any object of the space is related to this matrix. For example, coordinate tuples change with its inverse.
Examples
 Consider R^{2}, the vector space of all coordinates (a, b) where both a and b are real numbers. Then a very natural and simple basis is simply the vectors e_{1} = (1,0) and e_{2} = (0,1): suppose that v = (a, b) is a vector in R^{2}, then v = a(1,0) + b(0,1). But any two linearly independent vectors, like (1,1) and (−1,2), will also form a basis of R^{2}.
 More generally, the vectors e_{1}, e_{2}, ..., e_{n} are linearly independent and generate R^{n}. Therefore, they form a basis for R^{n} and the dimension of R^{n} is n. This basis is called the standard basis.
 Let V be the real vector space generated by the functions e^{t} and e^{2t}. These two functions are linearly independent, so they form a basis for V.
 Let R[x] denote the vector space of real polynomials; then (1, x, x^{2}, ...) is a basis of R[x]. The dimension of R[x] is therefore equal to aleph0.
Extending to a basis
Let S be a subset of a vector space V. To extend S to a basis means to find a basis B that contains S as a subset. This can be done if and only if S is linearly independent. Almost always, there is more than one such B, except in rather special circumstances (i.e. S is already a basis, or S is empty and V has two elements).
A similar question is when does a subset S contain a basis. This occurs if and only if S spans V. In this case, S will usually contain several different bases.
Example of alternative proofs
Often, a mathematical result can be proven in more than one way. Here, using three different proofs, we show that the vectors (1,1) and (−1,2) form a basis for R^{2}.
From the definition of basis
We have to prove that these two vectors are linearly independent and that they generate R^{2}.
Part I: If two vectors v,w are linearly independent, then (a and b scalars) implies
To prove that they are linearly independent, suppose that there are numbers a,b such that:
(i.e., they are linearly dependent). Then:
 and and
Subtracting the first equation from the second, we obtain:
 so
Adding this equation to the first equation then:
Hence we have linear independence.
Part II: To prove that these two vectors generate R^{2}, we have to let (a,b) be an arbitrary element of R^{2}, and show that there exist numbers r,s ∈ R such that:
Then we have to solve the equations:
Subtracting the first equation from the second, we get:
 and then
 and finally
By the dimension theorem
Since (−1,2) is clearly not a multiple of (1,1) and since (1,1) is not the zero vector, these two vectors are linearly independent. Since the dimension of R^{2} is 2, the two vectors already form a basis of R^{2} without needing any extension.
By the invertible matrix theorem
Simply compute the determinant
Since the above matrix has a nonzero determinant, its columns form a basis of R^{2}. See: invertible matrix.
Ordered bases and coordinates
A basis is just a linearly independent set of vectors with or without a given ordering. For many purposes it is convenient to work with an ordered basis. For example, when working with a coordinate representation of a vector it is customary to speak of the "first" or "second" coordinate, which makes sense only if an ordering is specified for the basis. For finitedimensional vector spaces one typically indexes a basis {v_{i}} by the first n integers. An ordered basis is also called a frame.
Suppose V is an ndimensional vector space over a field F. A choice of an ordered basis for V is equivalent to a choice of a linear isomorphism φ from the coordinate space F^{n} to V.
Proof. The proof makes use of the fact that the standard basis of F^{n} is an ordered basis.
Suppose first that
 φ : F^{n} → V
is a linear isomorphism. Define an ordered basis {v_{i}} for V by
 v_{i} = φ(e_{i}) for 1 ≤ i ≤ n
where {e_{i}} is the standard basis for F^{n}.
Conversely, given an ordered basis, consider the map defined by
 φ(x) = x_{1}v_{1} + x_{2}v_{2} + ... + x_{n}v_{n},
where x = x_{1}e_{1} + x_{2}e_{2} + ... + x_{n}e_{n} is an element of F^{n}. It is not hard to check that φ is a linear isomorphism.
These two constructions are clearly inverse to each other. Thus ordered bases for V are in 11 correspondence with linear isomorphisms F^{n} → V.
The inverse of the linear isomorphism φ determined by an ordered basis {v_{i}} equips V with coordinates: if, for a vector v ∈ V, φ^{−1}(v) = (a_{1}, a_{2},...,a_{n}) ∈ F^{n}, then the components a_{j} = a_{j}(v) are the coordinates of v in the sense that v = a_{1}(v) v_{1} + a_{2}(v) v_{2} + ... + a_{n}(v) v_{n}.
The maps sending a vector v to the components a_{j}(v) are linear maps from V to F, because of φ^{−1} is linear. Hence they are linear functionals. They form a basis for the dual space of V, called the dual basis.
Related notions
Analysis
In the context of infinitedimensional vector spaces over the real or complex numbers, the term Hamel basis (named after Georg Hamel) or algebraic basis can be used to refer to a basis as defined in this article. This is to make a distinction with other notions of "basis" that exist when infinitedimensional vector spaces are endowed with extra structure. The most important alternatives are orthogonal bases on Hilbert spaces, Schauder bases and Markushevich bases on normed linear spaces. The term Hamel basis is also commonly used to mean a basis for the real numbers R as a vector space over the field Q of rational numbers. (In this case, the dimension of R over Q is uncountable, specifically the continuum, the cardinal number 2^{ℵ0}.)
The common feature of the other notions is that they permit the taking of infinite linear combinations of the basis vectors in order to generate the space. This, of course, requires that infinite sums are meaningfully defined on these spaces, as is the case for topological vector spaces – a large class of vector spaces including e.g. Hilbert spaces, Banach spaces or Fréchet spaces.
The preference of other types of bases for infinitedimensional spaces is justified by the fact that the Hamel basis becomes "too big" in Banach spaces: If X is an infinitedimensional normed vector space which is complete (i.e. X is a Banach space), then any Hamel basis of X is necessarily uncountable. This is a consequence of the Baire category theorem. The completeness as well as infinite dimension are crucial assumptions in the previous claim. Indeed, finitedimensional spaces have by definition finite bases and there are infinitedimensional (noncomplete) normed spaces which have countable Hamel bases. Consider , the space of the sequences of real numbers which have only finitely many nonzero elements, with the norm Its standard basis, consisting of the sequences having only one nonzero element, which is equal to 1, is a countable Hamel basis.
Example
In the study of Fourier series, one learns that the functions {1} ∪ { sin(nx), cos(nx) : n = 1, 2, 3, ... } are an "orthogonal basis" of the (real or complex) vector space of all (real or complex valued) functions on the interval [0, 2π] that are squareintegrable on this interval, i.e., functions f satisfying
The functions {1} ∪ { sin(nx), cos(nx) : n = 1, 2, 3, ... } are linearly independent, and every function f that is squareintegrable on [0, 2π] is an "infinite linear combination" of them, in the sense that
for suitable (real or complex) coefficients a_{k}, b_{k}. But many^{[2]} squareintegrable functions cannot be represented as finite linear combinations of these basis functions, which therefore do not comprise a Hamel basis. Every Hamel basis of this space is much bigger than this merely countably infinite set of functions. Hamel bases of spaces of this kind are typically not useful, whereas orthonormal bases of these spaces are essential in Fourier analysis.
Geometry
The geometric notions of an affine space, projective space, convex set, and cone have related notions of basis'.^{[3]} An affine basis for an ndimensional affine space is points in general linear position). A projective basis is points in general position, in a projective space of dimension n). A convex basis of a polytope is the set of the vertices of its convex hull. A cone basis^{[4]} consists of one point by edge of a polygonal cone. See also a Hilbert basis (linear programming).
Proof that every vector space has a basis
Let V be any vector space over some field F. Every vector space must contain at least one element: the zero vector 0.
Note that if V = {0}, then the empty set is a basis for V. Now we consider the case where V contains at least one nonzero element, say v.
Define the set X as all linear independent subsets of V. Note that since V contains the nonzero element v, the singleton subset L = {v} of V is necessarily linearly independent.
Hence the set X contains at least the subset L = {v}, and so X is nonempty.
We let X be partially ordered by inclusion: If L_{1} and L_{2} belong to X, we say that L_{1} ≤ L_{2} when L_{1} ⊂ L_{2}. It is easy to check that (X, ≤) satisfies the definition of a partially ordered set.
We now note that if Y is a subset of X that is totally ordered by ≤, then the union L_{Y} of all the elements of Y (which are themselves certain subsets of V) is an upper bound for Y. To show this, it is necessary to verify both that a) L_{Y} belong to X, and that b) every element L of Y satisfies L ≤ L_{Y}. Both a) and b) are easy to check.
Now we apply Zorn's lemma, which asserts that because X is nonempty, and every totally ordered subset of the partially ordered set (X, ≤) has an upper bound, it follows that X has a maximal element. (In other words, there exists some element L_{max} of X satisfying the condition that whenever L_{max} ≤ L for some element L of X, then L = L_{max}.)
Finally we claim that L_{max} is a basis for V. Since L_{max} belong to X, we already know that L_{max} is a linearly independent subset of V.
Now suppose L_{max} does not span V. Then there exists some vector w of V that cannot be expressed as a linearly combination of elements of L_{max} (with coefficients in the field F). Note that such a vector w cannot be an element of L_{max}.
Now consider the subset L_{w} of V defined by L_{w} = L_{max} ∪ {w}. It is easy to see that a) L_{max} ≤ L_{w} (since L_{max} is a subset of L_{w}), and that b) L_{max} ≠ L_{w} (because L_{w} contains the vector w that is not contained in L_{max}).
But the combination of a) and b) above contradict the fact that L_{max} is a maximal element of X, which we have already proved. This contradiction shows that the assumption that L_{max} does not span V was not true.
Hence L_{max} does span V. Since we also know that L_{max} is linearly independent over the field F, this verifies that L_{max} is a basis for V. Which proves that the arbitrary vector space V has a basis.
Note: This proof relies on Zorn's lemma, which is logically equivalent to the Axiom of Choice. It turns out that, conversely, the assumption that every vector space has a basis can be used to prove the Axiom of Choice. Thus the two assertions are logically equivalent.
See also
Notes
 ↑ Halmos, Paul Richard (1987) Finitedimensional vector spaces (4th edition) SpringerVerlag, New York, page 10, ISBN 0387900934
 ↑ Note that one cannot say "most" because the cardinalities of the two sets (functions that can and cannot be represented with a finite number of basis functions) are the same.
 ↑ Notes on geometry, by Elmer G. Rees, p. 7
 ↑ Some remarks about additive functions on cones, Marek Kuczma
References
General references
 Blass, Andreas (1984), "Existence of bases implies the axiom of choice", Axiomatic set theory, Contemporary Mathematics volume 31, Providence, R.I.: American Mathematical Society, pp. 31–33, ISBN 0821850261, MR 0763890<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
 Brown, William A. (1991), Matrices and vector spaces, New York: M. Dekker, ISBN 9780824784195<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
 Lang, Serge (1987), Linear algebra, Berlin, New York: SpringerVerlag, ISBN 9780387964126<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
Historical references
 Banach, Stefan (1922), "Sur les opérations dans les ensembles abstraits et leur application aux équations intégrales (On operations in abstract sets and their application to integral equations)" (PDF), Fundamenta Mathematicae (in français), 3, ISSN 00162736<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
 Bolzano, Bernard (1804), Betrachtungen über einige Gegenstände der Elementargeometrie (Considerations of some aspects of elementary geometry) (in Deutsch)<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
 Bourbaki, Nicolas (1969), Éléments d'histoire des mathématiques (Elements of history of mathematics) (in français), Paris: Hermann<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
 Dorier, JeanLuc (1995), "A general outline of the genesis of vector space theory", Historia Mathematica, 22 (3): 227–261, doi:10.1006/hmat.1995.1024, MR 1347828<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
 Fourier, Jean Baptiste Joseph (1822), Théorie analytique de la chaleur (in français), Chez Firmin Didot, père et fils<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
 Grassmann, Hermann (1844), Die Lineale Ausdehnungslehre  Ein neuer Zweig der Mathematik (in Deutsch)<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>, reprint: Hermann Grassmann. Translated by Lloyd C. Kannenberg. (2000), Extension Theory, Kannenberg, L.C., Providence, R.I.: American Mathematical Society, ISBN 9780821820315<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
 Hamilton, William Rowan (1853), Lectures on Quaternions, Royal Irish Academy<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
 Möbius, August Ferdinand (1827), Der Barycentrische Calcul : ein neues Hülfsmittel zur analytischen Behandlung der Geometrie (Barycentric calculus: a new utility for an analytic treatment of geometry) (in Deutsch)<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
 Moore, Gregory H. (1995), "The axiomatization of linear algebra: 1875–1940", Historia Mathematica, 22 (3): 262–303, doi:10.1006/hmat.1995.1025<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
 Peano, Giuseppe (1888), Calcolo Geometrico secondo l'Ausdehnungslehre di H. Grassmann preceduto dalle Operazioni della Logica Deduttiva (in italiano), Turin<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>
External links
 Instructional videos from Khan Academy
 Hazewinkel, Michiel, ed. (2001), "Basis", Encyclopedia of Mathematics, Springer, ISBN 9781556080104<templatestyles src="Module:Citation/CS1/styles.css"></templatestyles>