Linear form
In linear algebra, a linear form (also known as a linear functional, a one-form, or a covector) is a linear map from a vector space to its field of scalars. In ℝn, if vectors are represented as column vectors, then linear functionals are represented as row vectors, and their action on vectors is given by the matrix product with the row vector on the left and the column vector on the right. In general, if V is a vector space over a field k, then a linear functional f is a function from V to k that is linear:
- for all
- for all
The set of all linear functionals from V to k, denoted by Homk(V,k), forms a vector space over k with the operations of addition and scalar multiplication defined pointwise. This space is called the dual space of V, or sometimes the algebraic dual space, to distinguish it from the continuous dual space. It is often written V∗, V′, or Vᐯ when the field k is understood.
Linear functionals on real or complex vector spaces
We assume throughout that all vector spaces that we consider are either vector spaces over the set of real numbers ℝ or vector spaces over the set of complex numbers ℂ.
We assume that X is a vector space over 𝕂 where 𝕂 is either ℝ or ℂ.
Basic definitions
- Definition: If X is a vector space over a field 𝕂 then 𝕂 is called X 's underlying (scalar) field and any element of 𝕂 is called a scalar.
- Definition: A linear map is a map F : X → Y between two vector spaces X and Y that have the same underlying scalar field, such that F(x + sy) = F(x) + sF(y) for all x, y ∈ X and all scalars s. If X and Y do not necessarily have the same underlying scalar field then we say that F is ℝ-linear and is a linear map over ℝ if it is a linear map when X and Y are both considered as vector spaces over ℝ (i.e. if F(x + ry) = F(x) + rF(y) for all x, y ∈ X and all real r).
- Definition: The kernel of a map F on X is the set Ker F := { x ∈ X : F(x) = 0}. We say that a map is trivial if it is identically equal to 0.
- Definition: A functional on a vector space X is a map from X into X's underlying field. A linear functional on X is a functional that is also a linear map.
Observe that if X is a vector space over ℝ then a "linear functional on X" is a linear map of the form f : X → ℝ (valued in ℝ), while if X is a vector space over ℂ then a "linear functional on X" is a linear map of the form f : X → ℂ (valued in ℂ).
Recall that ℝ (resp. ℂ) is a vector space over ℝ (resp. ℂ) so we can ask what are the linear functionals on ℝ? A function f : ℝ → ℝ is a linear functional on X = ℝ if and only if it is of the form f(x) = rx for some real number r ∈ ℝ. Note in particular that a function f having the equation of a line f(x) = a + rx with a ≠ 0 (e.g. f(x) = 1 + 2x) is not a linear functional on ℝ (since for instance, f(1 + 1) = a + 2r ≠ 2a + 2r = f(1) + f(1)). It is however, a type of function known as an affine linear functional.
A linear functional f is non-trivial if and only if it is surjective (i.e. its range is all of 𝕂).[1]
- Definition: The algebraic dual space, or simply the dual space, is the vector space over 𝕂 consisting of all linear functionals on X. It will be denoted by X#.
- Definition: If f and g are two real-valued functions and if S is a set that belongs to both of their domains, then we say that g dominates f on S and write f ≤ g on S if f(s) ≤ g(s) for all s ∈ S. We say that g extends f if every x in the domain of f belongs to the domain of g and f(x) = g(x). If g extends f then we call g a linear extension of f if g is a linear map.
Relationships with other maps
Relationship between real and complex linear functionals
Suppose that X is a vector space over ℂ. Let Xℝ denote X when it is considered as a vector space over ℝ. Note that every linear functional on X is, by definition, complex-valued while every linear functional on Xℝ is real-valued.
- Definition: By a real linear functional on X, we mean a linear functional on Xℝ (i.e. a linear map of the form f : Xℝ → ℝ, or more explicitly, a map f : X → ℝ such that f (x + y) = f (x) + f (y) and f (rx) = r f (x) for all x, y ∈ X and all real r ∈ ℝ).
If g is a real linear functional on X then g is a linear functional on X if and only if g is trivial (i.e. if g ∈ Xℝ# then g ∈ X# if and only if g = 0) (see footnote for an explanation).[2] Thus, we note the following important technicality:
- WARNING: Any non-trivial linear functional on a complex vector space X is not a real linear functional on X. And conversely, any non-trivial real linear functional on a complex vector space X is not a linear functional on X. However, on a real vector space, linear functionals and real linear functionals are one and the same.
However, a real linear functional g on X does induce a canonical linear functional Lg ∈ X# defined by Lg(x) := g(x) - i g(ix) for all x ∈ X, where i := √-1.
Now suppose that f ∈ X# and let R := Re f (resp. I := Im f) denote the real (resp. imaginary) part of f so that f(x) = R(x) + i I(x). Then for all x ∈ X, I(x) = - R(ix) and R(x) = I(ix) so that
- f(x) = R(x) - i R(ix) = I(ix) + i I(x).
This shows that f, R, and I each completely determine one another[3] and it follows that R and I are real linear functionals on X and the canonical linear functional on X induced by R is f (i.e. LR = f). Furthermore, for all x ∈ X,
- |f(x)|2 = |R(ix)|2 + |R(x)|2 = |I(ix)|2 + |I(x)|2
- = |R(ix)|2 + |I(ix)|2 = |R(x)|2 + |I(x)|2.
Thus the map g ↦ Lg, denoted by L •, defines a one-to-one correspondence from Xℝ# onto X# whose inverse is the map f ↦ Re f. Furthermore, L • is linear as a map over ℝ (i.e. Lg+h = Lg + Lh and Lrg = r Lg for all r ∈ ℝ and g, h ∈ Xℝ#). Similarly, the inverse of the surjective map X# → Xℝ# defined by f ↦ Im f is the map Xℝ# → X# that sends I ∈ Xℝ# to the linear functional x ↦ I(ix) + i I(x).
This relationship was discovered by Henry Löwig in 1934 (although it is usually credited to F. Murray).[4]
If f is a linear functional on a real or complex vector space X and if p is a seminorm on X, then |f| ≤ p on X if and only if Re f ≤ p on X (see footnote for proof).[5][6]
- Topological consequences
If X is a complex topological vector space (TVS), then either all three of f, Re f, and Im f are continuous (resp. bounded), or else all three are discontinuous (resp. unbounded). Moreover, if X is a complex normed space then ||f|| = ||Re f||[7] (where in particular, one side is infinite if and only if the other side is infinite).
Relationships with seminorms and sublinear functions
A sublinear function on a vector space X is a function p : X → ℝ that satisfies the following two properties:
- Subadditivity: p(x+y) ≤ p(x) + p(y) for all x, y ∈ X;
- Positive homogeneity: p(rx) = r p(x) for any positive real r > 0 and any x ∈ X.
A seminorm on X is a sublinear function p : X → ℝ that satisfies the following additional property:
- Absolute homogeneity: p(sx) = |s| p(x) for all x ∈ X and all scalars s;
Note that every linear functional on a real vector space is a sublinear function, although there are sublinear functions that are not linear functionals. Unlike linear functionals, a seminorm p is valued in the non-negative real numbers (i.e. p(x) is a real number and p(x) ≥ 0), so the only linear function that is also a seminorm is the trivial (identically) 0 map. However, if f is a linear functional on a vector space X, then its absolute value is a seminorm on X (i.e. the map on X defined by pf (x) := |f(x)| for all x ∈ X is a seminorm on X)
If f is a linear functional on a real vector space X and p is a seminorm on X, then f ≤ p if and only if |f| ≤ p.[8]
Hahn-Banach theorem
The Hahn-Banach theorem is considered one of the most important results of the subfield of mathematics called functional analysis (as the name suggests, linear functionals play an important role in functional analysis). Due to its importance, the Hahn-Banach theorem has been generalized many times and today "Hahn-Banach theorem" refers to any one of a collection of theorems. The general idea behind a Hahn-Banach theorem is that it gives conditions under which a linear functional on a vector subspace M of X (satisfying a certain condition) can be extended to a linear functional on the whole of X (that continues to satisfy that condition). The following is one of many results known collectively as "Hahn-Banach theorems."
Hahn–Banach dominated extension theorem[3](Rudin 1991, Th. 3.2) — If p : X → ℝ is a sublinear function, and f : M → ℝ is a linear functional on a linear subspace M ⊆ X which is dominated by p on M, then there exists a linear extension F : X → ℝ of f to the whole space X that is dominated by p, i.e., there exists a linear functional F such that
- F(m) = f(m) for all m ∈ M,
- |F(x)| ≤ p(x) for all x ∈ X.
Relationships between multiple linear functionals
Any two linear functionals with the same kernel are proportional (i.e. scalar multiples of each other). This fact can be generalized to the following theorem.
Theorem[9][10] — If f, g1, ..., gn are linear functionals on X, then the following are equivalent:
- f can be written as a linear combination of g1, ..., gn (i.e. there exist scalars s1, ..., sn such that f = s1 g1 + ⋅⋅⋅ + sn gn);
- ∩n
i=1 Ker gi ⊆ Ker f; - there exists a real number r such that |f(x)| ≤ r |gi(x)| for all x ∈ X and all i.
If f is a non-trivial linear functional on X with kernel N, x ∈ X satisfies f(x) = 1, and U is a balanced subset of X, then N ∩ (x + U) = ∅ if and only if |f(u)| < 1 for all u ∈ U.[7]
Hyperplanes and maximal subspaces
- Definition:[4] A vector subspace M of a vector space X is called proper if M ≠ X and it is called maximal in X if it is proper and the only vector subspace of X that contains M is X itself.
- Definition:[4] A hyperplane in X is a translate of a maximal vector subspace (i.e. it is a set of the form x + M := { x + m : m ∈ M} where M is a maximal vector subspace of X and x is any element of X.
A vector subspace M of X is maximal in X if and only if it is the kernel of some non-trivial linear functional on X (i.e. M = ker f for some non-trivial linear functional f on X).[4]
A vsubset H of X is a hyperplane in X if and only if there exists some non-trivial linear functional f on X and some scalar a such that H = { x ∈ X : f(x) = a}, or equivalently, if and only if there exists some non-trivial linear functional f on X such that H = { x ∈ X : f(x) = 1}.[4]
Continuous linear functionals
Functional analysis is a field of mathematics dedicated to studying vector spaces over ℝ or ℂ when they are endowed with a topology making addition and scalar multiplication continuous. Such objects are call topological vector spaces (TVSs). Prominent examples of TVSs include Euclidean space, normed spaces, Banach spaces, and Hilbert spaces.
If X is a topological vector space over 𝕂 then the continuous dual space or simply the dual space is the vector space over 𝕂 consisting of all continuous linear functionals on X. If X is a Banach space, then so is its (continuous) dual space. To distinguish the ordinary dual space from the continuous dual space, the former is sometimes called the algebraic dual space. In finite dimensions, every linear functional is continuous, so the continuous dual is the same as the algebraic dual, but in infinite dimensional locally convex space, the continuous dual is a proper subspace of the algebraic dual.
A linear functional f on a (not necessarily locally convex) topological vector space X is continuous if and only if there exists a continuous seminorm p on X such that |f| ≤ p.[8]
Every non-trivial continuous linear functional on a TVS X is an open map.[7]
A linear functional on a complex TVS is bounded (resp. continuous) if and only if its real part is bounded (resp. continuous).[3]
A linear functional is continuous if and only if its kernel is closed. [11]
If f is a linear functional on a topological vector space (TVS) X (e.g. a normed space) and if p is a continuous sublinear function on X then |f| ≤ p}} implies that f is continuous.
Equicontinuity of families of linear functionals
Let X be a topological vector space (TVS) with continuous dual space X'.
For any subset H of X', the following are equivalent:[12]
- H is equicontinuous;
- H is contained in the polar of some neighborhood of 0 in X;
- the (pre)polar of H is a neighborhood of 0 in X;
If H is an equicontinuous subset of X' then the following sets are also equicontinuous: the weak-* closure, the balanced hull, the convex hull, and the convex balanced hull.[12] Moreover, Alaoglu's theorem implies that the weak-* closure of an equicontinuous subset of X' is weak-* compact (and thus that every equicontinuous subset weak-* relatively compact[13]).[12]
Examples and applications
Linear functionals in Rn
Suppose that vectors in the real coordinate space Rn are represented as column vectors
For each row vector [a1 … an] there is a linear functional f defined by
and each linear functional can be expressed in this form.
This can be interpreted as either the matrix product or the dot product of the row vector [a1 ... an] and the column vector :
(Definite) Integration
Linear functionals first appeared in functional analysis, the study of vector spaces of functions. A typical example of a linear functional is integration: the linear transformation defined by the Riemann integral
is a linear functional from the vector space C[a, b] of continuous functions on the interval [a, b] to the real numbers. The linearity of I follows from the standard facts about the integral:
Evaluation
Let Pn denote the vector space of real-valued polynomial functions of degree ≤n defined on an interval [a, b]. If c ∈ [a, b], then let evc : Pn → R be the evaluation functional
The mapping f → f(c) is linear since
If x0, ..., xn are n + 1 distinct points in [a, b], then the evaluation functionals evxi, i = 0, 1, ..., n form a basis of the dual space of Pn. (Lax (1996) proves this last fact using Lagrange interpolation.)
Application to quadrature
The integration functional I defined above defines a linear functional on the subspace Pn of polynomials of degree ≤ n. If x0, ..., xn are n + 1 distinct points in [a, b], then there are coefficients a0, ..., an for which
for all f ∈ Pn. This forms the foundation of the theory of numerical quadrature.
This follows from the fact that the linear functionals evxi : f → f(xi) defined above form a basis of the dual space of Pn.[14]
Linear functionals in quantum mechanics
Linear functionals are particularly important in quantum mechanics. Quantum mechanical systems are represented by Hilbert spaces, which are anti–isomorphic to their own dual spaces. A state of a quantum mechanical system can be identified with a linear functional. For more information see bra–ket notation.
Distributions
In the theory of generalized functions, certain kinds of generalized functions called distributions can be realized as linear functionals on spaces of test functions.
Visualizing linear functionals
In finite dimensions, a linear functional can be visualized in terms of its level sets, the sets of vectors which map to a given value. In three dimensions, the level sets of a linear functional are a family of mutually parallel planes; in higher dimensions, they are parallel hyperplanes. This method of visualizing linear functionals is sometimes introduced in general relativity texts, such as Gravitation by Misner, Thorne & Wheeler (1973).
Dual vectors and bilinear forms
Every non-degenerate bilinear form on a finite-dimensional vector space V induces an isomorphism V → V∗ : v ↦ v∗ such that
where the bilinear form on V is denoted ⟨ , ⟩ (for instance, in Euclidean space ⟨v, w⟩ = v ⋅ w is the dot product of v and w).
The inverse isomorphism is V∗ → V : v∗ ↦ v, where v is the unique element of V such that
The above defined vector v∗ ∈ V∗ is said to be the dual vector of v ∈ V.
In an infinite dimensional Hilbert space, analogous results hold by the Riesz representation theorem. There is a mapping V → V∗ into the continuous dual space V∗. However, this mapping is antilinear rather than linear.
Bases in finite dimensions
Basis of the dual space in finite dimensions
Let the vector space V have a basis , not necessarily orthogonal. Then the dual space V* has a basis called the dual basis defined by the special property that
Or, more succinctly,
where δ is the Kronecker delta. Here the superscripts of the basis functionals are not exponents but are instead contravariant indices.
A linear functional belonging to the dual space can be expressed as a linear combination of basis functionals, with coefficients ("components") ui,
Then, applying the functional to a basis vector ej yields
due to linearity of scalar multiples of functionals and pointwise linearity of sums of functionals. Then
So each component of a linear functional can be extracted by applying the functional to the corresponding basis vector.
The dual basis and inner product
When the space V carries an inner product, then it is possible to write explicitly a formula for the dual basis of a given basis. Let V have (not necessarily orthogonal) basis . In three dimensions (n = 3), the dual basis can be written explicitly
for i = 1, 2, 3, where ε is the Levi-Civita symbol and the inner product (or dot product) on V.
In higher dimensions, this generalizes as follows
where is the Hodge star operator.
See also
- Discontinuous linear map
- Locally convex topological vector space – Type of topological vector space
- Positive linear functional
- Multilinear form – Map from multiple vectors to an underlying field of scalars, linear in each argument
- Topological vector space – Vector space with a notion of continuity
Notes
- This follows since just as the image of a vector subspace under a linear transformation is a vector subspace, so is the image of X under f. However, the only vector subspaces (that is, 𝕂-subspaces) of 𝕂 are { 0 } and 𝕂 itself.
- If g ∈ Xℝ# is non-trivial then the range of g is ℝ; but then g can't belong to X# because if it did, then its range would have to be ℂ rather than ℝ.
- Narici 2011, pp. 177-220.
- Narici 2011, pp. 10-11.
- Obvious if X is a real vector space. For the non-trivial direction, assume that Re f ≤ p on X and let x ∈ X. Let r ≥ 0 and t be real numbers such that f(x) = reit. Then |f(x)| = r = f(e-itx) = Re (f(e-itx)) ≤ p(e-itx) = p(x).
- Wilansky 2013, p. 20.
- Narici 2011, p. 128.
- Narici 2011, p. 126.
- Rudin 1991, pp. 63-64.
- Narici 2011, pp. 1-18.
- Rudin 1991, Theorem 1.18
- Narici 2011, pp. 225-273.
- Schaefer, Corollary 4.3
- Lax 1996
- J.A. Wheeler; C. Misner; K.S. Thorne (1973). Gravitation. W.H. Freeman & Co. p. 57. ISBN 0-7167-0344-0.
References
- Bishop, Richard; Goldberg, Samuel (1980), "Chapter 4", Tensor Analysis on Manifolds, Dover Publications, ISBN 0-486-64039-6
- Conway, John B. (1990). A Course in Functional Analysis. Graduate Texts in Mathematics. 96 (2nd ed.). Springer. ISBN 0-387-97245-5.
- Dunford, Nelson (1988). Linear operators (in Romanian). New York: Interscience Publishers. ISBN 0-471-60848-3. OCLC 18412261.
- Halmos, Paul (1974), Finite dimensional vector spaces, Springer, ISBN 0-387-90093-4
- Lax, Peter (1996), Linear algebra, Wiley-Interscience, ISBN 978-0-471-11111-5
- Misner, Charles W.; Thorne, Kip. S.; Wheeler, John A. (1973), Gravitation, W. H. Freeman, ISBN 0-7167-0344-0
- Narici, Lawrence; Beckenstein, Edward (2011). Topological Vector Spaces. Pure and applied mathematics (Second ed.). Boca Raton, FL: CRC Press. ISBN 978-1584888666. OCLC 144216834.
- Rudin, Walter (January 1, 1991). Functional Analysis. International Series in Pure and Applied Mathematics. 8 (Second ed.). New York, NY: McGraw-Hill Science/Engineering/Math. ISBN 978-0-07-054236-5. OCLC 21163277.CS1 maint: ref=harv (link) CS1 maint: date and year (link)
- Schaefer, Helmut H.; Wolff, Manfred P. (1999). Topological Vector Spaces. GTM. 8 (Second ed.). New York, NY: Springer New York Imprint Springer. ISBN 978-1-4612-7155-0. OCLC 840278135.CS1 maint: ref=harv (link)
- Schutz, Bernard (1985), "Chapter 3", A first course in general relativity, Cambridge, UK: Cambridge University Press, ISBN 0-521-27703-5
- Trèves, François (August 6, 2006) [1967]. Topological Vector Spaces, Distributions and Kernels. Mineola, N.Y.: Dover Publications. ISBN 978-0-486-45352-1. OCLC 853623322.CS1 maint: ref=harv (link) CS1 maint: date and year (link)