- Research
- Open access
- Published:
A first-order adjoint and a second-order hybrid method for an energy output least-squares elastography inverse problem of identifying tumor location
Boundary Value Problems volume 2013, Article number: 263 (2013)
Abstract
In this paper we investigate the elastography inverse problem of identifying cancerous tumors within the human body. From a mathematical standpoint, the elastography inverse problem consists of identifying the variable Lamé parameter μ in a system of linear elasticity where the underlying object exhibits nearly incompressible behavior. This problem is subsequently posed as an optimization problem using an energy output least-squares (EOLS) functional, but the nonlinearity that arises makes the computation of the EOLS functional’s derivatives challenging. We employ an adjoint method for the computation of the gradient, something shown to be an efficient method in recent studies, and also give a parallelizable hybrid method for the computation of the EOLS functional’s second derivative. Detailed discrete formulas and nontrivial computational examples are provided to show the feasibility of both the adjoint and hybrid approaches. Furthermore, all results are given in the framework of a general saddle point problem allowing easy adaptation to numerous other inverse problems.
MSC:35R30, 65N30.
1 Introduction
Consider the following system of partial differential equations describing the response of an isotropic elastic object to certain body forces and traction applied to its boundary:
Here the domain Ω is a subset of or and is its boundary. In (1a)-(1d), the vector-valued function represents the displacement of the elastic object, f is the applied body force, n is the unit outward normal, and
is the linearized strain tensor. The resulting stress tensor σ in the stress-strain law (1b) is obtained under the assumption that the elastic object is isotropic and the displacement is small enough so that a linear relationship holds. The Lamé parameters μ and λ quantify the elastic properties of the material. (In the following, for simplicity we set .)
In this work our objective is to investigate the elastography (also known as elasticity imaging) inverse problem of locating cancerous tumors within the human body. This inverse problem consists of identifying the variable parameter μ in (1a)-(1d) from a measurement of the displacement field u. Conversely, the direct problem for (1a)-(1d) is to find the displacement u when function h, the variable coefficients μ and λ, and the body force f are all known. The underlying idea is that differences in molecular makeup as well as microscopic and macroscopic structure result in significant differences in the stiffness of living soft tissue (see [1]). Moreover, changes in tissue stiffness generally correlate with changes in pathological state, with many cancers appearing as hard nodules within the surrounding softer tissue. In a clinical setting, measurements of displacement in human tissue can be obtained using ultrasound and this can then serve as data in the context of the elastography inverse problem. By solving this inverse problem and recovering μ, tumor locations can be identified using the marked differences in elastic properties between the healthy and unhealthy tissue. Additionally, we note that in the elastography inverse problem the human body is treated as a nearly incompressible object where the parameter λ is significantly large and hence only the parameter μ is sought.
Although numerous authors have contributed to using the elasticity properties of soft tissue as a tool to differentiate between normal and cancerous tissue, Raghavan and Yagle [2] were among the first authors to realize that this study can be best done in an inverse problem framework using measured strains and the equations of equilibrium to recover elasticity (cf. (1a)-(1d)). Since then, many studies have been devoted to investigating various aspects of the elastography inverse problem and the interested reader is referred to [3–8] and the cited reference therein. Additionally, a detailed account of the recent developments in elastography inverse problem can be found in the survey article by Doyley [1]. See also [9–21] and the cited references therein for more details.
One of the main technical challenges in the study of this inverse problem stems from the fact that the human body is treated as a nearly incompressible object. That is, the elasticity modulus λ is significantly large (and particularly ), rendering classical finite element methods ineffective due to the so-called locking effect. In the literature, several approaches have been proposed to overcome the locking effect, and in this work we employ the mixed finite elements strategy.
In the following, we provide the necessary details for the transformation of system (1a)-(1d) into a saddle point problem to which the mixed finite element approach can be applied.
We begin by recalling that the dot product of two tensors A and B can be denoted by . That is, for tensors A and B, we have
Given a sufficiently smooth domain , the -norm of a tensor-valued function is given by
On the other hand, for a vector-valued function , the -norm is given by
whereas the -norm by
In the following discussion, for the sake of simplicity, in (1a)-(1d) we set . For this case, the space of test functions, denoted by , is given by
By using Green’s identity and boundary conditions (1c) and (1d), we obtain the following weak form of elasticity system (1a)-(1d): Find such that
The mixed finite elements strategy, which, in the present context, consists of introducing a pressure term , is as follows:
As , (3) yields the incompressibility limit
The weak formulation of (3) reads
By using relation (3), the weak form (2) can be expressed as follows: Find such that
where the pressure p is also an unknown.
Therefore, the problem of finding , satisfying (2), has now been converted into the saddle point problem of finding such that
where and .
For the saddle point formulation, the Babuška-Brezzi condition provides guidance in the choice of finite element spaces necessary for a stable numerical approximation (see [22]).
The primary objective of this work is to develop an efficient computational framework for the elastography inverse problem. For this we employ an adjoint approach for the derivative computation of a recently proposed energy output least-squares (EOLS) functional [23]. We recall that Oberai et al. [24] used the adjoint approach to compute efficiently the gradient of the output least-squares functional. Inspired by Tortorelli and Michaleris [25], we also devise a hybrid method for an efficient computation of the second-order derivative of the EOLS functional. In this direction, we would also like to draw attention to an interesting paper by Cioacaa, Alexea, and Sandua [26] where a second-order adjoint method is studied. All the results and formulas given are for a general saddle point problem and hence can easily be adapted to a wide range of inverse problems for variational problems (see [27]). In the derivation of the adjoint formulas, we do not include the regularization functional while considering the EOLS functional. However, we use a smooth regularizer for the identification of a smooth parameter and a BV regularizer for the identification of discontinuous coefficients.
2 Optimization approach for inverse problems in saddle point problems
Let and Q be real Hilbert spaces, let B be a real Banach space, and let A be a nonempty, closed, and convex subset of B. Here B is the coefficient/parameter space and A is the set of all admissible coefficients. Let be a trilinear map which we assume to be symmetric with respect to the second and third arguments. That is, for every and for all , we have . Let be a bilinear form, let be a symmetric bilinear form, and let be a linear and continuous map. We assume that there are positive constants , , , , and such that the following inequalities hold:
Remark 2.1 We remark that for the subsequent development of our approach, it suffices to assume that A is a closed and convex set of admissible parameters. Most commonly, it is chosen as the set of box-constraints. In some works, the space in which A resides is required to be compactly embedded in the solution space (see [28–31]). In our discrete examples, we have used linear elements to approximate the imposed box-constraints.
We consider the following saddle point problem: Given , find such that
Given all the data, the direct problem in this setting is to find . However, our focus is on the inverse problem of finding a parameter that makes (8a)-(8b) true for a measurement of .
Evidently, saddle point problem (6a)-(6b) connected to the elastography inverse problem of identifying a variable parameter μ in the system of incompressible linear elasticity can be deduced by setting:
A common approach to solve inverse problems of parameter identification in PDEs is to minimize the output least-squares functional, which, in the present context, can be defined by
where , is the measured data, and is the solution of (8a)-(8b) corresponding to ℓ.
The output least-squares solution to the inverse problem of identifying ℓ is the one that solves the following optimization problem: Find such that
Recently, in [23], the following objective functional was proposed to solve the inverse problem of identifying the variable parameter in saddle point problem (8a)-(8b):
where is the measured data and is the solution of (8a)-(8b) corresponding to ℓ.
Clearly, to solve an optimization problem with the above objective functional, we need to compute its derivative which, in turn, requires us to compute the derivative of the solution map. It is well known that one of the most challenging aspects in the study of inverse problems is in finding an efficient computation of the derivative of the solution map. We will now develop an adjoint method for the computation of the first derivative of the EOLS functional and then a new hybrid method for the computation of the functional’s second derivative.
For every , the map is well defined and single-valued. The following result for the differentiability of S, which was announced in [23] without a proof, will be needed.
Theorem 2.1 For each ℓ in the interior of A, is infinitely differentiable at ℓ.
-
1.
Given u, the first derivative is the unique solution of the saddle point problem:
(12a)(12b) -
2.
The second-order derivative
is the unique solution of the saddle point problem
(13a)(13b)
Proof We define a map by , where and are the duals of and Q, and , , and are the associated dual elements given by the Riesz theorem. Then saddle point problem (8a)-(8b) is equivalent to the following implicit equation:
The differentiability of follows from the implicit function theorem. In fact, the map G is infinitely differentiable and the partial derivative with respect to variable is given by
By [[22], Proposition 4], the map is an isomorphism. Therefore, using the implicit function theorem, the map is infinitely differentiable at any ℓ in the interior of A.
We now compute the first and second derivatives of the coefficient-to-solution map. By using equation (8a), for any and for any sufficiently small , we have
and by manipulating the terms in these two equations, we obtain
which, by passing the above equation to the limit when , yields (12a)
Analogously, using equation (8b), for any and for every sufficiently small , we have
and by manipulating the above two equations, we obtain
which, by passing the above equation to limit , gives
which is (12b). Consequently, (12a) and (12b) characterize the first derivative.
The same arguments can be used to compute the form of the second derivative. From (12a), for any and for any sufficiently small , we have
and by rearranging the above set of equations, we obtain
Since the solution map is twice Fréchet differentiable, by passing to the limit , we get
which, after a rearrangement of terms, yields (13a)
From (12b), for any and any sufficiently small , we have
and by rearranging the above two equations, we get
By passing to limit , we finally deduce
which in conjunction with (13b) forms the corresponding saddle point whose unique solution characterizes the second derivative . □
3 An adjoint and a hybrid method for the energy output least squares
The developed adjoint method for the EOLS functional,
is based on the key observation that the underlying saddle point problem can equivalently be posed as a variational equation of finding such that
where
By a direct computation, we have
We define
and, by using (16), notice that
Therefore, for any ‘test function’ , we have
where stands for the partial derivative with respect to ℓ.
The key idea behind the adjoint method is to choose a particular v to avoid the computation of δu. By a direct computation and taking into account (18), we obtain
Now, let be the unique solution of the saddle point problem
which exists, by standard arguments, since the above problem is just (8a)-(8b) with .
By setting in (20), we obtain
where we have used the symmetry of the trilinear form T, , and (21a)-(21b). Since , we obtain
Therefore, using (19), we have
Summarizing, we have the following scheme to compute the derivative given :
-
1.
Compute u by (16).
-
2.
Compute w by (21a)-(21b).
-
3.
Compute by (22).
Let us now develop the hybrid method for the computation of the second-order derivative. In the hybrid method proposed below, the derivative δu is computed directly while the computation of the second derivative is avoided by using an adjoint method. We will follow the same general scheme that was used above, but here we will use derivative formula (12a)-(12b).
Let be a fixed direction. Then, for any , we define
By the construction of H, for every , we have
By a simple calculation, we have
Let be the unique solution of the saddle point problem (cf. (12a)-(12b)):
By setting in (24), we have
Recall that by derivative formula (13a)-(13b), we have
which implies
Consequently, from (23), we get
and, in particular,
Summarizing, we propose the following scheme to compute the derivative given , .
-
1.
Compute by (16).
-
2.
Compute by (12a)-(12b).
-
3.
Compute by (25a)-(25b).
-
4.
Compute by (26).
4 Discretization formulas for the adjoint and the hybrid method
In this section, we collect discrete formulae for saddle point problem (8a)-(8b) and the associated inverse problem. We begin, therefore, with a triangulation on Ω, is the space of all piecewise continuous polynomials of degree relative to , is the space of all piecewise continuous polynomials of degree relative to , and is the space of all piecewise continuous polynomials of degree relative to .
In order to represent the discrete saddle point problem in a computable form, we proceed as follows. We represent bases for , , and by , , and , respectively. The space is then isomorphic to and for any , we define by for , where the nodal basis corresponds to the nodes . Conversely, each corresponds to defined by . Similarly, will correspond to , where , , and , where are the nodes of the mesh defining . Finally, will correspond to , where , , and , where are the nodes of the mesh defining . (The spaces , , and are defined relative to the same elements, but the nodes will be different if .)
Recall that the discrete saddle point problem seeks, for each , the unique with
We define to be the finite element solution operator that assigns to each coefficient the unique approximate solution . Then , where U is defined by
and where the stiffness matrix and the load vector are given by
with
For future reference, it will be useful to note that
where the summation convention is used and T is the tensor defined by
Let us now compute the discrete analogue of energy least-squares objective functional. By using the above notations, the discrete form of
is given by
In order to get an operative expression for the gradient, we need to consider the so-called adjoint stiffness matrix defined by the following condition:
4.1 Computation of the gradient by using the adjoint method
Using the above notation, we have the following discrete adjoint method for the computation of gradient of .
-
1.
We compute by solving the linear system
(30) -
2.
We compute by solving the linear system
(31) -
3.
The gradient can be calculated by using the adjoint stiffness matrix. From (22), we have
(32)a direct discretization gives the following:
and therefore the gradient is given by
(33)
4.2 Computation of the Hessian by using a hybrid method
Recall that we have established the following:
By the standard discretization scheme, we have
-
1.
,
-
2.
,
-
3.
,
-
4.
.
Consequently, we have the following explicit formula for the Hessian:
Summarizing, we have the following scheme for the computation of the second derivative of the EOLS:
-
1.
Compute by solving linear system (30).
-
2.
Compute by solving linear system (31).
-
3.
Compute by solving m linear systems.
-
4.
Compute by using formula (35).
We note that to compute the Hessian using the hybrid method requires the solution of linear systems.
5 Numerical experiments
We consider here two representative examples of elastography inverse problems for the recovery of a variable μ on a two-dimensional isotropic domain with boundary . In the first example, a smooth coefficient is recovered using both the adjoint and hybrid gradient calculation methods. For the second example, we examine the recovery of a discontinuous coefficient using the adjoint method.
All examples are solved on a quadrangular mesh with 5,476 quadrangles and 16,576 total degrees of freedom. Example 1 uses a smooth Tikhonov-type regularization method, whereas the discontinuities in Example 2 necessitate the use of a BV-regularization scheme (see [23] for a more thorough discussion of regularization).
5.1 Example 1
In this example we consider the recovery of a smooth coefficient in which the left and right domain boundaries () are fixed with static condition and the top and bottom boundaries have Neumann condition . The functions defining the coefficient, load, and boundary conditions are as follows:
For this example, the underlying optimization problem was solved using both a first-order Newton-CG-Trust Region algorithm as well as a second-order quasi-Newton method, using the adjoint and hybrid gradient calculations outlined in the preceding sections, respectively. Comparatively, the hybrid method converges faster to the solution in only 9 algorithm iterations compared to 13 iterations for the adjoint method when both are started from the same initial point and under the same stopping criteria (). This can be seen qualitatively in Figures 1 and 2 through the comparison of the computed μ at selected intermediary algorithm steps (subfigures (a) and (b)).
5.2 Example 2
For the discontinuous example, the top of the region is taken as and fixed with (constant) Dirichlet condition . The remaining edges of the region are taken as with Neumann condition . The functions defining the coefficient, load, and boundary conditions are as follows:
where and .
6 Concluding remarks
In this work we have presented a detailed application of the adjoint method for efficiently computing the gradient of the energy output least-squares functional as well as a hybrid method for calculating the functional’s second derivative. We have also provided two numerical examples of elastography inverse problems to demonstrate the overall feasibility of implementation and establish the relative effectiveness of these methods when coupled with the appropriate first-order and second-order optimization algorithms. See Figure 3.
One issue not addressed in depth was the comparative performance of these methods, measured both against existing schemes and against one other. In short, we note that the hybrid method requires the solution of linear systems with m scaling along with the size of the mesh. However, the m systems remain entirely independent, allowing for the parallelization of parts of the computation and thus granting significant performance gains and potential advantages over other strategies. In a future work, we look to extend our study here into just such a thorough analysis and carefully consider the performance of the adjoint and hybrid derivative computation methods.
References
Doyley MM: Model-based elastography: a survey of approaches to the inverse elasticity problem. Phys. Med. Biol. 2012., 57: Article ID R35 10.1088/0031-9155/57/3/R35
Raghavan KR, Yagle AE: Forward and inverse problems in elasticity imaging of soft tissues. IEEE Trans. Nucl. Sci. 1994, 41: 1639-1648.
Aguilo MA, Aquino W, Brigham JC, Fatemi M: An inverse problem approach for elasticity imaging through vibroacoustics. IEEE Trans. Med. Imaging 2010, 29: 1012-1021.
Ammari H, Garapon P, Jouve F: Separation of scales in elasticity imaging: a numerical study. J. Comput. Math. 2010, 28: 354-370.
Arnold A, Reichling S, Bruhns O, Mosler J: Efficient computation of the elastography inverse problem by combining variational mesh adaption and clustering technique. Phys. Med. Biol. 2010, 55: 2035-2056.
Beretta E, Bonnetier E, Francini E, Mazzucato A: Small volume asymptotics for anisotropic elastic inclusions. Inverse Probl. Imaging 2012, 6: 1-23.
Ji L, McLaughlin J: Recovery of Lamé parameter μ in biological tissues. Inverse Probl. 2004, 20: 1-24.
Kallel F, Bertrand M: Tissue elasticity reconstruction using linear perturbation method. IEEE Trans. Med. Imaging 1996, 15: 299-313.
Barbone PE, Bamber JC: Quantitative elasticity imaging: what can and cannot be inferred from strain images. Phys. Med. Biol. 2002, 47: 2147-2164.
Barbone PE, Gokhale NH: Elastic modulus imaging: on the uniqueness and nonuniqueness of the elastography inverse problem in two dimensions. Inverse Probl. 2004, 20: 283-296.
Braess D: Finite Elements: Theory, Fast Solvers, and Applications in Solid Mechanics. 3rd edition. Cambridge University Press, Cambridge; 2007.
Chan TF, Tai XC: Identification of discontinuous coefficients in elliptic problems using total variation regularization. SIAM J. Sci. Comput. 2003, 25: 881-904.
Gockenbach MS, Khan AA: Identification of Lamé parameters in linear elasticity: a fixed point approach. J. Ind. Manag. Optim. 2005, 1: 487-497.
Gockenbach MS, Jadamba B, Khan AA: Numerical estimation of discontinuous coefficients by the method of equation error. Int. J. Math. Comput. Sci. 2006, 1: 343-359.
Gockenbach MS, Jadamba B, Khan AA: Equation error approach for elliptic inverse problems with an application to the identification of Lamé parameters. Inverse Probl. Sci. Eng. 2008, 16: 349-367.
Harrigan T, Konofagou EE: Estimation of material elastic moduli in elastography: a local method, and an investigation of Poisson ratio sensitivity. J. Biomech. 2004, 37: 1215-1221.
Jadamba B, Khan AA, Raciti F: On the inverse problem of identifying Lamé coefficients in linear elasticity. Comput. Math. Appl. 2008, 56: 431-443.
Jadamba B, Khan AA, Sama M: Inverse problems of parameter identification in partial differential equations. In Mathematics in Science and Technology. World Scientific, Hackensack; 2011:228-258.
Konofagou E, Harrigan T, Ophir J, Krouskop T: Poroelastography: estimation and imaging of the poroelastic properties of tissues. IEEE Proceedings of the Symposium in Ultrasonics, Ferroelectrics and Frequency Control 1999, 1627-1630, Lake Tahoe, NV
McLaughlin J, Yoon JR: Unique identifiability of elastic parameters from time-dependent interior displacement measurement. Inverse Probl. 2004, 20: 25-45.
Mehrabian H, Campbell G, Samani A: A constrained reconstruction technique of hyperelasticity parameters for breast cancer assessment. Phys. Med. Biol. 2012, 53: 7489-7508.
Brezzi F, Fortin M: Mixed and Hybrid Finite Element Methods. Springer, New York; 1991.
Doyley, MM, Jadamba, B, Khan, AA, Sama, M, Winkler, B: A new energy inversion for parameter identification in saddle point problems with an application to the elasticity imaging inverse problem of predicting tumor location (2013, submitted)
Oberai AA, Gokhale NH, Feijóo GR: Solution of inverse problems in elasticity imaging using the adjoint method. Inverse Probl. 2003, 19: 297-313.
Tortorelli DA, Michaleris P: Design sensitivity analysis: overview and review. Inverse Probl. Eng. 1994, 1: 71-105.
Cioacaa A, Alexea M, Sandua A: Second-order adjoints for solving PDE-constrained optimization problems. Optim. Methods Softw. 2012, 27: 625-653.
Goeleven D, Motreanu D 2. In Variational and Hemivariational Inequalities - Theory, Methods and Applications. Springer, Berlin; 2003.
Bush, N, Jadamba, B, Khan, AA, Raciti, F: Identification of a parameter in fourth-order partial differential equations by an equation error approach (2014, to appear)
Crossen, E, Gockenbach, MS, Jadamba, B, Khan, AA, Winkler, B: An equation error approach for the elasticity imaging inverse problem for predicting tumor location. Comput. Math. Appl. (2013, to appear)
Gockenbach MS, Khan AA: An abstract framework for elliptic inverse problems. Part 1: an output least-squares approach. Math. Mech. Solids 2007, 12: 259-276.
Gockenbach MS, Khan AA: An abstract framework for elliptic inverse problems. Part 2: an augmented Lagrangian approach. Math. Mech. Solids 2009, 14: 517-539.
Acknowledgements
The work of AA Khan is supported by RIT’s COS D-RIG Acceleration Research Funding Program 2012-2013 and a grant from the Simons Foundation (#210443 to Akhtar Khan). The work of M Sama is partially supported by Ministerio de Ciencia (Spain), project (MTM2012-30942).
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
This research was carried out during Prof. Miguel Sama’s visit at RIT and all the work was done at that time in a collaborative manner. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Cahill, N.D., Jadamba, B., Khan, A.A. et al. A first-order adjoint and a second-order hybrid method for an energy output least-squares elastography inverse problem of identifying tumor location. Bound Value Probl 2013, 263 (2013). https://doi.org/10.1186/1687-2770-2013-263
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/1687-2770-2013-263