byuflowlab / implicitad.jl Goto Github PK

Automates adjoints. Forward and reverse mode algorithmic differentiation around implicit functions (not propagating AD through), as well as custom rules to allow for mixed-mode AD or calling external (non-AD compatible) functions within an AD chain.

License: MIT License

Julia 100.00%

algorithmic-differentiation automatic-differentiation implicit-functions

implicitad.jl's Introduction

ImplicitAD.jl

Summary: Automate steady and unsteady adjoints.

Make implicit functions compatible with algorithmic differentiation (AD) without differentiating inside the solvers (discrete adjoint). Even though one can sometimes propagate AD through a solver, this is typically inefficient and less accurate. Instead, one should use adjoints or direct (forward) methods. However, implementing adjoints is often cumbersome. This package allows for a one-line change to automate this process. End-users can then use your package with AD normally, and utilize adjoints automatically.

We've also enabled methods to efficiently compute derivatives through explicit and implicit ODE solvers (unsteady discrete adjoint). For the implicit solve at each time step we can apply the same methodology. However, both still face memory challenges for long time-based simulations. We analytically propagate derivatives between time steps so that reverse mode AD tapes only need to extend across a single time step. This allows for arbitrarily long time sequences without increasing memory requirements.

As a side benefit the above functionality easily allows one to define custom AD rules. This is perhaps most useful when calling code from another language. We provide fall backs for utilizing finite differencing and complex step efficiently if the external code cannot provide derivatives (ideally via Jacobian vector products). This functionality can also be used for mixed-mode AD.

Author: Andrew Ning and Taylor McDonnell

Features:

Compatible with ForwardDiff and ReverseDiff (or any ChainRules compliant reverse mode AD package)
Compatible with any solver (no differentiation occurs inside the solver)
Simple drop in functionality
Customizable subfunctions to accommodate different use cases (e.g., custom linear solvers, factorizations, matrix-free operators)
Version for ordinary differentiation equations (i.e., discrete unsteady adjoint)
Analytic overrides for linear systems (more efficient)
Analytic overrides for eigenvalue problems (more efficient)
Can provide custom rules to be inserted into the AD chain. Provides finite differencing and complex step defaults for cases where AD is not available (e.g., calling another language).

Documentation:

Start with the tutorial to learn usage.
The API is described in the reference page.
The theory and also some scaling examples in this paper. A supplementary document deriving the linear and eigenvalue cases is available in the theory section.

Run Unit Tests:

pkg> activate .
pkg> test

Citing:

For now, please cite the following preprint. DOI: 10.48550/arXiv.2306.15243

Other Packages:

Nonconvex.jl and ImplicitDifferentiation.jl are other prior implementations of the nonlinear portion of this package. SciML provides support for continuous unsteady adjoints of ODEs. They have also recently added an implementation for the nonlinear case.

implicitad.jl's People

Contributors

Stargazers

Watchers

Forkers

sobhanmp tekajuna olgadoronina jomorlier dingraha jmaack24

implicitad.jl's Issues

Alternative method for providing partial derivatives?

The current method for providing user-defined jacobians could be simplified by allowing users to return the jacobian matrix from the solution procedure instead. Then a new function for the jacobian wouldn't have to be defined since it is often already computed as part of the solution procedure. The relevant modifications to make this happen are in the solve-drdy branch.

Partial derivative matrix cannot be safely re-used.

Currently, this package computes the partial derivative matrix A = drdy(residual, y, x, p) as part of the forward pass for use in the reverse pass. This approach appears to yield incorrect results when used as part of an iterative process.

Here's a minimum working example:

    function residual!(r, y, x, p)
        r[1] = (y[1] + x[1])*(y[2]^3-x[2])+x[3]
        r[2] = sin(y[2]*exp(y[1])-1)*x[4]
        return r
    end

    function solve(x, p)
        rwrap(r, y) = residual!(r, y, x[1:4], p)
        res = nlsolve(rwrap, [0.1; 1.2], autodiff=:forward)
        return res.zero
    end

    A = zeros(2, 2)

    function drdy(residual, y, x, p)
        A[1, 1] = y[2]^3-x[2]
        A[1, 2] = 3*y[2]^2*(y[1]+x[1])
        u = exp(y[1])*cos(y[2]*exp(y[1])-1)*x[4]
        A[2, 1] = y[2]*u
        A[2, 2] = u
        return A
    end

    function modprogram(x)
        z = 2.0*x
        w = z + x.^2
        y = implicit(solve, residual!, w, (), drdy=drdy) # first iteration
        y = implicit(solve, residual!, [y[1], y[2], w[3], w[4]], (), drdy=drdy) # second iteration
        return y[1] .+ w*y[2]
    end

    x = [1.0; 2.0; 3.0; 4.0; 5.0]

    J1 = ForwardDiff.jacobian(modprogram, x)
#      5×5 Matrix{Float64}:
#     8.68311  -0.709511   1.14925   7.45103e-16   0.0
#    -2.85187  12.4013     3.37157   1.4703e-15    0.0
#    -5.48354  -4.00227   25.7932    2.48556e-15   0.0
#    -8.86712  -6.47184   10.483    24.138         0.0
#   -13.0026   -9.4902    15.3721    5.38633e-15  28.9657

    J2 = ReverseDiff.jacobian(modprogram, x)
#     5×5 Matrix{Float64}:
#  10.0056    0.255723   0.67288   7.41178e-16   0.0
#   1.02787  15.233      1.97403   1.45878e-15   0.0
#   1.97638   1.4425    23.1061    2.46343e-15   0.0
#   3.19589   2.33259    6.1377   24.138         0.0
#   4.68641   3.42047    9.00023   5.33384e-15  28.9657

The solution is to compute the partial derivative matrix A = drdy(residual, y, x, p) as part of the reverse-pass. The corrected rule would look like this:

# Provide a ChainRule rule for reverse mode
function ChainRulesCore.rrule(::typeof(implicit), solve, residual, x, p, drdy, lsolve)

    # evaluate solver
    y = solve(x, p)

    function pullback(ybar)
        A = drdy(residual, y, x, p)
        u = lsolve(A', ybar)
        xbar = vjp(residual, y, x, p, -u)
        return NoTangent(), NoTangent(), NoTangent(), xbar, NoTangent(), NoTangent(), NoTangent()
    end

    return y, pullback
end

TagBot trigger issue

This issue is used to trigger TagBot; feel free to unsubscribe.

If you haven't already, you should update your TagBot.yml to include issue comment triggers.
Please see this post on Discourse for instructions and more details.

If you'd like for me to do this for you, comment TagBot fix on this issue.
I'll open a PR within a few hours, please be patient!

Differences with ImplicitDifferentiation.jl?

Hey there, and congrats on the package!
Could we take some time to reflect on the differences between your work and https://github.com/gdalle/ImplicitDifferentiation.jl, which I recently developed? I feel like they have similar goals, and maybe we could work together to avoid duplicates?

Keep getting error with provide_rule

I have a very similar function that gets me matrices and vectors from an external geometry module (here using Python and returns a sample output) for which I plan to finite difference when needed in the AD chain.

`using PyCall
py"""
def demopython(r,p):
return [r,r^3-4*r^2,r^2/2]
"""

demojulia(r,p) = [r,r^3-4*r^2,r^2/2]

function trysolve(r)
p = ()
meshdata = provide_rule(py"demopython",r,p;mode="ffd")
return meshdata
end

r = 1.0
J1 = ForwardDiff.derivative(trysolve,r)
print(J1)`

When run, this gives the stacktrace below

`ERROR: MethodError: no method matching Float64(::ForwardDiff.Dual{ForwardDiff.Tag{typeof(trysolve), Float64}, Float64, 1})

Closest candidates are:
(::Type{T})(::Real, ::RoundingMode) where T<:AbstractFloat
@ Base rounding.jl:207
(::Type{T})(::T) where T<:Number
@ Core boot.jl:792
Float64(::IrrationalConstants.Log4π)
@ IrrationalConstants ~/.julia/packages/IrrationalConstants/vp5v4/src/macro.jl:112
...

Stacktrace:
[1] convert(::Type{Float64}, x::ForwardDiff.Dual{ForwardDiff.Tag{typeof(trysolve), Float64}, Float64, 1})
@ Base ./number.jl:7
[2] cconvert(T::Type, x::ForwardDiff.Dual{ForwardDiff.Tag{typeof(trysolve), Float64}, Float64, 1})
@ Base ./essentials.jl:543
[3] macro expansion
@ ~/.julia/packages/PyCall/1gn3u/src/exception.jl:108 [inlined]
[4] PyObject(r::ForwardDiff.Dual{ForwardDiff.Tag{typeof(trysolve), Float64}, Float64, 1})
@ PyCall ~/.julia/packages/PyCall/1gn3u/src/conversions.jl:23
[5] _pycall!(ret::PyObject, o::PyObject, args::Tuple{ForwardDiff.Dual{…}, Tuple{}}, nargs::Int64, kw::Ptr{Nothing})
@ PyCall ~/.julia/packages/PyCall/1gn3u/src/pyfncall.jl:24
[6] _pycall!
@ ~/.julia/packages/PyCall/1gn3u/src/pyfncall.jl:11 [inlined]
[7] PyObject
@ ~/.julia/packages/PyCall/1gn3u/src/pyfncall.jl:86 [inlined]
[8] _provide_rule
@ ~/.julia/packages/ImplicitAD/bF0uI/src/external.jl:21 [inlined]
[9] #provide_rule#80
@ ~/.julia/packages/ImplicitAD/bF0uI/src/external.jl:19 [inlined]
[10] trysolve(r::ForwardDiff.Dual{ForwardDiff.Tag{typeof(trysolve), Float64}, Float64, 1})
@ Main ~/Desktop/BEM.jl/rankineAD.jl:66
[11] derivative(f::typeof(trysolve), x::Float64)
@ ForwardDiff ~/.julia/packages/ForwardDiff/PcZ48/src/derivative.jl:14
[12] top-level scope
@ ~/Desktop/BEM.jl/rankineAD.jl:72`

Dual Numbers in (Constant) Parameters

While it is implied that dual numbers shouldn't show up in the (constant) parameters, it could still happen in practice. It might be worth adding a note to the documentation to ensure users don't try this.

byuflowlab / implicitad.jl Goto Github PK

implicitad.jl's Introduction

ImplicitAD.jl

implicitad.jl's People

Contributors

Stargazers

Watchers

Forkers

implicitad.jl's Issues

Alternative method for providing partial derivatives?

Partial derivative matrix cannot be safely re-used.

TagBot trigger issue

Differences with ImplicitDifferentiation.jl?

Keep getting error with provide_rule

Dual Numbers in (Constant) Parameters

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent