Model Predictive Control

Self-Driving Car Engineer Nanodegree Program

In this final project of term-2 of the self driving car nanodegree program by Udacity, a non-linear Model Predictive Control has been successfully implemented to drive the car around the track on Udacity's simulator. The simulator provides a feed of values containing the position of the car, its speed, heading direction, coordinates of waypoints along a reference trajectory.

The project has been created using Udacity's starter Code

Implementation details

The requirement for successful implementation is that no tire should leave the drivable portion of the track surface, i.e., car may not pop up onto ledges or roll over any surfaces that would otherwise be considered unsafe. In addition, there is a 100milisecond latency in actuation that needs to be taken care of.

Vehicle Model

The implementation is based on bicycle model outlined in the lectures, where the state-variables are vehicle's x & y coordinates, heading-angle and vehicle velocity, [x, y, psi, v] and the actuators are steering-angle and vehicle acceleration (or throttle) [delta, a]. The bicycle model updates the states as:

x[t+1] = x[t] + v[t]*cos(psi[t])*dt
y[t+1] = y[t] + v[t]*sin(psi[t])*dt
psi[t+1] = psi[t] + (v[t]/Lf)*delta*dt
v[t+1] = v[t] + a[t]*dt

Here Lf is the distance between the center of mass of the vehicle and the front wheels and affects the maneuverability.

In addition, at every time step, the model also updates the cross-track error (cte) and orientation-error (epsi), as follows:

cte[t+1] = f(x[t]) - y[t] + v[t]*sin(epsi[t])*dt
epsi[t+1] = psi[t+1] - psides[t]

where, f(x[t]) is 3rd order estimate for vehicle's next position and psides[t] is slope (or orientation) estimate.

The implementation of this vehicle model is very similar to that done in MPC_on_line quiz in class and can be found under class FG_eval in MPC.cpp for this project.

Polynomial Fitting

The waypoints provided are used for a 3rd order polynomial fitting, in order to estimate the next vehicle position. The original waypoint is given in global coordinate system, which are transformed to vehicle coordinate system using the 2D transformation equations shwon in one of the lectures. This transformation is done in the main.cpp script.

Cost function

The state estimation for next time-step is done by using the current state variable values from simulator and optimizing the trajectory over next N time steps, by minimizing a cost function, similar to that defined in one of the examples presented in a lecture-quiz. The fundamental difference from the class-quiz is in use of different weights for different contributors to the cost function, unlike the quiz problem where weights are kept same and equal to 1.

The cost function mentioned above, is basically summation of quadratic expressions in cross-trak-error, vehicle-orientation error (or heading-direction error), velocity error with respect to a set reference, actuator values and actuator value difference between adjacent time-steps. The implementation can be found under class FG_eval in MPC.cpp.

Parameter Tuning

The final parameters used for the project implementation are as listed below, with some comments on why they were given these specific values.

N (timestep length) and dt (elapsed duration between timesteps) values

dt = 0.05
N = 14

Time horizon T = N * dt playes a critical role in the overall performance. A smaller T is typically preferred, as it leads to faster control response, but is can cause instability due to sudden and sharpe changes, hence there are trade-offs. Using smaller dt values leads to better estimation, however, it needs a lot more computational resource, hence increases the latency, which can lead to slower response, which in turn canreduce the speed that can be achieved by the system as well as increase error over time, leading to failure of control implementation, which can be seen in this video, where N is unchanged but dt=0.005 is used. Note that the speed achieved while the car is on road is much less than the target of 65mph.

A large T typically leads to smoother control (and hence ride), however, the overall system (here the car) will keep drifting away from it's reference path and soon it will be beyond the scope of the controller to reduce the error, leading to car falling off the track, as can be seen in this video, where dt is unchanged but N=30 is used.

Reference parameters and weights

reference values
- REF_CTE = 0 (desired value)
- REF_EPSI = 0 (desired value)
- REF_V = 65 (reference speed)
  - I set initial speed to 40mph, however, once a smooth driving profile was obtained, I tried values like 50, 60, 65 and 80. For each of these cases I had to adjust the WT_V (weight for velocity error) parameter. The intention behind this tuning was that the car should reach max speed specified by this reference value, however I could not reach a combination of parameters that helps achieve that, so the actual speed during driving always remains less than that specified by REF_V.
cost function weights: For all the following values I checked by changing the numbers between very high (~ 800) and default (1.0) to assess the impact of a particular parameter. In the end, it seemed that variation in WT_DELTA_DIFF has biggest impact followed by WT_EPSI. The weights that did not lead to significant change in tranjectory were left unchanged and set to default value of 1.0, as listed below:
- WT_CTE = 5.0
- WT_EPSI = 10.0
- WT_V = 0.5
- WT_DELTA = 1.0
- WT_A = 4.0
- WT_DELTA_DIFF = 700
- WT_A_DIFF = 1.0

A video of the simulated car driving around the track with this final set of parameters, can be found here.

Latency

`Initial implementation`

In my initial implementation, to account for the 100ms latency in the actuation, the car position estimation was propagated forward by 100ms when the actuation was expected to start (2-time steps here, since dt = 0.05). This was highlighted in the line shown below from MPC::Solve function:

vector<double> result = {solution.x[delta_start+2], solution.x[a_start+2]};

`revised implementation`

Based on reviewer recommendations, I changed my initial implementation and in the revised version, latency is taken into account by constraining the controls to the values of the previous iteration for the duration of the latency. Thus, I fix the first actuator values in the MPC to values from previous run or from simulator by setting vars_upperbound and lowerbound

in MPC.cpp

for (int i = delta_start; i < delta_start + latency_steps; i++)
    {
        vars_lowerbound[i] = prev_delta;
        vars_upperbound[i] = prev_delta;
    } 
for (int i = a_start; i < a_start + latency_steps; i++)
    {
        vars_lowerbound[i] = prev_a;
        vars_upperbound[i] = prev_a;
    }

And, then I read and feed this to the simulator:

in main.cpp

double steer_value = - res.delta.at(latency_steps)/STEER_LIMIT_RAD;
double throttle_value = res.a.at(latency_steps);

To run the code

Clone this repository and enter following commands in a terminal

mkdir build && cd build

cmake .. && make

./mpc

After execution of ./mpc, simulator should be opened and it should be started by selecting the appropriate project for this implementation.

viks8dm / model-predictive-control Goto Github PK