![]() ![]() |
Hybrid
Controls for Integrated Planning and Control
| Research Description: | |||||
A discrete transition
relation, which can be represented by a graph, is induced by the transition
between the domain of one policy to the domain of a second policy containing
the goal set of the first policy. On-line planning, and re-planning
under changing conditions, becomes more tractable on the graph, allowing
us to bring numerous discrete planning tools to bear on this essentially
continuous problem. By sequencing the local policies according to
the ordering determined by a discrete planner, the closed loop dynamics
induce the discrete transitions desired by the discrete plan. The
overall hybrid (switched) control policy responds to system perturbations
without the need for re-planning. In the face of changing environmental
conditions, the discrete graph allows for fast online re-planning, while
continuing to respect the system constraints.
By coupling
planning and control in this way, the hybrid control system plans in the
discrete space of control policies; thus, the
|
|||||
| Simple Systems: | |||||
| Our early work focused on simple systems where the robot was considered
a fully actuated point in its configuration space. Two basic policy
types were developed over convex polytopes. The first type
of policy generated a vector field whose integral curves flowed through
a designated goal set, as shown in the left-hand figure below. The
second policy type used a vector field that converged to a single point
contained in the interior. For kinematic systems, the
control law simply followed the vector field. For second-order systems,
the control law converged to the vector field flow in a way that guaranteed
the system did not exit the cell except via the designated goal set.
By properly scaling the vector field, the control policies could be applied
to systems with velocity and/or acceleration constraints.
Given a convex decomposition of the free configuration space, these
two types of policies can be composed to generate a globally convergent
switched control policy as shown on the right-hand figure below.
By allowing overlapping policy cells, the global policy domain in state
space can be increased for second-order systems. While a small region
may not be able to steer a high speed initial condition, a larger region
may be able to capture and steer the initial condition. Consider
the cases shown in the three figures below. The same policy
deployment and ordering is used for three different initial conditions.
In the left-hand figure, the system starts from rest and switches through
three policies before converging to the goal. For the middle figure,
the system starts with a non-zero velocity that requires a different initial
policy; this results in a different induced path. Finally, the right-hand
figure starts with high initial speed. The policy defined over the
large rectangle to the left of the central obstacle applies braking and
steering to capture the system and bring it into the domain of another
policy. At this point, the system takes the path around the central
obstacle. This policy-based switching is based entirely on
the state of the system, and the ordering of the policies. The system
is reactive and does not require re-planning.
|
|||||
| Nonholonomic Convex-bodied Systems: | |||||
Our current work seeks to extend the results to wheeled mobile robots,
where the wheel-to-ground contact induces nonholonomic velocity constraints
on the system. Realistic wheeled mobile robots also have constraints
on speed and acceleration that must be considered by the control policies.
Finally, many robots are non-circular; therefore, the control policies
must consider the robot shape and orientation in cluttered environments.
Non-circular robot used in experiments As in the simple case, the local feedback control policies are defined over local regions of the robot configuration space, which we term cells. Each cell has a designated goal set; the cells are deployed such that the goal set of one policy is contained in the interior of another cell. Over each cell, the local control policy induces a vector field whose flow leads to the designated goal set. In order to realize the sequential composition approach for these nonholonomic systems with multiple constraints, each policy must satisfy four requirements. First, the cell must have simple representations that allow fast inclusion tests. That is, the test to see if a particular policy is valid must be fast. Second, the cell must be contained in the free configuration space.
This can be tested by expanding the cell to account for the extent of the
robot body at each orientation on the cell boundary, and projecting this
expanded cell into the workspace. The projected set, which represents
the swept volume of the robot body over the configurations in the cell,
is then tested for intersection with a workspace obstacle. We have
developed a non-conservative analytic mapping for determining the expanded
cell given a piecewise analytic cell boundary and analytic definition of
the robot body; see CMU-RI-TR-06-34
for more details.
Third, given the input constraints and nonholonomic velocity constraints, it must be possible to generate a velocity that keeps the system within the cell on the cell boundary, with the exception of points contained in the goal set. This guarantees that the control system can keep the robot configuration within the cell. Coupled with the free space requirement in the second condition, this guarantees that the policy can be executed in a way that avoids collision. Finally, the control policy must guarantee convergence to some configuration in the designated goal set in finite time for any point on the cell interior. This requirement guarantees that the ordered policies will transition to a higher priority policy, and thus eventually reach the overall goal. This research has developed policies that satisfy these four requirements,
enabling the implementation of the approach on the robot pictured above.
The policies have been deployed in the robot environment shown below, and
the discrete policy graph that represents transitions between policies
specified. A partial deployment of the policies is shown below.
Given the policy graph encoding the discrete transition relations among
policies, a discrete planner determines an ordering of the policies given
an overall goal. Given an initial condition for the robot, the control
system searches for the highest priority policy that contains the initial
condition. From then on, the policies are executed according to the
specified ordering. An explicit path is never planned; the resulting
closed loop dynamics induce a path that satisfies the system constraints
and reaches the overall goal. Results for four initial conditions,
executed on the non-circular laboratory mobile robot, are shown at the
top of this page. Two additional experimental runs are shown below.
The policies and policy graph are also amenable to use with discrete planners using temporal logic to specify multiple goals, as detailed in CMU-RI-TR-06-34. |
| Personnel: |
| David
C. Conner
Howie Choset Alfred A. Rizzi |
| Related Topics: |
Last updated
August 31, 2006
© Copyright
2006 Biorobotics Lab, Carnegie Mellon University. All Rights Reserved.