Abstract

For successful push recovery in response to perturbations, a humanoid robot must select an appropriate stabilizing action. Existing approaches are limited because they are often derived from reduced-order models that ignore system-specific aspects such as swing leg dynamics or kinematic and actuation limits. In this study, the formulation of capturability for whole-body humanoid robots is introduced as a partition-based approach in the augmented center-of-mass (COM)-state space. The 1-step capturable boundary is computed from an optimization-based method that incorporates whole-body system properties with full-order nonlinear system dynamics in the sagittal plane including contact interactions with the ground and conditions for achieving a complete stop after stepping. The 1-step capturable boundary, along with the balanced state boundaries, are used to quantify the relative contributions of different strategies and contacts in maintaining or recovering balance in push recovery. The computed boundaries are also incorporated as explicit criteria into a partition-aware push recovery controller that monitors the robot’s COM state to selectively exploit the ankle, hip, or captured stepping strategies. The push recovery simulation experiments demonstrated the validity of the stability boundaries in fully exploiting a humanoid robot’s balancing capability through appropriate balancing actions in response to perturbations. Overall, the system-specific capturability with the whole-body system properties and dynamics outperformed that derived from a typical reduced-order model.

1 Introduction

Balance stability control in biped robots presents a critical but difficult problem because of their high dimensionality, nonlinearity, and contact interactions with their environment [1]. Existing approaches for balance criteria can be categorized into data-driven, limit cycle, reference points, or space partition. Stability evaluation based on data-driven learning [25] is practically useful but lacks interpretability and requires a large number of tests for each system. In the limit cycle approaches [68], the system’s local stability in response to infinitesimal perturbations is evaluated with respect to specified trajectories such as periodic gait. Generally, these approaches are not suited for standing push recovery due to the presence of non-infinitesimal perturbations and aperiodicity. Reference point approaches consider bounding a defined point within a range on the ground plane, such as the zero-moment point (ZMP) [9] or its modifications [1012] within the base of support (BOS) or its subsets, or in 3D space as thresholds for fall detection and prevention. Reference points are frequently based on reduced-order models or sensor measurements and can be implemented in real-time control with low computational effort. However, due to their general lack of complete system characteristics (e.g., kinematic and actuation limits) or contact interactions with the environment, most of them are neither necessary nor sufficient conditions to distinguish balance and fall [1316].

Among the reference points, the capture point is used to indicate the proper foot placement on the ground required to come to a complete stop [17]. The instantaneous capture point is a special case of the capture point evaluated for a linear inverted pendulum model (LIPM) without consideration of swing leg dynamics, and is equivalent to the ground projection of the divergent component of motion [18,19] and the extrapolated center of mass [20]. The concept was enhanced with the allowance of offsets [21], extensions to 3D [19], and considerations of external forces and the limitations of the LIPM [22].

By contrast, space partition approaches derive a region of the state space of the system or its projection, such as center-of-mass (COM) state (position and velocity) or its augmentations [1,13,14,23,24], as a criterion for evaluating the balance stability of a given state for control. In particular, these are typically based on viability kernel, which is the set of all initial states of the system from which there exists a trajectory that never evolves to a failed or non-viable state [25]. Thus, a system state is viable if and only if it remains inside the system’s viability kernel. However, for a general biped robot, evaluation of the viability kernel is computationally intractable because of issues such as nonlinearity, high dimensionality, and redundancy [17,25]. To address this, the concept of capturability, which represents the ability of a robot to come to a complete stop (within a specified number of steps) without falling [17,26], provides a tractable form of the viability kernel. The N-step viable capture basin, defined as the set of all states that are capturable with N or fewer steps, is a partition of the system state space derived from the notion of the capture point. However, this and similar partitions [27] typically depend on simplifying assumptions or reduced-order models in order to avoid the issue of high dimensionality. While reduced-order models are often used to implement real-time control, they do not reflect whole-body system properties and dynamics, such as the full effects of linear and angular momenta, and thus cannot fully exploit the balance capability of the robots and may overly restrict the robot’s ability to perform general tasks.

Push recovery is an elementary task that seeks to fully explore and exploit a legged system’s balancing capability, in response to not only external disturbances but also internal or desired perturbations such as in gait. For this reason, it has been used to demonstrate several existing criteria [24,28], including the instantaneous capture point and its extensions [23,29] and optimal selection between ZMP feedback and footstep adjustment [30] or among strategies [31]. Push recovery strategies can be categorized into non-stepping and stepping. The non-stepping strategies can be divided according to their controlled joints: ankle, hip, and whole body. The ankle strategy only adjusts the ankle joint to modulate the COM and the center of pressure (COP), like in a simple inverted pendulum with a base link [1]. The hip and whole-body strategies include the contribution of the ankle joint, while also regulating the linear and angular momenta of the upper body through adjusting only the leg joints and all joints, respectively [32], like in an inverted-pendulum-based model with a flywheel [17,24,28] or multiple degrees-of-freedom (DOFs) [1,33]. The stepping strategy relies on changing the contact through foot placement to reduce the COM velocity and/or adjust the size or location of BOS. The effect of these strategies is to modulate the ground reaction forces [34]. For small perturbations, the limited COM or COP modulation of the ankle strategy is sufficient, while the hip or stepping strategy is needed for large perturbations. In general, the sequential employment of ankle, hip, and stepping strategies is used to maintain or recover balance and is superior to the employment of a single strategy [24,31].

In a prior work [14], partitions of augmented COM-state space as bipedal balance stability criteria during the single (SS) and double support (DS) contacts and stepping transitions were derived for whole-body systems based on the relevant definitions. From a balanced state, there exists a (controlled) trajectory such that the system does not ever need to change its contacts. From a steppable unbalanced state, the non-stance foot can reach a desired step before undesired contact(s) are made. While a balanced state is approximately 0-step capturable, a steppable state does not imply capturability, i.e., the subsequent recovery of balance, once that step is made. In this study, the formulation of the 1-step capturable boundary for whole-body humanoid robots is introduced. The boundary, along with those of balanced basins, is incorporated as explicit criteria to select among the ankle, hip, and captured stepping strategies in an advanced partition-aware push recovery controller (Fig. 1). Unlike those from the literature that are mainly based on reduced-order models [17,26,27,29], the balance maintenance and recovery criteria are obtained for a whole-body model with full-order nonlinear dynamics to accurately characterize the system-specific properties and capabilities. Standing push recovery from a zero step length is used to establish the criteria and evaluate the corresponding control. The significance of using the whole-body models when computing the stability boundaries is validated not only against a typical reduced-order model but also across the whole-body stability boundaries with different strategies and contacts. The use of the capturable and balanced state boundaries as criteria is verified in simulation experiments where the proposed control design enables the robot to fully exploit its balance capability through appropriate balancing actions in response to perturbations.

Fig. 1
Schematic of the proposed partition-aware control for push recovery. The balancing action is selected among the ankle, hip, and stepping strategies by comparing their corresponding pre-computed stability boundaries with the current system state.
Fig. 1
Schematic of the proposed partition-aware control for push recovery. The balancing action is selected among the ankle, hip, and stepping strategies by comparing their corresponding pre-computed stability boundaries with the current system state.
Close modal

2 Whole-Body Humanoid System Models

The DARwIn-OP humanoid robot (ROBOTIS Co., Ltd, South Korea) is chosen as the platform for implementation. The sagittal plane is selected as the plane of perturbation application during standing push recovery on level ground.

2.1 Kinematic and Actuation Models.

The humanoid robot is modeled in the sagittal plane as a planar whole-body system with a floating base attached at the torso. The total DOFs include the joint variables of the robot for in-plane rotation and additional fictitious joint variables, including two translational and one rotational DOFs, for the position and orientation of the floating base. The index p = 1, 2 identifies the stance and swing foot in SS and the rear and front stance foot in DS, respectively, and rp = [rp,xrp,y]T denotes the position vector of the origin of the pth foot-link local frame at the foot sole center with respect to the global frame {X, Y} with units in meters (Fig. 2). For notational simplicity without loss of generality, r1 = 0 is assumed and Y = 0 represents level ground. The BOS lengths during SS and DS are the foot length fl and the sum of fl and the desired step length Δrxdesired, respectively.

Fig. 2
Whole-body kinematic model in the sagittal plane with contact wrenches for DS (left) and actuation limits (top right) of the DARwIn-OP humanoid robot. The DS BOS (bottom right) is also shown.
Fig. 2
Whole-body kinematic model in the sagittal plane with contact wrenches for DS (left) and actuation limits (top right) of the DARwIn-OP humanoid robot. The DS BOS (bottom right) is also shown.
Close modal

The range of motion of each joint was obtained as the most extreme positions of the servomotor measured by its encoder and verified against existing data [35]. The torque and velocity limits of the servomotor are modeled as a hexagonal region [13,14] (Fig. 2), denoting its desired operating range.

2.2 Dynamics With Contact Wrench Distribution.

A recursive Lagrangian dynamics algorithm is used to implement the joint-space dynamics for a whole-body system subject to contact constraints with respect to the vectors of joint variables q, velocities q˙, and accelerations q¨, where dot denotes time-derivative [14]. In addition, the Cartesian-space centroidal dynamics is formulated for linear and angular momenta as
(1)
where m is the total mass of the robot, r¯=[x¯y¯]T is the COM position with respect to the global frame, g is the gravitational acceleration vector, and Fp is the applied force at the pth foot. In the rotational dynamics, Mc is the resultant applied moment about the COM of the system, k is the link index, ρk is the link COM position vector relative to the COM of the system, ωk is the absolute angular velocity of the link, and Ik is the link inertia tensor written in the nonrotating link-centroidal local frame [14]. All the vectors related to rotation are normal to the sagittal plane and have only Z-components according to the right-hand rule.
The foot-ground contact interactions are included through the formulation of individual contact wrenches {Fp, Mp} consisting of the applied force and moment at the pth foot position rp. In the sagittal plane, the contact wrench distributions across both feet under full foot-ground contact and no-slip conditions during DS can be represented by using the time-varying parameters αR3 with each element normalized to between 0 and 1.
(2)
(3)
(4)
where the upper-hat indicates the unit vector along a principal axis, μ is the coefficient of static friction, and (fl/2)(2α31)x^ is the COP position relative to the origin of the local frame of the pth foot link. The transformation of 2α2 − 1 ∈ [−1, 1] allows for bidirectional friction. Then, F2 and M2 can be obtained from F1 and M1 as follows:
(5)
where the resultant effect of {Fp, Mp} is incorporated into Mc. For SS, a contact wrench is applied only at the stance foot position r1. These contact wrenches are incorporated into the joint-space dynamics subject to zero forces and torques for the floating base.

3 Construction of Double-Support Balanced State Boundaries

The formulations of balanced state boundary construction from prior works [13,14] are briefly revisited here for DS in the sagittal plane and refined for balancing with ankle and hip strategies. The boundaries are computed over the sampled domain of the task-augmented COM state space with an optimization-based framework. The domain is a set of the sampled COM positions r¯sampled augmented with desired foot contacts.

The objective of the optimization problem at each sampled position is to maximize the initial COM velocity r¯˙(0) in a given direction subject to constraints that represent the system properties, system and interaction dynamics, and stability state definitions. The constrained optimization problem is solved by a sequential quadratic programming algorithm [36] with a direct collocation scheme that parameterizes the joint kinematics and contact wrenches with third-order B-spline control vertices v [37] and time-discretized non-dimensional variables α, respectively. The maximized COM velocities at all sampled COM positions, which form the stability boundaries in the COM state space, are the solutions to the problem formulated as
(6)
where b is the vector of the constraint functions and the superscripts LB and UB represent the lower and upper bounds of the constraints, respectively. The optimization algorithm includes the analytical gradients of the cost and constraint functions. The terminal time T of the solution time interval [0, T] is manually tuned through numerical experiments.

The general constraints with desired contact(s) from the prior works [13,14] are used, representing ground penetration avoidance of the feet, system properties (i.e., joint kinematic and actuation limits including the zero forces and torques for the floating base), whole-body dynamics, and contact interactions in the absence of any external wrenches except for the desired foot contacts at all times. Only full foot-ground contact is considered here.

Additional constraints are imposed according to the definition of a balanced state and the action of each strategy in DS. Each sampled COM position is imposed as an initial condition
(7)
The DS contact configuration is represented by the desired position and orientation constraints imposed at both feet p = 1, 2 (Fig. 2). In particular, both feet are fixed at all times (t[0,T]) and the COM static equilibrium is imposed as a final condition, under the assumption that a typical fully-controlled robot can reach a full (i.e., for all joints) static equilibrium from the COM static equilibrium with a desired position within sufficient time
(8)
(9)
where Φp is the angle positive in counterclockwise from the ground to the pth foot sole with Φp = 0 indicating full foot contact.
For balancing with zero step length (Δrxdesired=0) in standing push recovery, all joints except for the ankle (denoted by the subscript non-ankle) and upper-body joints (denoted by the subscript upper) are fixed at all times (t[0,T]) for the ankle and hip strategy actions, respectively:
(10)
(11)

These constraints are implemented by fixing the corresponding joint angles to those of the reference pose with r¯=[0.00.2]T, Δrxdesired=0, arms straight down, bent knees, and upright torso. The initial COM positions r¯sampled are sampled from the arc formed by the COM workspace with all joints fixed except for the ankle at the reference pose.

4 Construction of Whole-Body 1-Step Capturable Boundaries

The aforementioned optimization-based framework is modified for the construction of the whole-body 1-step capturable boundary. While the objective function and the general constraints with desired contacts remain identical as above, additional constraints are required according to the definition of the capturable state and the action of stepping, which are distinct from those for balanced states. The sampled positions are assigned to be within the COM X-position range of the BOS for zero step length and the COM Y-position of the reference pose for the ankle and hip strategies. The joint angles qsampled corresponding to the sampled COM positions, which are obtained by adjusting the pelvis position, are imposed as initial conditions
(12)

Initial joint angular velocity conditions, which would fully determine the initial COM velocity, are not imposed.

In order to represent the SS contact configuration during the swing phase, the stance foot is fixed at all times (t[0,T])
(13)
The orientation of the swing foot is constrained to remain within specified bounds at all times (t[0,T]) in order to ensure that the foot sole faces downward during the swing phase such that a full-foot contact is made eventually after stepping
(14)
For a given Δrxdesired, the following stepping constraints on the swing foot position are imposed as final conditions
(15)
where Δr2,yUB denotes the upper bound of the final height for the swing foot. Satisfaction of these constraints guarantees that the desired step length and direction are achieved within [0, T]. These constraints are imposed as conditional constraints [14,38] to avoid pre-specifying the time instant tstep at which the step is made. With this approach, the satisfaction of the stepping constraints is evaluated throughout [0, T] within each optimization iteration. All of the general constraints are released after tstep ∈ [0, T], when the stepping constraints are first satisfied, such that the effective solution time interval is truncated to [0, tstep]. At stepping, the COM velocity r¯˙(tstep) is reduced due to the swing foot-ground collision. In this preliminary study, the reduction is derived using a simple rimless wheel model under the assumptions of perfectly inelastic impact without slip [39].
From the definition of 1-step capturable state, a final full static equilibrium should be achieved after making a desired step. In the proposed approach, the pre-computed DS whole-body-strategy balanced basin with the desired step length is imposed as a COM state constraint (Fig. 3) at t = tstep to ensure a captured step
(16)
Fig. 3
Illustration of the proposed approach to compute whole-body 1-step capturable boundary
Fig. 3
Illustration of the proposed approach to compute whole-body 1-step capturable boundary
Close modal

The initial and final COM Y-positions for the pre-computed DS balanced basin are that of the reference pose for the ankle and hip strategies. Since a balanced state of the robot is also 0-step capturable, the satisfaction of this condition at t = tstep ensures that full static equilibrium can be eventually reached without further stepping or other contact changes.

5 Partition-Aware Stability Controller

A partition-aware controller for balanced basins and steppability introduced in a prior work [14] is modified in this study for recovery of balance by incorporating the captured stepping strategy. The ankle- and hip-strategy balanced state boundaries are actively used as explicit criteria to select among the ankle, hip, and captured-stepping strategies given the current estimated COM state of the robot (Fig. 4).

Fig. 4
Block diagram of the proposed partition-aware controller. The superscripts represent the source of the variables and the subscripts represent the body portion, joint, and Cartesian component in that order.
Fig. 4
Block diagram of the proposed partition-aware controller. The superscripts represent the source of the variables and the subscripts represent the body portion, joint, and Cartesian component in that order.
Close modal

Forward kinematics is used to estimate the current COM state and foot orientations through the measured joint angles qmeasured and global orientation of the torso obtained from an inertial measurement unit (IMU). The controller consists of ankle, hip, and captured stepping subcontrollers, which represent the respective strategies. The ankle subcontroller is always engaged, but the hip and captured stepping subcontrollers are triggered only when the estimated COM state exits the ankle- and hip-strategy balanced basins, respectively.

5.1 Ankle and Hip Subcontrollers.

The ankle subcontroller adjusts the pth ankle pitch angle bias Δqp, ankle through a PD control scheme and is always engaged. It regulates the COM state as well as the foot orientation [14] for consistency with the modeling assumption of full contact with the ground:
(17)
where KD,ankle is the derivative gain for the COM X-velocity x¯˙, KP,ankleIMU is the proportional gain for foot orientation, and the superscript error denotes the difference between the desired and estimated values. The desired values of x¯˙ and Φp are set as zero, to be consistent with standing on level ground.

The proposed hip subcontroller is triggered when the estimated COM state exits the ankle-strategy balanced basin. It generates bang-bang-like motion profiles for the hip joint control [3,14] that regulates the ground reaction force through the angular momentum of the upper body, and for the pelvis X-position control that keeps the COM X-position within the BOS during balancing. The pth hip pitch angle bias Δqp,hip and the pelvis X-position bias Δrpelvis,x with respect to the stance foot position are regulated as follows:

For 0 ≤ t < 2TH1
(18)
For 2TH1t < 2TH1 + TH2
(19)
where the time t is from the instant when the hip subcontroller is triggered, Δqhipbb (for both left and right hip joints) and Δrpelvis,xbb are the pre-assigned maximum values of the respective bias, and 2TH1 and TH2 are the durations of the motion away from and then back to its initial position, respectively [3,14]. The biases are zero at all other times. The left and right leg joint angles are identical due to the bilateral symmetry of the standing pose with zero step length, until the stepping subcontroller is triggered. All the gains and parameters for the ankle and hip subcontrollers were manually tuned through simulation experiments to fully exploit their respective balanced basins.

5.2 Captured Stepping Subcontroller.

The captured stepping subcontroller achieves the desired step length Δrxdesired as quickly as possible with a bang-bang-like motion profile to adjust the swing foot and pelvis position with respect to the stance foot position in two swing phases (S1 and S2):

For 0 ≤ t < TS1
(20)
For TS1t < TS1 + TS2
(21)
For tTS1 + TS2
(22)
where the time t is from the instant when the captured stepping subcontroller is triggered, r2bb and Φ2bb are the pre-assigned parameters for the stride with sufficient foot clearance during S1 and to ensure foot-ground contact during S2, and TS1 and TS2 are the duration of each swing phase.

In order to supplement the captured stepping action, the ankle and hip subcontrollers remain engaged with gains and parameters that are different from those without stepping. All the gains and parameters of the ankle, hip, and captured stepping subcontrollers were manually tuned through simulation experiments to minimize foot tipping, avoid self-collision during stepping, and ensure full foot-ground contact at the end of the step.

6 Results and Discussion

The ankle- and hip-strategy balanced state and whole-body 1-step capturable boundaries were obtained for the humanoid robot in the sagittal plane and incorporated into the proposed partition-aware controller. Due to the consideration of standing push recovery in response to forward and backward perturbations, the objective function is the X-component of the COM velocity and only the X-component of the COM-state space is of interest. For the COM state constraint at t = tstep of the whole-body 1-step capturable boundary, a pre-computed DS whole-body-strategy balanced boundary from a prior work [13] was incorporated into the optimization algorithm as a linearly approximated form and 3.98% of the reduction ratio in the COM X-velocity was used for the step impact according to the simple collision model [39]. For captured stepping, the desired step length is Δrxdesired=0.057m and the range of allowable swing foot angle is [Φ2LB,Φ2UB]=[π/5,π/5]. Control simulations were conducted to demonstrate the proposed controller and validate the use of the stability boundaries as criteria.

6.1 Boundaries for Whole-Body Push Recovery: Balanced and 1-Step Capturable.

The ankle- and hip-strategy balanced state boundaries were computed for a DS standing pose with a zero step length (Fig. 5). Within the COM X-position range of the BOS for zero step length, the balanced basin area of the hip strategy is 44.5% greater than that of the ankle strategy due to the added balance capability of the knee and hip joints.

Fig. 5
Computed ankle- and hip-strategy balanced state and whole-body 1-step capturable (with free and fixed arms) boundaries. The LIPM-based capturable boundary is also included for comparison.
Fig. 5
Computed ankle- and hip-strategy balanced state and whole-body 1-step capturable (with free and fixed arms) boundaries. The LIPM-based capturable boundary is also included for comparison.
Close modal

The whole-body 1-step capturable boundary was computed within the BOS for zero step length (Fig. 5). Additionally, another capturable boundary was computed with a constraint that disallowed all upper-body motion, by imposing qupper(t) = 0 (i.e., at the reference configuration for each joint) at all times, to quantify the contribution of the upper-body DOFs in recovering balance. The increase in the capturable boundaries relative to that of the hip strategy demonstrates the greater contribution of the stepping versus hip strategy due to the improved mobility and extended BOS dimension. The relatively low gap between the two capturable boundaries is due to the low mass of the arms relative to the total robot mass. The time to step decreases as the COM X-position shifts forward, which limits the effect of upper-body momenta prior to stepping and causes the gap to narrow. On the other hand, while stepping is generally understood as the dominant method of balancing [27], this expected effect is not fully reflected in the observed increase of the computed capturable boundary relative to the balanced state boundary of the hip strategy or the DS balanced with the desired step length due to several factors. The desired step length is relatively short compared to the COM Y-position, which limits the COM velocity reduction according to the simple collision model used in this study. Thus, the increase in stability from stepping is primarily from the extended BOS for the desired step length and not from step impact. Non-stepping strategies also benefit disproportionately from DARwIn-OP’s long feet relative to its height and desired step length; it results in a relatively large BOS for zero step length during standing while interfering with swing-foot ground clearance for stepping. The robot’s uniform torque limits across all joints including the ankle, which is typically weaker in humans and other humanoid robots, directly increases the balance capability of the ankle and hip strategies. Another factor is the difference in the initial poses to achieve the initial COM X-positions, specifically, by adjusting the ankle joint angle for the balanced state boundaries versus the pelvis position for the capturable boundary. Nonetheless, the overall attributes of the computed capturable boundary and its relationship with the balanced boundaries demonstrate the validity of the proposed approach as proof of concept.

For comparison, the common LIPM-based capturability [17] with the same desired step length was also calculated from the 1-step instantaneous capture velocity x˙cap=(Δrxdesired+fl/2x¯)g/y¯ that corresponds to the instantaneous capture point at the forward edge Δrxdesired+fl/2 of the stepped BOS (Fig. 2). Here, x˙cap can be regarded as the maximum velocity threshold for achieving a captured state within one step [17]. On average, the LIPM-based capturable boundary is 59.5% greater than that of the whole body (Fig. 5). This implies that the LIPM does not accurately represent the actual robot’s capturability due to its modeling simplifications, including the lack of realistic kinematic and actuation limits, swing leg dynamics, and consideration of foot clearance. In particular, the typical LIPM-based capturability [16,17] ignores foot scuffing collisions through the assumption of zero swing foot velocity at the instant of stepping. These complete system properties and dynamics are incorporated within the constraints used to compute the boundaries in the proposed whole-body approach, providing a proof of concept of a more accurate threshold than that based on the reduced-order model.

6.2 Control Simulations: Maintaining and Recovering Balance.

The proposed partition-aware controller is verified and demonstrated by applying impulsive force perturbations to the torso of the robot in the Webots simulation environment (Cyberbotics Ltd., Switzerland) to reach various initial COM states from which the robot can recover or maintain balance through different strategies. In each test, the force perturbations were applied with the simulation time-step duration of 8 ms and had manually-tuned magnitudes such that the initial (i.e., post-impact) COM X-velocities approached each stability boundary at r¯=[0.00.2]T, corresponding to the reference joint configurations, for the ankle and hip strategies and at slightly varied r¯=[0.020.2]T for the stepping strategy. In order to evaluate the stability only in the sagittal plane, the bounding objects of both feet were widened laterally, and the torque and velocity limits of out-of-plane DOFs (hip yaw, hip roll, and ankle roll) were substantially increased in the simulation environment. The control parameter values were KD,ankle = 0.5 and KP,ankleIMU=1.0 for the ankle subcontroller; Δqhipbb=0.7rad, Δrpelvis,xbb=0.07m, TH1 = 0.2 s, and TH2 = 0.4 s for the hip subcontroller; and rbb2 = [0.1 0.04]T, Φ2,S1bb=π/6rad, Φ2,S2bb=π/12rad, TS1 = 0.2 s, and TS2 = 0.1 s for the captured stepping subcontroller.

When the perturbed initial COM state was within the ankle-strategy balanced basin (Fig. 6: Point A), only the ankle subcontroller was engaged to maintain balance. For the perturbed initial COM state outside of the ankle-strategy but within the hip-strategy balanced basin (Fig. 6: Point B), the hip subcontroller was activated and successfully maintained the balance of the robot. This comparison shows the benefit of actively incorporating the balanced state boundaries for different strategies into control.

Fig. 6
Simulation COM state trajectories (left) and motion snapshots (right) for standing push recovery at various initial COM states. Balanced trajectories obtained with the ankle (A) and hip (B) subcontrollers and a captured (C) and a falling (D) trajectory obtained with the captured stepping subcontroller of the partition-aware controller are shown. In the captured trajectory (C), the COM state at the stepped instant is also indicated.
Fig. 6
Simulation COM state trajectories (left) and motion snapshots (right) for standing push recovery at various initial COM states. Balanced trajectories obtained with the ankle (A) and hip (B) subcontrollers and a captured (C) and a falling (D) trajectory obtained with the captured stepping subcontroller of the partition-aware controller are shown. In the captured trajectory (C), the COM state at the stepped instant is also indicated.
Close modal

The captured stepping-strategy control was demonstrated when the initial COM state with x¯˙=0.31m/s was between the hip-strategy balanced state and whole-body 1-step capturable (with qupper(t) = 0t) boundaries (Fig. 6: Point C). The ankle and hip subcontrollers remained engaged with the gains of the ankle subcontroller set to KD,ankle = 0.5 and KP,ankleIMU=0.0 and the parameters of the hip subcontroller set to Δqhipbb=0.56rad and Δrpelvis,xbb=0.02m. In this case, the hip subcontroller was triggered twice before achieving a captured state. From this initially unbalanced COM state, the robot successfully came to a complete stop (i.e., captured) at the pelvis X-position of Δrxdesired/2 after making the desired step, indicating recovery of balance. This again verifies that the captured stepping strategy has an increased stabilizing capability than that of the hip strategy alone [27], in consistency with the difference in size between their respective basins. Note that the exit of the trajectory outside of the whole-body capturable boundary for some duration does not reflect the falling of those states due to their conditions such as joint and system (especially the swing foot) states and contact status (SS to DS) that are different from those of the boundary states. The COM states at and after the stepped instant lie within the DS balanced basin for the desired step length, verifying that the imposition of the corresponding COM state constraint ensures that the robot eventually achieves a captured state.

On the other hand, the robot failed to recover balance from the initial COM state (Fig. 6: Point D) that is outside of the whole-body 1-step capturable boundary with the same controller action, demonstrating the validity of the computed boundary. Similar to the captured case, the brief penetration of the trajectory inside the whole-body capturable boundary does not reflect the capturability of those states due to their system and contact conditions that are different from those of the boundary states. It should also be noted that this COM state is well within the LIPM-based capturable boundary, which indicates the inadequacy of balance criteria based on reduced-order dynamics. These results show the merit of the whole-body formulations in the accuracy of the boundary, due to the incorporation of system-specific aspects such as kinematic and actuation limits, linear and angular momenta, and swing leg dynamics.

Assumptions and conditions in the simulation environment, such as the allowance of minimal foot tipping and the use of constant torque and velocity limits, may have increased the balance capability of the robot in simulations relative to that in the computed stability basin results. In particular, due to the difficulty of maintaining full foot contact in the simulation environment, some degree of foot tipping occurred and was followed by small oscillations that converged to a static equilibrium with full contact at the end (Fig. 6). However, in the computed balanced basins, full foot-ground contact was assumed for all times. On the other hand, other factors, such as feedback characteristics (e.g., settling time), may have reduced the capability of the robot in simulations. These modeling discrepancies were kept to a minimum and monitored to limit their effects on the push recovery experiments, such that the simulated results reasonably demonstrate and validate the proposed controller as well as the computed boundaries. Moreover, the foot tipping during balancing illustrates that the ZMP condition is insufficient as a balance criterion because it was violated even when the state was balanced or 0-step capturable.

7 Conclusions and Future Work

The capturability formulated with whole-body system properties and dynamics of a humanoid robot provided valid criteria for balance recovery by stepping, and the computed boundaries were used to quantify the relative contributions of different balancing strategies and contacts. The incorporation of the 1-step capturable boundary, along with the balanced state boundaries, as explicit criteria into the proposed partition-aware controller demonstrated their validity in fully exploiting a humanoid robot’s balancing capability in push recovery simulations. Although the boundary may be relatively conservative due to the simplified step impact dynamics used for this preliminary result, it outperformed that derived from the reduced-order LIPM overall. The proposed approach can be extended to balance control for general bipedal tasks beyond standing push recovery. For instance, the 1-step capturable boundary can be applied iteratively to analyze or control N steps in walking. The relatively high computational cost due to the whole-body system will be addressed in a future study by approximating the stability boundaries from pre-computed results for selected configurations.

Acknowledgment

This work was supported in part by the U.S. National Science Foundation (CMMI-1436636 and IIS-1427193) and a Mitsui USA Foundation scholarship.

Conflict of Interest

There are no conflicts of interest.

Data Availability Statement

The authors attest that all data for this study are included in the paper.

References

1.
Mummolo
,
C.
,
Mangialardi
,
L.
, and
Kim
,
J. H.
,
2017
, “
Numerical Estimation of Balanced and Falling States for Constrained Legged Systems
,”
J. Nonlinear Sci.
,
27
(
4
), pp.
1291
1323
.
2.
Ghorbani
,
M.
,
Kakavandi
,
F.
, and
Masouleh
,
M. T.
,
2017
, “
An Experimental Study on a Learning-Based Approach for the Push Recovery of NAO Humanoid Robot
,”
Artificial Intelligence and Signal Processing Conference (AISP)
,
Shiraz, Iran
,
Oct. 25–27
, pp.
358
363
.
3.
Yi
,
S. J.
,
Zhang
,
B. T.
,
Hong
,
D.
, and
Lee
,
D. D.
,
2013
, “
Online Learning of Low Dimensional Strategies for High-Level Push Recovery in Bipedal Humanoid Robots
,”
IEEE International Conference on Robotics and Automation
,
Karlsruhe, Germany
,
May 6–10
, pp.
1649
1655
.
4.
Kalyanakrishnan
,
S.
, and
Goswami
,
A.
,
2011
, “
Learning to Predict Humanoid Fall
,”
Int. J. Humanoid Rob.
,
8
(
2
), pp.
245
273
.
5.
Hohn
,
O.
, and
Gerth
,
W.
,
2009
, “
Probabilistic Balance Monitoring for Bipedal Robots
,”
Int. J. Rob. Res.
,
28
(
2
), pp.
245
256
.
6.
Choi
,
J. J.
,
Agrawal
,
A.
,
Sreenath
,
K.
,
Tomlin
,
C. J.
, and
Bansal
,
S.
,
2022
, “
Computation of Regions of Attraction for Hybrid Limit Cycles Using Reachability: An Application to Walking Robots
,”
IEEE Rob. Autom. Lett.
,
7
(
2
), pp.
4504
4511
.
7.
Ames
,
A. D.
,
Galloway
,
K.
,
Sreenath
,
K.
, and
Grizzle
,
J. W.
,
2014
, “
Rapidly Exponentially Stabilizing Control Lyapunov Functions and Hybrid Zero Dynamics
,”
IEEE Trans. Autom. Control
,
59
(
4
), pp.
876
891
.
8.
Hobbelen
,
D. G.
, and
Wisse
,
M.
,
2007
, “
A Disturbance Rejection Measure for Limit Cycle Walkers: The Gait Sensitivity Norm
,”
IEEE Trans. Rob.
,
23
(
6
), pp.
1213
1224
.
9.
Vukobratović
,
M.
, and
Borovac
,
B.
,
2004
, “
Zero-Moment Point—Thirty Five Years of Its Life
,”
Int. J. Humanoid Rob.
,
1
(
1
), pp.
157
173
.
10.
Hirukawa
,
H.
,
Hattori
,
S.
,
Harada
,
K.
,
Kajita
,
S.
,
Kaneko
,
K.
,
Kanehiro
,
F.
,
Fujiwara
,
K.
, and
Morisawa
,
M.
,
2006
, “
A Universal Stability Criterion of the Foot Contact of Legged Robots-Adios ZMP
,”
Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006
,
Orlando, FL
,
May 15–19
, pp.
1976
1983
.
11.
Kim
,
J. H.
,
Lee
,
J.
, and
Oh
,
Y.
,
2018
, “
A Theoretical Framework for Stability Regions for Standing Balance of Humanoids Based on Their LIPM Treatment
,”
IEEE Trans. Syst. Man, Cybern. Syst.
,
50
(
1
), pp.
1
18
.
12.
Caron
,
S.
,
Pham
,
Q.-C.
, and
Nakamura
,
Y.
,
2017
, “
ZMP Support Areas for Multi-Contact Mobility Under Frictional Constraints
,”
IEEE Trans. Rob.
,
33
(
1
), pp.
67
80
.
13.
Peng
,
W. Z.
,
Song
,
H.
, and
Kim
,
J. H.
,
2021
, “
Stability Region-Based Analysis of Walking and Push Recovery Control
,”
ASME J. Mech. Rob.
,
13
(
3
), p.
031005
.
14.
Peng
,
W. Z.
,
Mummolo
,
C.
,
Song
,
H.
, and
Kim
,
J. H.
,
2022
, “
Whole-Body Balance Stability Regions for Multi-Level Momentum and Stepping Strategies
,”
Mech. Mach. Theory
,
174
, p.
104880
.
15.
Firmani
,
F.
, and
Park
,
E. J.
,
2013
, “
Theoretical Analysis of the States of Balance in Bipedal Walking
,”
ASME J. Biomech. Eng.
,
135
(
4
), p.
041003
.
16.
Pratt
,
J. E.
, and
Tedrake
,
R.
,
2006
, “Velocity-Based Stability Margins for Fast Bipedal Walking,”
Fast Motions in Biomechanics and Robotics: Optimization and Feedback Control
,
M.
Diehl
, and
K.
Mombaur
, eds.,
Springer
,
Berlin/Heidelberg
, pp.
299
324
.
17.
Koolen
,
T.
,
De Boer
,
T.
,
Rebula
,
J.
,
Goswami
,
A.
, and
Pratt
,
J.
,
2012
, “
Capturability-Based Analysis and Control of Legged Locomotion, Part 1: Theory and Application to Three Simple Gait Models
,”
Int. J. Rob. Res.
,
31
(
9
), pp.
1094
1113
.
18.
Takenaka
,
T.
,
Matsumoto
,
T.
, and
Yoshiike
,
T.
,
2009
, “
Real Time Motion Generation and Control for Biped Robot—1st Report: Walking Gait Pattern Generation
,”
IEEE/RSJ International Conference on Intelligent Robots and Systems
,
St. Louis, MO
,
Oct. 10–15
, pp.
1084
1091
.
19.
Englsberger
,
J.
,
Ott
,
C.
, and
Albu-Schäffer
,
A.
,
2015
, “
Three-Dimensional Bipedal Walking Control Based on Divergent Component of Motion
,”
IEEE Trans. Rob.
,
31
(
2
), pp.
355
368
.
20.
Hof
,
A. L.
,
2008
, “
The ‘Extrapolated Center of Mass’ Concept Suggests a Simple Control of Balance in Walking
,”
Hum. Mov. Sci.
,
27
(
1
), pp.
112
125
.
21.
Khadiv
,
M.
,
Herzog
,
A.
,
Moosavian
,
S. A. A.
, and
Righetti
,
L.
,
2016
, “
Step Timing Adjustment: A Step Toward Generating Robust Gaits
,”
IEEE/RAS International Conference on Humanoid Robots
,
Cancun, Mexico
,
Nov. 15–17
, pp.
35
42
.
22.
Xinjilefu
,
X.
,
Feng
,
S.
, and
Atkeson
,
C. G.
,
2015
, “
Center of Mass Estimator for Humanoids and Its Application in Modeling Error Compensation, Fall Detection and Prevention
,”
IEEE/RAS International Conference on Humanoid Robots
,
Seoul, South Korea
,
Dec. 3–5
, pp.
67
73
.
23.
Stephens
,
B. J.
, and
Atkeson
,
C. G.
,
2010
, “
Push Recovery by Stepping for Humanoid Robots With Force Controlled Joints
,”
IEEE/RAS International Conference on Humanoid Robots
,
Nashville, TN
,
Dec. 6–8
, pp.
52
59
.
24.
Stephens
,
B.
,
2007
, “
Humanoid Push Recovery
,”
IEEE/RAS International Conference on Humanoid Robots
,
Pittsburgh, PA
,
Nov. 29–Dec. 1
, pp.
589
595
.
25.
Wieber
,
P. B.
,
2002
, “
On the Stability of Walking Systems
,”
Proceedings of the International Workshop on Humanoid and Human Friendly Robotics
,
Tsukuba, Japan
,
Dec. 11–12
.
26.
Zaytsev
,
P.
,
Hasaneini
,
S. J.
, and
Ruina
,
A.
,
2015
, “
Two Steps is Enough: No Need to Plan Far Ahead for Walking Balance
,”
IEEE International Conference on Robotics and Automation
,
Seattle, WA
,
May 26–30
, pp.
6295
6300
.
27.
Zaytsev
,
P.
,
Wolfslag
,
W.
, and
Ruina
,
A.
,
2018
, “
The Boundaries of Walking Stability: Viability and Controllability of Simple Models
,”
IEEE Trans. Rob.
,
34
(
2
), pp.
336
352
.
28.
Pratt
,
J.
,
Carff
,
J.
,
Drakunov
,
S.
, and
Goswami
,
A.
,
2006
, “
Capture Point: A Step Toward Humanoid Push Recovery
,”
IEEE/RAS International Conference on Humanoid Robots
,
Genova, Italy
,
Dec. 4–6
, pp.
200
207
.
29.
Pratt
,
J.
,
Koolen
,
T.
,
de Boer
,
T.
,
Rebula
,
J.
,
Cotton
,
S.
,
Carff
,
J.
,
Johnson
,
M.
, and
Neuhaus
,
P.
,
2012
, “
Capturability-Based Analysis and Control of Legged Locomotion, Part 2: Application to M2V2, A Lower Body Humanoid
,”
Int. J. Rob. Res.
,
31
(
10
), pp.
1117
1133
.
30.
Griffin
,
R. J.
,
Wiedebach
,
G.
,
Bertrand
,
S.
,
Leonessa
,
A.
, and
Pratt
,
J.
,
2017
, “
Walking Stabilization Using Step Timing and Location Adjustment on the Humanoid Robot, Atlas
,”
IEEE International Conference on Intelligent Robots and Systems
,
Vancouver, BC, Canada
,
Sept. 24–28
, pp.
667
673
.
31.
Jeong
,
H.
,
Lee
,
I.
,
Oh
,
J.
,
Lee
,
K. K.
, and
Oh
,
J.-H.
,
2019
, “
A Robust Walking Controller Based on Online Optimization of Ankle, Hip, and Stepping Strategies
,”
IEEE Trans. Rob.
,
35
(
6
), pp.
1367
1386
.
32.
Maki
,
B. E.
, and
McIlroy
,
W. E.
,
1997
, “
The Role of Limb Movements in Maintaining Upright Stance: The ‘Change-In-Support’ Strategy
,”
Phys. Ther.
,
77
(
5
), pp.
488
507
.
33.
Lee
,
S. H.
, and
Goswami
,
A.
,
2007
, “
Reaction Mass Pendulum (RMP): An Explicit Model for Centroidal Angular Momentum of Humanoid Robots
,”
IEEE International Conference on Robotics and Automation
,
Rome, Italy
,
Apr. 10–14
, pp.
4667
4672
.
34.
Hofmann
,
A.
,
2006
, “
Robust Execution of Bipedal Walking Tasks From Biomechanical Principles
,”
Ph.D. dissertation
,
Massachusetts Institute of Technology
,
Cambridge, MA
.
35.
Williams
,
R. L.
, II
,
2012
, “
Darwin-OP Humanoid Robot Kinematics
,”
ASME International Design Engineering Technical Conferences and Computers and Information in Engineering Conference
,
Chicago, IL
,
Aug. 12–15
, pp.
1187
1196
.
36.
Gill
,
P. E.
,
Murray
,
W.
, and
Saunders
,
M. A.
,
2002
, “
SNOPT: An SQP Algorithm for Large-Scale Constrained Optimization
,”
SIAM J. Optim.
,
12
(
4
), pp.
979
1006
.
37.
Kim
,
J. H.
,
2011
, “
Optimization of Throwing Motion Planning for Whole-Body Humanoid Mechanism: Sidearm and Maximum Distance
,”
Mech. Mach. Theory
,
46
(
4
), pp.
438
453
.
38.
Peng
,
W. Z.
,
Song
,
H.
, and
Kim
,
J. H.
,
2021
, “
Reduced-Order Model With Foot Tipping Allowance for Legged Balancing
,”
ASME International Design Engineering Technical Conferences and Computers and Information in Engineering Conference
,
Virtual, Online
, p.
V08BT08A011
.
39.
Gamus
,
B.
, and
Or
,
Y.
,
2015
, “
Dynamic Bipedal Walking Under Stick-Slip Transitions
,”
SIAM J. Appl. Dyn. Syst.
,
14
(
2
), pp.
609
642
.