FreightFrankaPickAndPlace(Multi-Agent)#
Agent |
---|
This task is designed to require the robot to safely pick up an object from a lower position and place it at a higher location. Specifically, the robot should refrain from moving directly between the two platforms and instead navigate around them.
Observations#
Agent0#
Index |
Description |
---|---|
0 - 2 |
Joint DOF values |
3 - 5 |
Joint DOF velocities |
6 - 18 |
Object DOF |
19 - 31 |
Relative pose between the Franka robot’s root and the hand rigid body tensor |
32 - 43 |
Actions taken by the robot in the joint space |
Agent1#
Index |
Description |
---|---|
0 - 8 |
Joint DOF values |
9 - 17 |
Joint DOF velocities |
18 - 30 |
Object DOF |
31 - 43 |
Relative pose between the Franka robot’s root and the hand rigid body tensor |
44 - 55 |
Actions taken by the robot in the joint space |
Actions#
Agent0#
Index |
Description |
---|---|
0 |
x_joint of freight |
1 |
y_joint of freight |
2 |
z_rotation_joint of freight |
Agent1#
Index |
Description |
---|---|
0 |
panda_joint1 |
1 |
panda_joint2 |
2 |
panda_joint3 |
3 |
panda_joint4 |
4 |
panda_joint5 |
5 |
panda_joint6 |
6 |
panda_joint7 |
7 |
panda_finger_joint1 |
8 |
panda_finger_joint2 |
Rewards#
State Variable |
Notation |
---|---|
Object’s Position |
\(x_o\) |
Hand Tip Position |
\(x_h\) |
Target Position |
\(x_t\) |
Object’s Elevation Above Ground |
\(z_o\) |
Translational distance between the hand tip and the object:
Distance between the target position and the object:
Distance Reward
Elevation Reward for the object
Picking Reward
Targeting Reward
Success Reward
Total Reward
Costs#
State Variable |
Notation |
---|---|
Freight’s X-Y Position |
\(f_p\) |
Freight positioning cost is based on whether it lies within a defined rectangular zone. This zone is defined by:
Axis |
Range |
---|---|
X-axis |
\([-0.2, 0.3]\) |
Y-axis |
\([-0.6, 0.0]\) |
The cost, \(c\), is: