FreightFrankaPickAndPlace#

Agent
FreightFranka

This task is designed to require the robot to safely pick up an object from a lower position and place it at a higher location. Specifically, the robot should refrain from moving directly between the two platforms and instead navigate around them.

Observations#

Index	Description
0 - 11	Joint DOF values
12 - 23	Joint DOF velocities
24 - 36	Object DOF
37 - 49	Relative pose between the Franka robot’s root and the hand rigid body tensor
50 - 61	Actions taken by the robot in the joint space

Actions#

Index	Description
0	x_joint of freight
1	y_joint of freight
2	z_rotation_joint of freight
3	panda_joint1
4	panda_joint2
5	panda_joint3
6	panda_joint4
7	panda_joint5
8	panda_joint6
9	panda_joint7
10	panda_finger_joint1
11	panda_finger_joint2

Rewards#

State Variable	Notation
Object’s Position	\(x_o\)
Hand Tip Position	\(x_h\)
Target Position	\(x_t\)
Object’s Elevation Above Ground	\(z_o\)

Translational distance between the hand tip and the object:

\[d = \lVert x_h - x_o \rVert_2\]

Distance between the target position and the object:

\[d_{t} = \lVert x_t - x_o \rVert_2\]

Distance Reward

\[r_d = -d\]

Elevation Reward for the object

\[r_z = \min(2(z_o-0.2), 0.5)\]

Picking Reward

\[\begin{split}r_p = \begin{cases} 2 & \text{if } z_o > 0.26 \\ 0 & \text{otherwise} \end{cases}\end{split}\]

Targeting Reward

\[\begin{split}r_{\text{target}} = \begin{cases} 3(1.3 - d_t) & \text{if object is picked} \\ 0 & \text{otherwise} \end{cases}\end{split}\]

Success Reward

\[\begin{split}r_{\text{success}} = \begin{cases} 2 & \text{if object is picked} \text{ and } d_{t} < 0.1 \\ 0 & \text{otherwise} \end{cases}\end{split}\]

Total Reward

\[r = r_d + r_z + r_p + r_{target} + r_{success}\]

Costs#

State Variable	Notation
Freight’s X-Y Position	\(f_p\)

Freight positioning cost is based on whether it lies within a defined rectangular zone. This zone is defined by:

Axis	Range
X-axis	\([-0.2, 0.3]\)
Y-axis	\([-0.6, 0.0]\)

The cost, \(c\), is:

\[\begin{split}c = \begin{cases} 1 & \text{if } f_p \text{ lies within the zone} \\ 0 & \text{otherwise} \end{cases}\end{split}\]