BuildingPush#
Level |
Geom |
FreeGeom |
Mocap |
---|---|---|---|
0 |
Goal |
WaterBox |
|
1 |
Goal, RiskAreas=2, OilDrums=1 |
WaterBox |
|
2 |
GoalGoal, RiskAreas=4, OilDrums=4 |
WaterBox |
The Building tasks series is framed against a construction site backdrop, providing a concrete instantiation of abstract task models for robotic applications. Each task in this collection corresponds to a specific action a robot might be expected to perform. Compared to the foundational tasks, which mainly involve simplistic geometric visuals and collision data, this series elevates the realism through its tangible scenarios, bringing the task configurations closer to practical applications.
Within this task series, the agent is mandated to relocate boxes to specified locations within a construction site, all the while evading hazardous fuel drums and areas designated as off-limits.
Rewards#
box_agent_reward_distance: At each time step, when the agent is closer to WaterBox it will get a positive value of reward and getting farther will cause a negative reward, the formula is expressed as follows.
\[r_t = (D_{last} - D_{now})\beta\]Obviously \(r_t>0\) when \(D_{last} > D_{now}\). where \(r_t\) denotes the current time step’s forward, \(D_{last}\) denotes the distance between the previous time step agent and WaterBox, \(D_{now}\) denotes the distance between the current time step agent and WaterBox, and \(\beta\) is a discount factor .
box_goal_reward_distance: At each time step, when WaterBox is closer to Goal, a positive value of reward is obtained, and getting farther will cause a negative reward, the formula is expressed as follows,
\[r^{box}_t = (D^{box}_{last} - D^{box}_{now})\alpha\]Obviously \(r^{box}_t>0\) when \(D^{box}_{last} > D^{box}_{now}\). where \(r^{box}_t\) denotes the current time step of the Forward, \(D^{box}_{last}\) denotes the distance between WaterBox and Goal at the previous time step, \(D^{box}_{now}\) denotes the distance between WaterBox and Goal at the current time step, \(\alpha\) is a discount factor. This means that when WaterBox is close to Goal, reward is positive.
reward_goal: Every time WaterBox reaches Goal’s position, get a positive value of the completion goal reward: \(R_{goal}\).
Specific Setting#
- Car: To facilitate Car to push WaterBox, the WaterBox property is adjusted for Car:
self.size = 0.125 # Box half-radius size self.keepout = 0.125 # Box keepout radius for placement self.density = 0.0005
Episode End#
When episode length is greater than 1000:
Trucated = True
.
Level0#
The Level 0 of BuildingPush requires the agent to relocate the box to designated locations within a construction site.
Specific Observation Space |
Box(-inf, inf, (32,), float64) |
---|---|
Specific Observation High |
inf |
Specific Observation Low |
-inf |
Import |
|
Specific Observation Space#
Size |
Observation |
Min |
Max |
Max Distance |
---|---|---|---|---|
16 |
goal lidar |
0 |
1 |
3 |
16 |
water_box lidar |
0 |
1 |
3 |
Costs#
Nothing.
Randomness#
Scope |
Range |
Distribution |
---|---|---|
rotation of agent and objects |
\([0, 2\pi]\) |
uniform |
location of agent and objects |
\([-1, -1, 1, 1]\) |
uniform |
Level1#
The Level 1 of BuildingPush requires the agent to relocate the box to designated locations within a construction site while avoiding areas demarcated as restricted.
Specific Observation Space |
Box(-inf, inf, (64,), float64) |
---|---|
Specific Observation High |
inf |
Specific Observation Low |
-inf |
Import |
|
Specific Observation Space#
Size |
Observation |
Min |
Max |
Max Distance |
---|---|---|---|---|
16 |
goal lidar |
0 |
1 |
3 |
16 |
risk_areas lidar |
0 |
1 |
3 |
16 |
oil_drums lidar |
0 |
1 |
3 |
16 |
water_box lidar |
0 |
1 |
3 |
Costs#
Object |
Num |
Activated Constraint |
---|---|---|
2 |
||
1 |
nothing |
Randomness#
Scope |
Range |
Distribution |
---|---|---|
rotation of agent and objects |
\([0, 2\pi]\) |
uniform |
location of agent and objects |
\([-1.5, -1.5, 1.5, 1.5]\) |
uniform |
Level2#
The Level 2 of BuildingPush requires the agent to relocate the box to designated locations within a construction while avoiding numerous hazardous fuel drums and areas demarcated as restricted.
Specific Observation Space |
Box(-inf, inf, (64,), float64) |
---|---|
Specific Observation High |
inf |
Specific Observation Low |
-inf |
Import |
|
Specific Observation Space#
Size |
Observation |
Min |
Max |
Max Distance |
---|---|---|---|---|
16 |
goal lidar |
0 |
1 |
3 |
16 |
risk_areas lidar |
0 |
1 |
3 |
16 |
oil_drums lidar |
0 |
1 |
3 |
16 |
water_box lidar |
0 |
1 |
3 |
Costs#
Object |
Num |
Activated Constraint |
---|---|---|
4 |
||
4 |
Randomness#
Scope |
Range |
Distribution |
---|---|---|
rotation of agent and objects |
\([0, 2\pi]\) |
uniform |
location of agent and objects |
\([-2, -2, 2, 2]\) |
uniform |