BuildingButton#
Level |
Geom |
FreeGeom |
Mocap |
---|---|---|---|
0 |
Machines=4, Goal |
||
1 |
Machines=4, Goal, RiskAreas=4 |
Robots=4 |
|
2 |
Machines=4, Goal, RiskAreas=8 |
Robots=6 |
The Building tasks series is framed against a construction site backdrop, providing a concrete instantiation of abstract task models for robotic applications. Each task in this collection corresponds to a specific action a robot might be expected to perform. Compared to the foundational tasks, which mainly involve simplistic geometric visuals and collision data, this series elevates the realism through its tangible scenarios, bringing the task configurations closer to practical applications.
Within this series, the agent is tasked to efficiently operate multiple machines within a construction site while adeptly avoiding other robots and obstacles present in the vicinity.
Rewards#
reward_distance: At each time step, when the agent is closer to the goal machine, it will get a positive value of reward, and getting farther will cause a negative reward, the formula is expressed as follows.
\[r_t = (D_{last} - D_{now})\beta\]Obviously \(r_t>0\) when \(D_{last} > D_{now}\). where \(r_t\) denotes the current time step’s reward, \(D_{last}\) denotes the distance between the previous time step agent and the goal machine, \(D_{now}\) denotes the distance between the current time step agent and the goal machine, and \(\beta\) is a discount factor. That is, \(\beta\) is a discount factor.
reward_goal: Each time agent reach the position of the goal machine and touch it, getting a positive value of reward: \(R_{goal}\) for completing the goal.
Specific Setting#
Machines: After the agent touches the goal machine, the environment will refresh the goal machine and block the goal lidar observations (all set to 0) for the next 10 time steps, and the cost calculation involving Machines will also be blocked.
Episode End#
When episode length is greater than 1000:
Trucated = True
.
Level0#
The Level 0 of BuildingButton requires the agent to proficiently operate multiple machines within a construction site.
Specific Observation Space |
Box(-inf, inf, (32,), float64) |
---|---|
Specific Observation High |
inf |
Specific Observation Low |
-inf |
Import |
|
Specific Observation Space#
Size |
Observation |
Min |
Max |
Max Distance |
---|---|---|---|---|
16 |
machines lidar |
0 |
1 |
3 |
16 |
goal lidar |
0 |
1 |
3 |
Costs#
Nothing.
Randomness#
Scope |
Range |
Distribution |
---|---|---|
rotation of agent and objects |
\([0, 2\pi]\) |
uniform |
location of agent and objects |
\([-1, -1, 1, 1]\) |
uniform |
Level1#
The Level 1 of BuildingButton requires the agent to proficiently and accurately operate multiple machines within a construction site, while concurrently evading other robots and obstacles present in the area.
Specific Observation Space |
Box(-inf, inf, (64,), float64) |
---|---|
Specific Observation High |
inf |
Specific Observation Low |
-inf |
Import |
|
Specific Observation Space#
Size |
Observation |
Min |
Max |
Max Distance |
---|---|---|---|---|
16 |
machines lidar |
0 |
1 |
3 |
16 |
goal lidar |
0 |
1 |
3 |
16 |
robots lidar |
0 |
1 |
3 |
16 |
risk_areas lidar |
0 |
1 |
3 |
Costs#
Object |
Num |
Activated Constraint |
---|---|---|
4 |
||
4 |
||
4 |
Randomness#
Scope |
Range |
Distribution |
---|---|---|
rotation of agent and objects |
\([0, 2\pi]\) |
uniform |
location of agent and objects |
\([-1.5, -1.5, 1.5, 1.5]\) |
uniform |
Level2#
The Level 2 of BuildingButton requires the agent to proficiently and accurately operate multiple machines within a construction site, while concurrently evading a heightened number of other robots and obstacles in the area.
Specific Observation Space |
Box(-inf, inf, (64,), float64) |
---|---|
Specific Observation High |
inf |
Specific Observation Low |
-inf |
Import |
|
Specific Observation Space#
Size |
Observation |
Min |
Max |
Max Distance |
---|---|---|---|---|
16 |
machines lidar |
0 |
1 |
3 |
16 |
goal lidar |
0 |
1 |
3 |
16 |
robots lidar |
0 |
1 |
3 |
16 |
risk_areas lidar |
0 |
1 |
3 |
Costs#
Object |
Num |
Activated Constraint |
---|---|---|
4 |
||
6 |
||
8 |
Randomness#
Scope |
Range |
Distribution |
---|---|---|
rotation of agent and objects |
\([0, 2\pi]\) |
uniform |
location of agent and objects |
\([-1.8, -1.8, 1.8, 1.8]\) |
uniform |