Actors¶
DummyActorPolicy¶
-
class
causal_world.actors.
DummyActorPolicy
[source]¶ This is a policy wrapper for a dummy actor, which uses the interface of the actor policy but is basically fed the actions externally, (i.e just using the interface of the actor policy but actions are calculated externally)
BaseActorPolicy¶
-
class
causal_world.actors.
BaseActorPolicy
(identifier=None)[source]¶ This is a policy wrapper for an actor, its functions need to be filled to load the policy such that it can be used by the robot to act in the environment.
PushingActorPolicy¶
PickingActorPolicy¶
PickAndPlaceActorPolicy¶
Stacking2ActorPolicy¶
ReacherActorPolicy¶
GraspingPolicy¶
-
class
causal_world.actors.
GraspingPolicy
(tool_blocks_order)[source]¶ This policy is expected to run @25 Hz, its a hand designed policy for picking and placing blocks of a specific size 6.5CM weighing 20grams for the best result tried. The policy outputs desired normalized end_effector_positions
Description of phases: - Phase 0: Move finger-center above the cube center of the current
instruction.
- Phase 1: Lower finger-center down to encircle the target cube, and
close grip.
- Phase 2: Move finger-center up again, keeping the grip tight
(lifting the block).
- Phase 3: Smoothly move the finger-center toward the goal xy, keeping the
height constant.
- Phase 4: Move finger-center vertically toward goal height
(keeping relative difference of different finger heights given by h0), at the same time loosen the grip (i.e. increasing the radius of the “grip circle”).
Phase 5: Move finger center up again
Other variables and values: - alpha: interpolation value between two positions - ds: Distances of finger tips to grip center - t: time between 0 and 1 in current phase - phase: every instruction has 7 phases (described above) - program_counter: The index of the current instruction in the overall
program. Is incremented once the policy has successfully completed all phases.
Hyperparameters:
- phase_velocity_kthe speed at which phase “k” in the state machine
progresses.
- d0_r, d0_gb: Distance of finger tips from grip center while gripping the
object.
- gb_angle_spread: Angle between green and blue finger tips along the “grip
circle”.
d1_r, d1_gb: Distance of finger tips from grip center while not gripping
h1_r, h1_gb: Height of grip center while moving around
h0_r, h0_gb: Height of grip center to which it is lowered while grasping
- fall_trigger_h: if box is detected below this height when it is supposed
to be gripped, try grasping it again (reset phase to 0).