art-e-fact · azazdeaz · Nov 25, 2025 · Nov 26, 2025 · Nov 28, 2025 · Nov 29, 2025
diff --git a/article.md b/article.md
@@ -0,0 +1,255 @@
+With policy-controlled robots, it's always difficult to ensure that an updated policy can still solve all the challenges that the previous model could. This is where automated testing gets important. In this post, we implement tests for a Unitree Go2 robot controlled by an RL policy to execute waypoint missions.
+
+We will focus only on testing here, but you can find the full project source at [https://github.com/art-e-fact/go2-example](https://github.com/art-e-fact/go2-example). Feel free to jump right in. We tried to make spinning up the demo and getting the robot walking as quick and simple as possible.
+
+TODO: video here
+
+The stack we will use:
+
+- **dora-rs** is a Rust based robotics framework. Its simplicity, low latency, and ease of integration make it a great choice for many robotics projects.
+- **Isaac Sim** allows us to accurately simulate the robot in photo-realistic environments and use the same RL in simulation and on the real robot.
+- **pytest** combines well with the Python API of dora-rs
+- **Artefacts** is perfect for test automation and reviewing test results with our teammates.
+
+# Unit testing nodes
+
+## Simple input-output test 
+
+_See the full source_ [_here_](https://github.com/art-e-fact/go2-example/blob/aab4c57dcb5c86753f30870908a15104bd399780/nodes/policy_controller/tests/test_policy_controller.py)
+
+First, we want to know if the policy will calculate the next action without error.
+
+The `policy` node takes:
+
+- High level navigation commands (x, y velocity, and rotation)
+- Observations from the robot (velocity, joint position, etc.)
+- Clock time, which is important to know when the next action should be generated
+
+And returns the low-level joint targets for the robot.
+
+
+
+This is what a test case looks like:
+
+```python
+def test_generates_commands():
+    """Check if the policy runs and generates joint commands."""
+    with patch("policy_controller.main.Node") as MockNode:
+        mock_node_instance = MagicMock()
+        MockNode.return_value = mock_node_instance
+
+        # Create mock inputs
+        command_2d = Twist2D(linear_x=0.5, linear_y=0.0, angular_z=0.0)
+        observations = Observations(
+            lin_vel=np.zeros(3),
+            ang_vel=np.zeros(3),
+            gravity=np.array([0.0, 0.0, -9.81]),
+            joint_positions=np.zeros(12),
+            joint_velocities=np.zeros(12),
+            height_scan=np.zeros(154),
+        )
+        clock = Timestamp.now()
+
+        # The mocked node will yield these events when iterated.
+        mock_node_instance.__iter__.return_value = [
+            {"type": "INPUT", "id": "command_2d", "value": command_2d.to_arrow()},
+            {"type": "INPUT", "id": "clock", "value": clock.to_arrow()},
+            {
+                "type": "INPUT",
+                "id": "observations",
+                "value": observations.to_arrow(),
+            },
+        ]
+
+        from policy_controller.main import main
+
+        main()
+
+        # Check that send_output was called with joint_commands
+        mock_node_instance.send_output.assert_called()
+        args, _ = mock_node_instance.send_output.call_args
+        assert args[0] == "joint_commands"
+        joint_commands = JointCommands.from_arrow(args[1])
+        assert joint_commands.positions is not None
+        assert len(joint_commands.positions) == 12
+```
+
+
+
+The most important bits:
+
+- We mock dora-rs `Node` with a `MagicMock` so we can send inputs directly to the node, and assert that it outputs the correct values.
+- The input data can be almost anything as long as it's in the correct format. We will only test the shape of the output.
+- We can imitate the input data flow by mocking the node's iter values in `mock_node_instance.__iter__.return_value`
+- After the mock is ready, we can import the policy node and execute its `main()` function.
+- The policy node will call the `send_output` function on the mocked dora-rs node. This allows us to assert that the expected output was generated using generic pytest assertion tools.
+- We only validate the format of the output. The integration tests will validate if the robot is behaving correctly.
+
+
+
+## Testing multistep exchanges between nodes
+
+_See the full source_ [_here_](https://github.com/art-e-fact/go2-example/blob/aab4c57dcb5c86753f30870908a15104bd399780/nodes/navigator/tests/test_navigator.py)
+
+The `navigation` node takes:
+
+- The pose of the robot. For this demo, we use the ground truth data from the simulator.
+- The waypoint positions that the robot should reach.
+- A tick message to trigger generating a new output.
+
+And the output will be the high-level navigation command that the `policy` node will consume.
+
+```python
+def test_navigator_logic_stateful():
+    """Test if sends command output after every tick input."""
+    # A queue to send events to the node
+    event_queue = queue.Queue()
+
+    with patch("navigator.main.Node") as MockNode:
+        mock_node_instance = MagicMock()
+        MockNode.return_value = mock_node_instance
+        mock_node_instance.__iter__.return_value = iter(event_queue.get, None)
+
+        from navigator.main import main
+
+        # Run the main function in a separate thread
+        main_thread = threading.Thread(target=main, daemon=True)
+        main_thread.start()
+
+        # Set initial robot pose and waypoints
+        robot_pose = Transform.from_position_and_quaternion(
+            np.array([0.0, 0.0, 0.0]),
+            np.array([1.0, 0.0, 0.0, 0.0]),  # scalar-first (w, x, y, z)
+        )
+        waypoints = WaypointList(
+            [
+                Waypoint(
+                    transform=Transform.from_position_and_quaternion(
+                        np.array([10.0, 0.0, 0.0]), np.array([1.0, 0.0, 0.0, 0.0])
+                    ),
+                    status=WaypointStatus.ACTIVE,
+                )
+            ]
+        )
+        event_queue.put(
+            {"type": "INPUT", "id": "robot_pose", "value": robot_pose.to_arrow()}
+        )
+        event_queue.put(
+            {"type": "INPUT", "id": "waypoints", "value": waypoints.to_arrow()}
+        )
+
+        # Send a tick to trigger a command calculation
+        event_queue.put({"type": "INPUT", "id": "tick", "value": pa.array([])})
+
+        threading.Event().wait(0.1)  # Wait for async operations
+        mock_node_instance.send_output.assert_called_once()
+        args, _ = mock_node_instance.send_output.call_args
+        assert args[0] == "command_2d"
+        command_output = Twist2D.from_arrow(args[1])
+        assert command_output.linear_x > 0, (
+            f"Expected to go towards X direction, got {command_output}"
+        )
+        assert command_output.angular_z == 0.0, (
+            f"Expected to go straight, got {command_output}"
+        )
+
+        # Reset mock to check for the next call
+        mock_node_instance.send_output.reset_mock()
+
+        # Set a new robot pose
+        new_robot_pose = Transform.from_position_and_quaternion(
+            np.array([10.0, 10.0, 0.0]),
+            np.array([0.707, 0.0, 0.0, 0.707]),  # Rotated 90 degrees (facing +y)
+        )
+        event_queue.put(
+            {"type": "INPUT", "id": "robot_pose", "value": new_robot_pose.to_arrow()}
+        )
+
+        # Send another tick
+        event_queue.put({"type": "INPUT", "id": "tick", "value": pa.array([])})
+
+        # Check the new output
+        threading.Event().wait(0.1)  # Wait for async operations
+        mock_node_instance.send_output.assert_called_once()
+        args, _ = mock_node_instance.send_output.call_args
+        assert args[0] == "command_2d"
+        command_output = Twist2D.from_arrow(args[1])
+        assert command_output.angular_z > 0.0, f"Expected to turn, got {command_output}"
+
+        event_queue.put(None)  # Signal the iterator to end
+        main_thread.join(timeout=1)  # Wait for the thread to finish
+```
+
+The main differences compared to the one-step test:
+
+- The node runs in a separate thread. This allows us to let the node spin and wait for inputs as it would during normal execution.
+- The inputs are passed to a `Queue` which is shared with the node. We can use it as the mocked input stream by converting it into an iterator with ``iter(event_queue.get, None)``
+- To wait for the node to finish generating the output inside the thread, we wait in the test with `threading.Event().wait(0.1)`
+- After validating the first output, we continue sending new inputs and testing the newly generated outputs. 
+- Instead of asserting the exact output values, we only check if the node sends the robot in the correct general direction.
+
+# Integration testing 
+
+Dora is flexible about how we start the processes implementing the nodes. For example, we can use `pytest` to start one of our nodes. This allows us to output messages to the rest of the system inside test-cases and validate the received inputs.
+
+The `tester` node is implemented as a pytest test file. For the full source, see [`test_waypoints_poses.py`](https://github.com/art-e-fact/go2-example/blob/49b654743439f872332630a8627bdcae776cd85e/nodes/tester/tester/test_waypoints_poses.py) and [`conftest.py`](https://github.com/art-e-fact/go2-example/blob/49b654743439f872332630a8627bdcae776cd85e/nodes/tester/tester/conftest.py) . Here we will only discuss the key parts:
+
+- The dora-rs node is provided to the tests as a fixture. This allows us to reuse the same node between test-cases. 
+
+```python
+@pytest.fixture(scope="session")
+def session_node():
+    """Create a Dora node for the full test session."""
+    node = TestNode()
+    yield node
+    # Send the stop signal to tear down the rest of the nodes
+    node.send_output("stop", pa.array([True]))
+```
+
+- The simulation is executed at different speeds on faster and slower machines. To make the test timeouts hardware-agnostic, we need to use simulation time in the tests. See the full implementation in [`conftest.py`](https://github.com/art-e-fact/go2-example/blob/49b654743439f872332630a8627bdcae776cd85e/nodes/tester/tester/conftest.py)
+
+```python
+@pytest.fixture(scope="function")
+def node(request, session_node: TestNode):
+    """Provide TestNode that will reset the timeout before and after each test.
+
+    Use `@pytest.mark.clock_timeout(seconds)` to set timeout per test.
+    """
+    # Reset timeout before and after each test
+    session_node.reset_timeout()
+    # Try to read the clock_timeout marker from the test function
+    clock_timeout = request.node.get_closest_marker("clock_timeout")
+    if clock_timeout:
+        session_node.set_timeout(clock_timeout.args[0])
+    yield session_node
+    session_node.reset_timeout()
+```
+
+- Isaac Sim can take a long time to start. Instead of restarting it for each scenario, we send a `load_scene` message to the `simulation` node. This will trigger the `simulation` node to reset the world and load a new scene in seconds.
+
+```python
+node.send_output(
+    "load_scene", msgs.SceneInfo(name=scene, difficulty=difficulty).to_arrow()
+)
+```
+
+We can use the built-in parametrization feature of pytest to validate the robot behavior in different configurations.
+
+```python
+@pytest.mark.parametrize("difficulty", [0.1, 0.7, 1.1])
+@pytest.mark.clock_timeout(30)
+def test_completes_waypoint_mission_with_variable_height_steps(node, difficulty: float):
+    """Test that the waypoint mission completes successfully.
+
+    The pyramid steps height is configured via difficulty.
+    """
+    run_waypoint_mission_test(node, scene="generated_pyramid", difficulty=difficulty)
+
+
+@pytest.mark.parametrize("scene", ["rail_blocks", "stone_stairs", "excavator"])
+@pytest.mark.clock_timeout(50)
+def test_completes_waypoint_mission_in_photo_realistic_env(node, scene: str):
+    """Test that the waypoint mission completes successfully."""
+    run_waypoint_mission_test(node, scene, difficulty=1.0)
+
+```
diff --git a/isaac-rerun-logger/README.md b/isaac-rerun-logger/README.md
diff --git a/isaac-rerun-logger/pyproject.toml b/isaac-rerun-logger/pyproject.toml
@@ -0,0 +1,23 @@
+[project]
+name = "isaac-rerun-logger"
+version = "0.1.0"
+description = "Add your description here"
+readme = "README.md"
+authors = [
+    { name = "Andras Polgar", email = "azazdeaz@gmail.com" }
+]
+requires-python = ">=3.11"
+dependencies = [
+    "pillow>=11.2.1",
+    "rerun-sdk>=0.23.1",
+    "usd-core>=25.11",
+]
+
+[build-system]
+requires = ["uv_build>=0.9.7,<0.10.0"]
+build-backend = "uv_build"
+
+[dependency-groups]
+dev = [
+    "types-usd>=24.5.2",
+]