Enhance workspace analyzer computational efficiency by yuecideng · Pull Request #230 · DexForce/EmbodiChain

yuecideng · 2026-04-14T04:48:03Z

Description

This PR optimizes the workspace analyzer for large-scale sampling (10k-100k+ samples) by replacing per-sample Python loops with batched GPU operations and vectorized algorithms.

Changes

Batch FK/IK (P0 - highest impact):

workspace_analyzer.py: Replaced per-sample compute_fk() loop with chunked compute_batch_fk() calls
workspace_analyzer.py: Replaced per-sample compute_ik() loop with chunked compute_batch_ik() calls
Extracted _get_reference_pose() helper for cleaner IK reference pose handling
Added check_constraints() to WorkspaceConstraintChecker combining bounds + collision in one call

Vectorized metrics (P1):

density_metric.py: Replaced O(N²) brute-force density with scipy.spatial.cKDTree (O(N log N)), with brute-force fallback
reachability_metric.py: Replaced dict-based voxel counting with np.unique(return_counts=True)
manipulability_metric.py: Replaced per-sample Jacobian loops with batch np.linalg.det() and np.linalg.svd()

Vectorized sampler (P1):

halton_sampler.py: Replaced nested Python loops with vectorized van der Corput computation using numpy broadcasting

Benchmark script:

scripts/benchmark/workspace_analyzer/benchmark_workspace_analyzer.py: Measures each optimization independently across sample sizes [100, 1000, 10000, 100000]

Benchmark Results (CPU-only metrics)

Component	10k samples	50k-100k samples
Halton sampler	8.5 ms	103 ms (100k)
Density metric	18 ms	135 ms (50k)
Voxelization	18 ms	208 ms (100k)
Manipulability	79 ms	395 ms (50k)

Batch FK/IK benchmarks require live robot/simulation and are expected to show 10-100x speedup.

Type of change

Bug fix (non-breaking change which fixes an issue)
Enhancement (non-breaking change which improves an existing functionality)
New feature (non-breaking change which adds functionality)
Breaking change (existing functionality will not work without user modification)
Documentation update

Screenshots

N/A

Checklist

I have run the black . command to format the code base.
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
Dependencies have been updated, if applicable.

🤖 Generated with Claude Code

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

… Corput Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…peedup Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…izations Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Copilot

Pull request overview

This PR focuses on speeding up workspace analysis at large sample counts by batching FK/IK calls and vectorizing several metrics and sampling components.

Changes:

Updated WorkspaceAnalyzer to use batched FK (compute_batch_fk) and batched IK (compute_batch_ik), including a helper for IK reference pose selection.
Vectorized/optimized metrics (density via KDTree, reachability voxelization via np.unique, manipulability via batched det/svd) and Halton sampling.
Added a benchmark script to measure performance of individual optimizations.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
scripts/benchmark/workspace_analyzer/benchmark_workspace_analyzer.py	Adds a standalone benchmarking entrypoint for sampler/metric optimizations and placeholders for FK/IK benchmarks.
embodichain/lab/sim/utility/workspace_analyzer/workspace_analyzer.py	Switches FK/IK computation to batched robot APIs; adds `_get_reference_pose` and updates constraint checking usage.
embodichain/lab/sim/utility/workspace_analyzer/samplers/halton_sampler.py	Replaces loop-based Halton generation with a vectorized van der Corput implementation.
embodichain/lab/sim/utility/workspace_analyzer/metrics/reachability_metric.py	Replaces dict-based voxel counting with `np.unique(..., return_counts=True)`-based counting.
embodichain/lab/sim/utility/workspace_analyzer/metrics/manipulability_metric.py	Uses batched linear algebra for manipulability/condition numbers; handles empty valid-score cases.
embodichain/lab/sim/utility/workspace_analyzer/metrics/density_metric.py	Uses `scipy.spatial.cKDTree` to avoid O(N²) radius-density computation, with brute-force fallback.
embodichain/lab/sim/utility/workspace_analyzer/constraints/workspace_constraint.py	Adds `check_constraints()` convenience method combining bounds + collision checks.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-14T04:53:14Z

@@ -1003,208 +1019,127 @@ def compute_reachability(
        """
        num_samples = len(cartesian_points)


If cartesian_points is empty, batch_size = min(batch_size, num_samples) becomes 0, which will later make range(0, num_samples, batch_size) raise (step cannot be 0). Add an early return for num_samples == 0 before computing/clamping batch_size, or clamp batch_size to at least 1.

Suggested change

num_samples = len(cartesian_points)

num_samples = len(cartesian_points)

if num_samples == 0:

return (

cartesian_points,

cartesian_points[:0],

cartesian_points.new_empty((0,)),

torch.zeros((0,), dtype=torch.bool, device=cartesian_points.device),

cartesian_points.new_empty((0, 0)),

)

Copilot · 2026-04-14T04:53:15Z

        logger.log_info(
            f"Pre-filtered Cartesian points: {valid_cartesian_mask.sum()}/{num_samples} "
            f"points pass workspace constraints ({(valid_cartesian_mask.sum()/num_samples*100):.1f}%)"


This log line divides by num_samples unconditionally: (valid_cartesian_mask.sum()/num_samples*100). If cartesian_points is empty (num_samples == 0), this will raise a ZeroDivisionError before the early-return fix above. Please guard the percentage calculation when num_samples == 0.

Suggested change

logger.log_info(

f"Pre-filtered Cartesian points: {valid_cartesian_mask.sum()}/{num_samples} "

f"points pass workspace constraints ({(valid_cartesian_mask.sum()/num_samples*100):.1f}%)"

valid_percentage = (

valid_cartesian_mask.sum() / num_samples * 100 if num_samples > 0 else 0.0

)

logger.log_info(

f"Pre-filtered Cartesian points: {valid_cartesian_mask.sum()}/{num_samples} "

f"points pass workspace constraints ({valid_percentage:.1f}%)"

Copilot · 2026-04-14T04:53:15Z

+                    # Update best results: prefer first success, then accumulate
+                    newly_succeeded = success & ~batch_best_success.bool()
+                    batch_best_success[newly_succeeded] = 1.0
+                    batch_best_qpos[newly_succeeded] = qpos[newly_succeeded]

-                    # Count successes
-                    if ret is not None and ret[0]:
-                        success_count += 1
-                        # Store first successful configuration
-                        if best_qpos is None:
-                            best_qpos = qpos[0]  # Extract from batch dimension
+                    # Accumulate success rate
+                    batch_best_success += success.float()


batch_best_success is used as a success counter, but newly-succeeded entries are set to 1.0 and then immediately incremented again via batch_best_success += success.float(), which double-counts the first success for each point (inflating success_rates_batch). Consider tracking (1) a boolean has_best for storing first successful qpos and (2) a separate success_counts tensor for counting successes, or just remove the = 1.0 assignment and only accumulate via += success.float().

Copilot · 2026-04-14T04:53:15Z

+            # Count neighbors within radius for all points at once
+            counts = tree.query_ball_point(points, r=radius, return_length=True)


cKDTree.query_ball_point(..., return_length=True) is not supported in some SciPy versions (it raises TypeError: unexpected keyword argument 'return_length'). Since SciPy isn’t version-pinned in this repo, consider catching TypeError here (in addition to ImportError) and falling back to the brute-force path or to np.fromiter((len(x) for x in tree.query_ball_point(...)), ...) for broader compatibility.

Suggested change

# Count neighbors within radius for all points at once

counts = tree.query_ball_point(points, r=radius, return_length=True)

# Count neighbors within radius for all points at once.

# Older SciPy versions do not support `return_length=True`,

# so fall back to counting the returned neighbor lists.

try:

counts = tree.query_ball_point(points, r=radius, return_length=True)

except TypeError:

neighbors = tree.query_ball_point(points, r=radius)

counts = np.fromiter(

(len(neighbor_list) for neighbor_list in neighbors),

dtype=np.int64,

count=len(points),

)

Copilot · 2026-04-14T04:53:15Z

        num_samples = len(joint_configs)
+        batch_size = batch_size or self.config.sampling.batch_size
+        # Cap batch size to total samples
+        batch_size = min(batch_size, num_samples)
+
+        logger.log_info(
+            f"Computing FK for {num_samples} samples (batch_size={batch_size})..."
+        )

+        # Pre-allocate lists for results
        workspace_points_list = []
        valid_configs_list = []
-
-        logger.log_info(f"Computing FK for {num_samples} samples...")
-
-        # Track valid points for progress bar
        total_valid = 0

-        # Robot expects one configuration at a time (batch_size from robot environments, not samples)
-        # Process each configuration individually
        pbar = self._create_optimized_tqdm(
-            range(num_samples),
-            desc="Forward Kinematics",
-            unit="cfg",
+            range(0, num_samples, batch_size),
+            desc="Forward Kinematics (batched)",


When joint_configs is empty, batch_size = min(batch_size, num_samples) becomes 0 and then range(0, num_samples, batch_size) raises (step cannot be 0). Consider an early return for num_samples == 0 (empty workspace_points / valid_configs) before computing batch_size/creating the tqdm range, or ensure batch_size >= 1.

Copilot

Pull request overview

Copilot reviewed 12 out of 12 changed files in this pull request and generated 7 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-15T10:15:04Z

@@ -99,7 +99,8 @@ def _sample_from_bounds(
        # Calculate samples per dimension if not provided
        if self.samples_per_dim is None:
            # Compute samples_per_dim to approximate the desired num_samples
-            samples_per_dim = max(2, int(np.ceil(num_samples ** (1.0 / n_dims))))
+            # Use floor to ensure actual grid size never exceeds num_samples
+            samples_per_dim = max(2, int(num_samples ** (1.0 / n_dims)))
        else:


The docstring now states the grid size "will not exceed" num_samples, but with samples_per_dim = max(2, int(num_samples ** (1.0 / n_dims))) the actual grid can still exceed num_samples for small requests (e.g., num_samples=3, n_dims=2 -> samples_per_dim=2 -> 4 samples). Either adjust the wording to match the behavior, or change the logic so samples_per_dim**n_dims <= num_samples while still handling small num_samples cleanly.

Copilot · 2026-04-15T10:15:04Z

            valid_mask = manipulability_scores >= self.config.jacobian_threshold
            valid_scores = manipulability_scores[valid_mask]

+            if len(valid_scores) == 0:
+                valid_scores = np.array([0.0])
+
        self.results = {


When no Jacobians pass the threshold, setting valid_scores = np.array([0.0]) makes num_valid_points report 1 and forces min/max to 0, which is misleading (it implies one valid sample). Prefer keeping num_valid_points = 0 and explicitly setting mean/std/min/max to 0.0 (or NaN) for the empty-valid set.

Copilot · 2026-04-15T10:15:05Z

    config = SimulationManagerCfg(
-        headless=False, sim_device="cpu", width=1080, height=1080
+        headless=False, sim_device="cuda", width=1080, height=1080
    )


This example now hard-codes sim_device="cuda", which will fail on CPU-only machines and in CI environments without a GPU. Consider keeping the default as cpu (or adding a simple runtime check / CLI flag) so the example remains runnable everywhere.

Copilot · 2026-04-15T10:15:05Z


    wa_joint = WorkspaceAnalyzer(robot=robot, sim_manager=sim_manager)
-    results_joint = wa_joint.analyze(num_samples=3000, visualize=True)
+    results_joint = wa_joint.analyze(num_samples=30000, visualize=True)


Raising the default num_samples from 3,000 to 30,000 can make this example take significantly longer (and may be too slow for quick-start usage). Consider keeping a smaller default and documenting how to increase it for performance testing, or making it configurable via an argument/env var.

Copilot · 2026-04-15T10:15:05Z

        num_samples = len(joint_configs)
+        batch_size = batch_size or self.config.sampling.batch_size
+        # Cap batch size to total samples
+        batch_size = min(batch_size, num_samples)
+
+        logger.log_info(
+            f"Computing FK for {num_samples} samples (batch_size={batch_size})..."
+        )

+        # Pre-allocate lists for results
        workspace_points_list = []
        valid_configs_list = []
-
-        logger.log_info(f"Computing FK for {num_samples} samples...")
-
-        # Track valid points for progress bar
        total_valid = 0

-        # Robot expects one configuration at a time (batch_size from robot environments, not samples)
-        # Process each configuration individually
        pbar = self._create_optimized_tqdm(
-            range(num_samples),
-            desc="Forward Kinematics",
-            unit="cfg",
+            range(0, num_samples, batch_size),
+            desc="Forward Kinematics (batched)",
+            unit="batch",


batch_size = min(batch_size, num_samples) can produce batch_size == 0 when num_samples is 0 (or config batch_size is 0), which makes range(0, num_samples, batch_size) raise ValueError (step must be non-zero). Add an early return for num_samples == 0 (and/or validate batch_size >= 1) before constructing the progress range.

Copilot · 2026-04-15T10:15:05Z


        logger.log_info(
            f"Pre-filtered Cartesian points: {valid_cartesian_mask.sum()}/{num_samples} "
            f"points pass workspace constraints ({(valid_cartesian_mask.sum()/num_samples*100):.1f}%)"


This log line computes a percentage as a torch scalar and formats it with :.1f, which raises TypeError: unsupported format string passed to Tensor.__format__. Also, when num_samples == 0 it will divide by zero. Convert the mask sum to a Python number via .item() and guard the percentage computation for the empty-input case.

Suggested change

logger.log_info(

f"Pre-filtered Cartesian points: {valid_cartesian_mask.sum()}/{num_samples} "

f"points pass workspace constraints ({(valid_cartesian_mask.sum()/num_samples*100):.1f}%)"

num_valid_cartesian_points = valid_cartesian_mask.sum().item()

valid_cartesian_percentage = (

(num_valid_cartesian_points / num_samples * 100) if num_samples else 0.0

)

logger.log_info(

f"Pre-filtered Cartesian points: {num_valid_cartesian_points}/{num_samples} "

f"points pass workspace constraints ({valid_cartesian_percentage:.1f}%)"

Copilot · 2026-04-15T10:15:06Z

        num_samples = len(cartesian_points)
        ik_samples_per_point = self.config.ik_samples_per_point
+        batch_size = batch_size or self.config.sampling.batch_size
+        batch_size = min(batch_size, num_samples)

-        # Pre-filter Cartesian points by workspace constraints
-        # This eliminates points that are outside bounds or in collision zones
-        valid_cartesian_mask = self.constraint_checker.check_bounds(
+        # Pre-filter by workspace constraints (vectorized)
+        valid_cartesian_mask = self.constraint_checker.check_constraints(
            cartesian_points
-        ) & self.constraint_checker.check_collision(cartesian_points)
+        )

        logger.log_info(
            f"Pre-filtered Cartesian points: {valid_cartesian_mask.sum()}/{num_samples} "
            f"points pass workspace constraints ({(valid_cartesian_mask.sum()/num_samples*100):.1f}%)"
        )

-        # Store results for all points (including invalid ones for consistent indexing)
+        # Get reference end-effector pose for IK target orientation
+        current_ee_pose = self._get_reference_pose()
+
+        # Initialize result arrays
        all_success_rates = torch.zeros(num_samples, device=self.device)
        reachable_points_list = []
        best_configs_list = []
+        total_reachable = 0

-        logger.log_info(
-            f"Computing IK for {num_samples} Cartesian samples "
-            f"({ik_samples_per_point} seeds per point)..."
-        )
-
-        # Create a random sampler for generating IK seeds (avoid UniformSampler issues)
+        # Prepare random seeds for all attempts
        from embodichain.lab.sim.utility.workspace_analyzer.samplers import (
            RandomSampler,
        )

-        random_sampler = RandomSampler(seed=self.config.sampling.seed)
-
-        # Get reference end-effector pose for IK target orientation
-        # Priority: use reference_pose if provided, otherwise compute from current joint configuration
-        if (
-            hasattr(self.config, "reference_pose")
-            and self.config.reference_pose is not None
-        ):
-            # Use provided reference pose (should be 4x4 transformation matrix)
-            reference_pose = self.config.reference_pose
-            if isinstance(reference_pose, np.ndarray):
-                reference_pose = torch.from_numpy(reference_pose).to(self.device)
-            if reference_pose.dim() == 2:  # Shape: (4, 4) -> (1, 4, 4)
-                reference_pose = reference_pose.unsqueeze(0)
-            current_ee_pose = reference_pose  # Shape: (1, 4, 4)
-            logger.log_info("Using provided reference pose for IK target orientation")
-        else:
-            # Fallback: compute current end-effector pose from joint configuration
-            try:
-                # Using first environment (index 0) for qpos retrieval
-                current_qpos = self.robot.get_qpos()[0][
-                    self.robot.get_joint_ids(self.control_part_name)
-                ]
-                current_ee_pose = self.robot.compute_fk(
-                    name=self.control_part_name,
-                    qpos=current_qpos.unsqueeze(0),
-                    to_matrix=True,
-                )  # Shape: (1, 4, 4)
-                logger.log_info(
-                    "Computing reference pose from current robot configuration"
-                )
-            except Exception as e:
-                logger.log_warning(f"Failed to compute current robot pose: {e}")
-                # Create identity pose as fallback
-                current_ee_pose = torch.eye(4, device=self.device).unsqueeze(0)
-                current_ee_pose[0, :3, 3] = torch.tensor(
-                    [0.5, 0.0, 1.0], device=self.device
-                )  # Default position
-                logger.log_info("Using default identity pose as fallback")
-
-            # Print current joint configuration and computed pose
-            pose_np = current_ee_pose[0].cpu().numpy()
-            position = pose_np[:3, 3]
-            rotation_matrix = pose_np[:3, :3]
-
-            # Convert rotation matrix to Euler angles
-            import scipy.spatial.transform as spt
-
-            euler_angles = spt.Rotation.from_matrix(rotation_matrix).as_euler(
-                "xyz", degrees=True
-            )
-
-        # Print detailed reference pose information
-        pose_np = current_ee_pose[0].cpu().numpy()
-        position = pose_np[:3, 3]
-        rotation_matrix = pose_np[:3, :3]
-
-        # Convert rotation matrix to Euler angles (ZYX convention)
-        import scipy.spatial.transform as spt
-
-        euler_angles = spt.Rotation.from_matrix(rotation_matrix).as_euler(
-            "xyz", degrees=True
+        random_sampler = RandomSampler(
+            seed=self.config.sampling.seed, device=self.device
        )

-        # Format matrix with proper indentation
-        matrix_lines = np.array2string(pose_np, precision=4, suppress_small=True).split(
-            "\n"
-        )
-        matrix_str = "\n".join(f"\t   {line}" for line in matrix_lines)
        logger.log_info(
-            f"🎯 Using provided reference pose for IK target orientation:\n"
-            f"\t Position: [{position[0]:.4f}, {position[1]:.4f}, {position[2]:.4f}] m\n"
-            f"\t Rotation (XYZ Euler): [{euler_angles[0]:.2f}°, {euler_angles[1]:.2f}°, {euler_angles[2]:.2f}°]\n"
-            f"\t Matrix:\n{matrix_str}"
+            f"Computing IK for {num_samples} Cartesian samples "
+            f"(batch_size={batch_size}, {ik_samples_per_point} seeds per point)..."
        )

-        # Track statistics for progress bar
-        total_reachable = 0
-
-        # Process each point individually (robot expects batch_size from environments, not samples)
        pbar = self._create_optimized_tqdm(
-            range(num_samples),
-            desc="Inverse Kinematics",
-            unit="pt",
+            range(0, num_samples, batch_size),
+            desc="Inverse Kinematics (batched)",
+            unit="batch",
            color="magenta",


batch_size = min(batch_size, num_samples) can yield batch_size == 0 when num_samples is 0 (or batch_size is misconfigured as 0). That will make range(0, num_samples, batch_size) invalid (step=0). Consider early-returning a fully-empty result tuple for num_samples == 0 and validating batch_size >= 1 before constructing the tqdm range.

Copilot

Pull request overview

Copilot reviewed 12 out of 12 changed files in this pull request and generated 2 comments.

Comments suppressed due to low confidence (1)

embodichain/lab/sim/utility/workspace_analyzer/samplers/iniform_sampler.py:108

The updated docstring/logic claims the generated grid size (samples_per_dim**n_dims) will not exceed num_samples, but samples_per_dim = max(2, int(num_samples ** (1/n_dims))) can still overshoot for small num_samples / high n_dims (e.g., num_samples=2, n_dims=6 yields samples_per_dim=2 => 64 samples). Either relax the docstring, or adjust the computation to ensure samples_per_dim**n_dims <= num_samples (e.g., allow samples_per_dim to be 1, or decrement until the constraint holds).

                Note: The actual number of samples (samples_per_dim^n_dims) will not
                exceed this value, but may be less to maintain a uniform grid.

        Returns:
            Tensor of shape (actual_num_samples, n_dims) containing the sampled points.
            The actual number of samples will be samples_per_dim^n_dims.

        Raises:
            ValueError: If bounds are invalid.

        Examples:
            >>> sampler = UniformSampler(samples_per_dim=3)
            >>> bounds = torch.tensor([[-1, 1], [-1, 1]], dtype=torch.float32)
            >>> samples = sampler.sample(bounds, num_samples=10)
            >>> samples.shape
            torch.Size([9, 2])  # 3^2 = 9 samples
        """
        bounds = self._validate_bounds(bounds)

        n_dims = bounds.shape[0]

        # Calculate samples per dimension if not provided
        if self.samples_per_dim is None:
            # Compute samples_per_dim to approximate the desired num_samples
            # Use floor to ensure actual grid size never exceeds num_samples
            samples_per_dim = max(2, int(num_samples ** (1.0 / n_dims)))
        else:
            samples_per_dim = self.samples_per_dim

        actual_num_samples = samples_per_dim**n_dims

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-15T13:54:38Z

        num_samples = len(cartesian_points)
        ik_samples_per_point = self.config.ik_samples_per_point
+        batch_size = batch_size or self.config.sampling.batch_size
+        batch_size = min(batch_size, num_samples)

-        # Pre-filter Cartesian points by workspace constraints
-        # This eliminates points that are outside bounds or in collision zones
-        valid_cartesian_mask = self.constraint_checker.check_bounds(
+        # Pre-filter by workspace constraints (vectorized)
+        valid_cartesian_mask = self.constraint_checker.check_constraints(
            cartesian_points
-        ) & self.constraint_checker.check_collision(cartesian_points)
+        )

        logger.log_info(
            f"Pre-filtered Cartesian points: {valid_cartesian_mask.sum()}/{num_samples} "
            f"points pass workspace constraints ({(valid_cartesian_mask.sum()/num_samples*100):.1f}%)"
        )

-        # Store results for all points (including invalid ones for consistent indexing)
+        # Get reference end-effector pose for IK target orientation
+        current_ee_pose = self._get_reference_pose()
+
+        # Initialize result arrays
        all_success_rates = torch.zeros(num_samples, device=self.device)
        reachable_points_list = []
        best_configs_list = []
+        total_reachable = 0

-        logger.log_info(
-            f"Computing IK for {num_samples} Cartesian samples "
-            f"({ik_samples_per_point} seeds per point)..."
-        )
-
-        # Create a random sampler for generating IK seeds (avoid UniformSampler issues)
+        # Prepare random seeds for all attempts
        from embodichain.lab.sim.utility.workspace_analyzer.samplers import (
            RandomSampler,
        )

-        random_sampler = RandomSampler(seed=self.config.sampling.seed)
-
-        # Get reference end-effector pose for IK target orientation
-        # Priority: use reference_pose if provided, otherwise compute from current joint configuration
-        if (
-            hasattr(self.config, "reference_pose")
-            and self.config.reference_pose is not None
-        ):
-            # Use provided reference pose (should be 4x4 transformation matrix)
-            reference_pose = self.config.reference_pose
-            if isinstance(reference_pose, np.ndarray):
-                reference_pose = torch.from_numpy(reference_pose).to(self.device)
-            if reference_pose.dim() == 2:  # Shape: (4, 4) -> (1, 4, 4)
-                reference_pose = reference_pose.unsqueeze(0)
-            current_ee_pose = reference_pose  # Shape: (1, 4, 4)
-            logger.log_info("Using provided reference pose for IK target orientation")
-        else:
-            # Fallback: compute current end-effector pose from joint configuration
-            try:
-                # Using first environment (index 0) for qpos retrieval
-                current_qpos = self.robot.get_qpos()[0][
-                    self.robot.get_joint_ids(self.control_part_name)
-                ]
-                current_ee_pose = self.robot.compute_fk(
-                    name=self.control_part_name,
-                    qpos=current_qpos.unsqueeze(0),
-                    to_matrix=True,
-                )  # Shape: (1, 4, 4)
-                logger.log_info(
-                    "Computing reference pose from current robot configuration"
-                )
-            except Exception as e:
-                logger.log_warning(f"Failed to compute current robot pose: {e}")
-                # Create identity pose as fallback
-                current_ee_pose = torch.eye(4, device=self.device).unsqueeze(0)
-                current_ee_pose[0, :3, 3] = torch.tensor(
-                    [0.5, 0.0, 1.0], device=self.device
-                )  # Default position
-                logger.log_info("Using default identity pose as fallback")
-
-            # Print current joint configuration and computed pose
-            pose_np = current_ee_pose[0].cpu().numpy()
-            position = pose_np[:3, 3]
-            rotation_matrix = pose_np[:3, :3]
-
-            # Convert rotation matrix to Euler angles
-            import scipy.spatial.transform as spt
-
-            euler_angles = spt.Rotation.from_matrix(rotation_matrix).as_euler(
-                "xyz", degrees=True
-            )
-
-        # Print detailed reference pose information
-        pose_np = current_ee_pose[0].cpu().numpy()
-        position = pose_np[:3, 3]
-        rotation_matrix = pose_np[:3, :3]
-
-        # Convert rotation matrix to Euler angles (ZYX convention)
-        import scipy.spatial.transform as spt
-
-        euler_angles = spt.Rotation.from_matrix(rotation_matrix).as_euler(
-            "xyz", degrees=True
+        random_sampler = RandomSampler(
+            seed=self.config.sampling.seed, device=self.device
        )

-        # Format matrix with proper indentation
-        matrix_lines = np.array2string(pose_np, precision=4, suppress_small=True).split(
-            "\n"
-        )
-        matrix_str = "\n".join(f"\t   {line}" for line in matrix_lines)
        logger.log_info(
-            f"🎯 Using provided reference pose for IK target orientation:\n"
-            f"\t Position: [{position[0]:.4f}, {position[1]:.4f}, {position[2]:.4f}] m\n"
-            f"\t Rotation (XYZ Euler): [{euler_angles[0]:.2f}°, {euler_angles[1]:.2f}°, {euler_angles[2]:.2f}°]\n"
-            f"\t Matrix:\n{matrix_str}"
+            f"Computing IK for {num_samples} Cartesian samples "
+            f"(batch_size={batch_size}, {ik_samples_per_point} seeds per point)..."
        )

-        # Track statistics for progress bar
-        total_reachable = 0
-
-        # Process each point individually (robot expects batch_size from environments, not samples)
        pbar = self._create_optimized_tqdm(
-            range(num_samples),
-            desc="Inverse Kinematics",
-            unit="pt",
+            range(0, num_samples, batch_size),
+            desc="Inverse Kinematics (batched)",
+            unit="batch",
            color="magenta",
            emoji="🎯",
        )


compute_reachability() has the same empty-input issue as FK: if cartesian_points is empty then batch_size becomes 0 and range(0, num_samples, batch_size) will throw. Also the pre-filter log computes (valid_cartesian_mask.sum()/num_samples*100) which will divide by zero. Add a num_samples == 0 guard before logging / creating the progress bar and return empty outputs.

Copilot · 2026-04-15T13:54:39Z

+        if (
+            hasattr(self.config, "reference_pose")
+            and self.config.reference_pose is not None
+        ):
+            reference_pose = self.config.reference_pose
+            if isinstance(reference_pose, np.ndarray):
+                reference_pose = torch.from_numpy(reference_pose).to(self.device)
+            if reference_pose.dim() == 2:
+                reference_pose = reference_pose.unsqueeze(0)
+            logger.log_info("Using provided reference pose for IK target orientation")
+            return reference_pose


_get_reference_pose() only moves reference_pose to self.device when it is a NumPy array. If callers provide a torch.Tensor on CPU (or wrong dtype), the returned pose can be on the wrong device and later compute_batch_ik will error due to device mismatch. Normalize reference_pose for both numpy and torch inputs (e.g., reference_pose = torch.as_tensor(..., device=self.device, dtype=torch.float32)), and keep the (1,4,4) shape handling.

Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

yuecideng and others added 9 commits April 14, 2026 04:21

perf(workspace_analyzer): vectorize voxelization with np.unique

63ffa3c

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(workspace_analyzer): add check_constraints() combined method

1b3b483

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

perf(workspace_analyzer): batch manipulability with vectorized linalg

4802735

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

perf(workspace_analyzer): vectorize density metric with scipy cKDTree

bfd19b6

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

perf(workspace_analyzer): vectorize Halton sampler with batch van der…

7adb713

… Corput Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

perf(workspace_analyzer): batch FK via compute_batch_fk for 10-100x s…

a825d4f

…peedup Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

perf(workspace_analyzer): batch IK via compute_batch_ik for 10-100x s…

e925980

…peedup Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

bench(workspace_analyzer): add benchmark script for performance optim…

a0cac7a

…izations Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

style: run black formatter on workspace_analyzer changes

414189d

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Copilot AI review requested due to automatic review settings April 14, 2026 04:48

Copilot started reviewing on behalf of yuecideng April 14, 2026 04:48 View session

Copilot AI reviewed Apr 14, 2026

View reviewed changes

yuecideng added 3 commits April 15, 2026 16:52

wip

756fd90

wip

0110a88

wip

23a747c

yuecideng marked this pull request as ready for review April 15, 2026 10:09

Copilot AI review requested due to automatic review settings April 15, 2026 10:09

Copilot started reviewing on behalf of yuecideng April 15, 2026 10:10 View session

Merge branch 'main' into enhance/workspace-analyzer-perf

3abc086

Copilot AI reviewed Apr 15, 2026

View reviewed changes

yuecideng added 2 commits April 15, 2026 18:58

Merge branch 'main' into enhance/workspace-analyzer-perf

9b5b9de

Merge branch 'main' into enhance/workspace-analyzer-perf

2005021

Copilot AI review requested due to automatic review settings April 15, 2026 13:49

Copilot started reviewing on behalf of yuecideng April 15, 2026 13:49 View session

Copilot AI reviewed Apr 15, 2026

View reviewed changes

yuecideng merged commit d860435 into main Apr 15, 2026
8 checks passed

yuecideng deleted the enhance/workspace-analyzer-perf branch April 15, 2026 14:08

chase6305 pushed a commit that referenced this pull request Apr 16, 2026

Enhance workspace analyzer computational efficiency (#230)

29f7820

Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

yangchen73 pushed a commit that referenced this pull request Apr 16, 2026

Enhance workspace analyzer computational efficiency (#230)

5d55524

Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

		@@ -1003,208 +1019,127 @@ def compute_reachability(
		"""
		num_samples = len(cartesian_points)

-        num_samples = len(cartesian_points)
+        num_samples = len(cartesian_points)
+        if num_samples == 0:
+            return (
+                cartesian_points,
+                cartesian_points[:0],
+                cartesian_points.new_empty((0,)),
+                torch.zeros((0,), dtype=torch.bool, device=cartesian_points.device),
+                cartesian_points.new_empty((0, 0)),
+            )

		# Count neighbors within radius for all points at once
		counts = tree.query_ball_point(points, r=radius, return_length=True)

-            # Count neighbors within radius for all points at once
-            counts = tree.query_ball_point(points, r=radius, return_length=True)
+            # Count neighbors within radius for all points at once.
+            # Older SciPy versions do not support `return_length=True`,
+            # so fall back to counting the returned neighbor lists.
+            try:
+                counts = tree.query_ball_point(points, r=radius, return_length=True)
+            except TypeError:
+                neighbors = tree.query_ball_point(points, r=radius)
+                counts = np.fromiter(
+                    (len(neighbor_list) for neighbor_list in neighbors),
+                    dtype=np.int64,
+                    count=len(points),
+                )

-        logger.log_info(
-            f"Pre-filtered Cartesian points: {valid_cartesian_mask.sum()}/{num_samples} "
-            f"points pass workspace constraints ({(valid_cartesian_mask.sum()/num_samples*100):.1f}%)"
+        num_valid_cartesian_points = valid_cartesian_mask.sum().item()
+        valid_cartesian_percentage = (
+            (num_valid_cartesian_points / num_samples * 100) if num_samples else 0.0
+        )
+        logger.log_info(
+            f"Pre-filtered Cartesian points: {num_valid_cartesian_points}/{num_samples} "
+            f"points pass workspace constraints ({valid_cartesian_percentage:.1f}%)"

Conversation

yuecideng commented Apr 14, 2026

Description

Changes

Benchmark Results (CPU-only metrics)

Type of change

Screenshots

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants