Multi-Agent Exploration & Dynamic Obstacle Avoidance

In this project we developed a multi-robot mapping pipeline that can be used for manual or autonomous mapping of unknown terrains with multiple robots, without any prior information about the environment. Once the map is prepared, we incorporate dynamic environment changes into the occupancy grid using knowledge gained from camera sensors. This work was done during my research internship at RBBCPS, IISc.

Code · Slides

Manual Multi-Robot Mapping

For manual multi-robot mapping, we use standard LiDAR-based SLAM (Gmapping) for each individual robot to build its own map, then merge them into a global map. The merging uses a feature-matching algorithm that detects overlapping features and combines maps with or without knowing the initial pose of any robot. More details can be found here.

Map merging.

Multi-Robot Exploration

Exploration is preferred while mapping because the robots can autonomously generate a map without human effort. The classical approach finds frontier points on the edges of the occupancy grid and forwards them as goals to the robot. In our approach, we use RRT (Rapidly-exploring Random Trees) exploration, where a modified RRT algorithm detects frontier points and has proven much faster than standard exploration. We run two ROS nodes:

Global Frontier Detector — finds frontier points in the global occupancy grid.
Local Frontier Detector — finds frontier points in each robot’s local occupancy grid (one per robot).

RRT exploration simulation.

The algorithm runs until the loop-closure condition in the global map is satisfied — assuming the environment forms a closed loop, it stops once no open frontiers remain, indicating that all traversable parts of the terrain are mapped. More details about this package are on the Wiki.

Dynamic Obstacle Avoidance

Ground robots are commonly used in warehouses and factories where the environment is never static. We therefore created a system that can dynamically change the global costmap whenever the environment changes, using an object-detection and map-update pipeline.

3D Object Detection

We use 3D object detection instead of standard 2D detection because accurately updating the costmap requires the object’s position and an estimate of its real-world dimensions — a 2D bounding box can’t provide this, but a 3D bounding box in the camera frame can. We use MediaPipe’s Objectron module, which detects objects like chairs, shoes, mugs, and cameras.

3D object-detection samples using Objectron.

After obtaining the nine box points in the camera frame, we transform them into the real world using the Look-At transformation. Since a single camera can only map a point to a 3D line in the real world, we compute multiple such lines using multiple cameras and use gradient-descent optimization to recover the actual coordinates of each point — from which we compute the object’s center and dimensions.

Map Update

For map updates, we added a new layer to the global costmap as a plugin. The plugin receives data from Objectron and updates the cost at the object’s new location as well as its previous location.