Reimplementation of Continuous Space #2584

quaquel · 2024-12-30T20:13:05Z

This reimplements continuous space and adds the basic starting point for an agent-centric API analogous to what is possible with cell spaces.

API examples
This reimplements ContinuousSpace in line with the API design established for cell spaces. A key design choice of the cell spaces is that movements become agent-centric (i.e., agent.cell = some_cell). This PR does the same but for continuous spaces. So, you can do agent.position += speed * heading. Likewise, you can do agent.get_nearest_neighbors(k=5) or agent.get_neigbors_in_radius(radius=3.1415).

space = ContinuousSpace([ [0, 1], [0, 1] ], torus=True, random=model.random)

space.agent_positions  # the numpy array with all agent positions
distances, agents = space.calculate_distances[0.5, 2])  

agents, distances = space.get_agents_in_radius([0.5, 0.5], radius=3.1415)
agents, distances = space.get_k_nearest_agents([0.5, 0.5], k=2)

agent = ContinousAgent(model, space)
agent.position = [agent.random.random(), agent.random.random()]
agent.position += [0.1, 0.05]

nearest_neighbor, distance = agent.get_nearest_neigbors()
neighbors, distances = agent.get_neigbors_in_radius(radius=0.5)

Implementation details
In passing, this PR contains various performance improvements and generalizes continuous spaces to n-d.

agent.position is a view into the numpy array with all agent positions. This is analogous to how cells access their value in a property layer. In contrast to the current implementation, the numpy array with all agent positions is never fully rebuilt. Rather, it is a masked array with possible empty rows. If the numpy array becomes too small, it is expanded by adding 20% more rows to it. Most of this will be fine-tuned further, becoming user controllable.

Regarding performance, I have spent most of last week reading up on r-trees, ball trees, and kd trees. Moreover, I ran various performance tests comparing trees against brute force distance calculations. It seems that brute force wins for MESA's use case. Why? Trees are amazing if you need to do frequent lookups and have comparatively few updates of locations. In MESA, however, we will often have many updates of locations (any agent movement will trigger an update of the location and, thus, an update of the tree). Once I established that brute force was the way to go, Next, I compared various ways of calculating Euclidian distances. I settled on using scipy.spatial.cdist for non-torus and and a for loop over the columns for manual Euclidian distance calculations in case of torus.

for more information, see https://pre-commit.ci

github-actions · 2024-12-30T20:21:10Z

Performance benchmarks:

Model	Size	Init time [95% CI]	Run time [95% CI]
BoltzmannWealth	small	🔵 -1.1% [-2.1%, -0.0%]	🔵 +0.2% [+0.0%, +0.4%]
BoltzmannWealth	large	🔵 -0.3% [-1.5%, +0.5%]	🔵 +0.4% [-0.9%, +1.5%]
Schelling	small	🔵 -0.1% [-0.3%, +0.1%]	🔵 -0.5% [-0.6%, -0.3%]
Schelling	large	🔵 -0.2% [-0.7%, +0.2%]	🔵 -0.7% [-1.5%, +0.1%]
WolfSheep	small	🔵 -0.9% [-1.5%, -0.3%]	🔵 +2.5% [-2.0%, +6.9%]
WolfSheep	large	🔵 -0.9% [-1.9%, +0.3%]	🔵 -0.8% [-1.9%, +0.7%]
BoidFlockers	small	🔵 -1.5% [-2.1%, -0.7%]	🔵 -1.6% [-2.5%, -0.8%]
BoidFlockers	large	🔵 -3.0% [-3.9%, -2.0%]	🔵 -2.4% [-2.7%, -2.0%]

for more information, see https://pre-commit.ci

EwoutH · 2024-12-30T21:28:46Z

Thanks for working on this!

A key design choice of the cell spaces is that movements become agent-centric (i.e., agent.cell = some_cell). This PR does the same but for continuous spaces. So, you can do agent.position += speed * heading. Likewise, you can do agent.get_nearest_neighbors(k=5) or agent.get_neigbors_in_radius(radius=3.1415).

Absolutely awesome, I fully support this design direction.

I will do an API / conceptual level review tomorrow. Let me know when you would like to have a code/implementation level review.

quaquel · 2024-12-30T21:36:15Z

I need to write all the tests, once those are done, it is ready for a code review. API feedback at this point, however, is very much welcome.
One thing I am struggling with is how to get a clean separation between the space and the agent. In cell spaces, we could do this via the cell class, but there is no real equivalent here. So, there is a rather tight coupling between a ContinuousSpaceAgent and the ContinousSpace.
I want to expand the Agent classes so we have a variety of agents with different degrees of movement. For this, I'll look at ABM language #1354 and Agent spatial methods from GaelLucero #2149, and try to develop a logical progression of agent classes with increasing support for movement. Again, further ideas on this are welcome.
I want to redo the boid example and see what the performance difference is.

EwoutH · 2024-12-31T08:15:48Z

Okay, first of all, this is excellent work. While I could probably fledge our the API, your implementations are simply superior because you have considered every angle and detail. Especially the array stuff, I'm impressed.

Let me review the API, comparing how to do stuff in the old and new continuous spaces.

Creating a Space and Agents:

# Current Implementation
space = ContinuousSpace(
    x_max=1.0, 
    y_max=1.0,
    torus=True,
    x_min=0.0,
    y_min=0.0
)

class MyAgent(Agent):
    def __init__(self, unique_id, model):
        super().__init__(unique_id, model)

agent = MyAgent(1, model)

# New Implementation 
space = ContinuousSpace(
    dimensions=[[0, 1], [0, 1]],  # More flexible, supports n-dimensions
    torus=True,
    random=model.random
)

class MyAgent(ContinuousSpaceAgent):
    def __init__(self, space, model):
        super().__init__(space, model)

agent = MyAgent(space, model)

The new dimensions parameter is way more elegant and extensible, and I like the syntax. I'm not sure having to pass the space to the agent constructor is ideal, it might be able to be derived from the model. Also, an Agent might want to be part of multiple spaces (albeit on the same coordinate/position).

Maybe the model can track which spaces there are. If there's one that could be the default, I will loop back to this later.

Also for random, can we use model.random by default?

Placing/Moving Agents:

# Current Implementation
space.place_agent(agent, pos=(0.5, 0.5))
space.move_agent(agent, pos=(0.6, 0.6))
current_pos = agent.pos  # Returns tuple (x,y)

# New Implementation
agent.position = [0.5, 0.5]  # Direct assignment
agent.position += [0.1, 0.1]  # Vector arithmetic
current_pos = agent.position  # Returns numpy array

The new vector-based approach is much more intuitive and powerful, especially for physics-based simulations. The ability to use numpy array operations directly is a big win.

I like the API and syntax, good stuff. The square brackets make it also feel more like an position/coordinate (probably very personal).

Getting Neighbors:

# Current Implementation
neighbors = space.get_neighbors(
    pos=(0.5, 0.5),
    radius=0.2,
    include_center=True
)

# No built-in nearest neighbors functionality

# New Implementation
# From agent's perspective
neighbors = agent.get_neigbors_in_radius(radius=0.2)
nearest = agent.get_nearest_neighbors(k=5)

# From space's perspective
distances = space.calculate_distances([0.5, 0.5])

Great to have this build in. The agent-centric approach is really elegant here. I would go with a single get_neighbors method however. It can have both a radius and at_most argument, just like we have in the AgentSet.select(). We can extend it in the future with filtering by type and Agent property. Or if it returns an AgentSet we can apply an select() over it, or let the user do that. Many possibilities here!

This also might be a place where we want to input the space or list of spaces (optionally). That way an Agent can search in a particular space. If there's only one space, we can default to that (and/or to the first space added to the model).

Accessing All Agents/Positions:

# Current Implementation
# Need to rebuild cache first
space._build_agent_cache()
all_positions = space._agent_points  # numpy array
agents = list(space._agent_to_index.keys())

# New Implementation
all_positions = space.agent_positions  # Direct access to numpy array
agents = space.agents  # Returns AgentSet

The new implementation is much cleaner and more efficient. No need to manually rebuild cache, and proper encapsulation of internal details. The AgentSet integration is a good choice. The direct access to positions through a property is more intuitive.

Great stuff.

Removing Agents:

# Current Implementation
space.remove_agent(agent)
agent.pos = None  # Must manually clear position

# New Implementation
agent.remove()  # Handles everything automatically

The new implementation is clearly superior. This will save us a lot of bug reports in the long term.

Checking Bounds/Torus Adjustments:

# Current Implementation
is_valid = not space.out_of_bounds(pos=(0.5, 1.2))
adjusted_pos = space.torus_adj((1.2, 1.3))

# New Implementation
is_valid = space.in_bounds([0.5, 1.2])
adjusted_pos = space.torus_correct([1.2, 1.3])

I haven't used this feature often, but the new method names feel more intuitive (in_bounds vs not out_of_bounds). The torus_correct name is also clearer than torus_adj, but I would also consider torus_valid.

Really amazing!

quaquel · 2024-12-31T08:38:04Z

Thanks!

Some quick reactions

I'm not sure having to pass the space to the agent constructor is ideal

This is indeed one of the places were I struggled to cleanly seperate everything. However, I don't see any other way of doing it. I prefer explicit over implicit. Thus, I want to avoid having to make guesses about the attribute name of the space. I am skeptical about agents being in multiple spaces at the same time. Regardless, if a user want this, this is still possible by subclassing Agent.

Also for random, can we use model.random by default?

Again, I prefer explicit over implicit. Moreover, this is identical to how it is handled in discrete spaces so it keeps the API consistent. Note that neither DiscreteSpace nor ContinuousSpace have model as an argument for their initialization, so we cannot default to model.random.

I would go with a single get_neighbors method however.

I am open to this suggestion, but not convinced yet. In most libraries I looked at last week, they cleanly seperate both cases because implementation wise they are quite different. For example, ball tree in sklearn has query and query_radius. It also would again involve having to make guesses. For example, you do a neighborhood search for a given radius and you want 5 agents at most. What does this mean? How do you want to select if there are more agents within the radius? Should this be random, should this based on nearnes? With these two methods explicitly seperate, users are free to write their own custom follow up methods.

One option might be to add this as a third method?

but I would also consider torus_valid.

The method changes the values that you pass, while this name suggests a boolean as a return.

EwoutH · 2024-12-31T08:52:56Z

I am skeptical about agents being in multiple spaces at the same time.

Practical use case: I have some layer of fixed agents (trees, intersections, houses) in a cell space, but I want other agents (birds, cars, people) to move between those in continuous space. Then I would like to have a Cell Space and a ContiniousSpace in the same model.

For example, you do a neighborhood search for a given radius and you want 5 agents at most. What does this mean? How do you want to select if there are more agents within the radius? Should this be random, should this based on nearnes?

I would say random by default, with an argument to switch it to nearest. "Get 5 random agents with 100 meters of me" sounds like a common use case for me. The beautiful thing about a good single-method implementation is that you are very flexible with combinations of keyword arguments (again like we do with select).

One option might be to add this as a third method?

And then when we think of a think of another selection criterea the number of functions could double again, of they all need an additional keyword argument. While I see your side of the argument, personally, I really like having a clean function name (select, get_neighbours) with clear arguments in a logical order and with sensible defaults.

In most libraries I looked at last week, they cleanly seperate both cases because implementation wise they are quite different.

Internally it can be different code paths.

quaquel · 2024-12-31T11:34:50Z

Regarding a single neighbors methods, I tried to work it out, but the code just gets confusing and messy very quickly. There are various possible code paths (at least 4 with just radius and k, one of which should raise an exception). I personally think that keeping them separate, at least for now, keeps things simpler. Note that the existing continuous space only has the radius version and not the k nearest neighbors.

However, while playing with this, I thought it might be useful to return not just the agents but also the distances. This makes it possible to do follow-up operations yourself easily:

some_list = agent.get_neigbors_in_radius(radius=3.1415)

# sort by distance assuming some_list is [(agent, distance), (agent, distance)]
some_list.sort(key=lambda element:element[1])

# get k nearest agents within radius
k_agents = [entry[0] for entry in some_list[0:k]]

quaquel · 2024-12-31T11:37:34Z

Practical use case: I have some layer of fixed agents (trees, intersections, houses) in a cell space, but I want other agents (birds, cars, people) to move between those in continuous space. Then I would like to have a Cell Space and a ContiniousSpace in the same model.

I am sorry but I don't see the use case. You can just as easily have all agents with a permanent location inside the same continuous space. There is no need to use a cell space for this. Moreover, it also raises again the spectre of coordinate systems and their alignment across multiple spaces.

for more information, see https://pre-commit.ci

EwoutH · 2025-01-01T14:34:32Z

While this is probably a move in the right direction anyway, and could be worth exploring on its own, it might be worth fully fledging our conceptual view on spaces first. Just to prevent doing double work.

Conceptual model of Space #2585

for more information, see https://pre-commit.ci

jackiekazil · 2025-01-07T14:14:53Z

Discussed during dev meeting -- Related work after to address after this complete #2149, #1354, #1278

EwoutH · 2025-01-07T14:15:19Z

@projectmesa/maintainers to review:

API (see Reimplementation of Continuous Space #2584 (comment))
Code implementation
Example implementation

Future work: #1278, #1354 and #2149.

for more information, see https://pre-commit.ci

EwoutH

A few initial comments and questions on the examples. Main code will be next.

EwoutH · 2025-01-08T16:50:59Z

mesa/examples/basic/boid_flockers/agents.py

@@ -6,10 +6,10 @@

 import numpy as np

-from mesa import Agent
+from mesa.experimental.continuous_space import ContinuousSpaceAgent


What's you long term vision on the different Agent types in Mesa? Will each space need their own Agent class?

I don't see a viable alternative at the moment other than having a set of different agent subclasses designed to work with different spaces.

If I am understanding correctly, related #2585 (which is an awesome discussion BTW) then if a user has say someone in continouspace and part of network, then their agent would use inheritance to get those particular functions?

mesa/examples/basic/boid_flockers/model.py

EwoutH · 2025-01-08T16:59:31Z

mesa/examples/basic/boid_flockers/agents.py

-        vision,
-        separation,
+        space,
+        position=(0, 0),


Wasn't this supposed to be a sequence/array with (like [0, 0])?

position in newstyle is np.typing.ArrayLike, so anything castable to np.array works.

The cleaner solution is to make position a positional argument rather than a keyword argument because this avoids the need for a default value. The nice thing of a kwarg, however, is that create_agents becomes a bit more readable.

mesa/examples/basic/boid_flockers/model.py

EwoutH

A few initial comments/questions on the spaces code.

mesa/experimental/continuous_space/continuous_space.py

EwoutH · 2025-01-08T17:05:03Z

mesa/experimental/continuous_space/continuous_space.py

+        self._agent_positions: np.array = np.empty(
+            (n_agents, self.dimensions.shape[0]), dtype=float
+        )
+        # self._agents: np.array = np.zeros((n_agents,), dtype=object)
+
+        self.active_agents = []
+        self.agent_positions: (
+            np.array
+        )  # a view on _agent_positions containing all active positions
+
+        self._n_agents = 0
+
+        self._index_to_agent: dict[int, Agent] = {}
+        self._agent_to_index: dict[Agent, int | None] = {}


Seems like 2 variables and 4 private variables to track agents and their positions. Could you add some docstring about this, for future reference?

Not sure docstring is the right place for that. I can add some more comments, however.

Maybe we needs some dev docs or something. PR description could also work (for now).

I added some comments, let me know if this is sufficient.

mesa/experimental/continuous_space/continuous_space.py

Co-authored-by: Ewout ter Hoeven <E.M.terHoeven@student.tudelft.nl>

for more information, see https://pre-commit.ci

tpike3

@quaquel Awesome as always! I always enjoy learning from your code. I had a few questions, but fantastic!

tpike3 · 2025-01-09T15:51:16Z

mesa/examples/basic/boid_flockers/agents.py

@@ -6,10 +6,10 @@

 import numpy as np

-from mesa import Agent
+from mesa.experimental.continuous_space import ContinuousSpaceAgent


If I am understanding correctly, related #2585 (which is an awesome discussion BTW) then if a user has say someone in continouspace and part of network, then their agent would use inheritance to get those particular functions?

tpike3 · 2025-01-09T15:52:02Z

mesa/examples/basic/boid_flockers/agents.py

@@ -58,47 +61,31 @@ def __init__(

    def step(self):
        """Get the Boid's neighbors, compute the new vector, and move accordingly."""
-        neighbors = self.model.space.get_neighbors(self.pos, self.vision, True)
+        neighbors, distances = self.get_neighbors_in_radius(radius=self.vision)


This is very clean!

tpike3 · 2025-01-09T16:00:09Z

mesa/experimental/continuous_space/continuous_space.py

+
+import numpy as np
+from numpy.typing import ArrayLike
+from scipy.spatial.distance import cdist


I am conflicted about adding a core dependency for one call, although it is scipy. Is there thoughts or plan to grow this to add more types of calculations (e.g. non Euclidian) or use scpiy in other spots?

Its part of anaconda, the code is roughly 4 times faster than a numpy only version, and you can use other distances with it, so I think it is worth it, but it can be changed if others see this differently.

yeah it's not ideal, but scipy is a common one, We could include it in the recommended (but not core) dependencies, like we do with pandas.

My bias is always for faster so I say leave it in as core or add to recommended (scipy is a part of examples). (I am also tracking numpy, pandas, and tqdm as core

I also proposed adding **kwargs to calculate_distances below so user can exploit some of the other cdist capability

mesa/experimental/continuous_space/continuous_space.py

tpike3 · 2025-01-09T16:06:19Z

mesa/experimental/continuous_space/continuous_space.py

+
+        return delta
+
+    def calculate_distances(


For for these three functions 1- calculate_distances, 2- get_agents_in_radius, 3-get_k_nearest_neighbors it seems the return structure changes - 1- array, list, 2- (list, array), 3-list, array

This seems to violate the principle of least surprise --- is there a reason for this I am just not understanding?

That is fair point; I have been wondering about that as well.

The main difference is that calculate_distances focuses on distances, so I have this as the first return. The other 4 (ContinousSpace.get_agents_in_radius, ContinousSpace.get_k_nearest_agents, and their ContinuousSpaceAgent versions ) all are about getting agents, so I return those first.

My bias would be to keep them all array, list this may be because I have been burned with GIS libraries switching long, lat to Lat, long too many times. However, I will leave it up to you

To optimize the use of the scipy library would you want to add **kwargs so users could change some of the key word arguments like metric?

I added the kwargs idea, but this only works if torus is not true.

quaquel · 2025-01-09T17:03:01Z

If I am understanding correctly, related #2585 (which is an awesome discussion BTW) then if a user has say someone in continouspace and part of network, then their agent would use inheritance to get those particular functions?

You can probably just do that with multiple inheritance at the moment, we might want to flesh that out further in the near future.

EwoutH

Let's get this in, and start building with it.

Thanks for driving this home Jan!

quaquel and others added 3 commits December 30, 2024 20:52

initial draft

2e47f56

Update continuous_space.py

6ef0b13

[pre-commit.ci] auto fixes from pre-commit.com hooks

a8bd211

for more information, see https://pre-commit.ci

quaquel and others added 2 commits December 30, 2024 21:40

ruff

797e681

[pre-commit.ci] auto fixes from pre-commit.com hooks

2782736

for more information, see https://pre-commit.ci

quaquel and others added 9 commits December 31, 2024 14:06

Update continuous_space.py

d0b05ec

first tests

611af52

[pre-commit.ci] auto fixes from pre-commit.com hooks

85c0bc2

for more information, see https://pre-commit.ci

docstring for agent

2e31c49

Update continuous_space_agents.py

322a50e

[pre-commit.ci] auto fixes from pre-commit.com hooks

aee0ff8

for more information, see https://pre-commit.ci

Merge branch 'main' into continuous_space

1e15913

add scipy as dependency

85665c5

[pre-commit.ci] auto fixes from pre-commit.com hooks

cdbeb2a

for more information, see https://pre-commit.ci

quaquel and others added 7 commits January 1, 2025 15:43

Update continuous_space_agents.py

04f66f9

ensure aligntment between agents and distances

a2d113e

[pre-commit.ci] auto fixes from pre-commit.com hooks

76f3ccd

for more information, see https://pre-commit.ci

Update continuous_space.py

d8b62b0

[pre-commit.ci] auto fixes from pre-commit.com hooks

c5ad8bb

for more information, see https://pre-commit.ci

typing fix

5695c3b

tests for distance calculations

3a985d8

pre-commit-ci bot and others added 5 commits January 6, 2025 10:45

[pre-commit.ci] auto fixes from pre-commit.com hooks

21c01ad

for more information, see https://pre-commit.ci

Update test_continuous_space.py

c7df9be

[pre-commit.ci] auto fixes from pre-commit.com hooks

c0782ab

for more information, see https://pre-commit.ci

Merge branch 'main' into continuous_space

6677e03

merge related fixes

6f0a46a

EwoutH mentioned this pull request Jan 7, 2025

Tracking issue: Stabilizing the Cell Space #2519

Open

21 tasks

quaquel and others added 6 commits January 7, 2025 22:16

Merge branch 'main' into continuous_space

36bf20a

typing and tests

7780b3a

[pre-commit.ci] auto fixes from pre-commit.com hooks

c2aa0f2

for more information, see https://pre-commit.ci

more tests

528aab5

Delete for_development.py

e98eadf

[pre-commit.ci] auto fixes from pre-commit.com hooks

983fbb9

for more information, see https://pre-commit.ci

EwoutH reviewed Jan 8, 2025

View reviewed changes

quaquel and others added 5 commits January 8, 2025 18:45

Update mesa/examples/basic/boid_flockers/model.py

c4de8de

Co-authored-by: Ewout ter Hoeven <E.M.terHoeven@student.tudelft.nl>

Update mesa/examples/basic/boid_flockers/model.py

23159bf

Co-authored-by: Ewout ter Hoeven <E.M.terHoeven@student.tudelft.nl>

expanded docstring and comments

0706667

[pre-commit.ci] auto fixes from pre-commit.com hooks

0cbbbb5

for more information, see https://pre-commit.ci

Update continuous_space.py

4187786

tpike3 reviewed Jan 9, 2025

View reviewed changes

tpike3 approved these changes Jan 9, 2025

View reviewed changes

EwoutH approved these changes Jan 9, 2025

View reviewed changes

EwoutH added feature Release notes label experimental Release notes label labels Jan 9, 2025

quaquel added 2 commits January 10, 2025 08:20

final tweaks

1b8937f

Update continuous_space.py

897d62e

quaquel merged commit 29d0f3b into projectmesa:main Jan 10, 2025
11 checks passed

Reimplementation of Continuous Space #2584

Reimplementation of Continuous Space #2584

Conversation

quaquel commented Dec 30, 2024 • edited Loading

github-actions bot commented Dec 30, 2024

EwoutH commented Dec 30, 2024

quaquel commented Dec 30, 2024

EwoutH commented Dec 31, 2024

quaquel commented Dec 31, 2024

EwoutH commented Dec 31, 2024

quaquel commented Dec 31, 2024 • edited Loading

quaquel commented Dec 31, 2024

EwoutH commented Jan 1, 2025

jackiekazil commented Jan 7, 2025 • edited Loading

EwoutH commented Jan 7, 2025 • edited Loading

EwoutH left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

EwoutH left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tpike3 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tpike3 Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

quaquel commented Jan 9, 2025

EwoutH left a comment

Choose a reason for hiding this comment

quaquel commented Dec 30, 2024 •

edited

Loading

quaquel commented Dec 31, 2024 •

edited

Loading

jackiekazil commented Jan 7, 2025 •

edited

Loading

EwoutH commented Jan 7, 2025 •

edited

Loading

tpike3 Jan 9, 2025 •

edited

Loading