On resetting a state to a given state / Extracting image observation from past state? #439

famishedrover · 2023-08-10T18:37:49Z

Suppose I sample several states in the past and store them. Is it possible to reset the environment object to a given past state quickly? I am flexible is storing necessary state information to make the re-store possible. However the solution must have low space complexity. For example, storing the observation (which is a few floats) is okay, but storing the mujoco sim state can be very expensive.

Motivation :

I care about obtaining the image observation from a past state. Since storing images on the go is not feasible for my setup, I wanted to explore solutions where I can store less data intensive information like the observation etc. that is enough to exactly restore the environment state to get the image (or if there are other faster methods to get the image back).

Thank you!

famishedrover · 2023-08-11T19:14:40Z

One other solution that works is to store the sequence of actions taken after calling reset() upto the state I wish to save. Then replaying the same action sequence gets me the correct image representation.

When I try the following :

from metaworld.envs import (ALL_V2_ENVIRONMENTS_GOAL_OBSERVABLE,
                            ALL_V2_ENVIRONMENTS_GOAL_HIDDEN)
                            # these are ordered dicts where the key : value
                            # is env_name : env_constructor

import numpy as np

door_open_goal_observable_cls = ALL_V2_ENVIRONMENTS_GOAL_OBSERVABLE["door-open-v2-goal-observable"]
door_open_goal_hidden_cls = ALL_V2_ENVIRONMENTS_GOAL_HIDDEN["door-open-v2-goal-hidden"]

env1 = door_open_goal_observable_cls(seed=5)
env2 = door_open_goal_observable_cls(seed=5)

env1.reset()
env2.reset()
env1.render_mode = 'rgb_array'
env2.render_mode = 'rgb_array'

acs = []
for ix in range(10):
    acs.append(env1.action_space.sample())
    res = env1.step(acs[-1])

obs = env1.render()   
state = env1.get_env_state() 

for ix in acs : 
    res2 = env2.step(ix)

obs2 = env2.render()

env2.reset()
env2.set_env_state(state)
obs3 = env2.render()

print ((obs == obs2).all())
print ((obs == obs3).all())
print ((obs2 == obs3).all())

I get

True 
False 
False

reginald-mclean · 2023-08-11T23:20:30Z

What version of Meta-World are you using?

famishedrover · 2023-08-14T15:52:40Z

This is the commit I used to install (should be the latest one as I did it 4 days back.)

metaworld @ git+https://github.com/Farama-Foundation/Metaworld.git@d155d0051630bb365ea6a824e02c66c068947439

I have had similar issues with v1 metaworld.

reginald-mclean · 2023-08-14T19:41:43Z

It looks like you are using the Mujoco based (not mujoco-py) Meta-World where get_state and set_state don't function properly because of the change in bindings. As of right now the easiest thing to do would probably be something similar to the code you posted above: seed the environment with a specific seed, store actions, recreate the environment with that seed and apply the stored actions

famishedrover · 2023-08-14T20:20:21Z

I can switch to the mujoco-py based one if it works there & you can share to how init envs through that.

By tinkering some code I could save the mujoco internal state ( env.unwrapped.wrapped_env.__getstate__(), __setstate__() ) & then recover the state but its very large in size ( at par with images so not usable for my usecase )

reginald-mclean · 2023-08-14T20:26:07Z

If you go back to the most recent commit before the bindings were changed, or use the v2.0.0 release zip, the code you posted above will work. There's no API changes

famishedrover · 2023-08-14T20:55:27Z

Ok version 2.0.0 ftw!
I had to change the render line to obs = env1.render('rgb_array') but as you said this works!

Do you have a timeline on when the get_state() etc for the most recent version can be corrected? I would be willing to pitch in if you want!

Quick solution for other readers :
I am using :
pip install git+https://github.com/Farama-Foundation/Metaworld.git@b2a4cbb98e20081412cb4cc7ae3d4afc456a732a
and fixing some mujoco version issue with this solution.

reginald-mclean · 2023-08-14T21:03:07Z

If you want to try and tackle it, create a PR for it when it's complete. We also have an issue with using EZPickle #426 that could be related. Just don't have time to look into it.

famishedrover · 2023-08-24T22:50:29Z

@reginald-mclean it seems that there is some bug either in mujoco bin or metaworld. The above works for me fine on macos (intel) but fails on Ubuntu (20). I can confirm that I'm running the same python version & the same metaworld version in both the cases, but when running on Ubuntu I keep getting all False.

Do you have any idea why?

Additional :
when I reset & obtain rgb_array the image obs is different for two env objects init with same seed.
two subsequent calls to env.render gives different image for the same env object. [This problem exists for mujoco210 i.e. mujoco_py 2.1 and mujoco200 i.e. mujoco_py2.0]

famishedrover · 2023-08-25T02:17:31Z

While a proper fix comes along :
the issue happens because glfw is not being used in headless opengl. The fix is

start a fake screen using

export LD_PRELOAD=/usr/lib/x86_64-linux-gnu/libGLEW.so 
xvfb-run -a -s "-screen 0 1400x900x24" bash

Get image observation using

       a = mujoco_py.MjRenderContextOffscreen(self.sim, 0)
       a.render(*resolution)
       x = a.read_pixels(*resolution, depth=False)
       return x
      ```

reginald-mclean · 2023-08-25T02:31:15Z

If it fixes the bug, it's not actually a bug. I think what happens is that your mac has a frame buffer that it can use to render, the headless Ubuntu you're using doesn't. The "xvfb-run -a" command does exactly that, creates a virtual frame buffer you can use. You can also use xvfb-run -a for running Python scripts by adding it to your command (ie xvfb-run -a python myFile.py)

famishedrover · 2023-08-26T01:02:41Z

The key issue is that sim.render() method in mujocopy gives two different images even when run consecutively for the same sim state. This is unexpected afaik. This is resolved when I instead use the mjviewer & grab the image from it ( using xvfb ) instead of rendering in offscreen mode as is the default case in mujocopy.

reginald-mclean closed this as completed Aug 14, 2023

This was referenced Aug 28, 2023

I can use env.render() with V100-32G GPU openai/mujoco-py#724

Open

[Bug Report] mujoco rendering is not deterministic Farama-Foundation/Gymnasium#690

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

On resetting a state to a given state / Extracting image observation from past state? #439

On resetting a state to a given state / Extracting image observation from past state? #439

famishedrover commented Aug 10, 2023

famishedrover commented Aug 11, 2023

reginald-mclean commented Aug 11, 2023

famishedrover commented Aug 14, 2023

reginald-mclean commented Aug 14, 2023

famishedrover commented Aug 14, 2023

reginald-mclean commented Aug 14, 2023

famishedrover commented Aug 14, 2023 •

edited

Loading

reginald-mclean commented Aug 14, 2023

famishedrover commented Aug 24, 2023 •

edited

Loading

famishedrover commented Aug 25, 2023

reginald-mclean commented Aug 25, 2023 •

edited

Loading

famishedrover commented Aug 26, 2023

On resetting a state to a given state / Extracting image observation from past state? #439

On resetting a state to a given state / Extracting image observation from past state? #439

Comments

famishedrover commented Aug 10, 2023

famishedrover commented Aug 11, 2023

reginald-mclean commented Aug 11, 2023

famishedrover commented Aug 14, 2023

reginald-mclean commented Aug 14, 2023

famishedrover commented Aug 14, 2023

reginald-mclean commented Aug 14, 2023

famishedrover commented Aug 14, 2023 • edited Loading

reginald-mclean commented Aug 14, 2023

famishedrover commented Aug 24, 2023 • edited Loading

famishedrover commented Aug 25, 2023

reginald-mclean commented Aug 25, 2023 • edited Loading

famishedrover commented Aug 26, 2023

famishedrover commented Aug 14, 2023 •

edited

Loading

famishedrover commented Aug 24, 2023 •

edited

Loading

reginald-mclean commented Aug 25, 2023 •

edited

Loading