Simple demo notebook

Exploring NeuroGym Tasks¶

NeuroGym is a comprehensive toolkit that allows training any network model on many established neuroscience tasks using Reinforcement Learning techniques. It includes working memory tasks, value-based decision tasks and context-dependent perceptual categorization tasks.

In this notebook we first show how to install the relevant toolbox.

We then show how to access the available tasks and their relevant information.

Finally we train a feedforward neural network (MLP policy) on the Random Dots Motion task using the A2C algorithm Mnih et al. 2016 implemented in the stable-baselines3 toolbox, and plot the results.

You can easily change the code to train a network on any other available task or using a different algorithm (e.g. ACER, PPO2).

Installation¶

Google Colab: Uncomment and execute cell below when running this notebook on google colab.

Local: Follow these instructions when running this notebook locally.

In [ ]:

Copied!

# ! pip install neurogym[rl]
# ! pip install neurogym[rl]

Import libraries¶

In [ ]:

Copied!





import warnings
warnings.filterwarnings('ignore')
import neurogym as ngym
from neurogym import info
from neurogym.utils import plotting

from IPython.display import clear_output
clear_output()
import warnings
warnings.filterwarnings('ignore')
import neurogym as ngym
from neurogym import info
from neurogym.utils import plotting

from IPython.display import clear_output
clear_output()

Explore tasks¶

In [ ]:

Copied!

info.show_all_tasks()
info.show_all_tasks()

Visualize a single task¶

In [ ]:

Copied!

info.show_info("GoNogo-v0")
info.show_info("GoNogo-v0")

In [ ]:

Copied!





task = 'GoNogo-v0'
env = ngym.make(task)
print(env)
fig = plotting.plot_env(
    env,
    num_steps=100,
    # def_act=0,
    ob_traces=['Fixation cue', 'NoGo', 'Go'],
    # fig_kwargs={'figsize': (12, 12)}
    )
task = 'GoNogo-v0'
env = ngym.make(task)
print(env)
fig = plotting.plot_env(
    env,
    num_steps=100,
    # def_act=0,
    ob_traces=['Fixation cue', 'NoGo', 'Go'],
    # fig_kwargs={'figsize': (12, 12)}
    )

In [ ]:

Copied!

info.show_info(env)
info.show_info(env)

Explore tags¶

As seen in the info of the Go-Nogo task above, each environment has a number of tags associated with it.

The complete list of tags is as follows:

In [ ]:

Copied!

info.show_all_tags()
info.show_all_tags()

You can specifically list the environments associated with a given tag

In [ ]:

Copied!

info.show_all_tasks(tag="timing")
info.show_all_tasks(tag="timing")

Explore wrappers¶

In [ ]:

Copied!

info.show_all_wrappers()
info.show_all_wrappers()

In [ ]:

Copied!

info.show_info('Monitor-v0')
info.show_info('Monitor-v0')

Train a network¶

Here, we train a simple neural network on the task at hand. We use a configuration file to load the parameters for the monitor. You can refer to the documentation for more information about how to use the configuration system.

In [ ]:

Copied!





import numpy as np
from neurogym.wrappers import Monitor, TrialHistoryV2
from stable_baselines3.common.vec_env import DummyVecEnv
from stable_baselines3 import A2C  # ACER, PPO2
# task paremters
timing = {'fixation': ('constant', 300),
          'stimulus': ('constant', 700),
          'decision': ('constant', 300)}
kwargs = {'dt': 100, 'timing': timing}
# wrapper parameters
n_ch = 2
p = 0.8
num_blocks = 2
probs = np.array([[p, 1-p], [1-p, p]])  # repeating block

# Build the task
env = ngym.make(task, **kwargs)

# Apply the wrapper.
env = TrialHistoryV2(env, probs=probs)
env = Monitor(env, config="config.toml")
import numpy as np
from neurogym.wrappers import Monitor, TrialHistoryV2
from stable_baselines3.common.vec_env import DummyVecEnv
from stable_baselines3 import A2C  # ACER, PPO2
# task paremters
timing = {'fixation': ('constant', 300),
          'stimulus': ('constant', 700),
          'decision': ('constant', 300)}
kwargs = {'dt': 100, 'timing': timing}
# wrapper parameters
n_ch = 2
p = 0.8
num_blocks = 2
probs = np.array([[p, 1-p], [1-p, p]])  # repeating block

# Build the task
env = ngym.make(task, **kwargs)

# Apply the wrapper.
env = TrialHistoryV2(env, probs=probs)
env = Monitor(env, config="config.toml")

In [ ]:

Copied!





# the env is now wrapped automatically when passing it to the constructor
model = A2C("MlpPolicy", env, verbose=1, policy_kwargs={'net_arch': [64, 64]})
model.learn(total_timesteps=env.config.agent.training.value)
env.close()
# the env is now wrapped automatically when passing it to the constructor
model = A2C("MlpPolicy", env, verbose=1, policy_kwargs={'net_arch': [64, 64]})
model.learn(total_timesteps=env.config.agent.training.value)
env.close()

Visualize the results¶

In [ ]:

Copied!





# Create task
env = ngym.make(task, **kwargs)
# Apply the wrapper
env = TrialHistoryV2(env, probs=probs)
env = DummyVecEnv([lambda: env])
fig = plotting.plot_env(
    env,
    num_steps=100,
    # def_act=0,
    ob_traces=['Fixation cue', 'NoGo', 'Go'],
    # fig_kwargs={'figsize': (12, 12)},
    model=model
)
# Create task
env = ngym.make(task, **kwargs)
# Apply the wrapper
env = TrialHistoryV2(env, probs=probs)
env = DummyVecEnv([lambda: env])
fig = plotting.plot_env(
    env,
    num_steps=100,
    # def_act=0,
    ob_traces=['Fixation cue', 'NoGo', 'Go'],
    # fig_kwargs={'figsize': (12, 12)},
    model=model
)