Q-Mamba

Code of our ICML 2025 paper: Meta-Black-Box-Optimization through Offline Q-function Learning.

Preparations

Create and activate conda environment

First, create the q_mamba environment.

conda create --name q_mamba python=3.10
conda activate q_mamba

Then install Pytorch with version 1.12+ and CUDA 11.6+ (see https://pytorch.org/ for more details). The cuda-toolkit is also required conda install nvidia::cuda-toolkit=12.1.

Next, install the mamba-ssm using pip install mamba-ssm (see https://github.com/state-spaces/mamba.git for more details).

Finally, install the other necessary libraries.

pip install -r requirements.txt

Other requirements:

Linux
NVIDIA GPU
PyTorch 1.12+
CUDA 11.6+

Train

To quickly start training, firstly, download the training trajectories from here. The directory could be set like this basic structure:

├── /trajectory_files/
│  ├── trajectory_set_0_Rand.pkl
│  ├── trajectory_set_0_CfgX.pkl  
│  ├── trajectory_set_0_Unit.pkl   
│  ├── trajectory_set_1_Unit.pkl   
│  ├── trajectory_set_2_Unit.pkl

Then we can train the main Q-Mamba agent using:

# train q_mamba with conservative_reg_loss 
python run.py --train --trajectory_file_path './trajectory_files/trajectory_set_0_Unit.pkl' --has_conservative_reg_loss 

# train q_mamba without conservative_reg_loss
python run.py --train --trajectory_file_path './trajectory_files/trajectory_set_0_Unit.pkl'

Taking the training on Alg0 as an example, the models in the training is saved at ./model/trajectory_set_0_Unit/YYMMDDTHHmmSS/ (where YYMMDDTHHmmSS is the time stamp of the run) and the tensorboard log is stored at ./log/trajectory_set_0_Unit/YYMMDDTHHmmSS

Test

To test the trained model on the BBOB testing problems (Section 5.2, Table 1):

# test q_mamba 
python run.py --test --algorithm_id 0 --load_path [MODEL_PATH]

For example, we have a pre-trained Q-Mamba model in ./model/qmamba.pth, which is trained with default settings in the paper on Alg0, its testing command is

# test q_mamba 
python run.py --test --algorithm_id 0 --load_path ./model/qmamba.pth

The reward results are stored in ./log/test/q_mamba/YYMMDDTHHmmSS/test_rewards.pkl where YYMMDDTHHmmSS is the time stamp of the test.

If you find this repository useful, please cite it in your publications or projects as follows.

@inproceedings{
qmamba,
title={Meta-Black-Box-Optimization through Offline Q-function Learning},
author={Zeyuan Ma, Zhiguang Cao, Zhou Jiang, Hongshu Guo, Yue-Jiao Gong},
booktitle={Forty-second International Conference on Machine Learning},
year={2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
components		components
env		env
model		model
src		src
EE_data_ratio.py		EE_data_ratio.py
LICENSE		LICENSE
README.md		README.md
config.py		config.py
dataset.py		dataset.py
network.py		network.py
options.py		options.py
q_mamba.py		q_mamba.py
requirements.txt		requirements.txt
run.py		run.py
task_set_for_mamba.pkl		task_set_for_mamba.pkl
task_set_for_mamba_test.pkl		task_set_for_mamba_test.pkl
temp_test.py		temp_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Q-Mamba

Code of our ICML 2025 paper: Meta-Black-Box-Optimization through Offline Q-function Learning.

Preparations

Create and activate conda environment

Train

Test

If you find this repository useful, please cite it in your publications or projects as follows.

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

MetaEvo/Q-Mamba

Folders and files

Latest commit

History

Repository files navigation

Q-Mamba

Code of our ICML 2025 paper: Meta-Black-Box-Optimization through Offline Q-function Learning.

Preparations

Create and activate conda environment

Train

Test

If you find this repository useful, please cite it in your publications or projects as follows.

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages