OptoGPT++

Meet OptoGPT++ — an enhanced implementation of the OptoGPT, a decoder-only transformer that aims to solve inverse design of multi-layer thin film structures.

Key Enhancements

Inclusion of an absorption feature in the model ➕📈
Increased the maximum wave length to 2,000nm 💡
Longer training time for better predictive performance 🤯

Supporting Material

OptoGPT++: https://github.com/jnitzz/OptoLlama
OptoGPT: https://github.com/taigaoma1997/optogpt
ArXiV: 📝 https://arxiv.org/abs/2304.10294

Usage

Install Dependencies

python -m pip install torch
python -m pip install safetensors

Load Model Checkpoint

from safetensors.torch import load_file

model = OptoGPT()

safetensors_path = "optogpt-model.safetensors"
state_dict = load_file(safetensors_path)
model.load_state_dict(state_dict)

Useful Information

Stat	Value
#Parameters	108,381,113
Best validation MAE	0.0408
Epochs trained	1,000
Best epoch.	996
Batch size	256
n_blocks	6
n_heads	8
d_model	1,024
max_seq_length	20

Acknowledgements

This work is supported by the Helmholtz Association Initiative and Networking Fund through the Helmholtz AI platform, and the HAICORE@KIT grant.

Citations

If you find our work helpful, please feel free to cite as following:

@article{ma2024optogpt,
  title={OptoGPT: a foundation model for inverse design in optical multilayer thin film structures},
  author={Ma, Taigao and Wang, Haozhu and Guo, L Jay},
  journal={Opto-Electronic Advances},
  volume={7},
  number={7},
  year={2024},
  publisher={Opto-Electronic Advance},
  doi={10.29026/oea.2024.240062}
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train HZBSolarOptics/OptoGPT

Collection including HZBSolarOptics/OptoGPT

OptoLlama

Collection

All model checkpoints and datasets for OptoLlama • 3 items • Updated Oct 22, 2025

Paper for HZBSolarOptics/OptoGPT

OptoGPT: A Foundation Model for Inverse Design in Optical Multilayer Thin Film Structures

Paper • 2304.10294 • Published Apr 20, 2023