Tools for applying circuits-style interpretability techniques to RL agents.
ulissemini / circrl Goto Github PK
View Code? Open in Web Editor NEWThis project forked from montemac/circrl
Tools for applying circuits-style interpretability techniques to RL agents.
License: MIT License