Number of items: 2. Conference or workshop publication
|
Karvonen, Adam ; Wright, Benjamin ; Rager, Can ; Angell, Rico ; Brinkmann, Jannik ; Smith, Logan Riggs ; Verdun, Claudio Mayrink ; Bau, David ; Marks, Samuel
Measuring progress in dictionary learning for language model interpretability with board game models.
1-17
In: ICML 2024 Workshop on Mechanistic Interpretability
(2024)
ICML 2024 Workshop on Mechanistic Interpretability
(Wien, Austria)
[Conference or workshop publication]
|
|
Conference presentation
|
Karvonen, Adam ; Wright, Benjamin ; Rager, Can ; Angell, Rico ; Brinkmann, Jannik ; Smith, Logan Riggs ; Verdun, Claudio Mayrink ; Bau, David ; Marks, Samuel
Measuring progress in cictionary learning for language model interpretability with board game models.
(2024)
NeurIPS 2024
(Vancouver, Canada)
[Conference presentation]
|
|
This list was created automatically on Fri Jan 23 06:25:04 2026 CET
|