Anzahl der Einträge: 4. 2024
Karvonen, Adam ; Wright, Benjamin ; Rager, Can ; Angell, Rico ; Brinkmann, Jannik ; Smith, Logan Riggs ; Verdun, Claudio Mayrink ; Bau, David ; Marks, Samuel
Measuring progress in dictionary learning for language model interpretability with board game models.
1-17
In: ICML 2024 Workshop on Mechanistic Interpretability
(2024)
ICML 2024 Workshop on Mechanistic Interpretability
(Wien, Austria)
[Konferenzveröffentlichung]
|
|
2023
Brinkmann, Jannik ; Swoboda, Paul ; Bartelt, Christian
A multidimensional analysis of social biases in vision transformers.
4914-4923
In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)
(2023)
Paris, France
2023 IEEE/CVF International Conference on Computer Vision, ICCV
(Paris, France)
[Konferenzveröffentlichung]
|
|
Ernst, Jasmina S. ; Marton, Sascha ORCID: 0000-0001-8151-9223 ; Brinkmann, Jannik ; Vellasques, Eduardo ; Foucard, Damien ; Kraemer, Martin ; Lambert, Marian
Bias mitigation for large language models using adversarial learning.
Calegari, Roberta ; Tubella, Andrea Aler ; González Castañe, Gabriel ; Dignum, Virginia ; Milano, Michaela
CEUR Workshop Proceedings
3523
1-14
In: Proceedings of the 1st Workshop on Fairness and Bias in AI co-located with 26th European Conference on Artificial Intelligence (ECAI 2023),Kraków, Poland, October 1st, 2023
(2023)
Aachen, Germany
1st Workshop on Fairness and Bias in AI
(Kraków, Poland)
[Konferenzveröffentlichung]
|
|
Diese Liste wurde am Thu Nov 21 01:38:19 2024 CET automatisch erstellt.
|