Philosophy Meets Machine Learning: What Counts As Trustworthy?
Workshop at ICML 2026, Seoul, South Korea
11th July, 2026
Workshop at ICML 2026, Seoul, South Korea
11th July, 2026
Philosophers have long thought deeply about many concepts that are used colloquially in the machine learning (ML) community such as epistemology, counterfactuals, explainability, reliability, uncertainty and causality. As ML systems are now embedded in high-stakes decisions across science, industry, and public life, it is urgent that when ML researchers claim properties such as "explainability", "reliability", "intelligence" or "cognition", these claims are made with awareness of what practitioners, policymakers, and affected users mean by those terms. In particular, we argue that the ML community needs to take a step back and review whether the mathematical objectives used in optimisation and evaluation procedures truly take into account how philosophers have analysed them—analyses that explicitly aim to connect notions like explanation, evidence, and uncertainty to human understanding, justification, and use.
Philosophers of science and psychologists are more actively engaged than ever in such questions; however, their interaction with ML researchers remains sparse and fragmented. The goal of the proposed workshop is to facilitate a lively dialogue between the two otherwise largely separate communities, to promote more principled and grounded advances in ML and artificial intelligence.
11th May: Paper submission deadline
31st May: Notification of decision
11th July: Workshop
Been Kim is a director (principal scientist) at Google DeepMind. She is working to ensure that as machines get smarter, humans do too.
Her work empowers humans to maintain agency by extracting and teaching useful knowledge from AI to humans. She gets AI involved in this teaching (agentic interpretability) and this may involve new language (neologism). She proved that this is possible; using AlphaZero knowledge to teach grandmasters, one of whom became the youngest World Chess Champion (PNAS).
Raphaël Millière is an Associate Professor at the University of Oxford and a Fellow of Jesus College, with an affiliation at the Institute for Ethics in AI. He also holds an AI2050 Fellowship from Schmidt Sciences.
Raphaël's research mainly focuses on understanding modern artificial neural networks, such as large language models, through theoretical analysis, behavioural evaluation, and interpretability methods.
Mikhail Belkin is HDSI Endowed Chair Professor in AI at Halicioglu Data Science Institute and Computer Science and Engineering Department at UCSD.
His research interests are broadly in theory and applications of Artificial Intelligence, deep learning and data analysis. One of his key findings has been the "double descent" risk curve that extends the textbook U-shaped bias-variance trade-off curve beyond the point of interpolation. His recent work focusses on understanding feature learning and over-parameterization in deep learning.
Naftali Weinberger is a postdoc at the Munich Center for Mathematical Philosophy, working on general philosophy of science with a focus on causation and causal modelling. His research addresses a wide range of questions concerning the nature and scope of causal explanation, as well as epistemological questions about how to choose among the theories compatible with one’s evidence. He is also interested in the use of causal concepts in particular sciences, and have engaged with fields as diverse as population genetics, psychometrics, cognitive science, and neuroscience.
His current projects are on causally modelling dynamical systems and causal issues related to discrimination.
Alice Huang is an assistant professor and the Duncanson Chair in Ethics and Technology at the University of Western Ontario, jointly appointed in computer science and in philosophy. She is also a faculty affiliate at the Schwartz Reisman Institute for Technology & Society.
Her projects fall into two broad categories. The first connects formal results in artificial intelligence research to ethical issues related to interpretability, collaboration and fairness. The second uses formal and computational models to investigate pressing issues in our social discourse today, such as questions about misinformation, scientific practices and polarization.
Cameron Buckner is a Professor and Donald F. Cronin Endowed Chair in the Humanities at the University of Florida.
In his current work, he focuses on the relationship between learning and meaning, by offering approaches to mental content, cognition, and knowledge representation that take the latest empirical theories of learning as their starting point. While his main focus remains on cognitive science (especially animal cognition and artificial intelligence), these insights also ground solutions to more traditional philosophical problems.
The workshop will take place from 8am to 5pm on Saturday 11th July, 2026 at Coex, Seoul, South Korea.
8:20-8:30 Opening remarks
8:30-9:05 Invited talk: Naftali Weinberger
9:05-9:40 Invited talk: Misha Belkin
9:40-10:00 Coffee break
10:00-10:35 Invited talk: Raphaël Millière
10:35-11:10 Invited talk: Been Kim
11:10-12:00 Get lunch!
12:00-13:10 Poster session (bring your own lunch)
13:10-13:25 Break
13:25-14:00 Invited talk: Cameron Buckner
14:00-14:50 Oral presentations (6 x 7 minutes, followed by 7 minutes of joint Q&A)
1. Phongsakon Mark Konrad, Toygar Tanyel, Serkan Ayvaz
Self-Reports Do Not Identify Self-Models: An Identifiability Test for Counterfactual Reports
2. Kola Ayonrinde, Raphaël Millière
Getting Monosemantic About Monosemanticity
3. Amine M'Charrak, Thong Pham, Thomas Lukasiewicz, Yuxiao Dong, Shohei Shimizu
Before Normative and Moral Alignment: Causal Contract Faithfulness as a Precondition for Trustworthy AI
4. Gilad Landau, Aviv Keren
The Concept of Representation in ML: Beyond Plato and Aristotle
5. Louis Mahon, Elliot Ford, Callum Hackett
A Definition of Good Explanations and the Challenges Explaining LLM Outputs
6. Joseph Keshet
Why Sampling Is Not Choosing: Intentionality, Agency, and Moral Responsibility in Large Language Models
14:50-15:25 Coffee break
15:25-16:00 Invited talk: Alison Huang
16:00-17:00 Panel discussion & Closing
ETH Zürich
ETH Zürich
Stanford
TU Nuremberg, Helmholtz AI
Submissions are now closed. Thank you for your interest in PhilML workshop!
We invite short paper submissions (up to 4 pages, excluding references and appendix) from both philosophers and ML researchers on the following topics:
Epistemology of learning systems: knowledge, belief, evidence, justification, understanding, etc.
Uncertainty: interpretations of probability and credence; confidence, ignorance, ambiguity, etc.
Counterfactual reasoning: when counterfactual questions are well-posed, and what makes counterfactual answers meaningful.
Foundations of causal modelling: in particular, links between causal formalisms used in ML and philosophical accounts of causation.
Explainability and interpretability: explanation vs. prediction; understanding as a cognitive and social achievement; what counts as an explanation for whom, and why.
Reliability, robustness, and generalisation: principled notions of “reliability” beyond accuracy, statistical/philosophical perspectives on “reliable” scientific or societal use.
Submissions should be made by 11th May (anywhere on earth) on openreview.
Format: All submissions must be in PDF format. Submissions are limited to four content pages. Unlimited additional pages are allowed for references and supplementary materials. Reviewers may choose to read the supplementary materials but will not be required to. Camera-ready versions may go up to five content pages.
Style file: You must format your submission using the ICML 2026 LaTeX style file. Please include the references and supplementary materials in the same PDF as the main paper.
Double-blind reviewing: The reviewing process will be double blind. As an author, you are responsible for anonymizing your submission. In particular, you should not include author names, author affiliations, or acknowledgements in your submission and you should avoid providing any other identifying information (even in the supplementary material).
LLM policy: The use of LLMs are permitted only as a writing assistance tool.
Dual-submission policy: We welcome ongoing and unpublished work. We will also accept papers that are under review at the time of submission, or that have been recently accepted for publication at a non-ML venue (i.e., any venue that is not ICML, NeurIPS, ICLR, or a similar conference or journal). Submissions published in venues for related fields (in particular, philosophy) are welcome.
Non-archival: The workshop is a non-archival venue and will not have official proceedings. Workshop submissions can be subsequently or concurrently submitted to other venues.
Visibility: Submissions and reviews will not be public. Only accepted papers will be made public.
Reciprocal reviewing: Authors of submitted works are encouraged to volunteer as reviewers for other submissions, to ensure a fair and high-quality review process.
For questions, please contact philml.icml26@gmail.com.
Florian Dorner (MPI Tübingen)
Liang Wendong (MPI Tübingen)
Cheongwoong Kang (KAIST)
Haksoo Lim (KAIST)
Won Jo (KAIST)
Junho Choi (KAIST)
Annika Schneider (Helmholtz Munich)
Nikos Papanikolaou (MPI Tübingen)
Javier Abad (ETH Zürich)
Sarah Martinson (ETH Zürich)
Hanti Lin (UC Davis)
Raphaël Millière (Oxford)
Moritz Miller (MPI Tübingen)
Sergio Hernan Garrido Mejia (MPI Tübingen)
Seongun Kim (KAIST)