Skip to main content

Showing 1–36 of 36 results for author: Johansson, F D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2510.24385  [pdf, ps, other

    cs.CV

    When are radiology reports useful for training medical image classifiers?

    Authors: Herman Bergström, Zhongqi Yue, Fredrik D. Johansson

    Abstract: Medical images used to train machine learning models are often accompanied by radiology reports containing rich expert annotations. However, relying on these reports as inputs for clinical prediction requires the timely manual work of a trained radiologist. This raises a natural question: when can radiology reports be leveraged during training to improve image-only classification? Prior works are… ▽ More

    Submitted 29 October, 2025; v1 submitted 28 October, 2025; originally announced October 2025.

  2. arXiv:2510.07581  [pdf, ps, other

    cs.LG

    Expanding the Action Space of LLMs to Reason Beyond Language

    Authors: Zhongqi Yue, Weishi Wang, Yundaichuan Zhan, Juncheng Li, Daniel Dahlmeier, Fredrik D. Johansson

    Abstract: Large Language Models (LLMs) are powerful reasoners in natural language, but their actions are typically confined to outputting vocabulary tokens. As a result, interactions with external environments -- such as symbolic operators or simulators -- must be expressed through text in predefined formats, parsed, and routed to external interfaces. This overloads the model's language with both reasoning… ▽ More

    Submitted 17 October, 2025; v1 submitted 8 October, 2025; originally announced October 2025.

  3. arXiv:2508.18774  [pdf, ps, other

    cs.LG stat.ML

    Federated Learning with Heterogeneous and Private Label Sets

    Authors: Adam Breitholtz, Edvin Listo Zec, Fredrik D. Johansson

    Abstract: Although common in real-world applications, heterogeneous client label sets are rarely investigated in federated learning (FL). Furthermore, in the cases they are, clients are assumed to be willing to share their entire label sets with other clients. Federated learning with private label sets, shared only with the central server, adds further constraints on learning algorithms and is, in general,… ▽ More

    Submitted 26 August, 2025; originally announced August 2025.

  4. arXiv:2508.05367  [pdf, ps, other

    cs.LG

    Latent Preference Bandits

    Authors: Newton Mwai, Emil Carlsson, Fredrik D. Johansson

    Abstract: Bandit algorithms are guaranteed to solve diverse sequential decision-making problems, provided that a sufficient exploration budget is available. However, learning from scratch is often too costly for personalization tasks where a single individual faces only a small number of decision points. Latent bandits offer substantially reduced exploration times for such problems, given that the joint dis… ▽ More

    Submitted 7 August, 2025; originally announced August 2025.

    Comments: 25 pages, 9 figures

  5. arXiv:2507.17056  [pdf, ps, other

    cs.LG cs.AI

    Pragmatic Policy Development via Interpretable Behavior Cloning

    Authors: Anton Matsson, Yaochen Rao, Heather J. Litman, Fredrik D. Johansson

    Abstract: Offline reinforcement learning (RL) holds great promise for deriving optimal policies from observational data, but challenges related to interpretability and evaluation limit its practical use in safety-critical domains. Interpretability is hindered by the black-box nature of unconstrained RL policies, while evaluation -- typically performed off-policy -- is sensitive to large deviations from the… ▽ More

    Submitted 22 July, 2025; originally announced July 2025.

  6. arXiv:2505.03393  [pdf, other

    cs.LG

    Prediction Models That Learn to Avoid Missing Values

    Authors: Lena Stempfle, Anton Matsson, Newton Mwai, Fredrik D. Johansson

    Abstract: Handling missing values at test time is challenging for machine learning models, especially when aiming for both high accuracy and interpretability. Established approaches often add bias through imputation or excessive model complexity via missingness indicators. Moreover, either method can obscure interpretability, making it harder to understand how the model utilizes the observed variables in pr… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  7. arXiv:2412.07895  [pdf, other

    cs.LG cs.AI stat.AP

    How Should We Represent History in Interpretable Models of Clinical Policies?

    Authors: Anton Matsson, Lena Stempfle, Yaochen Rao, Zachary R. Margolin, Heather J. Litman, Fredrik D. Johansson

    Abstract: Modeling policies for sequential clinical decision-making based on observational data is useful for describing treatment practices, standardizing frequent patterns in treatment, and evaluating alternative policies. For each task, it is essential that the policy model is interpretable. Learning accurate models requires effectively capturing the state of a patient, either through sequence representa… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

  8. arXiv:2411.09591  [pdf, other

    cs.LG

    Handling missing values in clinical machine learning: Insights from an expert study

    Authors: Lena Stempfle, Arthur James, Julie Josse, Tobias Gauss, Fredrik D. Johansson

    Abstract: Inherently interpretable machine learning (IML) models offer valuable support for clinical decision-making but face challenges when features contain missing values. Traditional approaches, such as imputation or discarding incomplete records, are often impractical in scenarios where data is missing at test time. We surveyed 55 clinicians from 29 French trauma centers, collecting 20 complete respons… ▽ More

    Submitted 11 February, 2025; v1 submitted 14 November, 2024; originally announced November 2024.

    Comments: 8 pages, 5 figures, restructured writing from previous version and additional results

  9. arXiv:2411.03799  [pdf, ps, other

    cs.LG cs.AI

    Overcoming label shift with target-aware federated learning

    Authors: Edvin Listo Zec, Adam Breitholtz, Fredrik D. Johansson

    Abstract: Federated learning enables multiple actors to collaboratively train models without sharing private data. Existing algorithms are successful and well-justified in this task when the intended target domain, where the trained model will be used, shares data distribution with the aggregate of clients, but this is often violated in practice. A common reason is label shift -- that the label distribution… ▽ More

    Submitted 26 August, 2025; v1 submitted 6 November, 2024; originally announced November 2024.

  10. arXiv:2407.16239  [pdf, ps, other

    cs.LG stat.ML

    Identifiable Latent Bandits: Leveraging observational data for personalized decision-making

    Authors: Ahmet Zahid Balcıoğlu, Newton Mwai, Emil Carlsson, Fredrik D. Johansson

    Abstract: Sequential decision-making algorithms such as multi-armed bandits can find optimal personalized decisions, but are notoriously sample-hungry. In personalized medicine, for example, training a bandit from scratch for every patient is typically infeasible, as the number of trials required is much larger than the number of decision points for a single patient. To combat this, latent bandits offer rap… ▽ More

    Submitted 20 October, 2025; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: 29 pages, 17 figures

  11. arXiv:2405.16069  [pdf, other

    cs.LG stat.ME

    IncomeSCM: From tabular data set to time-series simulator and causal estimation benchmark

    Authors: Fredrik D. Johansson

    Abstract: Evaluating observational estimators of causal effects demands information that is rarely available: unconfounded interventions and outcomes from the population of interest, created either by randomization or adjustment. As a result, it is customary to fall back on simulators when creating benchmark tasks. Simulators offer great control but are often too simplistic to make challenging tasks, either… ▽ More

    Submitted 28 October, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  12. arXiv:2405.03059  [pdf, other

    cs.LG stat.ML

    Active Preference Learning for Ordering Items In- and Out-of-sample

    Authors: Herman Bergström, Emil Carlsson, Devdatt Dubhashi, Fredrik D. Johansson

    Abstract: Learning an ordering of items based on pairwise comparisons is useful when items are difficult to rate consistently on an absolute scale, for example, when annotators have to make subjective assessments. When exhaustive comparison is infeasible, actively sampling item pairs can reduce the number of annotations necessary for learning an accurate ordering. However, many algorithms ignore shared stru… ▽ More

    Submitted 27 October, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

  13. arXiv:2311.14108  [pdf, other

    cs.LG

    MINTY: Rule-based Models that Minimize the Need for Imputing Features with Missing Values

    Authors: Lena Stempfle, Fredrik D. Johansson

    Abstract: Rule models are often preferred in prediction tasks with tabular inputs as they can be easily interpreted using natural language and provide predictive performance on par with more complex models. However, most rule models' predictions are undefined or ambiguous when some inputs are missing, forcing users to rely on statistical imputation models or heuristics like zero imputation, undermining the… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  14. arXiv:2306.12774  [pdf, other

    cs.LG

    Pure Exploration in Bandits with Linear Constraints

    Authors: Emil Carlsson, Debabrota Basu, Fredrik D. Johansson, Devdatt Dubhashi

    Abstract: We address the problem of identifying the optimal policy with a fixed confidence level in a multi-armed bandit setup, when \emph{the arms are subject to linear constraints}. Unlike the standard best-arm identification problem which is well studied, the optimal policy in this case may not be deterministic and could mix between several arms. This changes the geometry of the problem which we characte… ▽ More

    Submitted 25 January, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: Accepted to AISTATS 2024

  15. arXiv:2303.09350  [pdf, other

    cs.LG stat.ML

    Unsupervised domain adaptation by learning using privileged information

    Authors: Adam Breitholtz, Anton Matsson, Fredrik D. Johansson

    Abstract: Successful unsupervised domain adaptation is guaranteed only under strong assumptions such as covariate shift and overlap between input domains. The latter is often violated in high-dimensional applications like image classification which, despite this limitation, continues to serve as inspiration and benchmark for algorithm development. In this work, we show that training-time access to side info… ▽ More

    Submitted 12 June, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

  16. arXiv:2303.08720  [pdf, other

    cs.LG stat.ML

    Practicality of generalization guarantees for unsupervised domain adaptation with neural networks

    Authors: Adam Breitholtz, Fredrik D. Johansson

    Abstract: Understanding generalization is crucial to confidently engineer and deploy machine learning models, especially when deployment implies a shift in the data domain. For such domain adaptation problems, we seek generalization bounds which are tractably computable and tight. If these desiderata can be reached, the bounds can serve as guarantees for adequate performance in deployment. However, in appli… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  17. arXiv:2301.08649  [pdf, other

    stat.ML cs.LG

    Off-Policy Evaluation with Out-of-Sample Guarantees

    Authors: Sofia Ek, Dave Zachariah, Fredrik D. Johansson, Petre Stoica

    Abstract: We consider the problem of evaluating the performance of a decision policy using past observational data. The outcome of a policy is measured in terms of a loss (aka. disutility or negative reward) and the main problem is making valid inferences about its out-of-sample loss when the past data was observed under a different and possibly unknown policy. Using a sample-splitting method, we show that… ▽ More

    Submitted 30 June, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

  18. arXiv:2209.07067  [pdf, other

    cs.LG stat.ML

    Efficient learning of nonlinear prediction models with time-series privileged information

    Authors: Bastian Jung, Fredrik D Johansson

    Abstract: In domains where sample sizes are limited, efficient learning algorithms are critical. Learning using privileged information (LuPI) offers increased sample efficiency by allowing prediction models access to auxiliary information at training time which is unavailable when the models are used. In recent work, it was shown that for prediction in linear-Gaussian dynamical systems, a LuPI learner with… ▽ More

    Submitted 20 November, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

  19. Sharing pattern submodels for prediction with missing values

    Authors: Lena Stempfle, Ashkan Panahi, Fredrik D. Johansson

    Abstract: Missing values are unavoidable in many applications of machine learning and present challenges both during training and at test time. When variables are missing in recurring patterns, fitting separate pattern submodels have been proposed as a solution. However, fitting models independently does not make efficient use of all available data. Conversely, fitting a single shared model to the full data… ▽ More

    Submitted 24 November, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

    Journal ref: AAAI Conference on Artificial Intelligence. 37, 8 (Jun. 2023), 9882-9890

  20. arXiv:2111.11113  [pdf, other

    cs.LG

    Case-based off-policy policy evaluation using prototype learning

    Authors: Anton Matsson, Fredrik D. Johansson

    Abstract: Importance sampling (IS) is often used to perform off-policy policy evaluation but is prone to several issues, especially when the behavior policy is unknown and must be estimated from data. Significant differences between the target and behavior policies can result in uncertain value estimates due to, for example, high variance and non-evaluated actions. If the behavior policy is estimated using… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

  21. arXiv:2111.06811  [pdf, other

    cs.LG stat.ML

    ADCB: An Alzheimer's disease benchmark for evaluating observational estimators of causal effects

    Authors: Newton Mwai Kinyanjui, Fredrik D. Johansson

    Abstract: Simulators make unique benchmarks for causal effect estimation since they do not rely on unverifiable assumptions or the ability to intervene on real-world systems, but are often too simple to capture important aspects of real applications. We propose a simulator of Alzheimer's disease aimed at modeling intricacies of healthcare data while enabling benchmarking of causal effect and policy estimato… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) - Extended Abstract

  22. arXiv:2110.14993  [pdf, other

    cs.LG stat.ML

    Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

    Authors: Rickard K. A. Karlsson, Martin Willbo, Zeshan Hussain, Rahul G. Krishnan, David Sontag, Fredrik D. Johansson

    Abstract: We study prediction of future outcomes with supervised models that use privileged information during learning. The privileged information comprises samples of time series observed between the baseline time of prediction and the future outcome; this information is only available at training time which differs from the traditional supervised learning. Our question is when using this privileged data… ▽ More

    Submitted 5 May, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:5459-5484, 2022

  23. arXiv:2109.01656  [pdf, other

    cs.LG

    Thompson Sampling for Bandits with Clustered Arms

    Authors: Emil Carlsson, Devdatt Dubhashi, Fredrik D. Johansson

    Abstract: We propose algorithms based on a multi-level Thompson sampling scheme, for the stochastic multi-armed bandit and its contextual variant with linear expected rewards, in the setting where arms are clustered. We show, both theoretically and empirically, how exploiting a given cluster structure can significantly improve the regret and computational cost compared to using standard Thompson sampling. I… ▽ More

    Submitted 15 June, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: IJCAI-2021. The supplementary material is not part of the IJCAI-21 Proceedings

  24. arXiv:2105.13857  [pdf, other

    cs.CL cs.AI

    Learning Approximate and Exact Numeral Systems via Reinforcement Learning

    Authors: Emil Carlsson, Devdatt Dubhashi, Fredrik D. Johansson

    Abstract: Recent work (Xu et al., 2020) has suggested that numeral systems in different languages are shaped by a functional need for efficient communication in an information-theoretic sense. Here we take a learning-theoretic approach and show how efficient communication emerges via reinforcement learning. In our framework, two artificial agents play a Lewis signaling game where the goal is to convey a num… ▽ More

    Submitted 30 April, 2024; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: CogSci 2021. Fixed typos

    Journal ref: Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 43 (2021)

  25. arXiv:2007.00973  [pdf, other

    cs.LG stat.ML

    Learning to search efficiently for causally near-optimal treatments

    Authors: Samuel Håkansson, Viktor Lindblom, Omer Gottesman, Fredrik D. Johansson

    Abstract: Finding an effective medical treatment often requires a search by trial and error. Making this search more efficient by minimizing the number of unnecessary trials could lower both costs and patient suffering. We formalize this problem as learning a policy for finding a near-optimal treatment in a minimum number of trials using a causal inference framework. We give a model-based dynamic programmin… ▽ More

    Submitted 17 February, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

  26. arXiv:2001.07426  [pdf, other

    cs.LG stat.ML

    Generalization Bounds and Representation Learning for Estimation of Potential Outcomes and Causal Effects

    Authors: Fredrik D. Johansson, Uri Shalit, Nathan Kallus, David Sontag

    Abstract: Practitioners in diverse fields such as healthcare, economics and education are eager to apply machine learning to improve decision making. The cost and impracticality of performing experiments and a recent monumental increase in electronic record keeping has brought attention to the problem of evaluating decisions based on non-experimental observational data. This is the setting of this work. In… ▽ More

    Submitted 31 July, 2023; v1 submitted 21 January, 2020; originally announced January 2020.

  27. arXiv:1910.04817  [pdf, other

    cs.LG stat.ML

    Estimation of Bounds on Potential Outcomes For Decision Making

    Authors: Maggie Makar, Fredrik D. Johansson, John Guttag, David Sontag

    Abstract: Estimation of individual treatment effects is commonly used as the basis for contextual decision making in fields such as healthcare, education, and economics. However, it is often sufficient for the decision maker to have estimates of upper and lower bounds on the potential outcomes of decision alternatives to assess risks and benefits. We show that, in such cases, we can improve sample efficienc… ▽ More

    Submitted 12 August, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Journal ref: ICML 2020

  28. arXiv:1907.04138  [pdf, other

    cs.LG stat.ML

    Characterization of Overlap in Observational Studies

    Authors: Michael Oberst, Fredrik D. Johansson, Dennis Wei, Tian Gao, Gabriel Brat, David Sontag, Kush R. Varshney

    Abstract: Overlap between treatment groups is required for non-parametric estimation of causal effects. If a subgroup of subjects always receives the same intervention, we cannot estimate the effect of intervention changes on that subgroup without further assumptions. When overlap does not hold globally, characterizing local regions of overlap can inform the relevance of causal conclusions for new subjects,… ▽ More

    Submitted 3 June, 2020; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: To appear at AISTATS 2020

    Journal ref: Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, PMLR 108:788-798, 2020

  29. A Survey on Graph Kernels

    Authors: Nils M. Kriege, Fredrik D. Johansson, Christopher Morris

    Abstract: Graph kernels have become an established and widely-used technique for solving classification tasks on graphs. This survey gives a comprehensive overview of techniques for kernel-based graph classification developed in the past 15 years. We describe and categorize graph kernels based on properties inherent to their design, such as the nature of their extracted graph features, their method of compu… ▽ More

    Submitted 4 February, 2020; v1 submitted 28 March, 2019; originally announced March 2019.

    Journal ref: Applied Network Science 5 (2020)

  30. arXiv:1903.03448  [pdf, other

    stat.ML cs.LG

    Support and Invertibility in Domain-Invariant Representations

    Authors: Fredrik D. Johansson, David Sontag, Rajesh Ranganath

    Abstract: Learning domain-invariant representations has become a popular approach to unsupervised domain adaptation and is often justified by invoking a particular suite of theoretical results. We argue that there are two significant flaws in such arguments. First, the results in question hold only for a fixed representation and do not account for information lost in non-invertible transformations. Second,… ▽ More

    Submitted 3 July, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

  31. arXiv:1811.05975  [pdf, other

    stat.AP cs.LG stat.ML

    Machine Learning Analysis of Heterogeneity in the Effect of Student Mindset Interventions

    Authors: Fredrik D. Johansson

    Abstract: We study heterogeneity in the effect of a mindset intervention on student-level performance through an observational dataset from the National Study of Learning Mindsets (NSLM). Our analysis uses machine learning (ML) to address the following associated problems: assessing treatment group overlap and covariate balance, imputing conditional average treatment effects, and interpreting imputed effect… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

  32. arXiv:1805.12002  [pdf, other

    stat.ML cs.LG

    Why Is My Classifier Discriminatory?

    Authors: Irene Chen, Fredrik D. Johansson, David Sontag

    Abstract: Recent attempts to achieve fairness in predictive models focus on the balance between fairness and accuracy. In sensitive applications such as healthcare or criminal justice, this trade-off is often undesirable as any increase in prediction error could have devastating consequences. In this work, we argue that the fairness of predictions should be evaluated in context of the data, and that unfairn… ▽ More

    Submitted 10 December, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: Appeared in Advances in Neural Information Processing Systems (NeurIPS 2018); 3 figures, 8 pages, 6 page supplementary

    Report number: Advances in Neural Information Processing Systems 31, pages 3543--3554. Dec. 2018

  33. arXiv:1611.03218  [pdf, other

    cs.AI cs.CL cs.LG cs.MA

    Learning to Play Guess Who? and Inventing a Grounded Language as a Consequence

    Authors: Emilio Jorge, Mikael Kågebäck, Fredrik D. Johansson, Emil Gustavsson

    Abstract: Acquiring your first language is an incredible feat and not easily duplicated. Learning to communicate using nothing but a few pictureless books, a corpus, would likely be impossible even for humans. Nevertheless, this is the dominating approach in most natural language processing today. As an alternative, we propose the use of situated interactions between agents as a driving force for communicat… ▽ More

    Submitted 15 March, 2017; v1 submitted 10 November, 2016; originally announced November 2016.

    Comments: Previous version was accepted to Deep Reinforcement Learning Workshop at NIPS 2016

  34. arXiv:1606.03976  [pdf, other

    stat.ML cs.AI cs.LG

    Estimating individual treatment effect: generalization bounds and algorithms

    Authors: Uri Shalit, Fredrik D. Johansson, David Sontag

    Abstract: There is intense interest in applying machine learning to problems of causal inference in fields such as healthcare, economics and education. In particular, individual-level causal inference has important applications such as precision medicine. We give a new theoretical analysis and family of algorithms for predicting individual treatment effect (ITE) from observational data, under the assumption… ▽ More

    Submitted 16 May, 2017; v1 submitted 13 June, 2016; originally announced June 2016.

    Comments: Added name "TARNet" to refer to version with alpha = 0. Removed supp

  35. arXiv:1605.03661  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Representations for Counterfactual Inference

    Authors: Fredrik D. Johansson, Uri Shalit, David Sontag

    Abstract: Observational studies are rising in importance due to the widespread accumulation of data in fields such as healthcare, education, employment and ecology. We consider the task of answering counterfactual questions such as, "Would this patient have lower blood sugar had she received a different medication?". We propose a new algorithmic framework for counterfactual inference which brings together i… ▽ More

    Submitted 6 June, 2018; v1 submitted 11 May, 2016; originally announced May 2016.

    Comments: Appeared in ICML 2016

  36. arXiv:1510.06492  [pdf, other

    cs.DS cs.LG

    Generalized Shortest Path Kernel on Graphs

    Authors: Linus Hermansson, Fredrik D. Johansson, Osamu Watanabe

    Abstract: We consider the problem of classifying graphs using graph kernels. We define a new graph kernel, called the generalized shortest path kernel, based on the number and length of shortest paths between nodes. For our example classification problem, we consider the task of classifying random graphs from two well-known families, by the number of clusters they contain. We verify empirically that the gen… ▽ More

    Submitted 22 October, 2015; originally announced October 2015.

    Comments: Short version presented at Discovery Science 2015 in Banff