ML-AIM Machine Learning and Artificial Intelligence for Medicine

Research Laboratory led by Prof. Mihaela van der Schaar

    Reinforcement Learning


  1. O. Atan, W. R. Zame, M. van der Schaar, "Sequential Patient Recruitment and Allocation for Adaptive Clinical Trials," International Conference on Artificial Intelligence and Statistics (AISTATS), 2019. [Link]
  2. O. Atan, W. R. Zame, M. van der Schaar, "Counterfactual Policy Optimization Using Domain-Adversarial Neural Networks," ICML 2018 Causal Machine Learning Workshop, 2018. [Link]
  3. O. Atan, J. Jordon, M. van der Schaar, "Deep-Treat: Learning Optimal Personalized Treatments from Observational Data using Neural Networks," AAAI, 2018. [Link]
  4. O. Atan, C. Tekin, M. van der Schaar, "Global Bandits; IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2018.
  5. C. Shen, C. Tekin, M. van der Schaar, "Generalized Global Bandit and Its Application in Cellular Coverage Optimization," IEEE Journal of Selected Topics in Signal Processing, 2018.
  6. O. Atan, W. R. Zame, Q. Feng, M. van der Schaar, "Constructing Effective Personalized Policies Using Counterfactual Inference from Biased Data Sets with Many Features," Submitted, 2017. [Link]
  7. R. Hellman, C. Tekin, M. van der Schaar, V. Santos, "Functional Contour-following via Haptic Perception and Reinforcement Learning," IEEE Transactions on Haptics, 2017.
  8. K. Kanoun, C. Tekin, D. Atienza, and M. van der Schaar, "Big-Data Streaming Applications Scheduling Based on Staged Multi-armed Bandits," IEEE Transactions on Computers, 2016. [Link] [Supplementary material]
  9. S. Amuru, C. Tekin, M. van der Schaar and M. Buehrer, "Jamming Bandits - A Novel Learning Method for Optimal Jamming," IEEE Transactions on Wireless Communications, vol. 15, no. 4, pp. 2792-2808, Apr. 2016. [Link]
  10. O. Atan, C. Tekin, J. Xu and M. van der Schaar, "Discovering Action-Dependent Relevance: Learning from Logged Data," Submitted, 2015. [Link]
  11. C. Tekin, O. Atan and M. van der Schaar, "Discover the Expert: Context-Adaptive Expert Selection for Medical Diagnosis," IEEE Transactions on Emerging Topics in Computing, vol. 3, no. 2, pp. 220 - 234, 2015. [Link]
  12. C. Tekin and M. van der Schaar, "Active Learning in Context-Driven Stream Mining with an Application to Image Mining," IEEE Trans. Image Process., vol. 24, no. 11, pp. 3666-3679, 2015. [Link]
  13. O. Atan and M. van der Schaar, "Discover Relevant Sources : A Multi-Armed Bandit Approach," Submitted, 2015. [Link]
  14. M. Wolf, M. van der Schaar, H. Kim and J. Xu, "Analysis and Decision-Making in Caring Environments for Adults with Special Needs Adults," IEEE Design & Test, Special Issue on Cyber-Physical systems for Medical Applications, vol. 32, no. 5, Oct. 2015. [Link]
  15. C. Tekin and M. van der Schaar, "Distributed Online Learning via Cooperative Contextual Bandits," IEEE Trans. Signal Process., vol. 63, no. 14, pp. 3700-3714, 2015. [Link]
  16. O. Atan, C. Tekin, M. van der Schaar, "Global Multi-armed Bandits with H?der Continuity," AISTATS, 2015. [Link]
  17. V. Di Valerio, C. Petrioli, L. Pescosolido, M. van der Schaar, "A Reinforcement Learning-based Data-Link Protocol for Underwater Acoustic Communications ," ACM International Conference on Underwater Networks & Systems 2015 (WUWNet?5). [Link]
  18. B.-G. Kim, Y. Zhang, M. van der Schaar, and J.-W. Lee, "Dynamic Pricing and Energy Consumption Scheduling with Reinforcement Learning," IEEE Transactions on Smart Grid, 2015. [Link]
  19. O. Atan, A. Yiannis, C. Tekin, and M. van der Schaar, "Bandit Framework For Systematic Learning In Wireless Video-Based Face Recognition," IEEE J. Sel. Topics Signal Process., vol. 9, no. 1, June. 2014. [Link]
  20. L. Song, C. Tekin, and M. van der Schaar, "Clustering Based Online Learning in Recommender Systems: A Bandit Approach," ICASSP 2014. [Link]
  21. O. Atan, Y. Andreopoulos, C. Tekin, and M. van der Schaar, "Bandit Framework for Systematic Learning in Wireless Video-Based Face Recognition," ICASSP 2014.[Link]
  22. B. Kim, Y. Zhang, M. van der Schaar, and J. Lee, "Dynamic Pricing for Smart Grid with Reinforcement Learning," 2014 IEEE INFOCOM Workshop on Communications and Control for Smart Energy Systems.[Link]
  23. X. Zhu, C. Lan and M. van der Schaar, "Low-complexity reinforcement learning for delay-sensitive compression in networked video stream mining," in Proc. IEEE ICME, San Jose, USA, July 2013. [Link]
  24. N. Mastronarde, K. Kanoun, D. Atienza, and M. van der Schaar, "Markov Decision Process Based Energy-efficient Scheduling for Slice-parallel Video Decoding," in Proc. ICME 2013, San Jose, USA, July 2013. [Link]
  25. W. Zame, J. Xu and M. van der Schaar, "Winning the Lottery: Learning Perfect Coordination with Minimal Feedback," in IEEE J. Sel. Topics in Signal Process., vol. 7, no. 5, pp. 846-857, Oct. 2013. [Link]
  26. H.P. Shiang and M. van der Schaar, "Conjecture-Based Load Balancing for Delay-Sensitive Users Without Message Exchanges," in IEEE Trans. on Vehicular Technology, vol. 62, no. 8, pp. 3983-3995, Oct. 2013. [Link]
  27. Y. Zhang and M. van der Schaar, "Robust Reputation Protocol Design for Online Communities: A Stochastic Stability Analysis," in IEEE J. of Sel. Topics in Signal Process., vol. 7, no. 5, pp. 907-920, Oct. 2013. [Link]
  28. F. Fu and M. van der Schaar, "Structural Solutions for Dynamic Scheduling in Wireless Multimedia Transmission", IEEE Transactions Circuits Systems for Video Tech., vol. 22, no. 5, pp. 727-739, May 2012. [Link]
  29. R. Izhak-Ratzin, H. Park, and M. van der Schaar, "Online Learning in BitTorrent Systems", IEEE Trans. on Parallel and Distributed Systems, vol. 23, no. 12, pp. 2280-2288, Mar. 2012. [Link]
  30. H. Park and M. van der Schaar, "Evolution of Resource Reciprocation Strategies in P2P Networks", IEEE Trans. Signal Process., vol. 58, no. 3, pp. 1205-1218, Mar. 2010. [Link]
  31. N. Mastronarde, K. Kanoun, D. Atienza, P. Frossard, and M. van der Schaar, "Markov Decision Process Based Energy-Efficient On-Line Scheduling for Slice-Parallel Video Decoding on Multicore Systems", IEEE Trans. on Multimedia, vol. 15, no. 2, pp. 268-278, Feb. 2013. [Link]
  32. N. Mastronarde and M. van der Schaar, “Reinforcement learning for power management in wireless multimedia communications,” IEEE International Conference on Multimedia & Expo (ICME), July 11-15, 2011 [Link] (Also featured in the IEEE COMSOC MMTC R-Letter, Dec. 2011. [Link]
  33. N. Mastronarde and M. van der Schaar, "Fast reinforcement learning for energy-efficient wireless communication," IEEE Trans. on Signal Processing, vol. 59, no. 12, pp. 6262 - 6266, Dec. 2011. [Link]
  34. N. Mastronarde and M. van der Schaar, "Reinforcement learning for energy-efficient wireless transmission," ICASSP 2011. [Link]
  35. R. Izhak-Ratzin, H. Park, and M. van der Schaar, "Reinforcement Learning in BitTorrent Systems," Infocom 2011 (mini conference). [Link]
  36. N. Mastronarde and M. van der Schaar, "Online Reinforcement Learning for Dynamic Multimedia Systems," IEEE Trans. on Image Processing, vol. 19, no. 2, pp. 290-305, Feb. 2010. [Link]
  37. N. Mastronarde and M. van der Schaar, "Online reinforcement learning for multimedia buffer control," ICASSP 2010. [Link]
  38. H. P. Shiang and M. van der Schaar, "Online Learning in Autonomic Multi-Hop Wireless Networks for Transmitting Mission-Critical Applications," IEEE J. Sel. Areas Commun., vol. 28, no. 5, pp. 728-741, June 2010. [Link]
  39. Y. Su and M. van der Schaar, "Dynamic Conjectures in Random Access Networks Using Bio-inspired Learning," IEEE J. Sel. Areas Commun., vol. 28, no. 4, pp. 587-601, May 2010. [Link] [Long version]
  40. Y. Su and M. van der Schaar, "Conjectural Equilibrium in Multi-user Power Control Games", IEEE Trans. Signal Process., vol. 57, no. 9, pp. 3638-3650, Sep. 2009. [Link]
  41. H. P. Shiang and M. van der Schaar, "Conjecture-Based Channel Selection Game for Delay-Sensitive Users in Multi-Channel Wireless Networks," in Proc. IEEE Gamenets '09, 2009. [Link]
  42. Y. Su and M. van der Schaar, "Minimum Required Learning and Impact of Information Feedback Delay for Cognitive Users," IEEE Trans. Veh. Tech., vol. 58, no. 6, pp. 2825-2834, July 2009. [Link]
  43. Ulrich Berthold, Fangwen Fu, Mihaela van der Schaar, and Friedrich K. Jondral, "Detection of Spectral Resources in Cognitive Radios Using Reinforcement Learning," in Proc. IEEE Dyspan 2008 , 2008. [Link]