OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Pedro Pinto Santos
Joined
February 2022
Names
Pedro Pinto Santos
(Preferred)
,
Pedro P. Santos
Emails
****@tecnico.ulisboa.pt
(Confirmed)
Personal Links
Google Scholar
Career & Education History
PhD student
Instituto Superior Técnico
(tecnico.ulisboa.pt)
2022
–
2025
Researcher
Instituto de Engenharia de Sistemas e Computadores - Investigação e Desenvolvimento
(inesc-id.pt)
2021
–
2025
MS student
Instituto Superior Técnico
(tecnico.ulisboa.pt)
2018
–
2021
Undergrad student
Instituto Superior Técnico
(tecnico.ulisboa.pt)
2015
–
2018
Advisors, Relations & Conflicts
Coauthor
Diogo S. Carvalho
Present
Coauthor
Miguel Vasco
Present
PhD Advisor
Francisco S. Melo
Present
PhD Advisor
Alberto Sardinha
Present
Expertise
Reinforcement learning
Present
Multi-agent systems
Present
Planning
Present
Sequential decision making
Present
Publications
Solving General-Utility Markov Decision Processes in the Single-Trial Regime with Online Planning
Pedro Pinto Santos
,
Alberto Sardinha
,
Francisco S. Melo
ICLR 2026 Poster
Readers:
Everyone
The Number of Trials Matters in Infinite-Horizon General-Utility Markov Decision Processes
Pedro Pinto Santos
,
Alberto Sardinha
,
Francisco S. Melo
EWRL 2025 Poster
Readers:
Everyone
The Number of Trials Matters in Infinite-Horizon General-Utility Markov Decision Processes
Pedro Pinto Santos
,
Alberto Sardinha
,
Francisco S. Melo
ICML 2025 spotlightposter
Readers:
Everyone
The impact of data distribution on Q-learning with function approximation
Pedro P. Santos
,
Diogo S. Carvalho
,
Alberto Sardinha
,
Francisco S. Melo
Mach. Learn. 2024
Readers:
Everyone
Generalizing Objective-Specification in Markov Decision Processes
Pedro P. Santos
AAMAS 2024
Readers:
Everyone
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Pedro P. Santos
,
Diogo S. Carvalho
,
Miguel Vasco
,
Alberto Sardinha
,
Pedro A. Santos
,
Ana Paiva
,
Francisco S. Melo
AAMAS 2024
Readers:
Everyone
The Number of Trials Matters in Infinite-Horizon General-Utility Markov Decision Processes
Pedro P. Santos
,
Alberto Sardinha
,
Francisco S. Melo
CoRR 2024
Readers:
Everyone
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Pedro Pinto Santos
,
Diogo S. Carvalho
,
Miguel Vasco
,
Pedro A. Santos
,
Ana Paiva
,
Alberto Sardinha
,
Francisco S. Melo
02 Jan 2023
OpenReview Archive Direct Upload
Readers:
Everyone
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Pedro Pinto Santos
,
Diogo S. Carvalho
,
Miguel Vasco
,
Francisco S. Melo
,
Alberto Sardinha
,
Pedro A. Santos
,
Ana Paiva
Published: 01 Feb 2023, Last Modified: 14 Jan 2026
Submitted to ICLR 2023
Readers:
Everyone
Co-Authors
Alberto Sardinha
Ana Paiva
Diogo S. Carvalho
Francisco S. Melo
Miguel Vasco
Pedro A. Santos