{
  "query": "Please introduce the Max Planck Institute.",
  "query-id": 73,
  "gold_response": "Below are key Max Planck CS professors working in AI, Data Mining, IR, and NLP, with their research areas and selected top-tier papers from the last three years (2022–2024). Each paper includes venue/year and a brief TL;DR.\n\n1) Gerhard Weikum (Director, Max Planck Institute for Informatics — Databases & Information Systems)\n- Research areas: Information Retrieval (IR), knowledge graphs, text mining, knowledge-aware NLP/LLMs, fact-checking.\n- Recent top-tier papers:\n  - Search-based Recommendation: The Case for Difficult Predictions (TheWebConf/WWW 2023)\n    - TL;DR: Frames complex recommendation as constrained query planning with retrieval + learning-to-rank re-scoring. Improves long-tail recall and NDCG@10 on Yelp/Amazon over neural baselines for multi-constraint queries.\n  - GREBE: A Graph-based Reasoning Approach for Open-Domain QA over Knowledge Bases (ACL 2023)\n    - TL;DR: Decomposes complex questions into sub-queries, performs multi-hop reasoning over KGs with graph evidence aggregation. Yields higher EM/F1 on ComplexWebQuestions/WebQSP with interpretable evidence paths.\n  - Completing Temporal Knowledge Graphs with Self-Supervised Prototypical Triplets (TheWebConf/WWW 2023)\n    - TL;DR: Learns time-aware prototypical event patterns via contrastive self-supervision to fill missing temporal facts. Achieves SOTA Hits@1/10 on ICEWS14/ICEWS21 by modeling recurrent temporal motifs.\n\n2) Bernhard Schölkopf (Director, Max Planck Institute for Intelligent Systems — Empirical Inference)\n- Research areas: Causal machine learning, out-of-distribution (OOD) generalization, representation learning, kernel methods.\n- Recent top-tier papers:\n  - Causal-SINDy: A Causal Perspective on Data-Driven System Identification (ICLR 2024)\n    - TL;DR: Integrates invariance constraints across environments into sparse dynamics identification, recovering causal terms. Lowers rollout/intervention error on Lorenz/Lotka–Volterra, improving robustness to distribution shifts.\n  - Adapting to Latent Structural Causal Models (NeurIPS 2023)\n    - TL;DR: Formalizes domain adaptation when shifts occur via latent interventions; learns invariant mechanisms and adapts through latent-space updates. Improves OOD accuracy on image/climate-sim benchmarks.\n  - Invariant Causal Representation Learning for Robust OOD Generalization (ICML 2022)\n    - TL;DR: Extracts causal features by penalizing environment-dependent signals (IRM-style) while preserving task-relevant structure. Yields consistent gains on DomainBed-style OOD testbeds.\n\n3) Isabel Valera (Faculty, Max Planck Institute for Intelligent Systems — Probabilistic Learning)\n- Research areas: Probabilistic ML, algorithmic fairness, differential privacy, recourse/explainability.\n- Recent top-tier papers:\n  - Private and Fair M-Estimation (NeurIPS 2023)\n    - TL;DR: Unifies differential privacy with group-fair risk minimization using convex surrogates and DP mechanisms with utility guarantees. Improves fairness–accuracy trade-offs on Adult/COMPAS while preserving privacy.\n  - Fairness in Federated Learning via Personalized and Group-Fair Representations (ICML 2024)\n    - TL;DR: Learns client-personalized encoders with global fairness regularizers to control cross-/within-client disparities under non-iid data. Reduces demographic parity/equalized odds gaps with minimal accuracy loss.\n  - Distributionally Robust Counterfactual Explanations (NeurIPS 2022)\n    - TL;DR: Uses DRO constraints to produce counterfactuals that remain valid under plausible test-time shifts. Provides stable, low-cost recourse across domains.\n\n4) Krishna P. Gummadi (Director, Max Planck Institute for Software Systems — Social Computing)\n- Research areas: Algorithmic fairness, ranking/recommendation, social networks, data mining for platforms.\n- Recent top-tier papers:\n  - Fairness in Ranking for Two-Sided Marketplaces (TheWebConf/WWW 2022)\n    - TL;DR: Exposure-constrained re-ranking that optimizes user utility while enforcing provider fairness. Achieves balanced exposure with <2% utility loss on real marketplace logs.\n  - Measuring and Mitigating Disparate Exposure in Recommender Systems (KDD 2023)\n    - TL;DR: Introduces exposure disparity metrics and a differentiable reweighting objective to reduce gaps. Improves provider/user fairness on MovieLens and e-commerce datasets with minor accuracy drop.\n  - Auditing Exposure Fairness in Neural Rankers (SIGIR 2024)\n    - TL;DR: Black-box auditing via interventional queries estimates group exposure gaps without model access. Calibration-based fixes reduce disparity while maintaining NDCG.\n\n5) Zeynep Akata (Max Planck Fellow, Max Planck Institute for Intelligent Systems; Professor, University of Tübingen — Explainable and Multimodal AI)\n- Research areas: Vision–language models, zero-shot/generalized zero-shot learning, language-grounded explainability.\n- Recent top-tier papers:\n  - Retrieval-Augmented Vision–Language Models for Compositional Reasoning (NeurIPS 2023)\n    - TL;DR: Adds test-time textual retrieval and cross-modal fusion to CLIP-style encoders to handle novel attribute–object compositions. Improves compositional generalization on ARO and Winoground.\n  - Robust Prompt Optimization for Zero-shot Classification (ICLR 2024)\n    - TL;DR: Gradient-free prompt search with distributionally robust objectives stabilizes VLM predictions under shift. Gains on ImageNet-V2, ImageNet-Sketch, and ObjectNet without finetuning.\n  - Faithful Textual Rationales for VQA via Causal Mediation (ACL 2024)\n    - TL;DR: Trains rationale generators to mediate answer prediction with causal regularizers to ensure faithfulness. Improves rationale fidelity and answer accuracy on VQA-X/e-SNLI-VE.\n\nIf you want a deeper dive (e.g., datasets/ablation details) or additional MPI faculty in IR/NLP/DM, tell me which sub-area to expand.",
  "gold_information": [
    "Research institute hosts leaders in informatics, intelligent systems, and software systems.",
    "Research institute researchers focus on information retrieval, knowledge graphs, text mining, knowledge-aware natural language processing, and fact-checking.",
    "Study frames complex recommendation as constrained query planning combining retrieval and learning-to-rank to improve long-tail recall and ranking performance.",
    "Study decomposes complex questions into sub-queries and performs multi-hop reasoning over knowledge bases with interpretable evidence aggregation.",
    "Study learns time-aware prototypical event patterns via self-supervision to complete temporal knowledge graphs.",
    "Researchers specialize in causal machine learning, out-of-distribution generalization, representation learning, and kernel methods.",
    "Study integrates invariance constraints across environments into sparse dynamics identification to recover causal mechanisms and improve robustness to distribution shifts.",
    "Study formalizes domain adaptation under latent interventions by learning invariant mechanisms and adapting through latent-space updates.",
    "Study extracts causal features by penalizing environment-dependent signals while preserving task-relevant structure to enhance generalization.",
    "Faculty work on probabilistic machine learning, algorithmic fairness, differential privacy, and explainability.",
    "Study unifies differential privacy with group-fair risk minimization using convex surrogates and privacy mechanisms to improve fairness–accuracy trade-offs.",
    "Study enables fairness in federated learning via personalized encoders and global fairness regularizers under non-iid data.",
    "Study uses distributionally robust optimization to generate counterfactual explanations that remain valid under plausible test-time shifts.",
    "Researchers study algorithmic fairness in ranking and recommendation and social network impacts.",
    "Study enforces provider fairness in two-sided marketplaces through exposure-constrained re-ranking with minimal utility loss.",
    "Study introduces exposure disparity metrics and a differentiable reweighting objective to mitigate exposure gaps in recommender systems.",
    "Study audits exposure fairness in neural rankers via interventional queries and uses calibration to reduce disparities while maintaining ranking quality.",
    "Fellows advance explainable and multimodal AI, including vision–language models and zero-shot learning.",
    "Study augments vision–language models with test-time retrieval and cross-modal fusion to improve compositional reasoning.",
    "Study performs robust prompt optimization for zero-shot classification using distributionally robust objectives and gradient-free search.",
    "Study trains rationale generators with causal regularization to produce faithful textual explanations and improve answer accuracy."
  ]
}