\section{Discussion and Conclusion}
This work investigated the possibility of estimating a holistic price for any product from available data on externalities caused by the producing company.
% Our preliminary experiments suggest that while conceptually sound, this cannot be done with to a satisfactory level of accuracy with the current data landscape.
% Illustrate with compeleteness? E.g.:
% At the time of writing the edge density between companies and metrics, even after filtering for the most documented companies on Wikirate, lies at $12.32\%$. A density this low indicates that any conclusion drawn from the data are incomplete.
True pricing is an indispensable tool for policymakers, consumers, and producers alike if we aim to reimagine our global supply chains in a sustainable way.
A requirement of this vision is that the tools involved, like that of this demo, are accessible and transparent.
The success of platforms such as Wikirate~\cite{Wikirate} already demonstrates the public demand for accountability, but the nature of individual and disconnected data points makes drawing concrete conclusions difficult.
Our demo and its open data \gls{kg} thus constitute the first concrete steps towards making true cost approximations more accessible. Furthermore, as data becomes increasingly available due to more interconnected and monitored economy, tools such as ours will become much more informative in the near future.

\paragraph{Limitations and Future Work}
A limitation of the system that remains is the difficulty of evaluation.
Besides a handful of true cost reports, which may act as sanity checks, there is no reliable ground truth.
One may perform quality control on the KG as a proxy for uncertainty, for example using the methodology of~\cite{wangKnowledgeGraphQuality2021} but reliably quantifying the expected error remains difficult.

Additionally, the approach makes strong simplifying assumptions, which are often violated in practice.
To name a few: ``All products contribute to all externalities in proportion to their market price'', ``The entire supply chain is vertically integrated'' or ``The costs associated with metrics are location independent and constant''.
Loosening these assumptions would be a valuable contribution, e.g., with a cost model as a function of time and space or a unified way of addressing over- or undercounting of externalities along a distributed supply chain.

As future work, we will make the pipeline more robust. The harmonization and categorization of sources rely on heuristics and keyword matching, and remain vulnerable to edge cases. Metrics such as ``Reduction of hazardous waste'' are commonly categorized under hazardous waste, which skews results.
Deduplication is also not trivial, as differences in gathering or preprocessing between metrics that target the same underlying concept make determining authority for competing values challenging.
Finally the \gls{llm}-based Text2SPARQL interface is error-prone and a more sophisticated approach such as an agentic system with query access is warranted to replace the current zero-shot setup.




