<!DOCTYPE html>
<html><head>
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
    <meta charset="UTF-8">
    <title>Supplementary material</title>
</head>
<body>
    <h1>Supplementary Material for Paper #8921</h1>

<h2>Contents</h2>
This archive contains:
<ul>
    <li><a href="proofs.pdf">proofs.pdf</a>: a .pdf file containing the proofs of all the propositions given in the paper</li>
    <li><a href="explainingDT-neurips21-heat-maps.pdf">explainingDT-neurips21-heat-maps.pdf</a>: some additional
    examples of heatmaps and sufficient reasons for mnist49 and mnist38 datasets</li>
    <li> Several programs that can be launched in order to test our scripts and reproduce our results (see below)</li>
    <li>All datasets used in the experiments (in the directory "dataset")</li>
    <li><a href="results.xlsx">results.xlsx</a>: a spreadsheet that contains the empirical results collected for all datasets</li>
    <ul>
        <li> the "results" tab corresponds to an extension of Table 1 
(as given in the  paper) to the 90 datasets considered in the 
experiments. A couple of additional columns have also been added 
(#Instances is the number of instances of the dataset, #Features gives 
the number of features used primarily for describing the instances, 
#Classes is the number of classes, and depth is the mean depth of the 
decision trees that have been generated)
            </li><li> the "time" tab focuses on the computation time 
required to get the number of sufficient reasons (#Sufficient) of an 
instance using d4, and to enumerate its minimal sufficient reasons up to
 10 000 (#Minimal). The median time, the maximum run time and the number
 of instances for which the computation did not terminate in due time 
(for the sufficient reasons) or led to generate 10 000 minimal
sufficient reasons (#failed) are reported.
                
    </li></ul>
    <li>The directory bin that contains the runtime codes of the softwares Muser (MUS computation) and d4 (model counter)</li>
</ul>


<h2>Software</h2>
<p>You need to have a Linux OS in order to run the scripts. First of all, create a conda environment:
    open a terminal in the right directory and type:
    <code>conda env create --file environment.yml</code>. Then, activate this environment: <code>conda activate
        paper8921</code>
</p>

<h4>mnist.py</h4>
<p>This script generates and displays a heatmap, two sufficient reasons, and relevant/necessary literals for the mnist
    instances.
    By default, you have 8 or 3, you can change and generate 4 and 9 by modifying line 28 of this script.
    To run it, please enter the following command <code>python3 mnist.py</code>
</p>

<h4>test_DT.py</h4>
<p>This script generates some examples for a given dataset. For 10 instances, it computes and displays
    a sufficient reason, a maximum number of 1000 minimal sufficient reasons, the necessary and relevant literals, a maximum number of 10 000 sufficient reasons, and the
    explanatory importance of
    literals. To run it, please enter the following command: <code>python3 test_DT.py dataset/compas.csv</code>
    (the dataset is selected as an argument of the script).</p>
<h4>generate_data_DT.py</h4>
<p>This script allows to generate results following a 10-fold cross validation process, as explained in the experimental section of the paper.
    To run the script, please enter the following command <code>python3 generate_data_DT.py dataset/compas.csv</code>
    (the dataset is selected as an argument of the script). The results are saved in a .json file to be found in the
    folder "json" (this folder is empty at start but it must exist for the script to run smoothly).
</p>

<h4>Other scripts</h4>
<p>The script my_tree.py contains the algorithms presented in the paper and used to derive sufficient reasons, to derive minimal sufficient reasons, to compute necessary and relevant features, etc.</p>
<p>The script coding_CNF.py contains several encodings of cardinality constraints into the CNF format.</p>
<h4></h4>



</body></html>
