</html>
<head>
  
  
  <style type="text/css">
body {
	background-color:#FFF;
	color:#000;
	margin-top:1px;
	margin-left:150px;
	margin-right:10%;
	margin-bottom:10%;
}
  
  
p, li, h1, h2, h3, h4, td {
	font-family:Arial;
}


h1 {
	color:#444;
}

h2 {
	color:#666;
}


h3 {
	color:#777;
}


h4 {
	color:#999;
}

span.marked {
	color:#84C;
	font-weight:bold;
}


.important {
	font-weight:bold;
	color:#84C;
}
.error {
	font-weight:bold;
	color:#A20;
}

.error2 {
	color:#900;
}


.version {
	border:1px solid black;
	background-color:#FFC;
}


.mono {
	font-family: monospace;
}

code {
	font-size: 120%;
	font-weight:bold;
	color:#444;
}

table {
	border: 1px solid;
	border-collapse: collapse;
}

td {
	border: 1px solid;
}

a {
	color:#333;
}  
</style>
  
<title>Supplementary Material</title>
</head>

<body>
<h1>Supplementary Material for AKBC submission </h1>

  <p>
    The following paragraphs explain how to reproduce the experiments in the paper
    and provide detailed information about the training of KGE models and hyperparameter searches performed.
  </p>


<h2>Reproducing results for one model/dataset</h2>

  <!-- p>In this section we assume that a pretrained libKGE model  (we specify below a link to the libKGE page, where these files are avalable for many datasets) and an AnyBURL ranking file (this file can be found as part of the supplementary material in this folder) is already available. For further information how to create these on your own, we refer to <a href="https://github.com/uma-pi1/kge">libKGE</a> and <a href="https://web.informatik.uni-mannheim.de/AnyBURL/">AnyBURL</a>.</p -->

  <p>
    Reproducing the experiments is shown exemplary for the ComplEx model on the dataset
    CoDEx-M, e.g, the entry with an MRR equal to 0.349 in the table in Appendix A and the corresponding marker
    of Figure 1 in the main text.
  </p>
  



  <p>
    To reproduce the example for ComplEx on CoDEx-M, please follow the steps below:
  </p>

<h4>1. Install the libKGE library in your python environment</h4>

  <p>
    Follow the quick start installation instructions of the <a href="https://github.com/uma-pi1/kge">libKGE</a>
    library. Also run <code>download.sh</code> as mentioned in the installation instructions.
    To check if the installation was successful run <code>kge</code> in your console,
    which should output <br> <br>
    <code>
      usage: kge [-h] {start,create,resume,eval,valid,test,dump,package} ... <br>
      kge: error: the following arguments are required: command
    </code> <br><br>
    Additionally keep track of the folder you cloned from Github which contains the library and is termed <code>kge</code>.
    The folder will be referenced as <code>/path/to/libKGE/kge</code> below.
  </p>

<h4>2. Download the pre-trained ComplEx model for CoDEx-M from the libKGE library</h4>

  <p>
    Download the file <code>codex-m-lp-complex.pt</code> from <a href="https://www.dropbox.com/s/psy21fvbn5pbmw6/codex-m-lp-complex.pt?dl=0">here</a>,
    it contains the pre-trained model checkpoint. Place the file in this folder, i.e, the folder where this readme.html is located.
  </p>

<h4>3. Ensure you have all files.</h4>
  <p>
    Ensure the files <code>anyburl-test</code> and <code>anyburl-valid</code>, which contain the original AnyBURL rankings,
    are located in this folder, i.e, the folder where the readme.html is located.
  </p>

  <p>
   You also need the location to the Codex-m dataset folder which is located in /path/to/libKGE/kge/data/codex-m
  </p>

<h4>4. Run the rescorex.py script, twice.</h4>

  <p>
    Ensure that you are within the python environment under which libKGE is installed.
    Navigate into this folder (alternatively, simply ensure that all the relativ/absolute paths to the files are correct)
    and run <br><br>

    <code>
      python rescorex.py --dataset_folder /path/to/libKGE/kge/data/codex-m
      --checkpoint codex-m-lp-complex.pt --ab_ranking anyburl-test --model_name complex
    </code>
    <br><br>
    subsequently, run<br><br>

    <code>
      python rescorex.py --dataset_folder /path/to/libKGE/kge/data/codex-m
      --checkpoint codex-m-lp-complex.pt --ab_ranking anyburl-valid --model_name complex
    </code>
    <br><br>
    When you are using Windows, additionally use the flag <code>-Xutf8</code>.
    After running the two commands, the two files <code>anyburl-test-complex</code>
    and <code>anyburl-valid-complex</code> have been created in the same folder.
    For help on the parameters of the script run <code>python rescorex.py --help</code>.
  </p>
  
  <p>When running these scripts some warnings are generated. These warnings can be ignored. They are based on the fact that the checkpoints available at the libKGE page have been generated with an older libKGE version.</p>
  
 <h4>5. Aggregate libKGE and AnyBURL ranking</h4> 
 
 <p>The code for aggregating the ranking files is written in Java and uses AnyBURL as dependency. This means that you have to download the latest AnyBURL version, which is available as <a href="https://web.informatik.uni-mannheim.de/AnyBURL/AnyBURL-JUNO.jar">AnyBURL-JUNO.jar<a>. The code for learning and performing the aggregation is available in the jar file JoinRE.jar located within this folder, i.e, the folder where this readme.html is located. Both jar files are compiled against the java 1.8 profile, which means that they should be also executable with older java versions.</p>

<p>Run the aggregation code with the following command (<strong>on Windows you have to separate the two jar files in the command with a ; semicolon, on Linux with a : colon</strong>). You have to specify seven input parameters seperated by blanks: (1) path to training file, (2) path to validation file, (3) path to ranking file created by a rule learner for the validation set, (4) path to rule-based ranking file with scores from kge model for the validation set, (5) path to ranking file created by a rule learner for the test set, (6) path to rule-based ranking file with scores from kge model for the test set, (7) output path where the aggregated ranking file should be stored.</p>

<p>You can use this command line call and run it from this folder, i.e, the folder where the readme.html is located. You have to change the first two arguments, to refer to your libKGE installation. The other parameters should fit to the content of this directory, if you conducted all previous steps.</p>


<!-- p> <code>java -cp JoinRE.jar;AnyBURL-JUNO.jar x.y.z.Rescorer E:/code/libKGE/kge/data/codex-m/train.txt E:/code/libKGE/kge/data/codex-m/valid.txt anyburl-valid anyburl-test anyburl-valid-complex anyburl-test-complex output</code> </p --> 

<p> <code>java -cp JoinRE.jar;AnyBURL-JUNO.jar x.y.z.Rescorer /path/to/libKGE/kge/data/codex-m/train.txt /path/to/libKGE/kge/data/codex-m/valid.txt anyburl-valid anyburl-test anyburl-valid-complex anyburl-test-complex output</code> </p> 

<p>As results, an additional ranking file named <code>output</code> is stored in this folder.</p>

<h4>6. Evaluate the results</h4> 

<p>You can use the evaluation method from AnyBURL to evaluate it. This requires to modify the first three paths (which refer to training, validation and test set of the used evaluation dataset) to /path/to/libKGE/kge in the file <code>config-eval.properties</code>, which is also available in this folder. Compute hist@k and MMR with this command line call.<p>

<p><code>java -cp AnyBURL-JUNO.jar de.unima.ki.anyburl.Eval</code></p>

<p>The output should look like this:</p>

<pre>
...
0.2772   0.3838   0.4922   0.3494
</pre>

<p>The numbers have the followng meaning: hits@1, hits@3, hits@10, and MRR.</p>


<!-- h4 style="color: #990000">5. Do some fancy JAVA stuff.</h4 -->


 <h2>Reproducing results for other models / datasets</h2> 
 
 <p>For all KGE models except of the transformer implementation we use the pre-trained models provided by libKGE. The AnyBURL rankings can be generated as described on the AnyBURL webpage. You can use the AnyBURL default parameter setting. There are only two exceptions:</p>


<ul>
	<li>Please add the additioanl parameter <code>TOP_K_OUTPUT = 100</code> to the <code>config-apply.configuration</code>. This generates the top 100 rankings (10 in the default setting).</li>
	<li>As recommended by the AnyBURL developer, add MAX_LENGTH_CYCLIC = 5 to the <code>config-learn.configuration</code> for the WN18RR dataset. The default is 3, which shold be used for the other datasets.</li>
</ul> 


<h2>Training KGE models</h2>
  <p>
    As mentioned in the main text, for all KGE models except of the transformer implementation we use the pre-trained models provided
    by the libKGE library which can be downloaded <a href="https://github.com/uma-pi1/kge#results-and-pretrained-models">here</a>.
    For the transformer  <a href="https://github.com/uma-pi1/kge/blob/master/kge/model/transformer.yaml">implementation</a>, we provide detailed config information in the folder
    <code>transformer-configs</code> within the supplementary materials. All the configs can be run seamlessly under the current libKGE master (June 2021).
  </p>

  <p>
    For the transformer model, we obtained hyperparameters for the FB237 dataset from the libKGE developers (<code>transformer-configs/training-configs/transformer-fb237-config.yaml</code>).
    For the remaining datasets, we use the Ax hyperparameter search provided by libKGE where we centered the search space around the FB237 parameters. For each dataset, we use the same search space.
    On Codex-L we run 15 trials out of which 7 are quasi-random. For the remaining datasets we use 30 trials with 15 quasi-random trials. The detailed specifications can be found in  <code>transformer-configs/search-configs</code>.
  </p>

  <p>
    The final models for the transformer are obtained by using the configurations of the
    search trials achieving the highest MRR on the validation split. The resulting configurations can be found in
    <code>transformer-configs/training-configs</code>.
  </p>


</body>

</html>

