# Algorithm 1: Knowledge Graph Construction and Embedding

The knowledge representation module forms the foundation of KARMA by converting domain knowledge into structured, learnable representations. This algorithm details the process of constructing knowledge graphs from various sources and learning embeddings that can be integrated with reinforcement learning agents.

## Input Processing and Knowledge Graph Construction

The knowledge graph construction process begins with the aggregation of domain knowledge from multiple sources including expert rules, ontologies, and textual descriptions. The algorithm systematically processes these heterogeneous knowledge sources to create a unified graph representation.

```
Algorithm 1.1: Knowledge Graph Construction
Input: Domain knowledge sources K_sources = {rules, ontologies, text}
Output: Knowledge graph G = (E, R_kg, T)

1: Initialize empty sets E ← ∅, R_kg ← ∅, T ← ∅
2: for each source s in K_sources do
3:    entities_s ← extract_entities(s)
4:    relations_s ← extract_relations(s)
5:    triples_s ← extract_triples(s)
6:    E ← E ∪ entities_s
7:    R_kg ← R_kg ∪ relations_s
8:    T ← T ∪ triples_s
9: end for
10: G ← construct_graph(E, R_kg, T)
11: G ← resolve_conflicts(G)  // Handle conflicting information
12: G ← validate_consistency(G)  // Ensure logical consistency
13: return G
```

The entity extraction process employs named entity recognition for textual sources and structured parsing for formal ontologies. Relation extraction utilizes dependency parsing and semantic role labeling to identify relationships between entities. The conflict resolution step addresses inconsistencies between different knowledge sources using confidence scores and source reliability metrics.

## Knowledge Graph Embedding Learning

Once the knowledge graph is constructed, the algorithm learns dense vector representations for entities and relations using the TransE embedding model, which has proven effective for capturing semantic relationships in knowledge graphs.

```
Algorithm 1.2: Knowledge Graph Embedding Learning
Input: Knowledge graph G = (E, R_kg, T), embedding dimension d
Output: Entity embeddings {e_i ∈ R^d}, relation embeddings {r_j ∈ R^d}

1: Initialize entity embeddings E_emb ← random_normal(|E|, d)
2: Initialize relation embeddings R_emb ← random_normal(|R_kg|, d)
3: Initialize optimizer Adam with learning rate α_kg
4: for epoch = 1 to max_epochs do
5:    for each triple (h, r, t) in T do
6:       // Positive sample
7:       score_pos ← ||h + r - t||_2
8:       
9:       // Generate negative samples
10:      (h	, r	, t	) ← corrupt_triple(h, r, t)
11:      score_neg ← ||h	 + r	 - t	||_2
12:      
13:      // Margin-based ranking loss
14:      loss ← max(0, γ_kg + score_pos - score_neg)
15:      
16:      // Update embeddings
17:      update_embeddings(E_emb, R_emb, loss)
18:   end for
19: end for
20: return E_emb, R_emb
```

The negative sampling strategy in line 10 employs a combination of random corruption and adversarial sampling to generate challenging negative examples. The corruption process randomly replaces either the head entity, relation, or tail entity while ensuring the corrupted triple is not present in the original knowledge graph.

## Knowledge-State Integration

The integration of knowledge graph embeddings with environment states requires a sophisticated mapping mechanism that identifies relevant knowledge for each state and computes attention-weighted representations.

```
Algorithm 1.3: Knowledge-State Integration
Input: State s, knowledge graph G, embeddings E_emb, R_emb
Output: Augmented state representation s	

1: // Map state to relevant entities
2: relevant_entities ← ∅
3: for each entity e in E do
4:    similarity ← compute_similarity(s, e)
5:    if similarity > θ_map then
6:       relevant_entities ← relevant_entities ∪ {e}
7:    end if
8: end for

9: // Compute attention weights
10: attention_scores ← ∅
11: for each entity e in relevant_entities do
12:    score ← neural_network(concat(s, E_emb[e]))
13:    attention_scores[e] ← score
14: end for
15: attention_weights ← softmax(attention_scores)

16: // Aggregate knowledge context
17: knowledge_context ← 0
18: for each entity e in relevant_entities do
19:    knowledge_context += attention_weights[e] * E_emb[e]
20: end for

21: // Create augmented state
22: s	 ← concat(s, knowledge_context)
23: return s	
```

The similarity computation in line 4 utilizes a learned similarity function that maps state features to entity descriptions using semantic embeddings. The attention mechanism ensures that the most relevant knowledge is emphasized while maintaining differentiability for end-to-end learning.

