RDF Storage

The construction is described in the paper "Hypergraph Based Query Optimization", which we will use by translating the logic to OT circuits, starting with decryption into the circuit using the extended decryption process of the address content. We adopt their term definitions in this section, Query Planner, and Query Evaluator. their definitions provided here roughly verbatim for convenience:

Notation:

RDF Graph – G = (V, E) where V = {v|v ∈ S ∪ O} and E = {e₁, e₂, ...}∃e = {u, v} where u, v ∈ V.
Edge Labeling Function – l_e(S, O) = P.
Node Labeling Function – l_v(v_t) = t where t ∈ (S ∪ O) and S = Subject(URI ∪ BLANKS), P = Predicate(URI), O = Object(URI ∪ BLANKS ∪ LIT).
Hypergraph – H(G) = (V, E) where node V = {v₁, ..., v_n} and E = {e₁, ..., e_n} where V = {v|v ∈ S ∪ O ∪ P} and each edge E is a non-empty set of V . ∀P, ∃e|(S_i, O_i) ∈ H(G) where 1 ≤ i ≤ n.
Overlapping Hyperedge – (h_i(G) ⊑ h_i+1(G)) where h₁(G) = (S₁, P₁, O₁) and h₂(G) = (S₂, P₂, O₂), (h₁(G) ⊑ h₂(G)) iff ∀s₁ ∈ S₁ ∈ h₁(G)∃s₂ ∈ S₂ ∈ h₂(G)∨∀o₁ ∈ O₁ ∈ h₁(G)∃o₂ ∈ O₂ ∈ h₂(G)∨∀p₁ ∈ P₁ ∈ h₁(G)∃p₂ ∈ P₂ ∈ h₂(G).
Predicate-Based Index – I(G) = (V, E) where V = {v|v ∈ P_i ∈ h_i ∧ δ} and E = (v_i, v_j) where v_i, v_j ∈ V and 1 ≤ i ≤ n − 1, 1 ≤ j ≤ n for δ ∈ V ∃e = (δ, v). δ is the root of the index.
SPARQL Query – Q^R contains <Q^q, Q^s, Q^p > where Q^q is the query form and Q^p is the match pattern if ?x ∈ var(Q^q) then ?x ∈ var(Q^p) and Q^s contains constraints like FILTER, OPTIONAL.
Query Graph – Q^G = (V, E), V ← {var ∈ Q^p_i} and E ← {P ∈ Q^p_i ∧ (var ∈ Q^p_i, var ∈ Q^p_i+1)} where 1 ≤ i ≤ n, n is the number of predicates, P is the predicate.
Query Path – Q^path∃Q^path, δ → P_i ∈ I(G)|P_i. size = minsize ∧ (P_i → P_i+1) ∈ I(G) if ∃var ∈ Q^p_i == var ∈ Q^p_i+1.
Data Insertion – Given Q^R and Q^p then check ∃P_i ∈ Q^p ∈ I(G), if true then check P_i ∈ H(G) ∨ create h_i ∈ P_i ∧ update H(G) with h_i. Check if ∃var ∈ Q^p ∈ H(G), if true then overlap h_i with var’s h ∨ update h_i with var ∈ Q^p.
Data Deletion – Given Q^R and Q^p then check ∃P_i ∈ Q^p ∈ I(G) ∧ P_i ∈ var, if true then check |h_i| == 0, if true then remove var ∈ Q^p ∈ H(G) ∨ copy of P_i exists.

Given these definitions, their paper proposes algorithms to perform the whole of the queries for this database:

Create Hypergraph:

Given an RDF graph G as triple (S, O, P).
V ← ∅, E ← ∅, e_i ← ∅
∀(S, O, P) ∈ G, V ← V ∪ V ∈ (S ∪ O ∪ P).
∀P_i|1 ≤ i ≤ n, 1 ≤ j ≤ n, ei ← {P_i, {S_j , O_j}}, E ← e_i.
H(G) = (V, E).

Create Predicate-Based Index:

Given a hypergraph H(G).
Sort hyperedges by size.
∀h_i|1 ≤ i ≤ n − 1, if MIN(size(h_i)) then I(G) = I(G) ∪ δ ↓ Pi, ∀j|i + 1 ≤ j ≤ n, if (h_i(G) ⊑ h_j(G)) ∧ (size(P_i) == size(P_j)) then I(G) = I(G) ∪ P_i ↔ P_j else I(G) = I(G) ∪ P_i → P_j.