{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Needs[\"PEP`\"]\n",
    "$Assumptions = {Element[_, Reals]};\n",
    "$Assumptions = \n",
    " Join[$Assumptions, {0 < \\[Mu] < L, \\[Mu] > 0, L > 0, L > \\[Mu], 0 < \\[Epsilon], \\[Epsilon] <= 1, 0 < \\[Eta], SubStar[f] == 0, Subscript[\\[Lambda], 1] >= 0, \\[Nu] > 0}]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Analyzing error feedback using our Lyapunov function\n",
    "Thus far, we have managed to arrive at the following Lyapunov function using numerics:\n",
    "$$\n",
    "L(\\xi_k) \\coloneqq c ||x_k - x_\\star||^2 - 2c (x_k - x_\\star)^\\top e_k + (1 - c) ||e_k||^2,\n",
    "$$\n",
    "where $c \\coloneqq \\frac{\\sqrt{1 - \\delta}}{2 \\sqrt{1 - \\delta} + 1}$. We will now try to come up with the correct closed form for that convergence rate using Mathematica, as well as the correct multipliers to construct a proof of that rate of convergence."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "const = 1 / (2 + 1 / Sqrt[\\[Epsilon]])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "(*Lyapunov[xk_, ek_] := \\[Phi] * (xk - SubStar[x])^2 - 2 * \\[Phi] * (xk - SubStar[x]) * ek + (1 - \\[Phi]) * ek^2*)\n",
    "Lyapunov[xk_, ek_] := (const * (xk - SubStar[x])^2 - 2 * const * (xk - SubStar[x]) * ek + (1 - const) * ek^2) / const\n",
    "objective := FullSimplify[Lyapunov[Subscript[x, k+1], Subscript[e, k+1]]]\n",
    "objective"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "initialization := FullSimplify[Lyapunov[Subscript[x, k], Subscript[e, k]] - 1]\n",
    "initialization"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "constr1 := CvxConstraint[SubStar[x], Subscript[x, k], SubStar[f], Subscript[f, k], 0, Subscript[g, k], L, \\[Mu]]\n",
    "constr1"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "constr2 := CvxConstraint[Subscript[x, k], SubStar[x], Subscript[f, k], SubStar[f], Subscript[g, k], 0, L, \\[Mu]]\n",
    "constr2"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "And the compression constraint:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "compress := CompressConstraint[Subscript[e, k] + \\[Eta] * Subscript[g, k], Subscript[c, k], \\[Epsilon]]\n",
    "compress"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "We are now ready to specify our Lagrangian!"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 17,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Lagrangian[objective_, constraints_, variables_] := objective - Sum[variables[[i]] * constraints[[i]], {i, 1, Length[constraints]}]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Lagrange := Lagrangian[objective, {initialization, constr1, constr2, compress}, {\\[Rho], Subscript[\\[Lambda], 1], Subscript[\\[Lambda], 2], \\[Nu]}]\n",
    "(* Expand by filling the expressions for x_{k+1}, e_{k+1} *)\n",
    "\\[ScriptCapitalL] := FullSimplify[Lagrange /. {Subscript[x, k+1] -> Subscript[x, k] - Subscript[c, k], Subscript[e, k+1] -> Subscript[e, k] + \\[Eta] * Subscript[g, k] - Subscript[c, k]}]\n",
    "\\[ScriptCapitalL]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Let us now construct the LMI:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "(* We simplify the hessian of the Lagrangian *)\n",
    "LMIfull := FullSimplify[D[-\\[ScriptCapitalL], {{Subscript[x, k], Subscript[g, k], Subscript[c, k], Subscript[e, k]}, 2}] / 2]\n",
    "LMIfull"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "linconstr := FullSimplify[D[-\\[ScriptCapitalL], {Subscript[f, k]}]]\n",
    "linconstr"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Since this implies that $\\lambda_1 = \\lambda_2$ we will simply backsubstitute this in our LMI."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "LMI := FullSimplify[LMIfull /. {Subscript[\\[Lambda], 2] -> Subscript[\\[Lambda], 1]}];\n",
    "LMI // MatrixForm"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Constructing proof of convergence\n",
    "We will now solve for rank deficiency of our LMI to end up with expressions to choose."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "minors = Simplify[Minors[LMI, 3]];\n",
    "minors"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "minors[[1, 2]] /. {\\[Nu] -> 1 / Sqrt[\\[Epsilon]]} // FullSimplify"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "(*numult = Simplify[Solve[minors[[1,2]] == 0, \\[Nu]]] [[1, 1]]  THIS TAKES LONG TO LOAD SO DON'T RUN THIS*)\n",
    "numult = \\[Nu] -> 1 / Sqrt[\\[Epsilon]]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "This tells us one of the multipliers."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "LMIVsubbed = FullSimplify[LMI /. {numult}];\n",
    "LMIVsubbed // MatrixForm"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "(* Create new matrix wtihout 3rd row and 3rd column from LMIVsubbed *)\n",
    "LMIVsubbed2 = {{LMIVsubbed[[1,1]], LMIVsubbed[[1,2]], LMIVsubbed[[1,4]]},\n",
    "                {LMIVsubbed[[2,1]], LMIVsubbed[[2,2]], LMIVsubbed[[2,4]]},\n",
    "                {LMIVsubbed[[4,1]], LMIVsubbed[[4,2]], LMIVsubbed[[4,4]]}};\n",
    "LMIVsubbed2 // MatrixForm"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Substar[\\[Eta]] = 2 / (L + \\[Mu]) * (1 - \\[Epsilon]) / (1 + Sqrt[\\[Epsilon]])^2 // FullSimplify"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Substar[\\[Eta]] - 2 / (L + \\[Mu]) * (1 - Sqrt[\\[Epsilon]]) / (1 + Sqrt[\\[Epsilon]]) // Simplify"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "(1 - Sqrt[x]) * (1 + Sqrt[x]) / (1 + Sqrt[x])^2 // Simplify"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "2/(L + \\[Mu]) (1 - Sqrt[\\[Epsilon]])/(1 + Sqrt[\\[Epsilon]])^2"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "\n",
    "LMIVsubbedOptimal = FullSimplify[ReplaceAll[LMIVsubbed2, {\\[Eta] -> Substar[\\[Eta]]}]];\n",
    "LMIVsubbedOptimal // MatrixForm"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "tau = FullSimplify[Solve[Det[matrix] == 0, \\[Tau]]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "rhoGeneralOptimal = FullSimplify[Solve[Det[LMIVsubbedOptimal] == 0, \\[Rho]]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "rho1OptimalL1 = rhoGeneralOptimal[[1, 1, 2]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "rho2OptimalL1 = rhoGeneralOptimal[[2, 1, 2]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "l1Ugly = FullSimplify[Reduce[rho2OptimalL1 - rho1OptimalL1 == 0, Subscript[\\[Lambda], 1]]][[2, 1, 1, 1, 1]]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "For whatever reason, the fact is that l1Ugly is the negative of the actual expression. This means that the multiplier is actually:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "l1Optimal = FullSimplify[-l1Ugly]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "wrong = Substar[\\[Eta]] / (L + \\[Mu]) * ((1 - Sqrt[\\[Epsilon]])*(L - \\[Mu]) + (1 + Sqrt[\\[Epsilon]]) * Sqrt[(L - \\[Mu])^2 + 4^2 * L * \\[Mu] * Sqrt[\\[Epsilon]] / (1 + Sqrt[\\[Epsilon]])^2])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "\\[Eta]/(L + \\[Mu]) ((1 - Sqrt[\\[Epsilon]]) (L - \\[Mu]) + (1 + \n",
    "      Sqrt[\\[Epsilon]]) Sqrt[(L - \\[Mu])^2 + \n",
    "     4^2 L \\[Mu] Sqrt[\\[Epsilon]]/(1 + Sqrt[\\[Epsilon]])^2])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "N[rho1OptimalL1/. {Subscript[\\[Lambda], 1] -> wrong} /. {\\[Epsilon] -> 0.9, L -> 1.0, \\[Mu] -> 0.5}]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "l1Optimal - wrong // FullSimplify"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "(* Try to rewrite as a product of the optimal eta and some other product *)\n",
    "l1Parameterized = \\[Eta] * FullSimplify[l1Optimal / Substar[\\[Eta]]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "FullSimplify[(L^2 * (1 + Sqrt[\\[Epsilon]])^2 - 2*L*\\[Mu]*(1 - 6*Sqrt[\\[Epsilon]] + \\[Epsilon]) + \\[Mu]^2 * (1 + Sqrt[\\[Epsilon]])^2) / (1 + Sqrt[\\[Epsilon]])^2]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "FullSimplify[(1 - 6 Sqrt[\\[Epsilon]] + \\[Epsilon]) * ((1 - Sqrt[\\[Epsilon]]) / (1 - \\[Epsilon]))^2]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Lets also have a quick look at the resulting rate after backsubstituting:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "rho1Optimal = FullSimplify[ReplaceAll[rho1OptimalL1, {Subscript[\\[Lambda], 1] -> l1Optimal}]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "rho2Optimal = FullSimplify[ReplaceAll[rho2OptimalL1, {Subscript[\\[Lambda], 1] -> l1Optimal}]]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Just to confirm: these two are in fact the same rate!"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "FullSimplify[rho1Optimal - rho2Optimal]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "rhoParameterized = \\[Eta] * FullSimplify[rho2Optimal / Substar[\\[Eta]]]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Extracting a natural proof"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Note that in our case we have the following things to deal with:\n",
    "- $x_k$,\n",
    "- $x_\\star$,\n",
    "- $g_k$,\n",
    "- $c_k$,\n",
    "- $e_k$,"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "etaRule = \\[Eta] -> Substar[\\[Eta]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "multipliers = {Subscript[\\[Lambda], 1] -> \\[Lambda], \n",
    "               Subscript[\\[Lambda], 2] -> \\[Lambda], \n",
    "               \\[Nu] -> 1 / Sqrt[\\[Epsilon]], \n",
    "               etaRule}"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "lSimple = Collect[\\[ScriptCapitalL] /. {Subscript[\\[Lambda], 2] -> Subscript[\\[Lambda], 1]}, Subscript[\\[Lambda], 1]] // FullSimplify"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "SOS = \\[Lambda] * (constr1 + constr2) + \\[Nu] * compress - (Lyapunov[Subscript[x, k+1], Subscript[e, k+1]] - \\[Rho] * Lyapunov[Subscript[x, k], Subscript[e, k]]);\n",
    "SOS = SOS /. {Subscript[x, k+1] -> Subscript[x, k] - Subscript[c, k], Subscript[e, k+1] -> Subscript[e, k] + \\[Eta] * Subscript[g, k] - Subscript[c, k]} /. multipliers // FullSimplify"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Variables[SOS]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "We start eliminating any involvement from $e_k$. That means we need to find\n",
    "$$\n",
    "SOS_1 = a_1 \\|e_k + b_1 xk + c_1 x_\\star + d_1 g_k\\|^2\n",
    "$$"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "a1 = Coefficient[SOS, Subscript[e, k]^2] // FullSimplify\n",
    "b1 = Coefficient[SOS, Subscript[e, k] * Subscript[x, k]] / a1 / 2 // FullSimplify\n",
    "c1 = Coefficient[SOS, Subscript[e, k] * SubStar[x]] / a1 / 2 // FullSimplify\n",
    "d1 = Coefficient[SOS, Subscript[e, k] * Subscript[g, k]] / a1 / 2 // FullSimplify"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "aSubbed = a1 /. {\\[Rho] -> rho1Optimal} // Simplify"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Reduce[aSubbed < 0 && L > \\[Mu] > 0 && 0 <\\[Epsilon] < 1, {L, \\[Mu], \\[Epsilon]}]\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Reduce[rho1Optimal < Sqrt[\\[Epsilon]] && L > \\[Mu] > 0 && 0 <\\[Epsilon] < 1, {L, \\[Mu], \\[Epsilon]}]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "SOS1 = a1 * (Subscript[e, k] + b1 * Subscript[x, k] + c1 * SubStar[x] + d1 * Subscript[g, k])^2;\n",
    "rest1 = Collect[SOS - SOS1, Subscript[e, k]] // FullSimplify"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Variables[rest1]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "a2 = Coefficient[rest1, Subscript[g, k]^2] // FullSimplify\n",
    "b2 = Coefficient[rest1, Subscript[g, k] * Subscript[x, k]] / a2 / 2 // FullSimplify\n",
    "c2 = Coefficient[rest1, Subscript[g, k] * SubStar[x]] / a2 / 2 // FullSimplify\n",
    "SOS2 = a2 * (Subscript[g, k] + b2 * Subscript[x, k] + c2 * SubStar[x])^2;"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "a2 /. {\\[Rho] -> rho1Optimal, \\[Lambda] -> l1Optimal} // Simplify"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "This means that SOS2 is actually equal to 0 after substitution, which will be one of the steps of the proof!"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "rest2 = Collect[rest1 - SOS2, Subscript[g, k]] // FullSimplify"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Now it remains to check that this constant is positive:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Coefficient[rest2, Subscript[x, k]^2] /. {\\[Rho] -> rho1Optimal, \\[Lambda] -> l1Optimal} // Simplify"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "In summary, we have:\n",
    "$$\n",
    "SOS = a \\|c_k + b x_k + c x_\\star + d g_k\\|^2,\n",
    "$$\n",
    "after reordering inequalities and subsitution, we get that\n",
    "$$\n",
    "\\rho \\mathcal{V}(x_k, e_k) \\geq \\mathcal{V}(x_{k+1}, e_{k+1}) + SOS \\geq \\mathcal{V}(x_{k+1}, e_{k+1}).\n",
    "$$"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Reduce[Sqrt[\\[Epsilon]] > rho1Optimal, {\\[Epsilon], L, \\[Mu]}] // Simplify"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "a1 / \\[Rho] /. {\\[Rho] -> rho1Optimal} // FullSimplify"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Can we get the general rate?\n",
    "I will here give a short attempt showing whether this is feasible or not directly with Mathematica."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "LMIVsubbed2 // MatrixForm"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "LMIVweird = {{LMIVsubbed2[[1,1]], Subscript[\\[Phi], 1], LMIVsubbed2[[1,3]]},\n",
    "             {Subscript[\\[Phi], 1], Subscript[\\[Phi], 2], Subscript[\\[Phi], 3]},\n",
    "             {LMIVsubbed2[[3,1]], Subscript[\\[Phi], 3], LMIVsubbed2[[3,3]]}};\n",
    "LMIVweird // MatrixForm"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "a = FullSimplify[Det[LMIVsubbed2]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "b = FullSimplify[Det[LMIVsubbedOptimal]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "FullSimplify[a - b]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "\n",
    "rhoGeneral = FullSimplify[Solve[FullSimplify[Det[LMIVsubbed2]] == 0, \\[Rho]]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "rhoGeneral1 = rhoGeneral[[1, 1, 2]]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "rho1 seems to be invalid"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "rhoGeneral2 = rhoGeneral[[2, 1, 2]]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "This seems valid, but the optimizer is hella noisy for good conditioning and low level of compression. This means we should be worried about the validity of our formulas when doing symbolic regression as well."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "LMIVsubbed2 // MatrixForm"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "maybe = Simplify[CholeskyDecomposition[LMIVsubbed2]];\n",
    "maybe // MatrixForm"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "option2 = FullSimplify[Numerator[FullSimplify[maybe[[3, 3]]]^2]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "maybe = FullSimplify[Solve[option2 == 0, \\[Rho]][[1, 1, 2]]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "Simplify[Numerator[Together[D[maybe, Subscript[\\[Lambda], 1]]]]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "LMIVsubbedSubOptimal = FullSimplify[ReplaceAll[LMIVsubbed2, {\\[Eta] -> c * Substar[\\[Eta]]}]];\n",
    "LMIVsubbedSubOptimal // MatrixForm"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "detGeneral = FullSimplify[Numerator[Together[Numerator[FullSimplify[Det[LMIVsubbed2]]]]]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "test = Solve[detGeneral == 0, \\[Rho]][[1, 1, 2]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "detOptimal = FullSimplify[Det[LMIVsubbedOptimal]]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "detThing = FullSimplify[Together[Det[LMIVsubbedSubOptimal]]] /. {c -> 1/2}"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "The problem appears to be that the specific setting of leads to a lot of cancellations of terms. So effectively a lot of parts of the polynomial disappear."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Looking at the LMI\n",
    "Let us now try to write some code to solve this optimization problem and visualize the values of the lagrange multipliers. We will also see if we have a rank deficiency we can exploit for the closed-form solution,"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Let us check for accuracy. We get the following results according to PEPit:\n",
    "- $(\\eta = 0.1, \\delta = 0.9) \\rightarrow 0.980$\n",
    "- $(\\eta = 0.2, \\delta = 0.9) \\rightarrow 0.9606$\n",
    "- $(\\eta = 0.3, \\delta = 0.9) \\rightarrow 0.9413$\n",
    "- $(\\eta = 0.8, \\delta = 0.9) \\rightarrow 0.8494$\n",
    "- $(\\eta = 1.2, \\delta = 0.9) \\rightarrow 1.3613$\n",
    "\n",
    "- $(\\eta = 0.1, \\delta = 0.1) \\rightarrow 1.099$"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "SolveDualProblem[LL_, mu_, eta_, delta_] := Module[\n",
    "  {d = delta, S},\n",
    "  \n",
    "  (* Define the dual variables and constraints *)\n",
    "  dualVars = {S, r, l1, nu};\n",
    "  LMIsubbed = LMI /. {\n",
    "      Subscript[\\[Lambda], 1] -> l1,\n",
    "      \\[Rho] -> r, \n",
    "      \\[Epsilon] -> 1 - d,\n",
    "      L -> LL, \n",
    "      \\[Mu] -> mu, \n",
    "      \\[Eta] -> eta,\n",
    "      \\[Nu] -> nu\n",
    "  };\n",
    "\n",
    "  constraints = {\n",
    "    S == LMIsubbed,\n",
    "    VectorGreaterEqual[{S, 0}, {\"SemidefiniteCone\", 4}],\n",
    "    l1 >= 0,\n",
    "    r >= 0,\n",
    "    nu >= 0\n",
    "  };\n",
    "  \n",
    "  (* Solve the dual problem *)\n",
    "  SemidefiniteOptimization[r, constraints, dualVars, Method -> \"MOSEK\", Tolerance -> 10^(-10)]\n",
    "]\n",
    "SolveDualProblem[1.0, 0.1, 0.1, 0.9]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Let us make a plot for what happens with varying delta, when we use the optimal $\\eta$ given by:\n",
    "$$\n",
    "\\eta \\coloneqq \\frac{2}{L + \\mu} \\Big( \\frac{\\delta}{(1 + \\sqrt{1 - \\delta})^2} \\Big)\n",
    "$$"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "localL = 1.0;\n",
    "localMu = 0.2;\n",
    "deltas = Range[0.001, 0.999, 0.01];\n",
    "optimalEtas = 2 / (localL + localMu) * (deltas / (1 + Sqrt[1 - deltas])^2);\n",
    "dualValues = Table[\n",
    "  SolveDualProblem[localL, localMu, optimalEtas[[i]], deltas[[i]]], \n",
    "  {i, Length[deltas]}\n",
    "];\n",
    "SValues = Table[S /. dualValues[[i]], {i, Length[dualValues]}];\n",
    "rValues = Table[r /. dualValues[[i]], {i, Length[dualValues]}];\n",
    "r1Values = Table[rho1Optimal /. {\\[Epsilon] -> 1 - deltas[[i]], L -> localL, \\[Mu] -> localMu}, {i, Length[dualValues]}];\n",
    "r2Values = Table[rho2Optimal /. {\\[Epsilon] -> 1 - deltas[[i]], L -> localL, \\[Mu] -> localMu}, {i, Length[dualValues]}];\n",
    "l1Values = Table[l1 /. dualValues[[i]], {i, Length[dualValues]}];\n",
    "l1Optimal = Table[l1Optimal /. {\\[Epsilon] -> 1 - deltas[[i]], L -> localL, \\[Mu] -> localMu}, {i, Length[deltas]}];\n",
    "\n",
    "ListLinePlot[\n",
    "  {\n",
    "    Transpose[{deltas, rValues}],\n",
    "    Transpose[{deltas, r1Values}],\n",
    "    Transpose[{deltas, r2Values}],\n",
    "    Transpose[{deltas, l1Values}],\n",
    "    Transpose[{deltas, l1Optimal}]\n",
    "  },\n",
    "  PlotStyle -> {\n",
    "    {Red, Thickness[0.004]},\n",
    "    {Blue, Thickness[0.004], Dashing[{0.02, 0.01}]},\n",
    "    {Green, Thickness[0.004], Dashing[{0.04, 0.01, 0.01, 0.01}]},\n",
    "    {Purple, Thickness[0.004], Dotted},\n",
    "    {Orange, Thickness[0.004], Dashing[{0.02, 0.01}]}\n",
    "  },\n",
    "  PlotLegends -> {\"ρ (contraction)\", \"ρ1\", \"ρ2\", \"λ (convexity)\", \"λ (optimal)\"},\n",
    "  AxesLabel -> {\"δ\", \"Value\"},\n",
    "  PlotLabel -> \"ρ and λ as functions of δ\",\n",
    "  GridLines -> Automatic,\n",
    "  PlotMarkers -> {{Automatic, 6}, {Automatic, 8}, {Automatic, 7}, {Automatic, 9}},\n",
    "  ImageSize -> Large,\n",
    "  PlotRange -> All\n",
    "]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "localDelta = 0.2;\n",
    "localL = 1.0;\n",
    "localMu = 0.5;\n",
    "etas = Range[0.001, 1.0, 0.005];\n",
    "dualValues = Table[\n",
    "  SolveDualProblem[localL, localMu, eta, localDelta], \n",
    "  {eta, etas}\n",
    "];\n",
    "\n",
    "SValues = Table[S /. dualValues[[i]], {i, Length[dualValues]}];\n",
    "rValues = Table[r /. dualValues[[i]], {i, Length[dualValues]}];\n",
    "rEvalValues = Table[rhoGeneral2 /. {\\[Epsilon] -> 1 - localDelta, L -> localL, \\[Mu] -> localMu, \\[Eta] -> etas[[i]], Subscript[\\[Lambda], 1] -> l1Values[[i]]}, {i, Length[dualValues]}];\n",
    "l1Values = Table[l1 /. dualValues[[i]], {i, Length[dualValues]}];\n",
    "nuValues = Table[nu /. dualValues[[i]], {i, Length[dualValues]}];\n",
    "testValues = Table[option2 /. {\\[Epsilon] -> 1 - localDelta, L -> localL, \\[Mu] -> localMu, \\[Eta] -> etas[[i]], Subscript[\\[Lambda], 1] -> l1Values[[i]], \\[Rho] -> rValues[[i]]}, {i, Length[dualValues]}];\n",
    "\n",
    "\n",
    "(* Create the line plot *)\n",
    "ListLinePlot[\n",
    "  {\n",
    "    Transpose[{etas, rValues}],\n",
    "    Transpose[{etas, rEvalValues}],\n",
    "    Transpose[{etas, l1Values}],\n",
    "    Transpose[{etas, nuValues}]\n",
    "  },\n",
    "  PlotLegends -> {\"ρ (contraction)\", \"ρ (function of multipliers)\", \"λ (convexity)\", \"ν\"},\n",
    "  PlotStyle -> {Red, Green, Blue, Orange},\n",
    "  AxesLabel -> {\"η\", \"Value\"},\n",
    "  PlotLabel -> \"Multipliers as Functions of η (δ = \" <> ToString[localDelta] <> \")\",\n",
    "  GridLines -> Automatic,\n",
    "  PlotMarkers -> {Automatic, 6},\n",
    "  ImageSize -> Large,\n",
    "  PlotRange -> All\n",
    "]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "(* Plot the residuals when we approximate rho by a linear function (1 - c * eta) *)\n",
    "\n",
    "slope = (1 - rValues[[-1]]) / etas[[-1]];\n",
    "linearRho = 1 - slope * etas;\n",
    "residuals = rValues - linearRho;\n",
    "ListLinePlot[\n",
    "  {\n",
    "    Transpose[{etas, residuals}]\n",
    "  },\n",
    "  PlotLegends -> {\"ρ (contraction) - ρ (linear approximation)\"},\n",
    "  PlotStyle -> {Red},\n",
    "  AxesLabel -> {\"η\", \"Residual\"},\n",
    "  PlotLabel -> \"Residuals of Linear Approximation of ρ (δ = \" <> ToString[localDelta] <> \")\",\n",
    "  GridLines -> Automatic,\n",
    "  PlotMarkers -> {Automatic, 6},\n",
    "  ImageSize -> Large,\n",
    "  PlotRange -> All\n",
    "]\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "matrices = Table[dualValues[[i]][[1]][[2]], {i, Length[dualValues]}];\n",
    "eig1Values = Table[\n",
    "  Eigenvalues[matrices[[i]]][[1]], \n",
    "  {i, Length[matrices]}\n",
    "];\n",
    "eig2Values = Table[\n",
    "  Eigenvalues[matrices[[i]]][[2]], \n",
    "  {i, Length[matrices]}\n",
    "];\n",
    "eig3Values = Table[\n",
    "  Eigenvalues[matrices[[i]]][[3]], \n",
    "  {i, Length[matrices]}\n",
    "];\n",
    "\n",
    "ListLinePlot[\n",
    "  {\n",
    "    Transpose[{etas, eig1Values}],\n",
    "    Transpose[{etas, eig2Values}],\n",
    "    Transpose[{etas, eig3Values}]\n",
    "  },\n",
    "  PlotLegends -> {\"λ₁\", \"λ₂\", \"λ₃\"},\n",
    "  PlotStyle -> {Red, Blue, Green},\n",
    "  AxesLabel -> {\"η\", \"Eigenvalue\"},\n",
    "  PlotLabel -> \"Eigenvalues of S as Functions of η (δ = \" <> ToString[localDelta] <> \")\",\n",
    "  GridLines -> Automatic,\n",
    "  PlotMarkers -> {Automatic, 3},\n",
    "  ImageSize -> Large,\n",
    "  PlotRange -> All\n",
    "]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "(* Plot the values of all the entries in the matrices *)\n",
    "e11 = Table[matrices[[i]][[1,1]], {i, Length[matrices]}];\n",
    "e12 = Table[matrices[[i]][[1,2]], {i, Length[matrices]}];\n",
    "e13 = Table[matrices[[i]][[1,3]], {i, Length[matrices]}];\n",
    "e22 = Table[matrices[[i]][[2,2]], {i, Length[matrices]}];\n",
    "e23 = Table[matrices[[i]][[2,3]], {i, Length[matrices]}];\n",
    "e33 = Table[matrices[[i]][[3,3]], {i, Length[matrices]}];\n",
    "\n",
    "ListLinePlot[\n",
    "  {\n",
    "    Transpose[{etas, e11}],\n",
    "    Transpose[{etas, e12}],\n",
    "    Transpose[{etas, e13}],\n",
    "    Transpose[{etas, e22}],\n",
    "    Transpose[{etas, e23}],\n",
    "    Transpose[{etas, e33}]\n",
    "  },\n",
    "  PlotLegends -> {\"e11\", \"e12\", \"e13\", \"e22\", \"e23\", \"e33\"},\n",
    "  PlotStyle -> {Red, Blue, Green, Orange, Purple, Cyan},\n",
    "  AxesLabel -> {\"η\", \"Value\"},\n",
    "  PlotLabel -> \"Entries of S as Functions of η (δ = \" <> ToString[localDelta] <> \")\",\n",
    "  GridLines -> Automatic,\n",
    "  PlotMarkers -> {Automatic, 3},\n",
    "  ImageSize -> Large,\n",
    "  PlotRange -> All\n",
    "]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "vscode": {
     "languageId": "wolfram"
    }
   },
   "outputs": [],
   "source": [
    "minorsValues11 = Table[Minors[matrices[[i]], 1][[1,1]], {i, Length[matrices]}];\n",
    "minorsValues12 = Table[Minors[matrices[[i]], 1][[1,2]], {i, Length[matrices]}];\n",
    "minorsValues13 = Table[Minors[matrices[[i]], 1][[1,3]], {i, Length[matrices]}];\n",
    "minorsValues22 = Table[Minors[matrices[[i]], 1][[2,2]], {i, Length[matrices]}];\n",
    "minorsValues23 = Table[Minors[matrices[[i]], 1][[2,3]], {i, Length[matrices]}];\n",
    "minorsValues33 = Table[Minors[matrices[[i]], 1][[3,3]], {i, Length[matrices]}];\n",
    "sum = minorsValues11 + minorsValues12 + minorsValues13 + minorsValues22 + minorsValues23 + minorsValues33;\n",
    "\n",
    "ListLinePlot[\n",
    "  {\n",
    "    Transpose[{etas, minorsValues11}],\n",
    "    Transpose[{etas, minorsValues12}],\n",
    "    Transpose[{etas, minorsValues13}],\n",
    "    Transpose[{etas, minorsValues22}],\n",
    "    Transpose[{etas, minorsValues23}],\n",
    "    Transpose[{etas, minorsValues33}],\n",
    "    Transpose[{etas, sum}]\n",
    "  },\n",
    "  PlotLegends -> {\"minors(1,1)\", \"minors(1,2)\", \"minors(1,3)\", \"minors(2,2)\", \"minors(2,3)\", \"minors(3,3)\", \"sum\"}\n",
    "]"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Wolfram Language 14.2",
   "language": "Wolfram Language",
   "name": "wolframlanguage14.2"
  },
  "language_info": {
   "codemirror_mode": "mathematica",
   "file_extension": ".m",
   "mimetype": "application/vnd.wolfram.m",
   "name": "Wolfram Language",
   "pygments_lexer": "mathematica",
   "version": "12.0"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 4
}
