{
  "nbformat": 4,
  "nbformat_minor": 0,
  "metadata": {
    "colab": {
      "provenance": []
    },
    "kernelspec": {
      "name": "python3",
      "display_name": "Python 3"
    },
    "language_info": {
      "name": "python"
    }
  },
  "cells": [
    {
      "cell_type": "markdown",
      "source": [
        "In this notebook, we provide the implementation of various characteristics of labeled graphs used in the paper [Characterizing Graph Datasets for Node Classification: Homophily-Heterophily Dichotomy and Beyond](https://arxiv.org/abs/2209.06177) (NeurIPS 2023)."
      ],
      "metadata": {
        "id": "99HDa76bW8PE"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "import numpy as np\n",
        "import networkx as nx"
      ],
      "metadata": {
        "id": "7t-z50nYW8k3"
      },
      "execution_count": 1,
      "outputs": []
    },
    {
      "cell_type": "code",
      "source": [
        "def convert_labels_to_consecutive_integers(labels):\n",
        "    unique_labels = np.unique(labels)\n",
        "    labels_map = {label: i for i, label in enumerate(unique_labels)}\n",
        "    new_labels = np.array([labels_map[label] for label in labels])\n",
        "\n",
        "    return new_labels\n"
      ],
      "metadata": {
        "id": "KoEmnFsyWoNs"
      },
      "execution_count": 2,
      "outputs": []
    },
    {
      "cell_type": "markdown",
      "source": [
        "$\\textit{Homophily}$ is a graph property describing the tendency of edges to connect similar nodes (i.e., nodes of the same class); the opposite property is called $\\textit{heterophily}$. How do we measure the level of homophily of a graph? There are several measures used in the literature: $\\textit{edge homophily}$ ($h_{edge}$), $\\textit{node homophily}$ ($h_{node}$), and $\\textit{class homophily}$ ($h_{class}$). But how do we decide which one to use?\n",
        "\n",
        "In our paper we conduct a theoretical analysis of the properties of these measures. We show that the three measures used in the literature do not satisfy some imporetant properties and thus cannot be used to compare homophily levels across different graphs. We recommend using another measure - $\\textit{adjusted homophily}$ ($h_{adj}$), which, as we show, satisfies more desirable properties than other measures.\n",
        "\n",
        "Further, we introduce another characteristic of labeled graphs - $\\textit{label informativenes}$ ($\\mathrm{LI}$), which shows how much information about a node's label is provided by its neighbor's label. It is often believed that the level of homophily of a graph is corellated with the performance of Graph Neural Networks (GNNs) on this graph. However, we show that label informativeness is much more correlated with the performance of GNNs than homophily. We introduce two versions of label informativeness: edge label informativeness ($\\mathrm{LI}_{edge}$) and node label informativeness ($\\mathrm{LI}_{node}$). In $\\mathrm{LI}_{node}$, averaging is over the nodes, so all nodes are weighted equally. In $\\mathrm{LI}_{edge}$, the averaging is over the edges, so high-degree nodes make a larger contribution to the final measure. We note that, for most real graph datasets, the values of these two measures are rather close. However, for some graphs they might be significantly different, especially if the degree distribution is very unbalanced.\n",
        "\n",
        "For more details about homophily and label informativeness measures (including formal definitions) see our paper.\n",
        "\n",
        "Below we provide the implementation of all the measures used in our paper. Each of the functions below takes a networkx graph and a numpy array of node labels as inputs."
      ],
      "metadata": {
        "id": "tPo75qgoXzLj"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "def h_edge(graph, labels):\n",
        "    \"\"\"Compute edge homophily.\"\"\"\n",
        "    edges_with_same_label = 0\n",
        "    for u, v in graph.edges:\n",
        "        if labels[u] == labels[v]:\n",
        "            edges_with_same_label += 1\n",
        "\n",
        "    h_edge = edges_with_same_label / len(graph.edges)\n",
        "\n",
        "    return h_edge\n",
        "\n",
        "\n",
        "def h_node(graph, labels):\n",
        "    \"\"\"Compute node homophily.\"\"\"\n",
        "    h_sum = 0\n",
        "    num_zero_degree_nodes = 0\n",
        "    for u in graph.nodes():\n",
        "        if graph.degree(u) == 0:\n",
        "            num_zero_degree_nodes += 1\n",
        "            continue\n",
        "\n",
        "        neighbors_with_same_label = 0\n",
        "        for v in graph.neighbors(u):\n",
        "            if labels[u] == labels[v]:\n",
        "                neighbors_with_same_label += 1\n",
        "\n",
        "        cur_node_h = neighbors_with_same_label / graph.degree(u)\n",
        "        h_sum += cur_node_h\n",
        "\n",
        "    h_node = h_sum / (len(graph.nodes) - num_zero_degree_nodes)\n",
        "\n",
        "    return h_node\n",
        "\n",
        "\n",
        "def h_class(graph, labels):\n",
        "    \"\"\"Compute class homophily.\"\"\"\n",
        "    labels = convert_labels_to_consecutive_integers(labels)\n",
        "\n",
        "    num_classes = len(np.unique(labels))\n",
        "    num_nodes = np.array([0 for _ in range(num_classes)])\n",
        "    numerator = np.array([0 for _ in range(num_classes)])\n",
        "    denominator = np.array([0 for _ in range(num_classes)])\n",
        "    for u in graph.nodes:\n",
        "        label = labels[u]\n",
        "        cur_numerator = 0\n",
        "        for v in graph.neighbors(u):\n",
        "            if label == labels[v]:\n",
        "                cur_numerator += 1\n",
        "\n",
        "        numerator[label] += cur_numerator\n",
        "        denominator[label] += graph.degree(u)\n",
        "        num_nodes[label] += 1\n",
        "\n",
        "    h = numerator / denominator\n",
        "    h -= num_nodes / len(graph.nodes)\n",
        "    h = np.maximum(h, 0)\n",
        "    h_class = h.sum() / (num_classes - 1)\n",
        "\n",
        "    return h_class\n",
        "\n",
        "\n",
        "def h_adj(graph, labels):\n",
        "    \"\"\"Compute adjusted homophily.\"\"\"\n",
        "    labels = convert_labels_to_consecutive_integers(labels)\n",
        "\n",
        "    num_classes = len(np.unique(labels))\n",
        "\n",
        "    degree_sums = np.zeros((num_classes,))\n",
        "    for u in graph.nodes:\n",
        "        label = labels[u]\n",
        "        degree_sums[label] += graph.degree(u)\n",
        "\n",
        "    adjust = (degree_sums ** 2 / (len(graph.edges) * 2) ** 2).sum()\n",
        "\n",
        "    h_adj = (h_edge(graph, labels) - adjust) / (1 - adjust)\n",
        "\n",
        "    return h_adj\n",
        "\n",
        "\n",
        "def li_edge(graph, labels, eps=1e-8):\n",
        "    \"\"\"Compute edge label informativeness.\"\"\"\n",
        "    labels = convert_labels_to_consecutive_integers(labels)\n",
        "\n",
        "    num_classes = len(np.unique(labels))\n",
        "\n",
        "    class_probs = np.array([0 for _ in range(num_classes)], dtype=float)\n",
        "    class_degree_weighted_probs = np.array([0 for _ in range(num_classes)], dtype=float)\n",
        "    for u in graph.nodes:\n",
        "        label = labels[u]\n",
        "        class_probs[label] += 1\n",
        "        class_degree_weighted_probs[label] += graph.degree(u)\n",
        "\n",
        "    class_probs /= class_probs.sum()\n",
        "    class_degree_weighted_probs /= class_degree_weighted_probs.sum()\n",
        "\n",
        "    edge_probs = np.zeros((num_classes, num_classes))\n",
        "    for u, v in graph.edges:\n",
        "        label_u = labels[u]\n",
        "        label_v = labels[v]\n",
        "        edge_probs[label_u, label_v] += 1\n",
        "        edge_probs[label_v, label_u] += 1\n",
        "\n",
        "    edge_probs /= edge_probs.sum()\n",
        "\n",
        "    edge_probs += eps\n",
        "\n",
        "    numerator = (edge_probs * np.log(edge_probs)).sum()\n",
        "    denominator = (class_degree_weighted_probs * np.log(class_degree_weighted_probs)).sum()\n",
        "    li_edge = 2 - numerator / denominator\n",
        "\n",
        "    return li_edge\n",
        "\n",
        "\n",
        "def li_node(graph, labels, eps=1e-8):\n",
        "    \"\"\"Compute node label informativeness.\"\"\"\n",
        "    labels = convert_labels_to_consecutive_integers(labels)\n",
        "\n",
        "    num_classes = len(np.unique(labels))\n",
        "\n",
        "    class_probs = np.array([0 for _ in range(num_classes)], dtype=float)\n",
        "    class_degree_weighted_probs = np.array([0 for _ in range(num_classes)], dtype=float)\n",
        "    num_zero_degree_nodes = 0\n",
        "    for u in graph.nodes:\n",
        "        if graph.degree(u) == 0:\n",
        "            num_zero_degree_nodes += 1\n",
        "            continue\n",
        "\n",
        "        label = labels[u]\n",
        "        class_probs[label] += 1\n",
        "        class_degree_weighted_probs[label] += graph.degree(u)\n",
        "\n",
        "    class_probs /= class_probs.sum()\n",
        "    class_degree_weighted_probs /= class_degree_weighted_probs.sum()\n",
        "    num_nonzero_degree_nodes = len(graph.nodes) - num_zero_degree_nodes\n",
        "\n",
        "\n",
        "    edge_probs = np.zeros((num_classes, num_classes))\n",
        "    for u, v in graph.edges:\n",
        "        label_u = labels[u]\n",
        "        label_v = labels[v]\n",
        "        edge_probs[label_u, label_v] += 1 / (num_nonzero_degree_nodes * graph.degree(u))\n",
        "        edge_probs[label_v, label_u] += 1 / (num_nonzero_degree_nodes * graph.degree(v))\n",
        "\n",
        "    edge_probs += eps\n",
        "\n",
        "    log = np.log(edge_probs / (class_probs.reshape(-1, 1) * class_degree_weighted_probs.reshape(1, -1)))\n",
        "    numerator = (edge_probs * log).sum()\n",
        "    denominator = (class_probs * np.log(class_probs)).sum()\n",
        "    li_node = - numerator / denominator\n",
        "\n",
        "    return li_node\n"
      ],
      "metadata": {
        "id": "cX7UEz91WqqA"
      },
      "execution_count": 3,
      "outputs": []
    },
    {
      "cell_type": "markdown",
      "source": [
        "Now, let's look at characteristics of some graphs. We will get datasets from PyTorch Geometric."
      ],
      "metadata": {
        "id": "EWMuEn8FuRus"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "!pip install torch_geometric"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "imMmMzhdWuWp",
        "outputId": "41416938-4db9-4e49-aee8-22f7a6b2b8d0"
      },
      "execution_count": 4,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "Requirement already satisfied: torch_geometric in /usr/local/lib/python3.10/dist-packages (2.4.0)\n",
            "Requirement already satisfied: tqdm in /usr/local/lib/python3.10/dist-packages (from torch_geometric) (4.66.1)\n",
            "Requirement already satisfied: numpy in /usr/local/lib/python3.10/dist-packages (from torch_geometric) (1.23.5)\n",
            "Requirement already satisfied: scipy in /usr/local/lib/python3.10/dist-packages (from torch_geometric) (1.11.3)\n",
            "Requirement already satisfied: jinja2 in /usr/local/lib/python3.10/dist-packages (from torch_geometric) (3.1.2)\n",
            "Requirement already satisfied: requests in /usr/local/lib/python3.10/dist-packages (from torch_geometric) (2.31.0)\n",
            "Requirement already satisfied: pyparsing in /usr/local/lib/python3.10/dist-packages (from torch_geometric) (3.1.1)\n",
            "Requirement already satisfied: scikit-learn in /usr/local/lib/python3.10/dist-packages (from torch_geometric) (1.2.2)\n",
            "Requirement already satisfied: psutil>=5.8.0 in /usr/local/lib/python3.10/dist-packages (from torch_geometric) (5.9.5)\n",
            "Requirement already satisfied: MarkupSafe>=2.0 in /usr/local/lib/python3.10/dist-packages (from jinja2->torch_geometric) (2.1.3)\n",
            "Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.10/dist-packages (from requests->torch_geometric) (3.3.1)\n",
            "Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.10/dist-packages (from requests->torch_geometric) (3.4)\n",
            "Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.10/dist-packages (from requests->torch_geometric) (2.0.7)\n",
            "Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.10/dist-packages (from requests->torch_geometric) (2023.7.22)\n",
            "Requirement already satisfied: joblib>=1.1.1 in /usr/local/lib/python3.10/dist-packages (from scikit-learn->torch_geometric) (1.3.2)\n",
            "Requirement already satisfied: threadpoolctl>=2.0.0 in /usr/local/lib/python3.10/dist-packages (from scikit-learn->torch_geometric) (3.2.0)\n"
          ]
        }
      ]
    },
    {
      "cell_type": "code",
      "source": [
        "from torch_geometric import datasets"
      ],
      "metadata": {
        "id": "j6KhFM4VuaUP"
      },
      "execution_count": 5,
      "outputs": []
    },
    {
      "cell_type": "code",
      "source": [
        "def get_graph_and_labels_from_pyg_dataset(dataset):\n",
        "    graph = nx.Graph()\n",
        "    graph.add_nodes_from(range(len(dataset.x)))\n",
        "    graph.add_edges_from(dataset.edge_index.T.numpy())\n",
        "\n",
        "    labels = dataset.y.numpy()\n",
        "\n",
        "    return graph, labels\n",
        "\n",
        "\n",
        "def print_graph_characteristics(graph, labels):\n",
        "    print(f'number of nodes: {len(graph.nodes)}')\n",
        "    print(f'number of edges: {len(graph.edges)}')\n",
        "    print(f'average node degree: {len(graph.edges) * 2 / len(graph.nodes):.2f}')\n",
        "    print(f'number of classes: {len(np.unique(labels))}')\n",
        "    print(f'edge homophily: {h_edge(graph, labels):.2f}')\n",
        "    print(f'node homophily: {h_node(graph, labels):.2f}')\n",
        "    print(f'class_homophily: {h_class(graph, labels):.2f}')\n",
        "    print(f'adjusted homophily: {h_adj(graph, labels):.2f}')\n",
        "    print(f'edge label informativeness: {li_edge(graph, labels):.2f}')\n",
        "    print(f'node label informativeness: {li_node(graph, labels):.2f}')\n"
      ],
      "metadata": {
        "id": "IAvH6LRwveJ1"
      },
      "execution_count": 6,
      "outputs": []
    },
    {
      "cell_type": "markdown",
      "source": [
        "First, let's look at the popular $\\texttt{cora}$ graph."
      ],
      "metadata": {
        "id": "6GNafoErx2kA"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "dataset = datasets.Planetoid(name='cora', root='data')[0]"
      ],
      "metadata": {
        "id": "1EQ2GU8gvLAt"
      },
      "execution_count": 7,
      "outputs": []
    },
    {
      "cell_type": "code",
      "source": [
        "graph, labels = get_graph_and_labels_from_pyg_dataset(dataset)\n",
        "print_graph_characteristics(graph, labels)"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "dX5wRWBLvcTU",
        "outputId": "f536dad5-964b-46c2-9cc5-3c399844e282"
      },
      "execution_count": 8,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "number of nodes: 2708\n",
            "number of edges: 5278\n",
            "average node degree: 3.90\n",
            "number of classes: 7\n",
            "edge homophily: 0.81\n",
            "node homophily: 0.83\n",
            "class_homophily: 0.77\n",
            "adjusted homophily: 0.77\n",
            "edge label informativeness: 0.59\n",
            "node label informativeness: 0.61\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "$\\texttt{Cora}$ is a classic example of a homophilous graph. Values of all homophily measures are high. Values of both label informativeness measures are also high, which is expected for a homophilous graph: knowing neighbor's label provides a lot of information about the node's label.\n",
        "\n",
        "Let's look at one more example of a homophilous graph - $\\texttt{lastfm-asia}$."
      ],
      "metadata": {
        "id": "PY-wdAN2yAvQ"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "dataset = datasets.LastFMAsia(root='data')"
      ],
      "metadata": {
        "id": "JX_6Jk2SxvkB"
      },
      "execution_count": 9,
      "outputs": []
    },
    {
      "cell_type": "code",
      "source": [
        "graph, labels = get_graph_and_labels_from_pyg_dataset(dataset)\n",
        "print_graph_characteristics(graph, labels)"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "AgUxxSq1y74Y",
        "outputId": "46077ae5-d262-479b-93e2-5c66e49bc76e"
      },
      "execution_count": 10,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "number of nodes: 7624\n",
            "number of edges: 27806\n",
            "average node degree: 7.29\n",
            "number of classes: 18\n",
            "edge homophily: 0.87\n",
            "node homophily: 0.83\n",
            "class_homophily: 0.77\n",
            "adjusted homophily: 0.86\n",
            "edge label informativeness: 0.74\n",
            "node label informativeness: 0.68\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "Once again, values of all homophily measures and both label informativeness measures are high.\n",
        "\n",
        "Now, let's look at $\\texttt{questions}$ graph - we will see a very different picture."
      ],
      "metadata": {
        "id": "7W0NSoNizB6i"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "dataset = datasets.HeterophilousGraphDataset(name='questions', root='data')"
      ],
      "metadata": {
        "id": "tDzXZ22Sy90u"
      },
      "execution_count": 11,
      "outputs": []
    },
    {
      "cell_type": "code",
      "source": [
        "graph, labels = get_graph_and_labels_from_pyg_dataset(dataset)\n",
        "print_graph_characteristics(graph, labels)"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "jDAMFHuCzlU-",
        "outputId": "ea55a528-0cb7-4e25-fb47-4ccd8afcc66b"
      },
      "execution_count": 12,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "number of nodes: 48921\n",
            "number of edges: 153540\n",
            "average node degree: 6.28\n",
            "number of classes: 2\n",
            "edge homophily: 0.84\n",
            "node homophily: 0.90\n",
            "class_homophily: 0.08\n",
            "adjusted homophily: 0.02\n",
            "edge label informativeness: 0.00\n",
            "node label informativeness: 0.01\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "The values of edge homophily and node homophily are high. One might start to think that these graph is homophilous, however, this is not true. The reason for high values of these two homophily measures is that the classes in these dataset are very unbalanced: 97% of the nodes belong to the majority class. Thus, most of the edges connect nodes of this large class. However, the edge pattern does not depend on node labels, as shown by the value of adjusted homophily being almost zero. Thus, this graph is heterophilous. The values of both label informativeness measures are also almost zero: knowing a neighbor's label does not tell us anything about the node's label.\n",
        "\n",
        "Let's look at one more heterophilous graph - $\\texttt{roman-empire}$."
      ],
      "metadata": {
        "id": "YCbIrbZizrdk"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "dataset = datasets.HeterophilousGraphDataset(name='roman-empire', root='data')"
      ],
      "metadata": {
        "id": "B8Fq0ULR0FD8"
      },
      "execution_count": 13,
      "outputs": []
    },
    {
      "cell_type": "code",
      "source": [
        "graph, labels = get_graph_and_labels_from_pyg_dataset(dataset)\n",
        "print_graph_characteristics(graph, labels)"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "EvR8OB-P1f40",
        "outputId": "287a0307-6690-42cf-fdba-210acc9a1d19"
      },
      "execution_count": 14,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "number of nodes: 22662\n",
            "number of edges: 32927\n",
            "average node degree: 2.91\n",
            "number of classes: 18\n",
            "edge homophily: 0.05\n",
            "node homophily: 0.05\n",
            "class_homophily: 0.02\n",
            "adjusted homophily: -0.05\n",
            "edge label informativeness: 0.11\n",
            "node label informativeness: 0.11\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "This time the values of all homophily measures agree - they are all almost zero, with adjusted homophily even being negative. This graph is heterophilous. What is interesting about this graph is that, despite its heterophily, the values of both label informativeness measures are significantly greater than zero: knowing a neighbor's label still provides us with useful useful information about the node's label."
      ],
      "metadata": {
        "id": "51o34sfa1vhF"
      }
    },
    {
      "cell_type": "markdown",
      "source": [
        "Finally, let's look at one more graph - $\\texttt{genius}$."
      ],
      "metadata": {
        "id": "eTHfKTPG2iQq"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "dataset = datasets.LINKXDataset(name='genius', root='data')"
      ],
      "metadata": {
        "id": "rk7BFMtr1hJb"
      },
      "execution_count": 15,
      "outputs": []
    },
    {
      "cell_type": "code",
      "source": [
        "graph, labels = get_graph_and_labels_from_pyg_dataset(dataset)\n",
        "print_graph_characteristics(graph, labels)"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "phPQRXFI28NT",
        "outputId": "9af40c2f-5ba7-43de-9390-2f18608e711b"
      },
      "execution_count": 16,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "number of nodes: 421961\n",
            "number of edges: 922868\n",
            "average node degree: 4.37\n",
            "number of classes: 2\n",
            "edge homophily: 0.59\n",
            "node homophily: 0.51\n",
            "class_homophily: 0.02\n",
            "adjusted homophily: -0.05\n",
            "edge label informativeness: 0.00\n",
            "node label informativeness: 0.17\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "The values of edge homophily and node homophily are high, however, in fact, the edge pattern does not depend on node labels - the value of adjusted homophily is close to zero. Thus, this graph is heterophilous. What else is interesting about this graph is that the values of edge label informativeness and node label informativeness differ significantly. This is explained by the very unbalanced distribution of node degrees in this graph. Let's examine it."
      ],
      "metadata": {
        "id": "0uBCl-3-3Etg"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "degrees = [degree for node, degree in graph.degree]\n",
        "\n",
        "print(f'share of nodes with degree 1: {sum(deg == 1 for deg in degrees) / len(degrees):.2f}')\n",
        "print(f'max node degree {max(degrees)}')"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "asxD8hs-5NFV",
        "outputId": "ee4cdf76-0b64-418f-cf9e-94c5e3c252d9"
      },
      "execution_count": 17,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "share of nodes with degree 1: 0.66\n",
            "max node degree 10469\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "66% of nodes in this graph have degree 1. However, the highest node degree is more than 10K.\n",
        "\n",
        "Let's plot the histogram of this distribution (note the logarithmic scale of y-axis)."
      ],
      "metadata": {
        "id": "UAY5YL_s6Okt"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "from matplotlib import pyplot as plt"
      ],
      "metadata": {
        "id": "uhrheq8W4Esm"
      },
      "execution_count": 18,
      "outputs": []
    },
    {
      "cell_type": "code",
      "source": [
        "plt.hist(degrees, bins=100)\n",
        "plt.yscale('log')\n",
        "plt.show()"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 430
        },
        "id": "GetnkZqC29qa",
        "outputId": "91b55736-acbd-475f-db5d-e043d4716c7c"
      },
      "execution_count": 19,
      "outputs": [
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "<Figure size 640x480 with 1 Axes>"
            ],
            "image/png": "iVBORw0KGgoAAAANSUhEUgAAAicAAAGdCAYAAADJ6dNTAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjcuMSwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy/bCgiHAAAACXBIWXMAAA9hAAAPYQGoP6dpAAAkXElEQVR4nO3dfXBU133/8Y8erBUqaHlQvUKALLvxQwVk5ejJcuzGNDtRZQYn5KE0QxxZydAmXVq729gVkwbG0zhi8sDQJLehSYfQTJxAmEmUxDikjoyj2JUtEBY2ke2YWsQqzq6gFC2SXcnePb8/8vPaawRmxUp7ru77NbMz7L1H5373SEKfufece/OMMUYAAACWyM91AQAAAG9GOAEAAFYhnAAAAKsQTgAAgFUIJwAAwCqEEwAAYBXCCQAAsArhBAAAWKUw1wVkKplM6qWXXtK8efOUl5eX63IAAMBFMMbo7NmzqqioUH7+hc+NuCacOI4jx3E0MTGh//qv/8p1OQAAYAqGhoa0dOnSC7bJc9vt60dGRjR//nwNDQ2ptLQ01+UAAICLEI/HtWzZMp05c0Z+v/+CbV1z5uR1r1/KKS0tJZwAAOAyFzMlwzUTYh3HUXV1terr63NdCgAAmEauu6wTj8fl9/s1MjLCmRMAAFwik7/frjlzAgAAvIFwAgAArOKacMKcEwAAvIE5JwAAYNox5wQAALgW4QQAAFiFcAIAAKzimnDChFgAALyBCbEAAGDaMSEWAAC4FuEEAABYxXVPJZ5uVe370t4f37o6R5UAAOBNnDkBAABWcU04YbUOAADe4JpwEg6HNTAwoIMHD+a6FAAAMI1cE04AAIA3EE4AAIBVCCcAAMAqhBMAAGAVwgkAALCKa8IJS4kBAPAG14QTlhIDAOANrgknAADAGwgnAADAKoQTAABgFcIJAACwCuEEAABYhXACAACsQjgBAABWIZwAAACrEE4AAIBVXBNOuH09AADe4Jpwwu3rAQDwBteEEwAA4A2EEwAAYBXCCQAAsArhBAAAWIVwAgAArEI4AQAAViGcAAAAqxBOAACAVQgnAADAKoQTAABglcJcHLSqqkqlpaXKz8/XggULdODAgVyUAQAALJSTcCJJ//mf/6m5c+fm6vAAAMBSXNYBAABWyTicdHd3a82aNaqoqFBeXp46OzvPaeM4jqqqqlRcXKzGxkb19vam7c/Ly9N73vMe1dfX6/77759y8QAAYPbJOJyMjY0pGAzKcZxJ9+/Zs0eRSERbtmzR4cOHFQwG1dzcrOHh4VSbRx99VH19ffrJT36iL3zhC3rqqaem/gkAAMCsknE4aWlp0ec//3mtXbt20v3btm3Thg0b1NbWpurqau3YsUMlJSXauXNnqs2SJUskSYsXL9att96qw4cPn/d44+PjisfjaS8AADB7ZXXOycTEhPr6+hQKhd44QH6+QqGQenp6JP3+zMvZs2clSaOjo3r44Ye1fPny8/bZ0dEhv9+fei1btiybJQMAAMtkNZycOnVKiURCgUAgbXsgEFA0GpUkxWIx3XTTTQoGg7rhhhv08Y9/XPX19eftc9OmTRoZGUm9hoaGslkyAACwzIwvJb7qqqt05MiRi27v8/nk8/nkOI4cx1EikZjG6gAAQK5l9cxJWVmZCgoKFIvF0rbHYjGVl5dfUt/hcFgDAwM6ePDgJfUDAADsltVwUlRUpNraWnV1daW2JZNJdXV1qampKZuHAgAAs1TGl3VGR0d17Nix1PvBwUH19/dr4cKFqqysVCQSUWtrq+rq6tTQ0KDt27drbGxMbW1tWS0cAADMThmHk0OHDmnVqlWp95FIRJLU2tqqXbt2ad26dTp58qQ2b96saDSqmpoa7d+//5xJsplizgkAAN6QZ4wxuS4iE/F4XH6/XyMjIyotLc16/1Xt+9LeH9+6OuvHAADAazL5+82zdQAAgFVcE04cx1F1dfUF74kCAADczzXhhKXEAAB4g2vCCQAA8AbCCQAAsIprwglzTgAA8AbXhBPmnAAA4A2uCScAAMAbCCcAAMAqhBMAAGAV14QTJsQCAOANrgknTIgFAMAbXBNOAACANxBOAACAVQgnAADAKoQTAABgFdeEE1brAADgDa4JJ6zWAQDAG1wTTgAAgDcQTgAAgFUIJwAAwCqEEwAAYBXCCQAAsIprwglLiQEA8AbXhBOWEgMA4A2uCScAAMAbCCcAAMAqhBMAAGAVwgkAALAK4QQAAFiFcAIAAKxCOAEAAFYhnAAAAKu4Jpxwh1gAALzBNeGEO8QCAOANrgknAADAGwgnAADAKoQTAABgFcIJAACwCuEEAABYhXACAACsQjgBAABWIZwAAACrEE4AAIBVCCcAAMAqhBMAAGCVnIWTl19+WVdccYU+85nP5KoEAABgoZyFk/vuu0833HBDrg4PAAAslZNw8vzzz+vZZ59VS0tLLg4PAAAslnE46e7u1po1a1RRUaG8vDx1dnae08ZxHFVVVam4uFiNjY3q7e1N2/+Zz3xGHR0dUy4aAADMXhmHk7GxMQWDQTmOM+n+PXv2KBKJaMuWLTp8+LCCwaCam5s1PDwsSfrxj3+sa665Rtdcc81FHW98fFzxeDztBQAAZq/CTL+gpaXlgpdjtm3bpg0bNqitrU2StGPHDu3bt087d+5Ue3u7Hn/8ce3evVt79+7V6OioXn31VZWWlmrz5s2T9tfR0aF777030zIBAIBLZXXOycTEhPr6+hQKhd44QH6+QqGQenp6JP0+bAwNDen48eP68pe/rA0bNpw3mEjSpk2bNDIyknoNDQ1ls2QAAGCZjM+cXMipU6eUSCQUCATStgcCAT377LNT6tPn88nn82WjPAAA4AJZDSeZuuOOOy66reM4chxHiURi+goCAAA5l9XLOmVlZSooKFAsFkvbHovFVF5efkl9h8NhDQwM6ODBg5fUDwAAsFtWw0lRUZFqa2vV1dWV2pZMJtXV1aWmpqZsHgoAAMxSGV/WGR0d1bFjx1LvBwcH1d/fr4ULF6qyslKRSEStra2qq6tTQ0ODtm/frrGxsdTqnanisg4AAN6QZ4wxmXzBI488olWrVp2zvbW1Vbt27ZIkff3rX9eXvvQlRaNR1dTU6Ktf/aoaGxuzUnA8Hpff79fIyIhKS0uz0uebVbXvS3t/fOvqrB8DAACvyeTvd8bhJNcIJwAAuE8mf79z9uA/AACAybgmnDiOo+rqatXX1+e6FAAAMI1cE05YSgwAgDe4JpwAAABvIJwAAACruCacMOcEAABvcE04Yc4JAADe4JpwAgAAvIFwAgAArOKacMKcEwAAvME14YQ5JwAAeINrwgkAAPAGwgkAALAK4QQAAFiFcAIAAKzimnDCah0AALzBNeGE1ToAAHiDa8IJAADwBsIJAACwCuEEAABYhXACAACsQjgBAABWIZwAAACruCaccJ8TAAC8wTXhhPucAADgDa4JJwAAwBsIJwAAwCqEEwAAYBXCCQAAsArhBAAAWIVwAgAArEI4AQAAViGcAAAAq7gmnHCHWAAAvME14YQ7xAIA4A2uCScAAMAbCCcAAMAqhBMAAGAVwgkAALAK4QQAAFiFcAIAAKxCOAEAAFYhnAAAAKsQTgAAgFUIJwAAwCozHk7OnDmjuro61dTUaMWKFfrWt7410yUAAACLFc70AefNm6fu7m6VlJRobGxMK1as0Ac/+EEtWrRopksBAAAWmvEzJwUFBSopKZEkjY+PyxgjY8xMlwEAACyVcTjp7u7WmjVrVFFRoby8PHV2dp7TxnEcVVVVqbi4WI2Njert7U3bf+bMGQWDQS1dulR33323ysrKpvwBAADA7JJxOBkbG1MwGJTjOJPu37NnjyKRiLZs2aLDhw8rGAyqublZw8PDqTbz58/XkSNHNDg4qO9973uKxWJT/wQAAGBWyTictLS06POf/7zWrl076f5t27Zpw4YNamtrU3V1tXbs2KGSkhLt3LnznLaBQEDBYFC/+tWvznu88fFxxePxtBcAAJi9sjrnZGJiQn19fQqFQm8cID9foVBIPT09kqRYLKazZ89KkkZGRtTd3a1rr732vH12dHTI7/enXsuWLctmyQAAwDJZDSenTp1SIpFQIBBI2x4IBBSNRiVJv/3tb3XzzTcrGAzq5ptv1t/8zd9o5cqV5+1z06ZNGhkZSb2GhoayWTIAALDMjC8lbmhoUH9//0W39/l88vl801cQAACwSlbPnJSVlamgoOCcCa6xWEzl5eWX1LfjOKqurlZ9ff0l9QMAAOyW1XBSVFSk2tpadXV1pbYlk0l1dXWpqanpkvoOh8MaGBjQwYMHL7VMAABgsYwv64yOjurYsWOp94ODg+rv79fChQtVWVmpSCSi1tZW1dXVqaGhQdu3b9fY2Jja2tqyWvhMqWrfd86241tX56ASAAC8IeNwcujQIa1atSr1PhKJSJJaW1u1a9curVu3TidPntTmzZsVjUZVU1Oj/fv3nzNJNlOO48hxHCUSiUvqBwAA2C3PuOze8fF4XH6/XyMjIyotLc16/5OdKXkrzpwAAJCZTP5+z/izdQAAAC7ENeGE1ToAAHiDa8IJq3UAAPAG14QTAADgDYQTAABgFdeEE+acAADgDa4JJ8w5AQDAG1wTTgAAgDcQTgAAgFUIJwAAwCquCSdMiAUAwBtcE06YEAsAgDe4JpwAAABvIJwAAACrEE4AAIBVCCcAAMAqrgknrNYBAMAb8owxJtdFZCIej8vv92tkZESlpaVZ77+qfV/GX3N86+qs1wEAwGySyd9v15w5AQAA3kA4AQAAViGcAAAAqxBOAACAVQgnAADAKq4JJywlBgDAG1wTTnjwHwAA3uCacAIAALyBcAIAAKxCOAEAAFYhnAAAAKsQTgAAgFUIJwAAwCqEEwAAYBXCCQAAsArhBAAAWKUw1wVcLMdx5DiOEolErks5R1X7vnO2Hd+6OgeVAADgfq45c8Lt6wEA8AbXhBMAAOANhBMAAGAVwgkAALAK4QQAAFiFcAIAAKxCOAEAAFYhnAAAAKsQTgAAgFUIJwAAwCqEEwAAYJUZDydDQ0O65ZZbVF1drXe+853au3fvTJcAAAAsNuMP/issLNT27dtVU1OjaDSq2tpa3XrrrfqDP/iDmS4FAABYaMbDyeLFi7V48WJJUnl5ucrKynT69GnCCQAAkDSFyzrd3d1as2aNKioqlJeXp87OznPaOI6jqqoqFRcXq7GxUb29vZP21dfXp0QioWXLlmVcOAAAmJ0yDidjY2MKBoNyHGfS/Xv27FEkEtGWLVt0+PBhBYNBNTc3a3h4OK3d6dOn9fGPf1zf/OY3p1Y5AACYlTK+rNPS0qKWlpbz7t+2bZs2bNigtrY2SdKOHTu0b98+7dy5U+3t7ZKk8fFxfeADH1B7e7tuvPHGCx5vfHxc4+PjqffxeDzTkgEAgItkdbXOxMSE+vr6FAqF3jhAfr5CoZB6enokScYY3XHHHfrTP/1T3X777W/bZ0dHh/x+f+rFJSAAAGa3rIaTU6dOKZFIKBAIpG0PBAKKRqOSpMcee0x79uxRZ2enampqVFNTo6effvq8fW7atEkjIyOp19DQUDZLBgAAlpnx1To33XSTksnkRbf3+Xzy+XzTWBEAALBJVs+clJWVqaCgQLFYLG17LBZTeXn5JfXtOI6qq6tVX19/Sf0AAAC7ZTWcFBUVqba2Vl1dXaltyWRSXV1dampquqS+w+GwBgYGdPDgwUstEwAAWCzjyzqjo6M6duxY6v3g4KD6+/u1cOFCVVZWKhKJqLW1VXV1dWpoaND27ds1NjaWWr0DAABwIRmHk0OHDmnVqlWp95FIRJLU2tqqXbt2ad26dTp58qQ2b96saDSqmpoa7d+//5xJsplyHEeO4yiRSFxSPwAAwG55xhiT6yIyEY/H5ff7NTIyotLS0qz3X9W+Lyv9HN+6Oiv9AAAwG2Ty93vGn0oMAABwIa4JJ6zWAQDAG1wTTlitAwCAN7gmnAAAAG8gnAAAAKu4Jpww5wQAAG9wTThhzgkAAN4w4w/+84q33i+F+54AAHBxXHPmBAAAeAPhBAAAWMU14YQJsQAAeINrwgkTYgEA8AbXhBMAAOANrNZxmcmemsxKIADAbMKZEwAAYBXCCQAAsIprwgmrdQAA8AbXhBNW6wAA4A2uCScAAMAbCCcAAMAqhBMAAGAVwgkAALAKN2GbIZPdPO2tuJkaAAAuOnPCUmIAALzBNeGEpcQAAHiDa8IJAADwBsIJAACwCuEEAABYhXACAACsQjgBAABWIZwAAACrEE4AAIBVCCcAAMAqhBMAAGAV14QTbl8PAIA3uObBf+FwWOFwWPF4XH6/P9flzJiLeWAgAACziWvOnAAAAG8gnAAAAKsQTgAAgFUIJwAAwCqEEwAAYBXXrNbxAlbmAADAmRMAAGAZwgkAALAK4QQAAFiFcAIAAKySk3Cydu1aLViwQB/+8IdzcXgAAGCxnISTO++8U9/5zndycWgAAGC5nISTW265RfPmzcvFoQEAgOUyDifd3d1as2aNKioqlJeXp87OznPaOI6jqqoqFRcXq7GxUb29vdmoFRepqn3fOS8AANwi43AyNjamYDAox3Em3b9nzx5FIhFt2bJFhw8fVjAYVHNzs4aHhy+5WAAAMPtlfIfYlpYWtbS0nHf/tm3btGHDBrW1tUmSduzYoX379mnnzp1qb2/PuMDx8XGNj4+n3sfj8Yz7AAAA7pHV29dPTEyor69PmzZtSm3Lz89XKBRST0/PlPrs6OjQvffem60SPeutl3aOb12do0oAALiwrE6IPXXqlBKJhAKBQNr2QCCgaDSaeh8KhfSRj3xEDz74oJYuXXrB4LJp0yaNjIykXkNDQ9ksGQAAWCYnD/77xS9+cdFtfT6ffD6fHMeR4zhKJBLTWBngPZNNmObMGoBcyuqZk7KyMhUUFCgWi6Vtj8ViKi8vv6S+w+GwBgYGdPDgwUvqBwAA2C2r4aSoqEi1tbXq6upKbUsmk+rq6lJTU1M2DwUAAGapjC/rjI6O6tixY6n3g4OD6u/v18KFC1VZWalIJKLW1lbV1dWpoaFB27dv19jYWGr1DgAAwIVkHE4OHTqkVatWpd5HIhFJUmtrq3bt2qV169bp5MmT2rx5s6LRqGpqarR///5zJslmijknAAB4Q8bh5JZbbpEx5oJtNm7cqI0bN065qMmEw2GFw2HF43H5/f6s9g0AAOyRk2frAAAAnE9OlhJPBZd1zo9n5wAAZhPXnDlhKTEAAN7gmnACAAC8gXACAACs4ppw4jiOqqurVV9fn+tSAADANHJNOGHOCQAA3uCacAIAALyBcAIAAKxCOAEAAFbhJmweNdmN245vXZ2DSgAASOeaMydMiAUAwBtcE04AAIA3EE4AAIBVCCcAAMAqhBMAAGAV14QTbl8PAIA3uCacsFoHAABvcE04AQAA3kA4AQAAViGcAAAAqxBOAACAVQgnAADAKjz4Dznx1gcP8tBBAMDrXHPmhKXEAAB4g2vCCQAA8AbCCQAAsArhBAAAWIVwAgAArEI4AQAAViGcAAAAqxBOAACAVQgnAADAKtwhFimz9a6ts/VzwV3e+nMo8bMInI9rzpxwh1gAALzBNeEEAAB4A+EEAABYhXACAACsQjgBAABWIZwAAACrEE4AAIBVCCcAAMAqhBMAAGAVwgkAALAK4QQAAFiFcAIAAKySk3DywAMP6Nprr9XVV1+tf/u3f8tFCQAAwFIz/lTi1157TZFIRAcOHJDf71dtba3Wrl2rRYsWzXQpAADAQjN+5qS3t1fLly/XkiVLNHfuXLW0tOg//uM/ZroMAABgqYzDSXd3t9asWaOKigrl5eWps7PznDaO46iqqkrFxcVqbGxUb29vat9LL72kJUuWpN4vWbJEJ06cmFr1AABg1sk4nIyNjSkYDMpxnEn379mzR5FIRFu2bNHhw4cVDAbV3Nys4eHhKRU4Pj6ueDye9gIAALNXxnNOWlpa1NLSct7927Zt04YNG9TW1iZJ2rFjh/bt26edO3eqvb1dFRUVaWdKTpw4oYaGhvP219HRoXvvvTfTMpEFVe37cnqs41tXT+nrbDLVzzWVvrPVrxvM5LhOxktjDXe7mN+V6fx9mqqszjmZmJhQX1+fQqHQGwfIz1coFFJPT48kqaGhQUePHtWJEyc0Ojqqn/3sZ2pubj5vn5s2bdLIyEjqNTQ0lM2SAQCAZbK6WufUqVNKJBIKBAJp2wOBgJ599tnfH7CwUF/5yle0atUqJZNJ3XPPPRdcqePz+eTz+bJZJgAAsNiMLyWWpNtuu0233XZbRl/jOI4cx1EikZimqgAAgA2yelmnrKxMBQUFisViadtjsZjKy8svqe9wOKyBgQEdPHjwkvoBAAB2y2o4KSoqUm1trbq6ulLbksmkurq61NTUlM1DAQCAWSrjyzqjo6M6duxY6v3g4KD6+/u1cOFCVVZWKhKJqLW1VXV1dWpoaND27ds1NjaWWr0zVVzWAQDAGzIOJ4cOHdKqVatS7yORiCSptbVVu3bt0rp163Ty5Elt3rxZ0WhUNTU12r9//zmTZDMVDocVDocVj8fl9/svqS8AAGCvjMPJLbfcImPMBdts3LhRGzdunHJRAADAu3LyVGIAAIDzcU04cRxH1dXVqq+vz3UpAABgGrkmnLCUGAAAb3BNOAEAAN5AOAEAAFZxTThhzgkAAN7gmnDCnBMAALwhJw/+uxSv32MlHo9PS//J8ZenpV9k7mK+x1P5fk3Xz85kJqsvW8d/a99T7Xc6a5wuMzmuk5nKsdw4znC/i/m5m6mfzdf7fLt7pUlSnrmYVhb57//+by1btizXZQAAgCkYGhrS0qVLL9jGdeEkmUzqpZde0rx585SXl5fVvuPxuJYtW6ahoSGVlpZmtW8vY1ynB+M6PRjX6cG4Tg83jasxRmfPnlVFRYXy8y88q8R1l3Xy8/PfNnFdqtLSUuu/yW7EuE4PxnV6MK7Tg3GdHm4Z14t9Np5rJsQCAABvIJwAAACrEE7exOfzacuWLfL5fLkuZVZhXKcH4zo9GNfpwbhOj9k6rq6bEAsAAGY3zpwAAACrEE4AAIBVCCcAAMAqhBMAAGAVwsn/5ziOqqqqVFxcrMbGRvX29ua6JGt0dHSovr5e8+bN0+WXX64PfOADeu6559La/N///Z/C4bAWLVqkuXPn6kMf+pBisVhamxdffFGrV69WSUmJLr/8ct1999167bXX0to88sgjete73iWfz6d3vOMd2rVr13R/PGts3bpVeXl5uuuuu1LbGNepOXHihD72sY9p0aJFmjNnjlauXKlDhw6l9htjtHnzZi1evFhz5sxRKBTS888/n9bH6dOntX79epWWlmr+/Pn65Cc/qdHR0bQ2Tz31lG6++WYVFxdr2bJl+uIXvzgjny9XEomEPve5z+nKK6/UnDlz9Ed/9Ef6p3/6p7RnpTC2b6+7u1tr1qxRRUWF8vLy1NnZmbZ/Jsdw7969uu6661RcXKyVK1fqwQcfzPrnnRIDs3v3blNUVGR27txpfv3rX5sNGzaY+fPnm1gsluvSrNDc3Gy+/e1vm6NHj5r+/n5z6623msrKSjM6Oppq86lPfcosW7bMdHV1mUOHDpkbbrjB3Hjjjan9r732mlmxYoUJhULmySefNA8++KApKyszmzZtSrV54YUXTElJiYlEImZgYMB87WtfMwUFBWb//v0z+nlzobe311RVVZl3vvOd5s4770xtZ1wzd/r0aXPFFVeYO+64wzzxxBPmhRdeMD//+c/NsWPHUm22bt1q/H6/6ezsNEeOHDG33XabufLKK80rr7ySavNnf/ZnJhgMmscff9z86le/Mu94xzvMRz/60dT+kZEREwgEzPr1683Ro0fN97//fTNnzhzzr//6rzP6eWfSfffdZxYtWmQeeOABMzg4aPbu3Wvmzp1r/vmf/znVhrF9ew8++KD57Gc/a374wx8aSeZHP/pR2v6ZGsPHHnvMFBQUmC9+8YtmYGDA/OM//qO57LLLzNNPPz3tY/B2CCfGmIaGBhMOh1PvE4mEqaioMB0dHTmsyl7Dw8NGkvnlL39pjDHmzJkz5rLLLjN79+5NtXnmmWeMJNPT02OM+f0vY35+volGo6k23/jGN0xpaakZHx83xhhzzz33mOXLl6cda926daa5uXm6P1JOnT171lx99dXmoYceMu95z3tS4YRxnZp/+Id/MDfddNN59yeTSVNeXm6+9KUvpbadOXPG+Hw+8/3vf98YY8zAwICRZA4ePJhq87Of/czk5eWZEydOGGOM+Zd/+RezYMGC1Di/fuxrr7022x/JGqtXrzaf+MQn0rZ98IMfNOvXrzfGMLZT8dZwMpNj+Od//udm9erVafU0Njaav/qrv8rqZ5wKz1/WmZiYUF9fn0KhUGpbfn6+QqGQenp6cliZvUZGRiRJCxculCT19fXp1VdfTRvD6667TpWVlakx7Onp0cqVKxUIBFJtmpubFY/H9etf/zrV5s19vN5mtn8fwuGwVq9efc5nZ1yn5ic/+Ynq6ur0kY98RJdffrmuv/56fetb30rtHxwcVDQaTRsTv9+vxsbGtHGdP3++6urqUm1CoZDy8/P1xBNPpNr8yZ/8iYqKilJtmpub9dxzz+l///d/p/tj5sSNN96orq4u/eY3v5EkHTlyRI8++qhaWlokMbbZMJNjaPP/DZ4PJ6dOnVIikUj7z12SAoGAotFojqqyVzKZ1F133aV3v/vdWrFihSQpGo2qqKhI8+fPT2v75jGMRqOTjvHr+y7UJh6P65VXXpmOj5Nzu3fv1uHDh9XR0XHOPsZ1al544QV94xvf0NVXX62f//zn+vSnP62//du/1b//+79LemNcLvQ7H41Gdfnll6ftLyws1MKFCzMa+9mmvb1df/EXf6HrrrtOl112ma6//nrdddddWr9+vSTGNhtmcgzP18aGMXbdU4mRW+FwWEePHtWjjz6a61Jcb2hoSHfeeaceeughFRcX57qcWSOZTKqurk5f+MIXJEnXX3+9jh49qh07dqi1tTXH1bnbD37wA91///363ve+p+XLl6u/v1933XWXKioqGFtklefPnJSVlamgoOCcFRCxWEzl5eU5qspOGzdu1AMPPKADBw5o6dKlqe3l5eWamJjQmTNn0tq/eQzLy8snHePX912oTWlpqebMmZPtj5NzfX19Gh4e1rve9S4VFhaqsLBQv/zlL/XVr35VhYWFCgQCjOsULF68WNXV1Wnb/viP/1gvvviipDfG5UK/8+Xl5RoeHk7b/9prr+n06dMZjf1sc/fdd6fOnqxcuVK33367/u7v/i515o+xvXQzOYbna2PDGHs+nBQVFam2tlZdXV2pbclkUl1dXWpqasphZfYwxmjjxo360Y9+pIcfflhXXnll2v7a2lpddtllaWP43HPP6cUXX0yNYVNTk55++um0X6iHHnpIpaWlqT8kTU1NaX283ma2fh/e+9736umnn1Z/f3/qVVdXp/Xr16f+zbhm7t3vfvc5S91/85vf6IorrpAkXXnllSovL08bk3g8rieeeCJtXM+cOaO+vr5Um4cffljJZFKNjY2pNt3d3Xr11VdTbR566CFde+21WrBgwbR9vlx6+eWXlZ+f/mejoKBAyWRSEmObDTM5hlb/35DrGbk22L17t/H5fGbXrl1mYGDA/OVf/qWZP39+2goIL/v0pz9t/H6/eeSRR8zvfve71Ovll19OtfnUpz5lKisrzcMPP2wOHTpkmpqaTFNTU2r/60te3/e+95n+/n6zf/9+84d/+IeTLnm9++67zTPPPGMcx5nVS14n8+bVOsYwrlPR29trCgsLzX333Weef/55c//995uSkhLz3e9+N9Vm69atZv78+ebHP/6xeeqpp8z73//+SZdqXn/99eaJJ54wjz76qLn66qvTlmqeOXPGBAIBc/vtt5ujR4+a3bt3m5KSklmz3HUyra2tZsmSJamlxD/84Q9NWVmZueeee1JtGNu3d/bsWfPkk0+aJ5980kgy27ZtM08++aT57W9/a4yZuTF87LHHTGFhofnyl79snnnmGbNlyxaWEtvma1/7mqmsrDRFRUWmoaHBPP7447kuyRqSJn19+9vfTrV55ZVXzF//9V+bBQsWmJKSErN27Vrzu9/9Lq2f48ePm5aWFjNnzhxTVlZm/v7v/968+uqraW0OHDhgampqTFFRkbnqqqvSjuEFbw0njOvU/PSnPzUrVqwwPp/PXHfddeab3/xm2v5kMmk+97nPmUAgYHw+n3nve99rnnvuubQ2//M//2M++tGPmrlz55rS0lLT1tZmzp49m9bmyJEj5qabbjI+n88sWbLEbN26ddo/Wy7F43Fz5513msrKSlNcXGyuuuoq89nPfjZtuSpj+/YOHDgw6f+pra2txpiZHcMf/OAH5pprrjFFRUVm+fLlZt++fdP2uTORZ8ybbu0HAACQY56fcwIAAOxCOAEAAFYhnAAAAKsQTgAAgFUIJwAAwCqEEwAAYBXCCQAAsArhBAAAWIVwAgAArEI4AQAAViGcAAAAqxBOAACAVf4fUDcZ2EDgseEAAAAASUVORK5CYII=\n"
          },
          "metadata": {}
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "We can see that most nodes have rather low degrees, but there are some outliers with huge node degrees. Since edge label informativeness and node label informativeness differ in how they weight high/low-degree nodes, the significant difference in their values for such a graph is not surprising."
      ],
      "metadata": {
        "id": "DwnRAlpG66UV"
      }
    }
  ]
}