{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 98,
   "id": "ad52d6c7-ebb9-40a0-9787-976b08cdc790",
   "metadata": {},
   "outputs": [],
   "source": [
    "from pathlib import Path\n",
    "from urllib.request import urlopen\n",
    "from bs4 import BeautifulSoup\n",
    "import numpy as np\n",
    "from sklearn.feature_extraction.text import CountVectorizer\n",
    "import matplotlib.pyplot as plt"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 99,
   "id": "bd896491-f236-4adc-9531-6260edbc76a6",
   "metadata": {},
   "outputs": [],
   "source": [
    "plt.rcParams.update({\n",
    "    \"text.usetex\": True,    \n",
    "    \"font.size\": 12\n",
    "})\n",
    "img_dir = Path('/home/cmendoza/MEGA/Research/dissertation/LaTeX/img/')"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 100,
   "id": "4364f22e-9fec-4027-86ab-2c123237a004",
   "metadata": {},
   "outputs": [],
   "source": [
    "uf = urlopen('https://www.eecis.udel.edu/~boncelet/')\n",
    "html = uf.read()\n",
    "soup = BeautifulSoup(html, 'html.parser')"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 101,
   "id": "e3fcac25-7624-44f8-bda1-c30933d953c6",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'Charles Boncelet\\'s Home Page | Boncelet HomepageSkip to main contentToggle navigationBoncelet HomepageResearchPublicationsIPythonBlogArchiveTagsRSS feedSourceCharles Boncelet\\'s Home PageUniversity of DelawareProfessorDepartment of Electrical & Computer Engineering313 Evans HallNewark DE 19716boncelet@udel.edu302-831-8008 (w)  I have a joint appointment in the Department of Computer & Information Sciences.  I am a Past President of the University of Delaware Faculty Senate and was a long time Associate Chair of the ECE Departmet in charge of the undergraduate program.I have held visiting positions at the University of Michigan (1992), Tampere University of Technology in Tampere, Finland (1999), the Australian Defence Force Academy in Canberra, Australia (2000), the University of Electronic Science and Technology of China in Chengdu, China, and the Idaho National Laboratory.BookMy textbook, Probability, Statistics, and Random Signals, is out.  It is available through my publisher, Oxford University Press, and through Amazon.com.My book is designed for a junior or senior year class, but many reviewerssuggested they may use it in their graduate classes.  It has lots of carefully drawn illustrations, computational examples using Matlab, Python, and R, and many worked out examples. The first nine chapters cover discrete and continuous probability.The remaining five chapters cover statistics, hypothesis testing, randomvectors and linear regression, and random processes.  Contact me if you are interested in using the text in your class or your studies.  The Solutions Manual has solutions for every homeworkproblem (writing solutions to every problem took me much longer than I anticipated!).Oxford University Press link to Probability, Statistics, and Random SignalsAmazon link to Probability, Statistics, and Random SignalsEducationPhD, Electrical Engineering and Computer Science, Princeton University, 1984.  My dissertation, \"Approaches and Algorithms for Robust Signal Processing,\" was supervised by Prof. Bradley Dickinson.MS, Electrical Engineering and Computer Science, Princeton University, 1982.BS, Applied and Engineering Physics, Cornell University, 1980, Cum Laude (the highest category for engineering students). I rowed freshman lightweight crew and participated in various intramural sports.TeachingI regularly teach the following courses:ELEG 306, Digital Signal ProcessingELEG 310, Random Signals and NoiseOver the years, I have also taught analog and digital circuits and graduate courses in communications, signal processing, and information theory.Two years ago Prof. Rossi (Math) and I took a group of UD students to Melbourne,Australia, for a Study Abroad program.  This past January I took a group toLondon.  Next year, Rossi and I plan to return to Melbourne.ResearchI\\'ve published over 100 papers in various journals and technical conferences. See Publications for the list.My main research areas are the following:Information security, principally steganography, steganalysis, and message authentication.Signal and Image Processing, principally linear and nonlinear filtering, lossless and lossy data compression. Computational Algorithms for robust regression, L1 estimation, and order statistic probabilities.I have three patents:U. S. Patent #8770998, “Safety System Electrical Connection and Methods,” C. G. Boncelet Jr, 2014. U. S. Patent #6557103, “Spread Spectrum Image Steganography,” C. G. Boncelet Jr., L. M. Marvel, and C. T. Retter.U. S. Patent #6556689, “Watermarking Methods for Digital Images and Videos,” X. Xia, C. G. Boncelet Jr., and G. Arce.ConsultingI consult in various ways.  Please contact me for more details.Yearly with NSF on various signal processing and communications panels.Patent Infringement.  Over the past decade or so, I have worked with a number of law firms as an expert witness in various cases in wireless communications, signal processing, and data compression.Covington & BurlingRopes & GrayGibson, Dunn, & CrutcherKirkland & Ellis (twice)Cooley, Godward, & KronishMcCarter & EnglishDigital signal processing and data analysis.  I am expert in thesetechnical areas and programming in Matlab, Python, and R.BlogI have started a blog focusing on Python and IPython applications in signal processing, probability, and data analysis.Teaching and curriculum development.Golf and baseball (can\\'t work all day!)Current events (issues, not people).PersonalI am married with two adult children. I enjoy various hobbies: golf, bicycling (on a recumbent), basketball, tennis, hiking and camping, and a bit of woodworking.Curriculum VitaeMy CV can be found here.            Contents © 2017         Charles Boncelet - Powered by         Nikola'"
      ]
     },
     "execution_count": 101,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "txt = soup.get_text()\n",
    "txt = txt.replace('\\n', '')\n",
    "txt"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 102,
   "id": "0de48fcb-15ee-48c0-a2c9-e4106c8b61e4",
   "metadata": {},
   "outputs": [],
   "source": [
    "count_vect = CountVectorizer()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 103,
   "id": "8fcdd54d-3a60-4fec-85cf-7da4656030d1",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "<1x363 sparse matrix of type '<class 'numpy.int64'>'\n",
       "\twith 363 stored elements in Compressed Sparse Row format>"
      ]
     },
     "execution_count": 103,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "txt_cnts = count_vect.fit_transform([soup.get_text()])\n",
    "txt_cnts"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 104,
   "id": "439e477e-c4b1-4861-be2f-181d6d0c67f2",
   "metadata": {},
   "outputs": [],
   "source": [
    "idx2word = {k:v for v, k in count_vect.vocabulary_.items()}"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 105,
   "id": "9fb9af53-5678-4459-b2e9-b9355a9345e0",
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "and: 44\n",
      "the: 18\n",
      "in: 17\n",
      "of: 15\n",
      "university: 10\n",
      "boncelet: 9\n",
      "for: 9\n",
      "signal: 8\n",
      "to: 8\n",
      "processing: 8\n",
      "my: 6\n",
      "various: 6\n",
      "random: 6\n",
      "have: 6\n",
      "probability: 5\n",
      "engineering: 5\n",
      "electrical: 4\n",
      "digital: 4\n",
      "computer: 4\n",
      "data: 4\n",
      "patent: 4\n",
      "signals: 4\n",
      "statistics: 4\n",
      "past: 3\n",
      "blog: 3\n",
      "jr: 3\n",
      "charles: 3\n",
      "communications: 3\n",
      "on: 3\n",
      "or: 3\n",
      "it: 3\n",
      "research: 3\n",
      "is: 3\n",
      "over: 3\n",
      "information: 3\n",
      "python: 3\n",
      "solutions: 3\n",
      "science: 3\n",
      "me: 3\n",
      "am: 3\n",
      "took: 3\n",
      "with: 3\n",
      "are: 2\n",
      "following: 2\n",
      "problem: 2\n",
      "prof: 2\n",
      "program: 2\n",
      "expert: 2\n",
      "examples: 2\n",
      "every: 2\n",
      "publications: 2\n",
      "through: 2\n",
      "using: 2\n",
      "regression: 2\n",
      "algorithms: 2\n",
      "robust: 2\n",
      "rossi: 2\n",
      "department: 2\n",
      "delaware: 2\n",
      "melbourne: 2\n",
      "curriculum: 2\n",
      "amazon: 2\n",
      "was: 2\n",
      "principally: 2\n",
      "princeton: 2\n",
      "press: 2\n",
      "matlab: 2\n",
      "methods: 2\n",
      "many: 2\n",
      "main: 2\n",
      "years: 2\n",
      "link: 2\n",
      "linear: 2\n",
      "year: 2\n",
      "worked: 2\n",
      "cover: 2\n",
      "out: 2\n",
      "oxford: 2\n",
      "page: 2\n",
      "image: 2\n",
      "homepage: 2\n",
      "home: 2\n",
      "has: 2\n",
      "group: 2\n",
      "graduate: 2\n",
      "golf: 2\n",
      "ipython: 2\n",
      "courses: 2\n",
      "eleg: 2\n",
      "your: 2\n",
      "students: 2\n",
      "technical: 2\n",
      "class: 2\n",
      "steganography: 2\n",
      "chapters: 2\n",
      "two: 2\n",
      "book: 2\n",
      "compression: 2\n",
      "china: 2\n",
      "analysis: 2\n",
      "tampere: 2\n",
      "computational: 2\n",
      "australia: 2\n",
      "areas: 2\n",
      "can: 2\n",
      "technology: 2\n",
      "teaching: 2\n",
      "contact: 2\n",
      "by: 2\n",
      "textbook: 1\n",
      "physics: 1\n",
      "woodworking: 1\n",
      "text: 1\n",
      "testing: 1\n",
      "witness: 1\n",
      "phd: 1\n",
      "tennis: 1\n",
      "panels: 1\n",
      "personal: 1\n",
      "people: 1\n",
      "papers: 1\n",
      "participated: 1\n",
      "wireless: 1\n",
      "ways: 1\n",
      "patents: 1\n",
      "toggle: 1\n",
      "work: 1\n",
      "xia: 1\n",
      "time: 1\n",
      "message: 1\n",
      "three: 1\n",
      "michigan: 1\n",
      "this: 1\n",
      "more: 1\n",
      "they: 1\n",
      "ms: 1\n",
      "much: 1\n",
      "national: 1\n",
      "navigation: 1\n",
      "newark: 1\n",
      "next: 1\n",
      "these: 1\n",
      "theory: 1\n",
      "nikola: 1\n",
      "nine: 1\n",
      "noise: 1\n",
      "nonlinear: 1\n",
      "not: 1\n",
      "nsf: 1\n",
      "number: 1\n",
      "yearly: 1\n",
      "their: 1\n",
      "writing: 1\n",
      "order: 1\n",
      "than: 1\n",
      "plan: 1\n",
      "source: 1\n",
      "please: 1\n",
      "retter: 1\n",
      "return: 1\n",
      "reviewers: 1\n",
      "steganalysis: 1\n",
      "ropes: 1\n",
      "rowed: 1\n",
      "rss: 1\n",
      "safety: 1\n",
      "ud: 1\n",
      "sciences: 1\n",
      "use: 1\n",
      "security: 1\n",
      "see: 1\n",
      "statistic: 1\n",
      "senate: 1\n",
      "senior: 1\n",
      "started: 1\n",
      "spread: 1\n",
      "skip: 1\n",
      "sports: 1\n",
      "so: 1\n",
      "udel: 1\n",
      "undergraduate: 1\n",
      "remaining: 1\n",
      "regularly: 1\n",
      "recumbent: 1\n",
      "positions: 1\n",
      "teach: 1\n",
      "spectrum: 1\n",
      "president: 1\n",
      "taught: 1\n",
      "tags: 1\n",
      "system: 1\n",
      "probabilities: 1\n",
      "vitae: 1\n",
      "supervised: 1\n",
      "processes: 1\n",
      "visiting: 1\n",
      "professor: 1\n",
      "suggested: 1\n",
      "twice: 1\n",
      "study: 1\n",
      "studies: 1\n",
      "programming: 1\n",
      "published: 1\n",
      "publisher: 1\n",
      "videos: 1\n",
      "vectors: 1\n",
      "ve: 1\n",
      "watermarking: 1\n",
      "powered: 1\n",
      "100: 1\n",
      "may: 1\n",
      "chair: 1\n",
      "category: 1\n",
      "cases: 1\n",
      "carefully: 1\n",
      "canberra: 1\n",
      "camping: 1\n",
      "but: 1\n",
      "burling: 1\n",
      "bs: 1\n",
      "bradley: 1\n",
      "bit: 1\n",
      "bicycling: 1\n",
      "be: 1\n",
      "basketball: 1\n",
      "baseball: 1\n",
      "charge: 1\n",
      "available: 1\n",
      "chengdu: 1\n",
      "circuits: 1\n",
      "cum: 1\n",
      "crutcher: 1\n",
      "crew: 1\n",
      "covington: 1\n",
      "cornell: 1\n",
      "cooley: 1\n",
      "continuous: 1\n",
      "contents: 1\n",
      "content: 1\n",
      "consulting: 1\n",
      "consult: 1\n",
      "connection: 1\n",
      "conferences: 1\n",
      "com: 1\n",
      "classes: 1\n",
      "children: 1\n",
      "authentication: 1\n",
      "australian: 1\n",
      "at: 1\n",
      "6557103: 1\n",
      "6556689: 1\n",
      "313: 1\n",
      "310: 1\n",
      "306: 1\n",
      "302: 1\n",
      "2017: 1\n",
      "2014: 1\n",
      "2000: 1\n",
      "1999: 1\n",
      "1992: 1\n",
      "1984: 1\n",
      "1982: 1\n",
      "1980: 1\n",
      "19716: 1\n",
      "8008: 1\n",
      "831: 1\n",
      "8770998: 1\n",
      "abroad: 1\n",
      "associate: 1\n",
      "as: 1\n",
      "archive: 1\n",
      "arce: 1\n",
      "approaches: 1\n",
      "appointment: 1\n",
      "applied: 1\n",
      "current: 1\n",
      "applications: 1\n",
      "analog: 1\n",
      "an: 1\n",
      "also: 1\n",
      "all: 1\n",
      "ago: 1\n",
      "adult: 1\n",
      "academy: 1\n",
      "anticipated: 1\n",
      "mccarter: 1\n",
      "cv: 1\n",
      "de: 1\n",
      "you: 1\n",
      "joint: 1\n",
      "january: 1\n",
      "issues: 1\n",
      "intramural: 1\n",
      "interested: 1\n",
      "infringement: 1\n",
      "images: 1\n",
      "illustrations: 1\n",
      "if: 1\n",
      "idaho: 1\n",
      "hypothesis: 1\n",
      "homework: 1\n",
      "hobbies: 1\n",
      "hiking: 1\n",
      "junior: 1\n",
      "highest: 1\n",
      "kirkland: 1\n",
      "l1: 1\n",
      "math: 1\n",
      "marvel: 1\n",
      "married: 1\n",
      "manual: 1\n",
      "lots: 1\n",
      "lossy: 1\n",
      "lossless: 1\n",
      "longer: 1\n",
      "long: 1\n",
      "london: 1\n",
      "list: 1\n",
      "lightweight: 1\n",
      "law: 1\n",
      "laude: 1\n",
      "laboratory: 1\n",
      "kronish: 1\n",
      "here: 1\n",
      "held: 1\n",
      "hall: 1\n",
      "electronic: 1\n",
      "education: 1\n",
      "edu: 1\n",
      "ece: 1\n",
      "dunn: 1\n",
      "drawn: 1\n",
      "dissertation: 1\n",
      "discrete: 1\n",
      "dickinson: 1\n",
      "development: 1\n",
      "details: 1\n",
      "designed: 1\n",
      "departmet: 1\n",
      "defence: 1\n",
      "decade: 1\n",
      "ellis: 1\n",
      "english: 1\n",
      "enjoy: 1\n",
      "estimation: 1\n",
      "gray: 1\n",
      "godward: 1\n",
      "gibson: 1\n",
      "freshman: 1\n",
      "found: 1\n",
      "force: 1\n",
      "focusing: 1\n",
      "day: 1\n",
      "five: 1\n",
      "firms: 1\n",
      "finland: 1\n",
      "filtering: 1\n",
      "feed: 1\n",
      "faculty: 1\n",
      "events: 1\n",
      "evans: 1\n",
      "first: 1\n",
      "journals: 1\n"
     ]
    }
   ],
   "source": [
    "txt_cnts = txt_cnts.toarray().squeeze()\n",
    "isort = np.argsort(-txt_cnts)\n",
    "for idx in isort:\n",
    "    print(f'{idx2word[idx]}: {txt_cnts[idx]}')"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 96,
   "id": "5ee09178-48e5-4995-9a81-337a663a581f",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "<BarContainer object of 363 artists>"
      ]
     },
     "execution_count": 96,
     "metadata": {},
     "output_type": "execute_result"
    },
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAh0AAAGeCAYAAAAwrLMMAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjYuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8o6BhiAAAACXBIWXMAAA9hAAAPYQGoP6dpAAAVEElEQVR4nO3dT04jZ/rA8aejSKwCFWYXDVLHfQPPzA3cq9l2JzcwNzDNKksG3wBygm68zcqeE0C4QVdGQspuoCDST2IT/xYtPLgxf0ybB7v8+Uhe2Mb2+9bbNF+Vq+wXw+FwGAAAT+yr5x4AALAcRAcAkEJ0AAApRAcAkEJ0AAApRAcAkEJ0AAApRAcAkOLr5x7AdX/++Wf8/vvv8c0338SLFy+eezgAwAMMh8P4448/4rvvvouvvrp9f8ZcRcfvv/8eGxsbzz0MAOARTk5O4q9//eut989VdHzzzTcR8WnQq6urzzwaAOAhLi4uYmNjY/R3/DZzFR1Xb6msrq6KDgBYMPcdGuFAUgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAYCov3/3y3ENgQYkOACCF6AAAUogOACCF6AAAUogOACCF6AAAUogOACCF6AAAUogOACCF6AAAUogOACCF6AAAUogOACCF6AAAUogOACCF6AAAUogOACCF6AAAUogOACCF6AAAUogOACCF6AAAUogOACCF6AAAUogOACDF11/y4NevX0e/3x+7rdvtRlEUERFRVVV0Op0veQkAoCYevaej1+vFYDAYu63b7UZERLvdjna7Hc1mMzY3N79shABALTwqOqqqitPT0xu37+zsRLvdHl1vtVqxv7//+NEBALXxqOj48OFD/PDDD2O3lWUZVVWN3lq57vM9IgDA8pk6OgaDQbRarRu3l2U58eeLooiqqqYeGABQL1MfSFpVVTQajQeHxPr6+sS3YiIiLi8v4/LycnT94uJi2uEAAAtiqj0d+/v78ebNm6le4LbgiPh0DMja2trosrGxMdVzAwCL48HRcXx8HH//+99vvb/RaEy8/WrPyCTb29txfn4+upycnDx0OADAgnnw2yunp6dxfHw8Oij048ePEfHpNNlGoxFv3ryJoiiiLMsbkTHpGJCIiJWVlVhZWXns2AGABfJiOBwOH/PA4+Pj+Nvf/hbXH371wWBXp832er3o9/uxt7f3oOe8uLiItbW1OD8/j9XV1ccMC4An9vLdL/Gff/3zuYfBHHno3+9HnTLb6/ViZ2cnIiK2trZGez86nU5UVRW9Xi96vV4cHh4+ODgAgHp79J6Op2BPB8D8s6eDzz3png4AgGmJDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAghegAAFKIDgAgxdfT/HBVVfHhw4eIiPj48WOUZRk///xzFEUx+plutzu6XlVVdDqdmQ0WAFhcU+3p2NrailarFe12O3Z3d2N9fT3evn07ur/b7UZERLvdjna7Hc1mMzY3N2c7YgBgIU0VHWVZRq/XG11/9epVHB0dja7v7OxEu90eXW+1WrG/vz+DYQIAi26qt1f6/f7Y9cPDw2i1WhHxKUiqqhp7q+XKYDAY/RwAsJymio7rer1eVFUVBwcHEfEpOiYpiiKqqnrsywAANTF1dFwdTFpVVbx9+3bino3r1tfX4/T0dOJ9l5eXcXl5Obp+cXEx7XAAgAUx9SmzRVFEu90enZXy7bff3rkn47bgiPh0DMja2trosrGxMe1wAIAF8eDoqKoqtra2xgKj1WpFVVUxGAyi0Wjc+rjb7tve3o7z8/PR5eTkZLrRAwAL48HRUZZldLvdsT0XVwFSFEU0Go0oimLisR23HUS6srISq6urYxcAoJ4eHB3NZjM6nc7YXov3799Hs9kcRcX29nYMBoPR/b1eb+wUWgBgeU11IOn29vboA8AiPu3p+Pe//z263ul0otvtjj7L4/DwMPb29mY0VABgkb0YDofD5x7ElYuLi1hbW4vz83NvtQDMqZfvfon//Oufzz0M5shD/377wjcAIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSiA4AIIXoAABSfD3tA7rdbkREfPz4MSIi9vb2btxfFEVERFRVFZ1O5wuHCADUwVTRsbW1Fbu7u6Prm5ub8fr16+j3+xHxvyBpt9sRETEYDGJzc/NGmAAAy+fBb69UVRXHx8dRVdXots3NzRgMBlGWZURE7OzsjIIjIqLVasX+/v7sRgsALKypjuk4OjoaBUZERKPRiIhPQVKWZVRVNXpr5brBYPBlowQAFt6D314piiLOzs7GbruKiUajEUdHR7c+7vreEQBgOU19IOl1Ozs7sbe3N3HvxpX19fU4PT2deN/l5WVcXl6Orl9cXHzJcACAOfboU2a3trbixx9/HDuGY5LbgiPiU7Ssra2NLhsbG48dDgAw5x4VHb1eL169ejV2OuzV8R2fq6rq1vu2t7fj/Px8dDk5OXnMcACABTD12ytXx3Fc7eGoqipOT0+j0WhEURRRluWNyGi1WhOfa2VlJVZWVqYdAgCwgKba03F8fBzHx8fRbDajLMsoyzL29/djfX09Ij7tubh+pkqv17v37RcAYDm8GA6Hw4f8YFVV8f333088E+X6U3S73dGejsPDw7EPE7vPxcVFrK2txfn5eayurj74cQDkefnul/jPv/753MNgjjz07/cXnTI7yfXjPN68efPQpwcAas4XvgEAKUQHAJBCdAAAKUQHAJBCdAAAKUQHAJBCdAAAKUQHAJBCdAAAKUQHAJBCdABz4eW7X557CMATEx0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAUAqn8myvEQHAJBCdAAAKUQHAJBCdAAAKUQHAJBCdAAAKUQHAJBCdAAAKUQHAJBCdAAAKUQHAJBCdAAAKUQHAJBCdAAAKUQHAJBCdPAkXr775bmHAMCcER0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQArRAQCkEB0AQIqvp/nhqqriw4cPcXBwEP1+/8b93W43iqIY/Wyn05nJIAGAxffg6Dg+Po6jo6OoqipOT09v3N/tdiMiot1uR0TEYDCIzc3N2Nvbm9FQAYBF9uDoaDab0Ww2o9frTbx/Z2cnfvvtt9H1VqsVr1+/Fh0AQETM6JiOsiyjqqrRWyvXDQaDWbwEALDgZhYdkxRFEVVVzeIlAIAFN9WBpNNaX1+fePzHlcvLy7i8vBxdv7i4eMrhAADP6ElPmb0rOCI+HQeytrY2umxsbDzlcGCil+9+ee4hACyFmURHo9GYeHtVVbfeFxGxvb0d5+fno8vJyckshgMAzKGZvL3SaDSiKIooy/JGZLRarVsft7KyEisrK7MYAgAw56be03HbWybb29tjZ6r0er3RZ3YAADw4OsqyjG63G3t7e3F8fBxbW1tjn9nR6XSiqqro9XrR6/Xi8PDQZ3QAACMPfnul0WhEp9O586PNr9/35s2bLxsZAFArvvANAEghOgCAFKLjCfn8B6gnv9vwOKIDAEghOgCAFKIDAEghOgCAFKIDAEghOgCAFKIDAEghOsI59zycfyuPY7sBEaIDAEgiOgCAFKIDAEghOgCAFKIDAEghOgCAFKIDAEghOgCAFKIDAEghOgCAFKIDAEghOgCAFKIDAEghOgCAFKIDAEghOhbYy3e/LMRzLhvbkKfm3xiLSnQAAClEBwCQQnQAAClEBwCQQnQAAClEBwCQQnQAAClEx2ec/w7Al/B35HaiAwBIIToAgBSiAwBIIToAgBSiAwBIIToAgBSiAwBIITp4Fp+fx+689ud3tQazXgtr+3C21fOzBk9LdAAAKUQHAJBCdAAAKUQHAJBCdAAAKUQHAJBCdAAAKZYqOp7y/OuX736Z6fMv2rnisxjvQ57jtp+Z9fZ/yGtmmocx3GXexzevFmW7PeXv12NeZ1G227yYp+21VNEBADwf0QEApBAdAEAK0QEApBAdAEAK0QEApBAdAECKpYyOq3OWPz93edL1284bn/a850nPM8257/N0nvU0Y7n+s4+d62O23V2f5/GYMTz2eR4zpsf823qsaV77MZ+fcNv6f+mY7/u9fMz6z/KzKGa5JrN4noz/ZzI+y+O2Nb9v3ad5vrvuz/x9uX7bNPP7/Gfn6W9HxJJGBwCQT3QAAClEBwCQQnQAAClEBwCQQnQAAClEBwCQ4sVwOBw+9yCuXFxcxNraWpyfn8fq6urMn3/S+cr/+dc/b9w+77fN23gWdYyLOu55G0+dxj1v46nTuOdtPHUa97SPfwoP/fttTwcAkOLrWT9ht9uNoigiIqKqquh0OrN+CQBgAc10T0e3242IiHa7He12O5rNZmxubs7yJQCABTXT6NjZ2Yl2uz263mq1Yn9/f5YvAQAsqJlFR1mWUVXV6K2V6waDwaxeBgBYUDONjkmKooiqqmb1MgDAgpr5gaSfW19fj9PT04n3XV5exuXl5ej6+fl5RHw69eYp/Hn5fzduu7i4uHH7vN82b+NZ1DEu6rjnbTx1Gve8jadO45638dRp3NM+/ilcPe+9n8IxnJF+vz+c9HRFUQz39vYmPuann34aRoSLi4uLi4tLDS4nJyd3tsLMPhysLMt49erVjcp58eJF9Pv9aLVaNx7z+Z6OP//8M05PT+Mvf/lLvHjxYhbDGrm4uIiNjY04OTl5kg8em1fmbd7LYlnnbt7mPQ+Gw2H88ccf8d1338VXX91+5MbM3l5pNBpRFEWUZRmNRmPsvknBERGxsrISKysrY7dNOhB1llZXV+dqobKY93JZ1nlHLO/czXu5zOO819bW7v2ZmZ4yu729PXamSq/XGzuFFgBYXjONjk6nE1VVRa/Xi16vF4eHh7G3tzfLlwAAFtTMz165/rHnb968mfXTP9rKykr89NNPN97OqTvzNu9lsaxzN2/zXiRz9S2zAEB9+ZZZACCF6AAAUogOACDFk38M+jzodrujz/+oqmrsYNe6GAwGsbe3F69fv45GoxH9fj/+8Y9/jB3MW4ftUFVVfPjwIQ4ODqLf79+4/745Luo2uGvedV/7brcbEREfP36MiLhxRlxd1zzi7rnXdd2v/q1HfJp3WZbx888/j32GUx3X/L5512a9v/DTz+fe7u7ucHd3d3S93+8P2+32M47oaRwcHAyLohhGxLDRaNz46Pk6bIdff/11uLe3N9zd3R02m80b9983x0XdBvfNu85r3+l0xq632+1hq9UaXa/rmg+H98+9ruvebreHHz9+HLu+DGt+37zrst61j46iKIZnZ2djt9WxtQ4ODm7M87o6bYeDg4OJf3zvm+Oib4Pb5l3XtT87Oxu2Wq2xsf/666/DiBj951zXNX/I3Ou67q1Wa+yP5+7u7rAoitH1uq75ffOuy3rX+piOsiyjqqqJH61+/ZNT624ZtsN9c1yGbTDJos/76OgoyrIcXb/6ioWqqmq/5nfN/T6LPPd+vz/2tsDh4eHoqzTqvOZ3zfs+izTvWh/Tcf0X9rqiKB70i7toPnz4EOvr63F6ehofP36M3d3diFiO7XDfHOu+Deq49kVRxNnZ2dhtV/+BNhqNODo6uvVxi77m9839Sh3X/bperxdVVcXBwUFELM/v+efzvlKH9a51dNzmatHqpNlsRsT//kPa39+Pt2/f3vhHe10dt8PnruZ42xcJ1mEbLNPa7+zsxN7e3p1fDFnXNf987nVe96uDKquqirdv3977RaB1WfO75l2X9V7K6Ji3RZiFz7/Z94cffojNzc07K7eO2+Fz982xDttgWdZ+a2srfvzxx3u/RLKOaz5p7nVe96IoRnPd39+Pb7/9Nn777bdbf74ua37bvIuiqM161/qYjs8X6UpVVbfet6h6vd7Y9atCLstyKbbDfXOs8zZYhrXv9Xrx6tWrsfe8l2XNJ8396vbr6rDuVVXF1tbW2B/SVqsVVVXFYDCo7ZrfN++IGq33cx/J+tSKohg7DWk4nM8jer/E2dnZ2FHt12+7Opq5TtvhrrNX7prjom+DSfNehrXv9/vDg4OD0fWzs7Oxs1fqvOa3zb2u6/75GTrXb+v3+8PhsJ5rft+867Tetd7TERGxvb09dvRur9e7d/fsoimKIjqdzljR7u/vx5s3b0Y1XKftcNsuw/vmuOjbYNK86772x8fHcXx8HM1mM8qyjLIsY39/P9bX1yOi3mt+19zruu7NZvPGvN6/fx/NZnN0Jkcd1/y+eddpvZfiW2a73e5osQ4PD0dH/NZJVVWxv78/uv7f//73xjwXfTuUZRm9Xi/ev38fx8fH0el0Jn4i311zXMRtcN+867r2VVXF999/P/E96+v/bdVxzR8y9zqv+/V5XZ2lcf2gyrqu+V3zrst6L0V0AADPr/ZvrwAA80F0AAApRAcAkEJ0AAApRAcAkEJ0AAApRAcAkEJ0AAApRAcAkEJ0AAApRAcAkEJ0AAAp/h88EUJmRJiOvQAAAABJRU5ErkJggg==\n",
      "text/plain": [
       "<Figure size 640x480 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.bar(isort, txt_cnts)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "1ce765e9-df37-4a6b-8426-2d2c85ae0302",
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python [conda env:ICWaves] *",
   "language": "python",
   "name": "conda-env-ICWaves-py"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.9.15"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
}
