{"message":{"transcript":[{"chunks":[{"end":0.44,"start":0.0,"text":"So"},{"end":1.0,"start":0.44,"text":"I"},{"end":7.8,"start":1.0,"text":"just"},{"end":8.4,"start":7.8,"text":"showed"},{"end":8.72,"start":8.4,"text":"you"},{"end":9.6,"start":8.72,"text":"that"},{"end":10.32,"start":9.6,"text":"empirically"},{"end":10.6,"start":10.32,"text":"the"},{"end":11.28,"start":10.6,"text":"likelihood"},{"end":11.72,"start":11.28,"text":"will"},{"end":12.88,"start":11.72,"text":"converge"},{"end":13.48,"start":12.88,"text":"but"},{"end":13.96,"start":13.48,"text":"theoretically"},{"end":14.0,"start":13.96,"text":"it"},{"end":14.6,"start":14.0,"text":"can"},{"end":15.04,"start":14.6,"text":"also"},{"end":15.08,"start":15.04,"text":"be"},{"end":15.56,"start":15.08,"text":"proved"},{"end":15.96,"start":15.56,"text":"that"},{"end":16.68,"start":15.96,"text":"EM"},{"end":17.6,"start":16.68,"text":"algorithm"},{"end":17.68,"start":17.6,"text":"would"},{"end":18.2,"start":17.68,"text":"converge"},{"end":18.2,"start":18.2,"text":"to"},{"end":18.24,"start":18.2,"text":"a"},{"end":18.72,"start":18.24,"text":"local"},{"end":19.24,"start":18.72,"text":"maximum."},{"end":19.84,"start":19.24,"text":"So"},{"end":20.2,"start":19.84,"text":"here's"},{"end":20.4,"start":20.2,"text":"just"},{"end":21.04,"start":20.4,"text":"the"},{"end":21.6,"start":21.04,"text":"illustration"},{"end":21.76,"start":21.6,"text":"of"},{"end":22.16,"start":21.76,"text":"what"},{"end":23.04,"start":22.16,"text":"happened"},{"end":23.56,"start":23.04,"text":"and"},{"end":23.72,"start":23.56,"text":"a"},{"end":24.56,"start":23.72,"text":"detailed"},{"end":26.08,"start":24.56,"text":"explanation"},{"end":26.44,"start":26.08,"text":"this"},{"end":27.76,"start":26.44,"text":"would"},{"end":28.76,"start":27.76,"text":"require"},{"end":29.96,"start":28.76,"text":"more"}],"text":" So I just showed you that empirically the likelihood will converge but theoretically it can also be proved that EM algorithm would converge to a local maximum. So here's just the illustration of what happened and a detailed explanation this would require more"},{"chunks":[{"end":30.48,"start":30.0,"text":"knowledge"},{"end":31.04,"start":30.48,"text":"about"},{"end":31.24,"start":31.04,"text":"some"},{"end":31.84,"start":31.24,"text":"of"},{"end":32.16,"start":31.84,"text":"the"},{"end":32.8,"start":32.16,"text":"inequalities"},{"end":33.2,"start":32.8,"text":"that"},{"end":33.44,"start":33.2,"text":"we"},{"end":34.12,"start":33.44,"text":"haven't"},{"end":34.2,"start":34.12,"text":"really"},{"end":35.2,"start":34.2,"text":"covered"},{"end":35.480000000000004,"start":35.2,"text":"yet."},{"end":35.8,"start":35.480000000000004,"text":"So"},{"end":35.88,"start":35.8,"text":"here"},{"end":35.88,"start":35.88,"text":"what"},{"end":35.92,"start":35.88,"text":"you"},{"end":36.32,"start":35.92,"text":"see"},{"end":36.480000000000004,"start":36.32,"text":"is"},{"end":36.68,"start":36.480000000000004,"text":"on"},{"end":38.0,"start":36.68,"text":"the"},{"end":39.24,"start":38.0,"text":"x"},{"end":39.84,"start":39.24,"text":"dimension"},{"end":40.12,"start":39.84,"text":"we"},{"end":40.68,"start":40.12,"text":"have"},{"end":40.88,"start":40.68,"text":"a"},{"end":41.44,"start":40.88,"text":"theta"},{"end":42.0,"start":41.44,"text":"value."},{"end":42.6,"start":42.0,"text":"This"},{"end":43.28,"start":42.6,"text":"is"},{"end":43.72,"start":43.28,"text":"the"},{"end":44.519999999999996,"start":43.72,"text":"parameter"},{"end":44.88,"start":44.519999999999996,"text":"that"},{"end":44.96,"start":44.88,"text":"we"},{"end":44.96,"start":44.96,"text":"are"},{"end":45.519999999999996,"start":44.96,"text":"asked."},{"end":46.120000000000005,"start":45.519999999999996,"text":"On"},{"end":47.08,"start":46.120000000000005,"text":"the"},{"end":48.08,"start":47.08,"text":"y-axis"},{"end":48.2,"start":48.08,"text":"we"},{"end":48.56,"start":48.2,"text":"see"},{"end":49.04,"start":48.56,"text":"the"},{"end":50.72,"start":49.04,"text":"likelihood"},{"end":52.44,"start":50.72,"text":"function."},{"end":52.92,"start":52.44,"text":"So"},{"end":52.96,"start":52.92,"text":"this"},{"end":52.96,"start":52.96,"text":"curve"},{"end":53.04,"start":52.96,"text":"is"},{"end":54.08,"start":53.04,"text":"the"},{"end":54.879999999999995,"start":54.08,"text":"original"},{"end":55.56,"start":54.879999999999995,"text":"likelihood"},{"end":56.08,"start":55.56,"text":"function."},{"end":56.760000000000005,"start":56.08,"text":"So"},{"end":56.96,"start":56.760000000000005,"text":"this"},{"end":57.2,"start":56.96,"text":"one."},{"end":57.6,"start":57.2,"text":"And"},{"end":57.6,"start":57.6,"text":"this"},{"end":57.6,"start":57.6,"text":"is"},{"end":58.04,"start":57.6,"text":"the"},{"end":58.32,"start":58.04,"text":"one"},{"end":59.16,"start":58.32,"text":"that"},{"end":59.36,"start":59.16,"text":"we"},{"end":59.64,"start":59.36,"text":"hope"},{"end":59.96,"start":59.64,"text":"to"}],"text":" knowledge about some of the inequalities that we haven't really covered yet. So here what you see is on the x dimension we have a theta value. This is the parameter that we are asked. On the y-axis we see the likelihood function. So this curve is the original likelihood function. So this one. And this is the one that we hope to"},{"chunks":[{"end":61.08,"start":60.0,"text":"maximize"},{"end":61.48,"start":61.08,"text":"and"},{"end":61.48,"start":61.48,"text":"we"},{"end":61.48,"start":61.48,"text":"hope"},{"end":61.6,"start":61.48,"text":"to"},{"end":62.24,"start":61.6,"text":"find"},{"end":62.64,"start":62.24,"text":"a"},{"end":62.96,"start":62.64,"text":"theta"},{"end":64.08,"start":62.96,"text":"value"},{"end":64.6,"start":64.08,"text":"at"},{"end":64.68,"start":64.6,"text":"this"},{"end":64.88,"start":64.68,"text":"point"},{"end":64.96,"start":64.88,"text":"to"},{"end":65.4,"start":64.96,"text":"maximize"},{"end":65.8,"start":65.4,"text":"this."},{"end":66.04,"start":65.8,"text":"But"},{"end":66.08,"start":66.04,"text":"in"},{"end":66.16,"start":66.08,"text":"the"},{"end":67.2,"start":66.16,"text":"case"},{"end":67.36,"start":67.2,"text":"of"},{"end":67.8,"start":67.36,"text":"mixture"},{"end":68.32,"start":67.8,"text":"model,"},{"end":68.36,"start":68.32,"text":"we"},{"end":68.92,"start":68.36,"text":"cannot"},{"end":69.32,"start":68.92,"text":"easily"},{"end":69.88,"start":69.32,"text":"find"},{"end":70.24,"start":69.88,"text":"the"},{"end":70.76,"start":70.24,"text":"analytical"},{"end":71.2,"start":70.76,"text":"solution"},{"end":71.24,"start":71.2,"text":"to"},{"end":71.24,"start":71.24,"text":"the"},{"end":71.44,"start":71.24,"text":"problem."},{"end":71.6,"start":71.44,"text":"So"},{"end":71.64,"start":71.6,"text":"we"},{"end":72.12,"start":71.64,"text":"have"},{"end":72.36,"start":72.12,"text":"to"},{"end":73.12,"start":72.36,"text":"resolve"},{"end":73.36,"start":73.12,"text":"a"},{"end":73.96000000000001,"start":73.36,"text":"numerical"},{"end":74.28,"start":73.96000000000001,"text":"algorithm"},{"end":74.72,"start":74.28,"text":"and"},{"end":74.84,"start":74.72,"text":"EM"},{"end":75.24,"start":74.84,"text":"algorithm"},{"end":75.6,"start":75.24,"text":"is"},{"end":76.08,"start":75.6,"text":"such"},{"end":76.08,"start":76.08,"text":"an"},{"end":76.08,"start":76.08,"text":"algorithm."},{"end":76.32,"start":76.08,"text":"It's"},{"end":76.4,"start":76.32,"text":"a"},{"end":77.08,"start":76.4,"text":"here"},{"end":77.84,"start":77.08,"text":"climb"},{"end":78.76,"start":77.84,"text":"algorithm."},{"end":78.88,"start":78.76,"text":"That"},{"end":78.88,"start":78.88,"text":"would"},{"end":78.88,"start":78.88,"text":"mean"},{"end":79.16,"start":78.88,"text":"you"},{"end":79.8,"start":79.16,"text":"start"},{"end":79.96000000000001,"start":79.8,"text":"with"},{"end":80.84,"start":79.96000000000001,"text":"some"},{"end":81.56,"start":80.84,"text":"random"},{"end":81.56,"start":81.56,"text":"guess."},{"end":82.0,"start":81.56,"text":"Let's"},{"end":82.36,"start":82.0,"text":"say"},{"end":82.92,"start":82.36,"text":"you"},{"end":83.8,"start":82.92,"text":"start"},{"end":84.12,"start":83.8,"text":"from"},{"end":84.52,"start":84.12,"text":"here."},{"end":84.84,"start":84.52,"text":"That's"},{"end":85.24,"start":84.84,"text":"your"},{"end":85.8,"start":85.24,"text":"starting"},{"end":86.12,"start":85.8,"text":"point."},{"end":86.2,"start":86.12,"text":"And"},{"end":86.52,"start":86.2,"text":"then"},{"end":86.56,"start":86.52,"text":"you"},{"end":86.84,"start":86.56,"text":"try"},{"end":87.16,"start":86.84,"text":"to"},{"end":87.56,"start":87.16,"text":"improve"},{"end":88.08,"start":87.56,"text":"this"},{"end":89.48,"start":88.08,"text":"by"},{"end":89.96000000000001,"start":89.48,"text":"moving"}],"text":" maximize and we hope to find a theta value at this point to maximize this. But in the case of mixture model, we cannot easily find the analytical solution to the problem. So we have to resolve a numerical algorithm and EM algorithm is such an algorithm. It's a here climb algorithm. That would mean you start with some random guess. Let's say you start from here. That's your starting point. And then you try to improve this by moving"},{"chunks":[{"end":90.52,"start":90.0,"text":"this"},{"end":91.16,"start":90.52,"text":"to"},{"end":91.44,"start":91.16,"text":"another"},{"end":91.64,"start":91.44,"text":"point"},{"end":91.64,"start":91.64,"text":"where"},{"end":91.76,"start":91.64,"text":"you"},{"end":92.0,"start":91.76,"text":"can"},{"end":93.12,"start":92.0,"text":"have"},{"end":93.6,"start":93.12,"text":"a"},{"end":94.4,"start":93.6,"text":"higher"},{"end":95.36,"start":94.4,"text":"likelihood."},{"end":95.52,"start":95.36,"text":"So"},{"end":96.12,"start":95.52,"text":"that's"},{"end":96.68,"start":96.12,"text":"the"},{"end":96.96,"start":96.68,"text":"idea"},{"end":97.16,"start":96.96,"text":"of"},{"end":97.48,"start":97.16,"text":"here"},{"end":98.68,"start":97.48,"text":"climbing."},{"end":99.2,"start":98.68,"text":"And"},{"end":99.68,"start":99.2,"text":"in"},{"end":99.68,"start":99.68,"text":"the"},{"end":100.08,"start":99.68,"text":"EAM"},{"end":100.56,"start":100.08,"text":"algorithm,"},{"end":100.56,"start":100.56,"text":"the"},{"end":100.56,"start":100.56,"text":"way"},{"end":100.56,"start":100.56,"text":"we"},{"end":100.92,"start":100.56,"text":"achieve"},{"end":101.6,"start":100.92,"text":"this"},{"end":102.12,"start":101.6,"text":"is"},{"end":102.28,"start":102.12,"text":"to"},{"end":102.32,"start":102.28,"text":"do"},{"end":102.56,"start":102.32,"text":"two"},{"end":103.0,"start":102.56,"text":"things."},{"end":103.76,"start":103.0,"text":"First,"},{"end":103.96000000000001,"start":103.76,"text":"we'll"},{"end":104.72,"start":103.96000000000001,"text":"fix"},{"end":104.84,"start":104.72,"text":"a"},{"end":105.24,"start":104.84,"text":"lower"},{"end":105.72,"start":105.24,"text":"bound"},{"end":105.84,"start":105.72,"text":"of"},{"end":106.4,"start":105.84,"text":"likelihood"},{"end":106.88,"start":106.4,"text":"function."},{"end":107.08,"start":106.88,"text":"So"},{"end":107.08,"start":107.08,"text":"this"},{"end":107.6,"start":107.08,"text":"is"},{"end":107.68,"start":107.6,"text":"the"},{"end":107.96000000000001,"start":107.68,"text":"lower"},{"end":108.28,"start":107.96000000000001,"text":"bound"},{"end":108.36,"start":108.28,"text":"you"},{"end":108.6,"start":108.36,"text":"can"},{"end":108.88,"start":108.6,"text":"see"},{"end":111.24,"start":108.88,"text":"here."},{"end":111.96000000000001,"start":111.24,"text":"And"},{"end":112.6,"start":111.96000000000001,"text":"once"},{"end":112.68,"start":112.6,"text":"we"},{"end":112.72,"start":112.68,"text":"fit"},{"end":112.96000000000001,"start":112.72,"text":"the"},{"end":113.28,"start":112.96000000000001,"text":"lower"},{"end":113.6,"start":113.28,"text":"bound,"},{"end":113.96000000000001,"start":113.6,"text":"we"},{"end":114.36,"start":113.96000000000001,"text":"can"},{"end":114.8,"start":114.36,"text":"then"},{"end":116.03999999999999,"start":114.8,"text":"maximize"},{"end":116.2,"start":116.03999999999999,"text":"the"},{"end":116.52,"start":116.2,"text":"lower"},{"end":117.16,"start":116.52,"text":"bound."},{"end":117.48,"start":117.16,"text":"And"},{"end":117.72,"start":117.48,"text":"of"},{"end":118.2,"start":117.72,"text":"course,"},{"end":118.36,"start":118.2,"text":"the"},{"end":118.6,"start":118.36,"text":"reason"},{"end":118.84,"start":118.6,"text":"why"},{"end":119.03999999999999,"start":118.84,"text":"this"},{"end":119.36,"start":119.03999999999999,"text":"works"},{"end":119.64,"start":119.36,"text":"is"},{"end":119.96000000000001,"start":119.64,"text":"because"}],"text":" this to another point where you can have a higher likelihood. So that's the idea of here climbing. And in the EAM algorithm, the way we achieve this is to do two things. First, we'll fix a lower bound of likelihood function. So this is the lower bound you can see here. And once we fit the lower bound, we can then maximize the lower bound. And of course, the reason why this works is because"},{"chunks":[{"end":120.28,"start":120.0,"text":"The"},{"end":120.6,"start":120.28,"text":"lower"},{"end":120.88,"start":120.6,"text":"bound"},{"end":121.04,"start":120.88,"text":"is"},{"end":121.4,"start":121.04,"text":"much"},{"end":121.52,"start":121.4,"text":"easier"},{"end":121.88,"start":121.52,"text":"to"},{"end":122.8,"start":121.88,"text":"optimize."},{"end":123.28,"start":122.8,"text":"So"},{"end":124.04,"start":123.28,"text":"we"},{"end":124.76,"start":124.04,"text":"know"},{"end":125.68,"start":124.76,"text":"our"},{"end":126.68,"start":125.68,"text":"current"},{"end":126.76,"start":126.68,"text":"gas"},{"end":126.76,"start":126.76,"text":"is"},{"end":126.76,"start":126.76,"text":"here."},{"end":126.8,"start":126.76,"text":"And"},{"end":127.08,"start":126.8,"text":"by"},{"end":128.04,"start":127.08,"text":"maximizing"},{"end":128.24,"start":128.04,"text":"the"},{"end":128.56,"start":128.24,"text":"lower"},{"end":129.84,"start":128.56,"text":"bound,"},{"end":130.2,"start":129.84,"text":"we'll"},{"end":130.2,"start":130.2,"text":"move"},{"end":130.28,"start":130.2,"text":"this"},{"end":130.32,"start":130.28,"text":"point"},{"end":130.48,"start":130.32,"text":"to"},{"end":130.88,"start":130.48,"text":"the"},{"end":131.24,"start":130.88,"text":"top,"},{"end":131.68,"start":131.24,"text":"to"},{"end":133.12,"start":131.68,"text":"here."},{"end":133.2,"start":133.12,"text":"And"},{"end":133.2,"start":133.2,"text":"we"},{"end":134.52,"start":133.2,"text":"can"},{"end":134.92,"start":134.52,"text":"then"},{"end":135.2,"start":134.92,"text":"map"},{"end":135.68,"start":135.2,"text":"to"},{"end":136.4,"start":135.68,"text":"the"},{"end":137.07999999999998,"start":136.4,"text":"original"},{"end":137.16,"start":137.07999999999998,"text":"liglow"},{"end":137.36,"start":137.16,"text":"function"},{"end":137.44,"start":137.36,"text":"we"},{"end":137.8,"start":137.44,"text":"find"},{"end":138.16,"start":137.8,"text":"at"},{"end":138.68,"start":138.16,"text":"this"},{"end":139.64,"start":138.68,"text":"point."},{"end":140.07999999999998,"start":139.64,"text":"Because"},{"end":140.4,"start":140.07999999999998,"text":"it's"},{"end":140.8,"start":140.4,"text":"a"},{"end":141.12,"start":140.8,"text":"lower"},{"end":141.52,"start":141.12,"text":"bound,"},{"end":141.76,"start":141.52,"text":"we"},{"end":142.04,"start":141.76,"text":"are"},{"end":142.68,"start":142.04,"text":"guaranteed"},{"end":143.0,"start":142.68,"text":"to"},{"end":143.96,"start":143.0,"text":"improve"},{"end":144.2,"start":143.96,"text":"this"},{"end":145.24,"start":144.2,"text":"gas."},{"end":145.48,"start":145.24,"text":"Because"},{"end":145.48,"start":145.48,"text":"we"},{"end":145.64,"start":145.48,"text":"improve"},{"end":146.12,"start":145.64,"text":"our"},{"end":146.24,"start":146.12,"text":"lower"},{"end":146.8,"start":146.24,"text":"bound,"},{"end":147.76,"start":146.8,"text":"and"},{"end":148.52,"start":147.76,"text":"then"},{"end":148.96,"start":148.52,"text":"the"},{"end":149.6,"start":148.96,"text":"original"},{"end":149.96,"start":149.6,"text":"liglow"}],"text":" The lower bound is much easier to optimize. So we know our current gas is here. And by maximizing the lower bound, we'll move this point to the top, to here. And we can then map to the original liglow function we find at this point. Because it's a lower bound, we are guaranteed to improve this gas. Because we improve our lower bound, and then the original liglow"},{"chunks":[{"end":150.64,"start":150.0,"text":"curve"},{"end":150.92,"start":150.64,"text":"which"},{"end":150.92,"start":150.92,"text":"is"},{"end":151.08,"start":150.92,"text":"above"},{"end":151.4,"start":151.08,"text":"this"},{"end":151.52,"start":151.4,"text":"lower"},{"end":152.12,"start":151.52,"text":"bound"},{"end":152.76,"start":152.12,"text":"will"},{"end":153.52,"start":152.76,"text":"definitely"},{"end":153.84,"start":153.52,"text":"be"},{"end":154.52,"start":153.84,"text":"improved"},{"end":154.8,"start":154.52,"text":"as"},{"end":155.88,"start":154.8,"text":"well."},{"end":156.12,"start":155.88,"text":"So"},{"end":156.4,"start":156.12,"text":"we"},{"end":156.88,"start":156.4,"text":"already"},{"end":157.24,"start":156.88,"text":"know"},{"end":157.6,"start":157.24,"text":"it's"},{"end":157.64,"start":157.6,"text":"improving"},{"end":158.04,"start":157.64,"text":"the"},{"end":158.36,"start":158.04,"text":"lower"},{"end":159.04,"start":158.36,"text":"bound."},{"end":159.24,"start":159.04,"text":"So"},{"end":159.32,"start":159.24,"text":"we"},{"end":159.72,"start":159.32,"text":"definitely"},{"end":160.08,"start":159.72,"text":"improve"},{"end":160.48,"start":160.08,"text":"this"},{"end":160.84,"start":160.48,"text":"original"},{"end":161.44,"start":160.84,"text":"likelihood"},{"end":161.84,"start":161.44,"text":"function"},{"end":162.4,"start":161.84,"text":"which"},{"end":163.28,"start":162.4,"text":"is"},{"end":163.84,"start":163.28,"text":"above"},{"end":164.32,"start":163.84,"text":"this"},{"end":165.04,"start":164.32,"text":"lower"},{"end":167.44,"start":165.04,"text":"bound."},{"end":167.96,"start":167.44,"text":"So"},{"end":168.28,"start":167.96,"text":"in"},{"end":168.72,"start":168.28,"text":"our"},{"end":169.4,"start":168.72,"text":"example,"},{"end":169.68,"start":169.4,"text":"the"},{"end":170.28,"start":169.68,"text":"current"},{"end":170.72,"start":170.28,"text":"guess"},{"end":170.8,"start":170.72,"text":"is"},{"end":171.2,"start":170.8,"text":"the"},{"end":171.84,"start":171.2,"text":"parameter"},{"end":172.28,"start":171.84,"text":"value"},{"end":172.52,"start":172.28,"text":"given"},{"end":172.72,"start":172.52,"text":"by"},{"end":172.76,"start":172.72,"text":"the"},{"end":173.4,"start":172.76,"text":"current"},{"end":173.84,"start":173.4,"text":"generation."},{"end":173.96,"start":173.84,"text":"And"},{"end":174.04,"start":173.96,"text":"then"},{"end":174.28,"start":174.04,"text":"the"},{"end":174.84,"start":174.28,"text":"next"},{"end":175.28,"start":174.84,"text":"guess"},{"end":175.56,"start":175.28,"text":"is"},{"end":175.68,"start":175.56,"text":"the"},{"end":176.48,"start":175.68,"text":"re-estimated"},{"end":176.88,"start":176.48,"text":"parameter"},{"end":177.6,"start":176.88,"text":"values."},{"end":178.24,"start":177.6,"text":"From"},{"end":178.84,"start":178.24,"text":"this"},{"end":179.24,"start":178.84,"text":"illustration,"},{"end":179.24,"start":179.24,"text":"you"},{"end":179.28,"start":179.24,"text":"can"},{"end":179.48,"start":179.28,"text":"see"},{"end":179.96,"start":179.48,"text":"the"}],"text":" curve which is above this lower bound will definitely be improved as well. So we already know it's improving the lower bound. So we definitely improve this original likelihood function which is above this lower bound. So in our example, the current guess is the parameter value given by the current generation. And then the next guess is the re-estimated parameter values. From this illustration, you can see the"},{"chunks":[{"end":180.32,"start":180.0,"text":"X"},{"end":180.4,"start":180.32,"text":"to"},{"end":181.04,"start":180.4,"text":"gas"},{"end":181.56,"start":181.04,"text":"is"},{"end":181.96,"start":181.56,"text":"always"},{"end":182.56,"start":181.96,"text":"better"},{"end":182.8,"start":182.56,"text":"than"},{"end":182.88,"start":182.8,"text":"the"},{"end":183.0,"start":182.88,"text":"current"},{"end":183.52,"start":183.0,"text":"gas,"},{"end":184.0,"start":183.52,"text":"unless"},{"end":184.0,"start":184.0,"text":"it"},{"end":184.36,"start":184.0,"text":"has"},{"end":184.68,"start":184.36,"text":"reached"},{"end":184.8,"start":184.68,"text":"the"},{"end":185.24,"start":184.8,"text":"maximum"},{"end":185.24,"start":185.24,"text":"where"},{"end":185.32,"start":185.24,"text":"it"},{"end":185.8,"start":185.32,"text":"would"},{"end":185.84,"start":185.8,"text":"be"},{"end":185.92,"start":185.84,"text":"stuck"},{"end":186.32,"start":185.92,"text":"there,"},{"end":186.92,"start":186.32,"text":"so"},{"end":187.2,"start":186.92,"text":"the"},{"end":187.8,"start":187.2,"text":"two"},{"end":189.52,"start":187.8,"text":"would"},{"end":189.52,"start":189.52,"text":"be"},{"end":189.76,"start":189.52,"text":"equal."},{"end":190.48,"start":189.76,"text":"So"},{"end":191.44,"start":190.48,"text":"the"},{"end":192.0,"start":191.44,"text":"E"},{"end":192.68,"start":192.0,"text":"step"},{"end":193.0,"start":192.68,"text":"is"},{"end":193.6,"start":193.0,"text":"basically"},{"end":193.76,"start":193.6,"text":"to"},{"end":194.68,"start":193.76,"text":"compute"},{"end":195.6,"start":194.68,"text":"this"},{"end":195.96,"start":195.6,"text":"lower"},{"end":197.07999999999998,"start":195.96,"text":"bound."},{"end":197.44,"start":197.07999999999998,"text":"And"},{"end":197.52,"start":197.44,"text":"we"},{"end":198.12,"start":197.52,"text":"don't"},{"end":198.68,"start":198.12,"text":"directly"},{"end":199.04,"start":198.68,"text":"just"},{"end":199.52,"start":199.04,"text":"compute"},{"end":199.92000000000002,"start":199.52,"text":"this"},{"end":200.36,"start":199.92000000000002,"text":"lycra-hole"},{"end":200.72,"start":200.36,"text":"function,"},{"end":201.07999999999998,"start":200.72,"text":"but"},{"end":201.07999999999998,"start":201.07999999999998,"text":"we"},{"end":201.36,"start":201.07999999999998,"text":"compute"},{"end":203.0,"start":201.36,"text":"the"},{"end":203.52,"start":203.0,"text":"latent"},{"end":204.32,"start":203.52,"text":"variable"},{"end":205.68,"start":204.32,"text":"values,"},{"end":206.0,"start":205.68,"text":"and"},{"end":206.48,"start":206.0,"text":"these"},{"end":206.88,"start":206.48,"text":"are"},{"end":207.44,"start":206.88,"text":"basically"},{"end":207.6,"start":207.44,"text":"a"},{"end":207.68,"start":207.6,"text":"part"},{"end":208.0,"start":207.68,"text":"of"},{"end":208.32,"start":208.0,"text":"this"},{"end":208.56,"start":208.32,"text":"lower"},{"end":208.88,"start":208.56,"text":"bound."},{"end":209.12,"start":208.88,"text":"This"},{"end":209.56,"start":209.12,"text":"helps"},{"end":209.6,"start":209.56,"text":"determine"},{"end":209.96,"start":209.6,"text":"the"}],"text":" X to gas is always better than the current gas, unless it has reached the maximum where it would be stuck there, so the two would be equal. So the E step is basically to compute this lower bound. And we don't directly just compute this lycra-hole function, but we compute the latent variable values, and these are basically a part of this lower bound. This helps determine the"},{"chunks":[{"end":210.4,"start":210.0,"text":"The"},{"end":210.8,"start":210.4,"text":"aim"},{"end":210.92,"start":210.8,"text":"step"},{"end":210.96,"start":210.92,"text":"on"},{"end":211.12,"start":210.96,"text":"the"},{"end":211.56,"start":211.12,"text":"other"},{"end":212.4,"start":211.56,"text":"hand"},{"end":212.72,"start":212.4,"text":"is"},{"end":212.72,"start":212.72,"text":"to"},{"end":213.4,"start":212.72,"text":"maximize"},{"end":213.68,"start":213.4,"text":"the"},{"end":213.72,"start":213.68,"text":"lower"},{"end":214.28,"start":213.72,"text":"bound."},{"end":214.48,"start":214.28,"text":"It"},{"end":215.2,"start":214.48,"text":"allows"},{"end":215.4,"start":215.2,"text":"us"},{"end":215.4,"start":215.4,"text":"to"},{"end":215.4,"start":215.4,"text":"move"},{"end":216.2,"start":215.4,"text":"parameters"},{"end":216.44,"start":216.2,"text":"to"},{"end":216.64,"start":216.44,"text":"a"},{"end":216.64,"start":216.64,"text":"new"},{"end":216.92,"start":216.64,"text":"point."},{"end":217.44,"start":216.92,"text":"And"},{"end":217.92,"start":217.44,"text":"that's"},{"end":218.44,"start":217.92,"text":"why"},{"end":218.92,"start":218.44,"text":"EM"},{"end":219.36,"start":218.92,"text":"algorithm"},{"end":219.64,"start":219.36,"text":"is"},{"end":220.12,"start":219.64,"text":"guaranteed"},{"end":220.12,"start":220.12,"text":"to"},{"end":220.12,"start":220.12,"text":"converge"},{"end":220.12,"start":220.12,"text":"to"},{"end":220.12,"start":220.12,"text":"a"},{"end":220.36,"start":220.12,"text":"local"},{"end":220.76,"start":220.36,"text":"maximum."},{"end":221.12,"start":220.76,"text":"Now,"},{"end":221.44,"start":221.12,"text":"as"},{"end":221.52,"start":221.44,"text":"you"},{"end":223.04,"start":221.52,"text":"can"},{"end":223.64,"start":223.04,"text":"imagine,"},{"end":224.0,"start":223.64,"text":"when"},{"end":224.04,"start":224.0,"text":"we"},{"end":224.84,"start":224.04,"text":"have"},{"end":225.36,"start":224.84,"text":"many"},{"end":225.76,"start":225.36,"text":"local"},{"end":226.56,"start":225.76,"text":"maxima,"},{"end":226.68,"start":226.56,"text":"we"},{"end":227.24,"start":226.68,"text":"also"},{"end":227.72,"start":227.24,"text":"have"},{"end":227.76,"start":227.72,"text":"to"},{"end":228.24,"start":227.76,"text":"repeat"},{"end":228.48,"start":228.24,"text":"the"},{"end":228.56,"start":228.48,"text":"EM"},{"end":228.96,"start":228.56,"text":"algorithm"},{"end":229.28,"start":228.96,"text":"multiple"},{"end":229.92000000000002,"start":229.28,"text":"times"},{"end":230.12,"start":229.92000000000002,"text":"in"},{"end":230.56,"start":230.12,"text":"order"},{"end":230.56,"start":230.56,"text":"to"},{"end":230.64,"start":230.56,"text":"figure"},{"end":231.0,"start":230.64,"text":"out"},{"end":231.12,"start":231.0,"text":"which"},{"end":231.56,"start":231.12,"text":"one"},{"end":231.84,"start":231.56,"text":"is"},{"end":232.12,"start":231.84,"text":"the"},{"end":232.96,"start":232.12,"text":"actual"},{"end":233.76,"start":232.96,"text":"global"},{"end":234.04,"start":233.76,"text":"maximum."},{"end":234.36,"start":234.04,"text":"And"},{"end":234.76,"start":234.36,"text":"this"},{"end":235.16,"start":234.76,"text":"actually"},{"end":235.4,"start":235.16,"text":"in"},{"end":235.96,"start":235.4,"text":"general"},{"end":236.04,"start":235.96,"text":"is"},{"end":236.04,"start":236.04,"text":"a"},{"end":236.76,"start":236.04,"text":"difficult"},{"end":237.64,"start":236.76,"text":"problem"},{"end":237.84,"start":237.64,"text":"in"},{"end":238.6,"start":237.84,"text":"numerical"},{"end":239.96,"start":238.6,"text":"optimization."}],"text":" The aim step on the other hand is to maximize the lower bound. It allows us to move parameters to a new point. And that's why EM algorithm is guaranteed to converge to a local maximum. Now, as you can imagine, when we have many local maxima, we also have to repeat the EM algorithm multiple times in order to figure out which one is the actual global maximum. And this actually in general is a difficult problem in numerical optimization."},{"chunks":[{"end":240.16,"start":240.0,"text":"For"},{"end":240.64,"start":240.16,"text":"example,"},{"end":241.32,"start":240.64,"text":"had"},{"end":241.44,"start":241.32,"text":"we"},{"end":241.88,"start":241.44,"text":"started"},{"end":242.72,"start":241.88,"text":"from"},{"end":242.84,"start":242.72,"text":"here,"},{"end":243.52,"start":242.84,"text":"then"},{"end":243.72,"start":243.52,"text":"we"},{"end":244.28,"start":243.72,"text":"gradually"},{"end":244.88,"start":244.28,"text":"just"},{"end":245.44,"start":244.88,"text":"climb"},{"end":245.52,"start":245.44,"text":"up"},{"end":245.52,"start":245.52,"text":"to"},{"end":245.8,"start":245.52,"text":"this"},{"end":246.72,"start":245.8,"text":"top."},{"end":246.92,"start":246.72,"text":"So"},{"end":247.4,"start":246.92,"text":"that's"},{"end":247.72,"start":247.4,"text":"not"},{"end":247.96,"start":247.72,"text":"optimal."},{"end":248.2,"start":247.96,"text":"And"},{"end":249.08,"start":248.2,"text":"we'd"},{"end":249.48,"start":249.08,"text":"like"},{"end":249.48,"start":249.48,"text":"to"},{"end":249.84,"start":249.48,"text":"climb"},{"end":250.0,"start":249.84,"text":"up"},{"end":250.08,"start":250.0,"text":"all"},{"end":250.16,"start":250.08,"text":"the"},{"end":250.56,"start":250.16,"text":"way"},{"end":250.72,"start":250.56,"text":"to"},{"end":251.28,"start":250.72,"text":"here."},{"end":251.44,"start":251.28,"text":"So"},{"end":251.84,"start":251.44,"text":"the"},{"end":252.28,"start":251.84,"text":"only"},{"end":252.52,"start":252.28,"text":"way"},{"end":252.84,"start":252.52,"text":"to"},{"end":253.32,"start":252.84,"text":"climb"},{"end":253.32,"start":253.32,"text":"up"},{"end":253.32,"start":253.32,"text":"to"},{"end":253.32,"start":253.32,"text":"this"},{"end":253.32,"start":253.32,"text":"here"},{"end":253.32,"start":253.32,"text":"is"},{"end":253.6,"start":253.32,"text":"to"},{"end":254.24,"start":253.6,"text":"start"},{"end":254.84,"start":254.24,"text":"from"},{"end":255.72,"start":254.84,"text":"somewhere"},{"end":256.84,"start":255.72,"text":"here"},{"end":257.4,"start":256.84,"text":"or"},{"end":258.08,"start":257.4,"text":"here."},{"end":258.92,"start":258.08,"text":"So"},{"end":259.56,"start":258.92,"text":"in"},{"end":260.12,"start":259.56,"text":"the"},{"end":260.52,"start":260.12,"text":"EM"},{"end":261.28,"start":260.52,"text":"algorithm,"},{"end":261.44,"start":261.28,"text":"we"},{"end":262.0,"start":261.44,"text":"generally"},{"end":262.08,"start":262.0,"text":"would"},{"end":262.24,"start":262.08,"text":"have"},{"end":262.52,"start":262.24,"text":"to"},{"end":263.0,"start":262.52,"text":"start"},{"end":263.2,"start":263.0,"text":"from"},{"end":263.52,"start":263.2,"text":"different"},{"end":263.8,"start":263.52,"text":"points"},{"end":264.12,"start":263.8,"text":"or"},{"end":264.92,"start":264.12,"text":"have"},{"end":265.32,"start":264.92,"text":"some"},{"end":265.72,"start":265.32,"text":"other"},{"end":265.72,"start":265.72,"text":"way"},{"end":265.72,"start":265.72,"text":"to"},{"end":265.72,"start":265.72,"text":"determine"},{"end":265.72,"start":265.72,"text":"a"},{"end":266.28,"start":265.72,"text":"good"},{"end":267.16,"start":266.28,"text":"initial"},{"end":267.6,"start":267.16,"text":"starting"},{"end":269.96,"start":267.6,"text":"point."}],"text":" For example, had we started from here, then we gradually just climb up to this top. So that's not optimal. And we'd like to climb up all the way to here. So the only way to climb up to this here is to start from somewhere here or here. So in the EM algorithm, we generally would have to start from different points or have some other way to determine a good initial starting point."},{"chunks":[{"end":270.08,"start":270.0,"text":"To"},{"end":270.64,"start":270.08,"text":"summarize,"},{"end":270.64,"start":270.64,"text":"in"},{"end":271.08,"start":270.64,"text":"this"},{"end":271.48,"start":271.08,"text":"lecture"},{"end":271.6,"start":271.48,"text":"we"},{"end":272.04,"start":271.6,"text":"introduced"},{"end":272.52,"start":272.04,"text":"the"},{"end":273.08,"start":272.52,"text":"EM"},{"end":273.6,"start":273.08,"text":"algorithm."},{"end":274.08,"start":273.6,"text":"This"},{"end":274.28,"start":274.08,"text":"is"},{"end":274.68,"start":274.28,"text":"a"},{"end":274.96,"start":274.68,"text":"general"},{"end":275.4,"start":274.96,"text":"algorithm"},{"end":275.56,"start":275.4,"text":"for"},{"end":276.12,"start":275.56,"text":"computing"},{"end":276.48,"start":276.12,"text":"maximum"},{"end":277.28,"start":276.48,"text":"microestimate"},{"end":277.64,"start":277.28,"text":"of"},{"end":277.76,"start":277.64,"text":"all"},{"end":277.92,"start":277.76,"text":"kinds"},{"end":278.4,"start":277.92,"text":"of"},{"end":278.72,"start":278.4,"text":"mixture"},{"end":279.8,"start":278.72,"text":"models."},{"end":279.96,"start":279.8,"text":"So"},{"end":280.16,"start":279.96,"text":"not"},{"end":280.52,"start":280.16,"text":"just"},{"end":280.64,"start":280.52,"text":"for"},{"end":280.96,"start":280.64,"text":"our"},{"end":281.36,"start":280.96,"text":"simple"},{"end":281.6,"start":281.36,"text":"mixture"},{"end":281.92,"start":281.6,"text":"model."},{"end":281.96,"start":281.92,"text":"And"},{"end":282.56,"start":281.96,"text":"it's"},{"end":282.84,"start":282.56,"text":"a"},{"end":283.08,"start":282.84,"text":"here"},{"end":283.48,"start":283.08,"text":"climbing"},{"end":284.2,"start":283.48,"text":"algorithm."},{"end":284.36,"start":284.2,"text":"So"},{"end":284.88,"start":284.36,"text":"can"},{"end":284.92,"start":284.88,"text":"only"},{"end":285.2,"start":284.92,"text":"converge"},{"end":285.2,"start":285.2,"text":"to"},{"end":285.24,"start":285.2,"text":"a"},{"end":285.68,"start":285.24,"text":"local"},{"end":286.08,"start":285.68,"text":"maximum"},{"end":286.2,"start":286.08,"text":"and"},{"end":286.56,"start":286.2,"text":"it"},{"end":286.64,"start":286.56,"text":"will"},{"end":287.16,"start":286.64,"text":"depend"},{"end":287.4,"start":287.16,"text":"on"},{"end":287.76,"start":287.4,"text":"initial"},{"end":289.2,"start":287.76,"text":"points."},{"end":289.84,"start":289.2,"text":"The"},{"end":290.4,"start":289.84,"text":"general"},{"end":290.64,"start":290.4,"text":"idea"},{"end":290.84,"start":290.64,"text":"is"},{"end":291.24,"start":290.84,"text":"that"},{"end":291.68,"start":291.24,"text":"we"},{"end":292.12,"start":291.68,"text":"will"},{"end":292.4,"start":292.12,"text":"have"},{"end":292.6,"start":292.4,"text":"two"},{"end":293.4,"start":292.6,"text":"steps"},{"end":293.4,"start":293.4,"text":"to"},{"end":293.72,"start":293.4,"text":"improve"},{"end":294.16,"start":293.72,"text":"the"},{"end":294.56,"start":294.16,"text":"estimate"},{"end":294.72,"start":294.56,"text":"of"},{"end":295.32,"start":294.72,"text":"parameters."},{"end":295.68,"start":295.32,"text":"In"},{"end":296.2,"start":295.68,"text":"the"},{"end":296.24,"start":296.2,"text":"E"},{"end":296.6,"start":296.24,"text":"step,"},{"end":296.68,"start":296.6,"text":"we"},{"end":296.88,"start":296.68,"text":"roughly"},{"end":297.32,"start":296.88,"text":"are"},{"end":297.76,"start":297.32,"text":"augmenting"},{"end":298.0,"start":297.76,"text":"our"},{"end":298.4,"start":298.0,"text":"data"},{"end":298.92,"start":298.4,"text":"by"},{"end":299.52,"start":298.92,"text":"predicting"},{"end":299.96,"start":299.52,"text":"values"}],"text":" To summarize, in this lecture we introduced the EM algorithm. This is a general algorithm for computing maximum microestimate of all kinds of mixture models. So not just for our simple mixture model. And it's a here climbing algorithm. So can only converge to a local maximum and it will depend on initial points. The general idea is that we will have two steps to improve the estimate of parameters. In the E step, we roughly are augmenting our data by predicting values"},{"chunks":[{"end":300.24,"start":300.0,"text":"of"},{"end":300.76,"start":300.24,"text":"useful"},{"end":300.88,"start":300.76,"text":"hidden"},{"end":301.48,"start":300.88,"text":"variables"},{"end":301.92,"start":301.48,"text":"that"},{"end":302.52,"start":301.92,"text":"we"},{"end":303.12,"start":302.52,"text":"would"},{"end":303.6,"start":303.12,"text":"use"},{"end":303.8,"start":303.6,"text":"to"},{"end":304.4,"start":303.8,"text":"simplify"},{"end":304.48,"start":304.4,"text":"the"},{"end":305.08,"start":304.48,"text":"estimation."},{"end":305.52,"start":305.08,"text":"In"},{"end":305.8,"start":305.52,"text":"our"},{"end":306.16,"start":305.8,"text":"case,"},{"end":307.32,"start":306.16,"text":"this"},{"end":307.48,"start":307.32,"text":"is"},{"end":307.88,"start":307.48,"text":"the"},{"end":308.36,"start":307.88,"text":"distribution"},{"end":308.8,"start":308.36,"text":"that"},{"end":309.16,"start":308.8,"text":"has"},{"end":309.24,"start":309.16,"text":"been"},{"end":309.24,"start":309.24,"text":"used"},{"end":309.28,"start":309.24,"text":"to"},{"end":309.6,"start":309.28,"text":"generate"},{"end":309.76,"start":309.6,"text":"the"},{"end":310.04,"start":309.76,"text":"work."},{"end":310.24,"start":310.04,"text":"In"},{"end":311.16,"start":310.24,"text":"the"},{"end":311.52,"start":311.16,"text":"end"},{"end":311.92,"start":311.52,"text":"step"},{"end":312.28,"start":311.92,"text":"then,"},{"end":312.4,"start":312.28,"text":"we"},{"end":312.68,"start":312.4,"text":"would"},{"end":313.36,"start":312.68,"text":"exploit"},{"end":313.72,"start":313.36,"text":"such"},{"end":314.32,"start":313.72,"text":"augmented"},{"end":314.76,"start":314.32,"text":"data,"},{"end":315.04,"start":314.76,"text":"which"},{"end":315.36,"start":315.04,"text":"would"},{"end":315.64,"start":315.36,"text":"make"},{"end":315.76,"start":315.64,"text":"it"},{"end":316.12,"start":315.76,"text":"easier"},{"end":316.68,"start":316.12,"text":"to"},{"end":317.2,"start":316.68,"text":"estimate"},{"end":317.24,"start":317.2,"text":"the"},{"end":317.88,"start":317.24,"text":"distribution"},{"end":318.4,"start":317.88,"text":"to"},{"end":318.68,"start":318.4,"text":"improve"},{"end":319.12,"start":318.68,"text":"the"},{"end":319.56,"start":319.12,"text":"estimate"},{"end":319.72,"start":319.56,"text":"of"},{"end":320.4,"start":319.72,"text":"parameters."},{"end":321.0,"start":320.4,"text":"Here,"},{"end":321.72,"start":321.0,"text":"improve"},{"end":322.0,"start":321.72,"text":"is"},{"end":322.48,"start":322.0,"text":"guaranteed"},{"end":322.76,"start":322.48,"text":"in"},{"end":323.36,"start":322.76,"text":"terms"},{"end":323.68,"start":323.36,"text":"of"},{"end":323.68,"start":323.68,"text":"the"},{"end":323.84,"start":323.68,"text":"likelihood"},{"end":324.76,"start":323.84,"text":"function."},{"end":324.96,"start":324.76,"text":"Note"},{"end":325.48,"start":324.96,"text":"that"},{"end":325.96,"start":325.48,"text":"it's"},{"end":326.28,"start":325.96,"text":"not"},{"end":327.08,"start":326.28,"text":"necessary"},{"end":327.48,"start":327.08,"text":"that"},{"end":328.36,"start":327.48,"text":"we"},{"end":329.12,"start":328.36,"text":"will"},{"end":329.52,"start":329.12,"text":"have"},{"end":329.52,"start":329.52,"text":"a"},{"end":329.56,"start":329.52,"text":"stable"},{"end":329.96,"start":329.56,"text":"convergence"}],"text":" of useful hidden variables that we would use to simplify the estimation. In our case, this is the distribution that has been used to generate the work. In the end step then, we would exploit such augmented data, which would make it easier to estimate the distribution to improve the estimate of parameters. Here, improve is guaranteed in terms of the likelihood function. Note that it's not necessary that we will have a stable convergence"},{"chunks":[{"end":330.76,"start":330.0,"text":"parameter"},{"end":331.4,"start":330.76,"text":"values,"},{"end":331.44,"start":331.4,"text":"even"},{"end":331.6,"start":331.44,"text":"though"},{"end":332.12,"start":331.6,"text":"the"},{"end":332.72,"start":332.12,"text":"likeholder"},{"end":333.24,"start":332.72,"text":"function"},{"end":333.56,"start":333.24,"text":"is"},{"end":333.96,"start":333.56,"text":"ensured"},{"end":334.32,"start":333.96,"text":"to"},{"end":335.0,"start":334.32,"text":"increase."},{"end":335.28,"start":335.0,"text":"There"},{"end":335.48,"start":335.28,"text":"are"},{"end":335.8,"start":335.48,"text":"some"},{"end":336.36,"start":335.8,"text":"properties"},{"end":336.6,"start":336.36,"text":"that"},{"end":336.64,"start":336.6,"text":"have"},{"end":336.72,"start":336.64,"text":"to"},{"end":336.92,"start":336.72,"text":"be"},{"end":337.6,"start":336.92,"text":"satisfied"},{"end":337.6,"start":337.6,"text":"in"},{"end":338.08,"start":337.6,"text":"order"},{"end":338.32,"start":338.08,"text":"for"},{"end":338.72,"start":338.32,"text":"the"},{"end":340.2,"start":338.72,"text":"parameters"},{"end":340.84,"start":340.2,"text":"also"},{"end":341.12,"start":340.84,"text":"to"},{"end":342.48,"start":341.12,"text":"converge"},{"end":342.68,"start":342.48,"text":"to"},{"end":343.32,"start":342.68,"text":"some"},{"end":343.88,"start":343.32,"text":"stable"},{"end":344.24,"start":343.88,"text":"value."},{"end":347.4,"start":344.24,"text":"Now"},{"end":347.68,"start":347.4,"text":"here,"},{"end":348.2,"start":347.68,"text":"data"},{"end":349.32,"start":348.2,"text":"augmentation"},{"end":349.64,"start":349.32,"text":"is"},{"end":349.96,"start":349.64,"text":"done"},{"end":350.72,"start":349.96,"text":"probabilistically."},{"end":350.96,"start":350.72,"text":"That"},{"end":351.32,"start":350.96,"text":"means"},{"end":351.48,"start":351.32,"text":"we're"},{"end":351.72,"start":351.48,"text":"not"},{"end":352.16,"start":351.72,"text":"going"},{"end":352.16,"start":352.16,"text":"to"},{"end":352.32,"start":352.16,"text":"just"},{"end":352.6,"start":352.32,"text":"say"},{"end":353.12,"start":352.6,"text":"exactly"},{"end":353.52,"start":353.12,"text":"what's"},{"end":353.52,"start":353.52,"text":"the"},{"end":354.04,"start":353.52,"text":"value"},{"end":354.32,"start":354.04,"text":"of"},{"end":354.32,"start":354.32,"text":"a"},{"end":354.32,"start":354.32,"text":"hidden"},{"end":354.68,"start":354.32,"text":"variable,"},{"end":354.68,"start":354.68,"text":"but"},{"end":354.68,"start":354.68,"text":"we're"},{"end":354.76,"start":354.68,"text":"going"},{"end":354.76,"start":354.76,"text":"to"},{"end":355.32,"start":354.76,"text":"have"},{"end":355.6,"start":355.32,"text":"a"},{"end":356.96,"start":355.6,"text":"probability"},{"end":357.52,"start":356.96,"text":"distribution"},{"end":357.76,"start":357.52,"text":"over"},{"end":357.8,"start":357.76,"text":"the"},{"end":358.12,"start":357.8,"text":"possible"},{"end":358.76,"start":358.12,"text":"values"},{"end":359.0,"start":358.76,"text":"of"},{"end":359.08,"start":359.0,"text":"these"},{"end":359.36,"start":359.08,"text":"hidden"},{"end":359.96,"start":359.36,"text":"variables."}],"text":" parameter values, even though the likeholder function is ensured to increase. There are some properties that have to be satisfied in order for the parameters also to converge to some stable value. Now here, data augmentation is done probabilistically. That means we're not going to just say exactly what's the value of a hidden variable, but we're going to have a probability distribution over the possible values of these hidden variables."},{"chunks":[{"end":361.0,"start":360.0,"text":"variables."},{"end":361.16,"start":361.0,"text":"So"},{"end":361.56,"start":361.16,"text":"this"},{"end":362.2,"start":361.56,"text":"causes"},{"end":362.4,"start":362.2,"text":"a"},{"end":362.84,"start":362.4,"text":"split"},{"end":363.08,"start":362.84,"text":"of"},{"end":363.96,"start":363.08,"text":"counts"},{"end":364.24,"start":363.96,"text":"of"},{"end":365.08,"start":364.24,"text":"events"},{"end":366.52,"start":365.08,"text":"probabilistically."},{"end":367.28,"start":366.52,"text":"And"},{"end":367.56,"start":367.28,"text":"in"},{"end":368.04,"start":367.56,"text":"our"},{"end":368.68,"start":368.04,"text":"case,"},{"end":369.56,"start":368.68,"text":"we'll"},{"end":369.84,"start":369.56,"text":"split"},{"end":370.08,"start":369.84,"text":"the"},{"end":370.28,"start":370.08,"text":"world"},{"end":370.6,"start":370.28,"text":"counts"},{"end":370.6,"start":370.6,"text":"between"},{"end":370.68,"start":370.6,"text":"the"},{"end":373.0,"start":370.68,"text":"two"},{"end":385.24,"start":373.0,"text":"distributions."}],"text":" variables. So this causes a split of counts of events probabilistically. And in our case, we'll split the world counts between the two distributions."}]}}