<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">

 <title>The ICLR Blog Track</title>
 <link href="https://iclr.iro.umontreal.ca/4f7d7869-9804-4ce5-bf63-403917c28947_1642248606/atom.xml" rel="self"/>
 <link href="https://iclr.iro.umontreal.ca/4f7d7869-9804-4ce5-bf63-403917c28947_1642248606/"/>
 <updated>2022-01-15T06:10:33-06:00</updated>
 <id>https://iclr.iro.umontreal.ca/4f7d7869-9804-4ce5-bf63-403917c28947_1642248606</id>
 <author>
   <name>Mark Otto</name>
   <email>markdotto@gmail.com</email>
 </author>

 
 <entry>
   <title>Evaluation of Feature-based explanations</title>
   <link href="https://iclr.iro.umontreal.ca/4f7d7869-9804-4ce5-bf63-403917c28947_1642248606/2022/01/15/Urja/"/>
   <updated>2022-01-15T00:00:00-06:00</updated>
   <id>https://iclr.iro.umontreal.ca/4f7d7869-9804-4ce5-bf63-403917c28947_1642248606/2022/01/15/Urja</id>
   <content type="html">&lt;!-- content --&gt;

&lt;p&gt;The rising need of making Machine Learning (ML) models interpretable, fair and trustworthy has led the research community to come up with better explanations to enable interpretation, validation, and transparency while utilising these models in domains such as healthcare or finance. But, how do we judge whether the explanation is better or not? Different types of explanations require different evaluation metrics to get assessed. The most common type of explanation is feature importance explanations that are usually presented in the form of relative ranking of feature as per their importance in determining model’s output. Another common type is counterfactual explanations that tells us what minimum changes are required in the features that will result in different classification by the model.&lt;/p&gt;

&lt;p&gt;For black-box classifiers Feature importance and counterfactuals are estimated using different techniques that involves either training an interpretable classifier to mimic black-box locally, or perturbations based on insertion and removal of features. A set of evaluation metrics are required to represent the faithfulness of explanations to truly represent the black-box model that is usually termed as “robustness” of explanations.&lt;/p&gt;

&lt;h3 id=&quot;contribution-of-the-paper&quot;&gt;Contribution of the Paper&lt;/h3&gt;
&lt;p&gt;This blog post describes contribution of the paper titled “EVALUATIONS AND METHODS FOR EXPLANATION THROUGH ROBUSTNESS ANALYSIS” by &lt;a href=&quot;https://openreview.net/forum?id=Hye4KeSYDr&quot;&gt;Cheng et. al&lt;/a&gt; that discusses assessing robustness in a novel way and coming up with more robust explanation in the specific area of insertion and removal of explanations. This is because such explanations face two drawbacks:&lt;/p&gt;
&lt;ol&gt;
  &lt;li&gt;When the feature importance is estimated by removing a feature by setting it to a baseline value, it has higher chance to attribute high importance if some values deflect a lot from baseline. An example would be setting RGB pixels to black, that will give bright pixels more importance.&lt;/li&gt;
  &lt;li&gt;When the feature importance is estimated by removing a feature by giving it some value sampled from the distribution (using a generative model) , there is an inherent bias that goes from the generative model to this process and not all domains can have a proper generative models.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;To address this, the paper contributes towards two things:&lt;/p&gt;
&lt;ol&gt;
  &lt;li&gt;Proposing a novel evaluation criteria to assess the robustness of feature based explanations -  feature importance and counterfactuals - based on small perturbations to understand how the model’s output changes with these small perturbations on different set of features.&lt;/li&gt;
  &lt;li&gt;Optimising on the novel evaluation criteria to come up with better explanations&lt;/li&gt;
&lt;/ol&gt;

&lt;h4 id=&quot;robustness&quot;&gt;Robustness:&lt;/h4&gt;
&lt;p&gt;In order to have an evaluation of the explanations, two key things are assumed:&lt;/p&gt;
&lt;ol&gt;
  &lt;li&gt;Changing the values of only non-important features have a weak influence on model’s output&lt;/li&gt;
  &lt;li&gt;Changing the values of only important features can easily change model’s output&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Based on the above set of assumptions, a robustness parameter ε&lt;sup&gt;*&lt;/sup&gt; is defined as given by the following equation:&lt;/p&gt;

&lt;p&gt;ε&lt;sup&gt;&lt;em&gt;&amp;lt;/sup&amp;gt;&lt;sub&gt;&lt;strong&gt;x&lt;/strong&gt;s&lt;/sub&gt; = *g(f,&lt;strong&gt;x&lt;/strong&gt;, S)&lt;/em&gt; = min&lt;sub&gt;δ&lt;/sub&gt;|δ| *s.t. f(x+δ) != y, δ&lt;sub&gt;S&lt;sup&gt;*&lt;/sup&gt;&lt;/sub&gt;&lt;/sup&gt;&lt;/p&gt;

&lt;p&gt;In the above equation, &lt;em&gt;f&lt;/em&gt; id the model, &lt;em&gt;x&lt;/em&gt; is the input, &lt;em&gt;U&lt;/em&gt; is the set of all features and &lt;em&gt;S&lt;/em&gt; is the subset of &lt;em&gt;U&lt;/em&gt;. The term δ is the minimum adversarial perturbation that is performed on &lt;em&gt;S&lt;/em&gt;. This minimum perturbation, when done on set of important features, should be &lt;strong&gt;low&lt;/strong&gt; as per assumption 2. Similarly, minimising δ over a set of non-important features S&lt;sup&gt;c&lt;/sup&gt; should give a higher δ value, as high pertubations are required to change model’s output by perturbing only non-important features.&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;Based on this, we can say: R(S) - where S is a set of important features - is given by ε&lt;sup&gt;*&lt;/sup&gt;&lt;sub&gt;&lt;strong&gt;x&lt;/strong&gt;s&lt;/sub&gt; and,&lt;/p&gt;
&lt;/blockquote&gt;

&lt;blockquote&gt;
  &lt;p&gt;R(S&lt;sup&gt;c&lt;/sup&gt;); where  S&lt;sup&gt;c&lt;/sup&gt; is a set of non-important features - is given by ε&lt;sup&gt;*&lt;/sup&gt;&lt;sub&gt;&lt;strong&gt;x&lt;/strong&gt;s&lt;sup&gt;c&lt;/sup&gt;&lt;/sub&gt;.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;With this evaluation metric R, we can also have a look at the AUC curve that is plotted against the top K features belonging to that subset.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;https://iclr.iro.umontreal.ca/4f7d7869-9804-4ce5-bf63-403917c28947_1642248606/public/images/2022-01-15-Urja/rnimp.png&quot; alt=&quot;AUC&quot; /&gt; 
&lt;img src=&quot;https://iclr.iro.umontreal.ca/4f7d7869-9804-4ce5-bf63-403917c28947_1642248606/public/images/2022-01-15-Urja/rsimp.png&quot; alt=&quot;AUC&quot; /&gt;&lt;/p&gt;

&lt;h4 id=&quot;counterfactual-flavor&quot;&gt;Counterfactual flavor&lt;/h4&gt;
&lt;p&gt;If we optimise the previous equation of ε&lt;sup&gt;*&lt;/sup&gt;&lt;sub&gt;&lt;strong&gt;x&lt;/strong&gt;s&lt;/sub&gt; from that to the following equation:&lt;/p&gt;

&lt;p&gt;ε&lt;sup&gt;&lt;em&gt;&amp;lt;/sup&amp;gt;&lt;sub&gt;&lt;strong&gt;x&lt;/strong&gt;s&lt;/sub&gt; = *g(f,&lt;strong&gt;x&lt;/strong&gt;, S)&lt;/em&gt; = min&lt;sub&gt;δ&lt;/sub&gt;|δ| *s.t. f(x+δ) = t, δ&lt;sub&gt;S&lt;sup&gt;*&lt;/sup&gt;&lt;/sub&gt;&lt;/sup&gt;&lt;/p&gt;

&lt;p&gt;we can see that if the optimisation function can optimise for pertubations leading to another desired class &lt;em&gt;t&lt;/em&gt;, it can provide us with the counterfactual use of the S subset of features.&lt;/p&gt;

&lt;h4 id=&quot;extracting-explanations&quot;&gt;Extracting explanations&lt;/h4&gt;

&lt;p&gt;Based on &lt;em&gt;g(f,&lt;strong&gt;x&lt;/strong&gt;, S)&lt;/em&gt;, we can extract a set of important and non important features by solving the follwing set of optimsation problems respectively:&lt;/p&gt;

&lt;blockquote&gt;
  &lt;table&gt;
    &lt;tbody&gt;
      &lt;tr&gt;
        &lt;td&gt;minimise &lt;em&gt;g(f,&lt;strong&gt;x&lt;/strong&gt;, S)&lt;/em&gt; s.t.&lt;/td&gt;
        &lt;td&gt;S&lt;/td&gt;
        &lt;td&gt;&amp;lt;= K&lt;/td&gt;
      &lt;/tr&gt;
    &lt;/tbody&gt;
  &lt;/table&gt;
&lt;/blockquote&gt;

&lt;blockquote&gt;
  &lt;table&gt;
    &lt;tbody&gt;
      &lt;tr&gt;
        &lt;td&gt;maximise &lt;em&gt;g(f,&lt;strong&gt;x&lt;/strong&gt;, S&lt;sup&gt;c&lt;/sup&gt;)&lt;/em&gt; s.t.&lt;/td&gt;
        &lt;td&gt;S&lt;sup&gt;c&lt;/sup&gt;&lt;/td&gt;
        &lt;td&gt;&amp;lt;= K `&lt;/td&gt;
      &lt;/tr&gt;
    &lt;/tbody&gt;
  &lt;/table&gt;
&lt;/blockquote&gt;

&lt;p&gt;where K is the number of features we intend to analyse or consider.&lt;/p&gt;

&lt;p&gt;The above equations could be solved by a greedy approach where we initialise an empty set S (or S&lt;sup&gt;c&lt;/sup&gt;) and keep on adding features that most optimises the corresponding optimisation function. However, there is a drawback of missing the interaction among features. Two feature might be very important when put together and not important in a standalone manner. For this, marginal contribution of a feature is also taken into consideration by analysing the change in model’s output when unchosen features are also included with this feature. The concept is based on game theory and can be used to optimally decide the contribution of feature on model’s output.&lt;/p&gt;

&lt;p&gt;To avoid confusion, this evaluation criteria and explanations is different from SHAP in a way that SHAP considers removal of features by setting it to a baseline value, whereas here we are more interested in capturing change in model’s output by slightly changing the input space from their original value. This change can then be used for optimisation to have more deflection in output (for important features) or less deflection (for non-important features).&lt;/p&gt;
</content>
 </entry>
 
 <entry>
   <title>Blog Posts as Conference Contributions</title>
   <link href="https://iclr.iro.umontreal.ca/4f7d7869-9804-4ce5-bf63-403917c28947_1642248606/2021/09/08/blog-posts-as-conference-contributions/"/>
   <updated>2021-09-08T00:00:00-05:00</updated>
   <id>https://iclr.iro.umontreal.ca/4f7d7869-9804-4ce5-bf63-403917c28947_1642248606/2021/09/08/blog-posts-as-conference-contributions</id>
   <content type="html">&lt;h1 id=&quot;motivations&quot;&gt;Motivations&lt;/h1&gt;

&lt;p&gt;The Machine Learning community is currently experiencing a
&lt;a href=&quot;https://neuripsconf.medium.com/designing-the-reproducibility-program-for-neurips-2020-7fcccaa5c6ad&quot;&gt;reproducibility
crisis&lt;/a&gt;
and a reviewing crisis &lt;a href=&quot;#Litt&quot;&gt;[Littman, 2021]&lt;/a&gt;. Because of the highly competitive and noisy
reviewing process of ML conferences &lt;a href=&quot;#Tran&quot;&gt;[Tran et al., 2020]&lt;/a&gt;, researchers have an incentive to
oversell their results, slowing down the progress and diminishing the
integrity of the scientific community. Moreover with the growing number
of papers published and submitted at the main ML conferences &lt;a href=&quot;#Lin&quot;&gt;[Lin et al., 2020]&lt;/a&gt;, it has
become more challenging to keep track of the latest advances in the
field.&lt;/p&gt;

&lt;p&gt;Blog posts are becoming an increasingly popular and useful way to talk
about science &lt;a href=&quot;#Brow&quot;&gt;[Brown and Woolston, 2018]&lt;/a&gt;. They offer substantial value to the scientific community
by providing a flexible platform to foster open, human, and transparent
discussions about new insights or limitations of a scientific
publication. However, because they are not as recognized as standard
scientific publications, only a minority of researchers manage to
maintain an active blog and get visibility for their efforts. Many are
well-established researchers (&lt;a href=&quot;https://francisbach.com/&quot;&gt;Francis Bach&lt;/a&gt;,
&lt;a href=&quot;https://www.argmin.net/&quot;&gt;Ben Recht&lt;/a&gt;, &lt;a href=&quot;https://www.inference.vc/&quot;&gt;Ferenc
Huszár&lt;/a&gt;, &lt;a href=&quot;https://lilianweng.github.io/lil-log/&quot;&gt;Lilian
Weng&lt;/a&gt;) or big corporations that
leverage entire teams of graphic designers designer and writers to
polish their blogs (&lt;a href=&quot;https://ai.facebook.com/blog/?page=1&quot;&gt;Facebook AI&lt;/a&gt;,
&lt;a href=&quot;https://ai.googleblog.com/&quot;&gt;Google AI&lt;/a&gt;,
&lt;a href=&quot;https://deepmind.com/blog&quot;&gt;DeepMind&lt;/a&gt;,
&lt;a href=&quot;https://openai.com/blog/&quot;&gt;OpenAI&lt;/a&gt;). As a result, the incentives for
writing scientific blog posts are largely personal; it is unreasonable
to expect a significant portion of the machine learning community to
contribute to such an initiative when everyone is trying to establish
themselves through publications.&lt;/p&gt;

&lt;p&gt;Our goal is to create a formal call for blog posts at ICLR to
incentivize and reward researchers to review past work and summarize the
outcomes, develop new intuitions, or highlight some shortcomings. A very
influential initiative of this kind happened after the second world war
in France. Because of the lack of up-to-date textbooks, a collective of
mathematicians under the pseudonym Nicolas Bourbaki &lt;a href=&quot;#Halm&quot;&gt;[Halmos 1957]&lt;/a&gt;, decided to start a
series of textbooks  about the foundations of mathematics &lt;a href=&quot;#Bour&quot;&gt;[Bourbaki, 1939]&lt;/a&gt;.
In the same vein, we aim at providing a new way to summarize scientific knowledge in the ML community.&lt;/p&gt;

&lt;h1 id=&quot;our-idea-blog-post-conference-track&quot;&gt;Our Idea: Blog post Conference Track&lt;/h1&gt;

&lt;p&gt;Due to the large diversity of topics that can be discussed in a blog
post, we decided to restrict the range of topics for this call for blog
posts. We identified that the blog posts that would bring to most value
to the community and the conference would be posts that distill and
discuss &lt;em&gt;previously published papers&lt;/em&gt;.&lt;/p&gt;

&lt;h2 id=&quot;call-for-blog-posts-on-papers-previously-published-at-iclr&quot;&gt;Call for blog posts on papers previously published at ICLR&lt;/h2&gt;

&lt;p&gt;The call for blog post would take the following form:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;
    &lt;p&gt;Write a post about a paper previously published at ICLR, with the
constraint that one cannot write a blog post on work that they have
a conflict of interest with. This implies that one cannot review
their own work, or work originating from their institution or
company. We want to foster productive discussion about &lt;em&gt;ideas&lt;/em&gt;, and
prevent posts that intentionally aim to help or hurt individuals or
institutions.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Blogs will be peer-reviewed (double-blind, see
Section &lt;a href=&quot;#sub:sub_process&quot; data-reference-type=&quot;ref&quot; data-reference=&quot;sub:sub_process&quot;&gt;2.5&lt;/a&gt;)
for quality and novelty of the content: clarity and pedagogy of the
exposition, new theoretical or practical insights,
reproduction/extension of experiments, etc.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;The posts will be published under a unified template (see
Section &lt;a href=&quot;#sub:sub_format&quot; data-reference-type=&quot;ref&quot; data-reference=&quot;sub:sub_format&quot;&gt;2.4&lt;/a&gt;
and
Section &lt;a href=&quot;#sub:sub_process&quot; data-reference-type=&quot;ref&quot; data-reference=&quot;sub:sub_process&quot;&gt;2.5&lt;/a&gt;)
and hosted on the conference website or our own Github page.&lt;/p&gt;
  &lt;/li&gt;
&lt;/ul&gt;

&lt;h2 id=&quot;positive-impact-for-the-community&quot;&gt;Positive Impact for the Community&lt;/h2&gt;

&lt;p&gt;We believe having this call for blog posts as a conference track would
increase the posts’ visibility, impact, and credibility, while
simultaneously providing benefits to the conference.&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;
    &lt;p&gt;&lt;em&gt;Adoption&lt;/em&gt;: we think that, with the conference’s stamp, such a
format will be more broadly recognized and adopted by the community.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;&lt;em&gt;Accessibility&lt;/em&gt;: maintaining a blog is time consuming , and requires
many blog posts to gain a stable following. By allowing researchers
to publish a single post, we will permit occasional blog writers to
publish their ideas, something that is relatively impossible right
now. Moreover, it will make this format accessible to more
independent/junior blog writers that do not have a company or a
research lab to support them.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;&lt;em&gt;Synchronization&lt;/em&gt;: the fast evolving field of ML advances at the
paces of its conferences. By following the same pace the blog posts
will add value and momentum to the conference. It will benefit from
the same advantages of conferences with respect to scientific
journals: faster publication process and cross-fertilization of
ideas.&lt;/p&gt;
  &lt;/li&gt;
&lt;/ul&gt;

&lt;h2 id=&quot;positive-impact-for-the-conference&quot;&gt;Positive Impact for the Conference&lt;/h2&gt;

&lt;p&gt;We develop the potential positive impact of a blog post track for the
conference itself:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;
    &lt;p&gt;Increases the value of the papers submitted to ICLR: blog posts will
discuss previously published papers, thus increasing their
visibility and quality.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Incentivizes researchers to submit their best research to ICLR: high
quality work will likely get highlighted in future years in a blog
post.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Improves reproducibility and transparency: the blog post track will
identify and publicly document pitfalls and “tricks” that were not
clearly communicated in the original publication.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Provides a scientific value by itself: such blog posts will
reproduce and extend results of previously published papers. They
will distill important theoretical and practical ideas improving
their adoption and impact.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Tests of time: this track will provide a sort of crowd-sourced test
of time at a shorter timescale than the current test of times
awards.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Promotes accessibility: because many of this track’s blog posts will
vulgarize past content, this track will make the conference broadly
more accessible (to students, non-natives, and, more generally,
non-experts in the field).&lt;/p&gt;
  &lt;/li&gt;
&lt;/ul&gt;

&lt;h2 id=&quot;submission-format&quot;&gt;Submission Format&lt;/h2&gt;

&lt;p&gt;Our goal is to avoid heavily engineered, professionally-made
blog-posts—Such as the “100+ hours” mentioned as a standard by the &lt;a href=&quot;https://distill.pub/journal/&quot;&gt;Distill
  guidelines&lt;/a&gt;—to entice ideas and clear writing rather than dynamic
visualizations or embedded javascript engines.&lt;/p&gt;

&lt;p&gt;As a result, we restrict submissions to the Markdown format. We believe
this is a good trade-off between complexity and flexibility. Markdown
enables users to easily embed media such as images, gifs, audio, and
video as well as write mathematical equations using MathJax, without
requiring users to know how to create HTML web pages. This (mostly)
static format is also fairly portable; users can download the blog post
without much effort for offline reading or archival purposes. More
importantly, this format can be easily hosted and maintained through
GitHub.&lt;/p&gt;

&lt;h2 id=&quot;submission-process&quot;&gt;Submission Process&lt;/h2&gt;

&lt;p&gt;A full copy of the track’s blogs will always be publicly available as a
GitHub repository &lt;a href=&quot;https://github.com/bourbaki-blogchain/bourbaki-blogchain.github.io&quot;&gt;(mock-up
link)&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;The process for creating and submitting a blog post is as follows:&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;
    &lt;p&gt;Entrants will fork this repository and &lt;strong&gt;make their fork private&lt;/strong&gt;.
 Failure to do so will result in the submission being rejected, as it
 breaches the double-blind review process.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Users will modify their fork as they see fit; they will add their post
 along with any media files it might require. Since this is a full fork,
 they will be able to view their own copy of the blog. This means that
 they will be able to see exactly how their post will look and behave
 on the main website.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Once completed, entrants will &lt;strong&gt;anonymize&lt;/strong&gt; their blog post (i.e. strip their
 name, affiliation, etc).&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Entrants will download a ZIP of their &lt;strong&gt;anonymized&lt;/strong&gt; fork (see figure
 below), and submit the ZIP to our OpenReview venue.&lt;/p&gt;
  &lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;img src=&quot;https://iclr.iro.umontreal.ca/4f7d7869-9804-4ce5-bf63-403917c28947_1642248606/public/images/download_zip.png&quot; alt=&quot;Download instructions image&quot; /&gt;&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;Once accepted, entrants will de-anonymize their post, make their fork
public again, and make a &lt;em&gt;Pull Request&lt;/em&gt; on Github from their fork to the
main blog, allowing us to pull in their new blog post in a transparent
way.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Once the submission period has ended, the GitHub repository of our track will
be temporarily made private for the duration of the conference, allowing the
conference to host the website. After the conference, the GitHub repository will
be made public again to allow viewers to fork and download its contents.&lt;/p&gt;

&lt;h2 id=&quot;the-potential-pitfalls-of-our-blog-post-track&quot;&gt;The potential Pitfalls of our Blog Post Track&lt;/h2&gt;

&lt;p&gt;In this section we identify potential issues arising with such a track
and explain how to mitigate them:&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;
    &lt;p&gt;&lt;em&gt;Adversarial Blog Posts&lt;/em&gt;: Since the guidelines are to write a blog
post on a previously published paper, one may expect some researcher
to try to use bad faith arguments to criticize a concurrent paper
through one of these blog post. We do not think this will happen,
because these blog posts will be public and thus researchers would
discredit themselves by using bad faith arguments.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;&lt;em&gt;Too many/few submissions&lt;/em&gt;: As this is a new track, it may be
difficult to predict the volume of submissions. The fact that there
are currently many independent blog posts on the web is a good
indicator that there will be positive interest. To get a better
estimate of the volume of potential submissions, we intend to
leverage social media to gauge the interest of the ML community in
such a track; this will allow us to gather a large enough reviewing
committee.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;&lt;em&gt;Reviewing&lt;/em&gt;: Once again as this is a new track, it may be unclear
how to judge blog posts during a review process. We will recruit a
large reviewing committee and define clear guidelines for the
reviewing process. Our primary focus will be on the originality of
the perspective and the novelty of the ideas, insights, and
experiments. For instance, posts that reuse less content from the
original paper (results, direct quotes) will be scored more
favourably than those that use more.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;&lt;em&gt;Too many posts on the same paper&lt;/em&gt;: We may mitigate this by only
selecting a small numbers of blog posts on the same paper. This
could actually be a strength since this can encourage discussion and
highlight different perspectives on the same work. Moreover, we
could explicitly state that we will have this hard limit (e.g.,
accepting a maximum of 3 blog posts on the same paper) to entice
researchers to submit blog posts on papers that have less
visibility.&lt;/p&gt;
  &lt;/li&gt;
&lt;/ol&gt;

&lt;h1 id=&quot;related-initiatives&quot;&gt;Related Initiatives&lt;/h1&gt;

&lt;p&gt;We mainly address our difference with respect to
&lt;a href=&quot;https://distill.pub/&quot;&gt;Distill&lt;/a&gt;, the &lt;a href=&quot;https://ml-retrospectives.github.io&quot;&gt;ML Retrospectives
Workshop&lt;/a&gt;, a Tutorial Track, and
other workshops discussing alternative formats for publications.&lt;/p&gt;

&lt;h4 id=&quot;distill&quot;&gt;Distill.&lt;/h4&gt;

&lt;p&gt;Created in 2016, &lt;a href=&quot;https://distill.pub/&quot;&gt;Distill&lt;/a&gt; is an online scientific
journal based on blog post publications. We address our differences with
respect to Distill:&lt;/p&gt;
&lt;ul&gt;
  &lt;li&gt;
    &lt;p&gt;&lt;em&gt;Visualizations&lt;/em&gt;: Blog posts should take advantage of the fact that
they’re not paperbound, and use innovative visualisations. But the
process of creating the intricate, dynamic visualisations associated
with Distill posts is a daunting for most authors. Creating blog
posts should be more easily accessible to newer authors and
researchers. Sometimes, being able to embed videos and gifs is
enough.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;&lt;em&gt;Content&lt;/em&gt;: Distill does not target the same type of content as our
track. Distill aims at presenting new research, and at making this
research more accessible. We want our blog post track to incentivize
researchers to revisit and discuss on other researcher’s works, in a
more natural way than scientific papers allow. Such a practice would
undoubtedly be useful for the community, both as a short-term “test
of time”, and also as a way to extract the key ideas from lengthy
articles.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;&lt;em&gt;Limited adoption by the community&lt;/em&gt;: we believe that since Distill
is not associated with a big conference track, its widespread
adoption is hindered. This lack of association confines it to a
small subset of the community that is already familiar with blog
posts.&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;&lt;em&gt;Leveraging the momentum of the conference&lt;/em&gt;: Distill describes
itself as a scientific journal. A large amount of the publications
in the ML community are conference papers. A blog post track that
follows conferences would be better suited to follow the pace of the
community.&lt;/p&gt;
  &lt;/li&gt;
&lt;/ul&gt;

&lt;h4 id=&quot;ml-retrospective-workshop&quot;&gt;ML-Retrospective Workshop.&lt;/h4&gt;

&lt;p&gt;A recurrent workshop in the ML community is the &lt;a href=&quot;https://ml-retrospectives.github.io&quot;&gt;ML Retrospectives
Workshop&lt;/a&gt; (NeurIPS 2019, 2020 and
ICML 2020). This workshop is a venue for researchers to talk about their
previous work in a more open and transparent way. More precisely,
emphasis has recently been put on addressing:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;
    &lt;p&gt;Flaws or mistakes in the paper’s methodology&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Limitations in the applicability of the work&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Changes in understanding or intuition&lt;/p&gt;
  &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;We share the ultimate goal of “making research more human”, but with a
completely different format. We believe that the constraint to write
about someone else’s work using natural language will channel fruitful
discussions and provide more visibility to previously published papers.&lt;/p&gt;

&lt;h4 id=&quot;tutorial-track&quot;&gt;Tutorial Track.&lt;/h4&gt;

&lt;p&gt;We believe that our proposed blog post track differentiates itself from
a tutorial track because tutorials operate at different scales. On the
one hand, a tutorial regarding a whole topic (e.g. GANs, adversarial
examples, Random matrix theory in ML) contains a long talk, slides, and
potentially exercises to get familiar with the topics. It is usually
made by a team of expert researchers on the topic. On the other hand,
the call for blog posts we propose focuses on a single publication. It
regards a single paper that can concern a more precise and recent topic
(e.g., a specific paper that addresses mode collapse on GANs, a novel
technique to perform adversarial training, etc.) and could be written by
a single researcher (once again making it more accessible to junior
researchers).&lt;/p&gt;

&lt;h4 id=&quot;previous-workshops-on-rethinking-publication-formats&quot;&gt;Previous workshops on rethinking publication formats.&lt;/h4&gt;

&lt;p&gt;Recently, the &lt;a href=&quot;https://rethinkingmlpapers.github.io/&quot;&gt;Rethinking ML Papers
Workshop&lt;/a&gt; at ICLR 2021 fuelled
the discussion (see references therein for related past workshops). The
presenters discussed the importance of accessibility, web
demonstrations, visualization and blog posts (among others). One
particularly related discussion was the &lt;a href=&quot;https://slideslive.com/38956531/beyond-static-papers-rethinking-how-we-share-scientific-understanding-in-ml&quot;&gt;talk by Lilian Weng
(time=4h25mins)&lt;/a&gt;
on the usefulness of blog posts to get up-to-date with the field of ML.&lt;/p&gt;

&lt;p&gt;In alignment with these initiatives, this new track is another step in
the direction of making research more human.&lt;/p&gt;

&lt;h3 id=&quot;bibliography&quot;&gt;bibliography&lt;/h3&gt;
&lt;p&gt;&lt;a name=&quot;Litt&quot;&gt;Michael L Littman. Collusion rings threaten the integrity of computer science research. Communications of the ACM, 2021.&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a name=&quot;Tran&quot;&gt;David Tran, Alex Valtchanov, Keshav Ganapathy, Raymond Feng, Eric Slud, Micah Goldblum, and Tom Goldstein. An open review of openreview: A critical analysis of the machine learning conference review process. arXiv, 2020. &lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a name=&quot;Lin&quot;&gt;Hsuan-Tien Lin, Maria-Florina Balcan, Raia Hadsell, and Marc’Aurelio Ranzato. What we learned from neurips2020 reviewing process. Medium https://medium.com/@NeurIPSConf/what-we-learned-from-neurips-2020-reviewing-process-e24549eea38f, 2020. &lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a name=&quot;Brow&quot;&gt;Eryn Brown and Chris Woolston. Why science blogging still matters. Nature, 2018.&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a name=&quot;Halm&quot;&gt;Paul R Halmos. Nicolas bourbaki. Scientific American, 1957.&lt;a&gt;&lt;/a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a name=&quot;Bour&quot;&gt;Nicolas Bourbaki. Elements of mathematics. Éditions Hermann, 1939.&lt;/a&gt;&lt;/p&gt;

</content>
 </entry>
 

</feed>
