Issue 29882562: Issue 6956 - Move extension's style sheet generation into core

Manish Jethani

Sept. 16, 2018, 5:19 p.m. (2018-09-16 17:19:27 UTC) #1

Manish Jethani

Patch Set 1 All this code, except the documentation and unit tests, is copied from ...

Sept. 16, 2018, 5:23 p.m. (2018-09-16 17:23:13 UTC) #2

Sebastian Noack

https://codereview.adblockplus.org/29882562/diff/29882563/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29882563/lib/elemHide.js#newcode92 lib/elemHide.js:92: yield selectors.slice(i, i + selectorGroupSize); Apparently this code is ...

Sept. 17, 2018, 6:59 p.m. (2018-09-17 18:59:58 UTC) #3

Manish Jethani

On 2018/09/17 18:59:58, Sebastian Noack wrote: > https://codereview.adblockplus.org/29882562/diff/29882563/lib/elemHide.js > File lib/elemHide.js (right): > > https://codereview.adblockplus.org/29882562/diff/29882563/lib/elemHide.js#newcode92 ...

Sept. 18, 2018, 2:51 p.m. (2018-09-18 14:51:53 UTC) #4

Manish Jethani

Patch Set 2: Make createStyleSheet top-level Patch Set 3: Avoid temporary array and join It ...

Sept. 18, 2018, 3:12 p.m. (2018-09-18 15:12:35 UTC) #5

Sebastian Noack

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) Since splitSelectors() is only ...

Sept. 18, 2018, 3:33 p.m. (2018-09-18 15:33:24 UTC) #6

Manish Jethani

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) On 2018/09/18 15:33:24, Sebastian ...

Sept. 18, 2018, 3:50 p.m. (2018-09-18 15:50:02 UTC) #7

Sebastian Noack

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) On 2018/09/18 15:50:02, Manish ...

Sept. 18, 2018, 4:41 p.m. (2018-09-18 16:41:45 UTC) #8

Manish Jethani

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) On 2018/09/18 16:41:45, Sebastian ...

Sept. 18, 2018, 5:24 p.m. (2018-09-18 17:24:08 UTC) #9

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js
File lib/elemHide.js (right):

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#new...
lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors))
On 2018/09/18 16:41:45, Sebastian Noack wrote:
> On 2018/09/18 15:50:02, Manish Jethani wrote:
> > On 2018/09/18 15:33:24, Sebastian Noack wrote:
> > > Since splitSelectors() is only called from one code path (here), and isn't
> > > exported either, why not just merge the logic of splitSelectors() and
> > > createRules()?
> > 
> > I tried this and it seems to actually perform slightly worse. I think V8
> > optimizes smaller functions. Given that these two can be logically separated
> > (and there's no performance benefit of merging them), I'd rather leave it
like
> > it is.
> 
> V8 optimizes functions regardless of size, however, it only inlines small
> functions. However, we could not only merge splitSelectors() and createRules()

There is a specific optimization in TurboFan called "small function" that marks
a function for compilation (not inlining) if it's small enough. Anyway, I'll
admit I don't know about the details.

> but as well put the whole logic into createStyleSheet(). That way we'd get
away
> with neither using generators nor creating an array, avoiding a performance
> penalty on older browser engines that don't optimize generators, as well as
any
> performance penalty caused by functions not being inlined by the JIT, and
FWIW,

OK, so I merged everything into one function and ran the test mentioned in the
issue comments. I opened 25 windows, 8 times (so the function was called 200
times).

All merged: 2328.1999970786273
Split up into three functions: 1946.2000061757863

I'm not really seeing any case for merging these three perfectly independent and
well documented functions.

> it will be less code as well.

Well, it will be one big function that is harder to grok than three small
functions that specialize in their individual tasks. I know that this is
subjective, but since there's no good case for performance here (except on older
browsers, which is still speculative), I'd say let's leave it as it is at least
for this change.

Regarding older browsers: where there's a tradeoff between older browsers and
either one of maintainability and mobile (latest V8), I would go for the latter.

Sebastian Noack

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) On 2018/09/18 17:24:07, Manish ...

Sept. 18, 2018, 5:40 p.m. (2018-09-18 17:40:54 UTC) #10

Manish Jethani

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) On 2018/09/18 17:40:54, Sebastian ...

Sept. 18, 2018, 5:47 p.m. (2018-09-18 17:47:47 UTC) #11

Manish Jethani

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) On 2018/09/18 17:47:46, Manish ...

Sept. 18, 2018, 7:51 p.m. (2018-09-18 19:51:57 UTC) #12

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js
File lib/elemHide.js (right):

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#new...
lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors))
On 2018/09/18 17:47:46, Manish Jethani wrote:
> On 2018/09/18 17:40:54, Sebastian Noack wrote:
> > On 2018/09/18 17:24:07, Manish Jethani wrote:
> > > OK, so I merged everything into one function and ran the test mentioned in
> the
> > > issue comments. I opened 25 windows, 8 times (so the function was called
200
> > > times).
> > > 
> > > All merged: 2328.1999970786273
> > > Split up into three functions: 1946.2000061757863
> > 
> > This appears rather odd to me. On which version of Chrome did you do that
> > benchmark? Mind also trying on Firefox at least?
> 
> I did it on Chrome 71 (Canary). It's not going to be much different on stable
> (version 69), but I'll try it out regardless. Also Firefox.

Alright, here we go.

The alternative version of the `createStyleSheet` function is this:

  function createStyleSheet(selectors)
  {
    let s = performance.now();
    let styleSheet = "";

     // Chromium's Blink engine supports only up to 8,192 simple selectors, and
    // even fewer compound selectors, in a rule. The exact number of selectors
    // that would work depends on their sizes (e.g. "#foo .bar" has a size of
2).
    // Since we don't know the sizes of the selectors here, we simply split them
    // into groups of 1,024, based on the reasonable assumption that the average
    // selector won't have a size greater than 8. The alternative would be to
    // calculate the sizes of the selectors and divide them up accordingly, but
    // this approach is more efficient and has worked well in practice. In
theory
    // this could still lead to some selectors not working on Chromium, but it
is
    // highly unlikely.
    // See issue #6298 and https://crbug.com/804179
    for (let i = 0; i < selectors.length; i += selectorGroupSize)
      styleSheet += selectors.slice(i, i + selectorGroupSize).join(", ") +
                    " {display: none !important;}\n";

    window.n += performance.now() - s;

    return styleSheet;
  }

You can comment out the `performance.now()` lines to make it pass the unit
tests.

On Chrome 69:

 - three functions with generators: 1743.899998255074
 - all merged: 2081.1999943107367

On Firefox 61:

 - three functions with generators: 838
 - all merged: 760

Given that most of our user base is on the latest version of V8, it's a
no-contest. Firefox will catch up. Also this is just a single run for Firefox
(whereas on Chrome I also did this on version 71 with similar results).

I encourage you to try it out yourself, for example on Firefox:

  for i in $(seq 25); do open -a firefox "http://example.com/?n=$i"; done

I did this 8 times, resulting in 200 calls to the function (no other windows
open).

At this point, considering this issue wasn't about optimizing these functions
but merely about copying them over in the first place, we should just land this
patch.

Sebastian Noack

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) On 2018/09/18 17:47:46, Manish ...

Sept. 18, 2018, 9:21 p.m. (2018-09-18 21:21:04 UTC) #13

Manish Jethani

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) On 2018/09/18 21:21:03, Sebastian ...

Sept. 19, 2018, 9:39 a.m. (2018-09-19 09:39:01 UTC) #14

Manish Jethani

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) Since I was modifying ...

Sept. 19, 2018, 10:18 a.m. (2018-09-19 10:18:34 UTC) #15

Manish Jethani

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) On 2018/09/19 09:39:00, Manish ...

Sept. 19, 2018, 10:29 a.m. (2018-09-19 10:29:08 UTC) #16

Sebastian Noack

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) On 2018/09/19 10:29:08, Manish ...

Sept. 19, 2018, 11:05 a.m. (2018-09-19 11:05:53 UTC) #17

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js
File lib/elemHide.js (right):

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#new...
lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors))
On 2018/09/19 10:29:08, Manish Jethani wrote:
> On 2018/09/19 09:39:00, Manish Jethani wrote:
> > On 2018/09/18 21:21:03, Sebastian Noack wrote:
> > 
> > > However, I might still argue that the code (now where I have seen how it
> would
> > > look like), is simpler and easier to follow if those functions are merged,
> and
> > 
> > A lot of the best programmers would disagree that one function doing
multiple
> > things is easier to follow than multiple functions each doing one thing [1]
> > 
> > [1]:
> >
>
https://softwareengineering.stackexchange.com/questions/308108/when-is-it-app...
> 
> Also:
>
https://arstechnica.com/information-technology/2013/08/ask-stack-is-it-ok-to-...

I'm obviously not trying to argue that functions should generally be avoided and
that code should always be inlined if possible. But an algorithm that would
result into 6 lines of code if implemented in a single function, hardly calls
for being split up into 3 functions.

For reference, this is how a prime number checker would look like following your
practice:

  function* generateNumbers(start, stop)
  {
    for (let i = start; i < stop; i++)
      yield i;
  }

  function isDividable(divident, divisor)
  {
    return dividend % divisor == 0;
  }

  function isValid(num)
  {
    return num != 0 && num != 1;
  }

  function isPrime(num) {
    for (let i of generateNumbers(2, num))
      if (isDividable(num, i)) return false;
    return isValid(num);
  }

Is this any more readable than if you'd merge those functions? It's not, because
now when following the code execution path you have to jump from one function to
another going up and down in the source file, while the complexity split out is
trivial, so that it would be easier to read the code in place rather than
following function calls. IMO, this is exactly the same with the code here.

If it's really faster (on the current version of Chrome) when splitting up the
code like that, this is another story, but still a mystery to me. Just one more
idea why that might be: Perhaps it performs better if you move that long comment
out of the function body. I remember that V8 determines what a small function
(subject to particular optimizations) is among others based on the number of
characters in the function body.

Manish Jethani

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) On 2018/09/19 11:05:53, Sebastian ...

Sept. 19, 2018, 1:14 p.m. (2018-09-19 13:14:41 UTC) #18

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js
File lib/elemHide.js (right):

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#new...
lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors))
On 2018/09/19 11:05:53, Sebastian Noack wrote:
> On 2018/09/19 10:29:08, Manish Jethani wrote:
> > On 2018/09/19 09:39:00, Manish Jethani wrote:
> > > On 2018/09/18 21:21:03, Sebastian Noack wrote:
> > > 
> > > > However, I might still argue that the code (now where I have seen how it
> > would
> > > > look like), is simpler and easier to follow if those functions are
merged,
> > and
> > > 
> > > A lot of the best programmers would disagree that one function doing
> multiple
> > > things is easier to follow than multiple functions each doing one thing
[1]
> > > 
> > > [1]:
> > >
> >
>
https://softwareengineering.stackexchange.com/questions/308108/when-is-it-app...
> > 
> > Also:
> >
>
https://arstechnica.com/information-technology/2013/08/ask-stack-is-it-ok-to-...
> 
> I'm obviously not trying to argue that functions should generally be avoided
and
> that code should always be inlined if possible. But an algorithm that would
> result into 6 lines of code if implemented in a single function, hardly calls
> for being split up into 3 functions.

If they are logically separate (as in, the programmer thinks of them as separate
operations), then why not? It should not depend on the number of lines of code.

> For reference, this is how a prime number checker would look like following
your
> practice:
>
> [...]
>
> Is this any more readable than if you'd merge those functions? It's not,
because

This would be fine if it's a program for children to learn programming. It
depends on the context, to a great extent.

> now when following the code execution path you have to jump from one function
to
> another going up and down in the source file, while the complexity split out
is
> trivial, so that it would be easier to read the code in place rather than
> following function calls. IMO, this is exactly the same with the code here.

Well it seems more readable to me this way, maybe because I'm already looking
ahead to what we're going to do here to deduplicate the style sheet/rules.

Anyway, I've got a new idea for how to optimize the style sheets, and indeed we
may be maintaining a cache of the first 7 out of the 8-10 selector groups
(20,000 selectors / 1,024 group size), in which case this function would get
larger with more logic and then it's better to split it up into specialized
functions.

> If it's really faster (on the current version of Chrome) when splitting up the
> code like that, this is another story, but still a mystery to me. Just one
more
> idea why that might be: Perhaps it performs better if you move that long
comment
> out of the function body. I remember that V8 determines what a small function
> (subject to particular optimizations) is among others based on the number of
> characters in the function body.

I thought maybe that was the case, maybe it is after all. But I also know that
the new V8 (TurboFan) doesn't take comments into account when deciding the size
of the function.

Jon Sonesen

https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js File lib/elemHide.js (right): https://codereview.adblockplus.org/29882562/diff/29884568/lib/elemHide.js#newcode288 lib/elemHide.js:288: for (let selectorGroup of splitSelectors(selectors)) FWIW I would say ...

Sept. 19, 2018, 4:36 p.m. (2018-09-19 16:36:27 UTC) #19

Manish Jethani

Alright, so the issue here was about literally copying and pasting code from the extension ...

Sept. 20, 2018, 12:30 p.m. (2018-09-20 12:30:19 UTC) #20

Manish Jethani

Sept. 20, 2018, 5:22 p.m. (2018-09-20 17:22:13 UTC) #22

On 2018/09/20 16:11:38, Sebastian Noack wrote:
> LGTM

Thanks!

Issue 29882562: Issue 6956 - Move extension's style sheet generation into core (Closed)

Patch Set 1 #

Patch Set 2 : Make createStyleSheet top-level #

Patch Set 3 : Avoid temporary array and join #

Messages