| Left: | ||
| Right: |
| OLD | NEW |
|---|---|
| 1 /* | 1 /* |
| 2 * This file is part of Adblock Plus <http://adblockplus.org/>, | 2 * This file is part of Adblock Plus <http://adblockplus.org/>, |
| 3 * Copyright (C) 2006-2014 Eyeo GmbH | 3 * Copyright (C) 2006-2014 Eyeo GmbH |
| 4 * | 4 * |
| 5 * Adblock Plus is free software: you can redistribute it and/or modify | 5 * Adblock Plus is free software: you can redistribute it and/or modify |
| 6 * it under the terms of the GNU General Public License version 3 as | 6 * it under the terms of the GNU General Public License version 3 as |
| 7 * published by the Free Software Foundation. | 7 * published by the Free Software Foundation. |
| 8 * | 8 * |
| 9 * Adblock Plus is distributed in the hope that it will be useful, | 9 * Adblock Plus is distributed in the hope that it will be useful, |
| 10 * but WITHOUT ANY WARRANTY; without even the implied warranty of | 10 * but WITHOUT ANY WARRANTY; without even the implied warranty of |
| 11 * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the | 11 * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the |
| 12 * GNU General Public License for more details. | 12 * GNU General Public License for more details. |
| 13 * | 13 * |
| 14 * You should have received a copy of the GNU General Public License | 14 * You should have received a copy of the GNU General Public License |
| 15 * along with Adblock Plus. If not, see <http://www.gnu.org/licenses/>. | 15 * along with Adblock Plus. If not, see <http://www.gnu.org/licenses/>. |
| 16 */ | 16 */ |
| 17 | 17 |
| 18 #include <AdblockPlus/ReferrerMapping.h> | 18 #include <AdblockPlus/ReferrerMapping.h> |
| 19 | 19 |
| 20 using namespace AdblockPlus; | 20 using namespace AdblockPlus; |
| 21 | 21 |
| 22 ReferrerMapping::ReferrerMapping(const int maxCachedUrls) | 22 ReferrerMapping::ReferrerMapping(const int maxCachedUrls) |
| 23 : maxCachedUrls(maxCachedUrls) | 23 : maxCachedUrls(maxCachedUrls) |
| 24 { | 24 { |
| 25 } | 25 } |
| 26 | 26 |
| 27 void ReferrerMapping::Add(const std::string& url, const std::string& referrer) | 27 void ReferrerMapping::Add(const std::string& url, const std::string& referrer, |
| 28 FilterEngine::ContentType requestType) | |
| 28 { | 29 { |
| 29 if (mapping.find(url) != mapping.end()) | 30 if (mapping.find(url) != mapping.end()) |
| 30 cachedUrls.remove(url); | 31 cachedUrls.remove(url); |
| 31 cachedUrls.push_back(url); | 32 cachedUrls.push_back(url); |
| 32 mapping[url] = referrer; | 33 mapping.emplace(url, RequestInfo(referrer, requestType)); |
|
Wladimir Palant
2014/11/25 15:46:37
My understanding is that map::emplace won't replac
sergei
2014/11/26 10:34:43
Excellent point, I forget about it.
| |
| 33 | 34 |
| 34 const int urlsToPop = cachedUrls.size() - maxCachedUrls; | 35 const int urlsToPop = cachedUrls.size() - maxCachedUrls; |
| 35 for (int i = 0; i < urlsToPop; i++) | 36 for (int i = 0; i < urlsToPop; i++) |
| 36 { | 37 { |
| 37 const std::string poppedUrl = cachedUrls.front(); | 38 const std::string poppedUrl = cachedUrls.front(); |
| 38 cachedUrls.pop_front(); | 39 cachedUrls.pop_front(); |
| 39 mapping.erase(poppedUrl); | 40 mapping.erase(poppedUrl); |
| 40 } | 41 } |
| 41 } | 42 } |
| 42 | 43 |
| 43 std::vector<std::string> ReferrerMapping::BuildReferrerChain( | 44 std::vector<std::string> ReferrerMapping::BuildFrameStructure( |
| 44 const std::string& url) const | 45 const std::string& url) const |
| 45 { | 46 { |
| 46 std::vector<std::string> referrerChain; | 47 std::vector<std::string> referrerChain; |
|
Wladimir Palant
2014/11/25 15:46:37
If you rename the function you should also rename
sergei
2014/11/26 10:34:43
renamed to `frames`.
| |
| 47 referrerChain.push_back(url); | 48 if (url.empty()) |
| 49 { | |
| 50 return referrerChain; | |
|
Wladimir Palant
2014/11/25 15:46:37
Why is this special case necessary? An empty strin
sergei
2014/11/26 10:34:43
Because it's rather an invalid argument and we sho
Wladimir Palant
2014/12/09 21:20:03
I realized that I misunderstood how this method is
| |
| 51 } | |
| 48 // We need to limit the chain length to ensure we don't block indefinitely | 52 // We need to limit the chain length to ensure we don't block indefinitely |
| 49 // if there's a referrer loop. | 53 // if there's a referrer loop. |
| 50 const int maxChainLength = 10; | 54 // Despite there is quite effecient algorithm to work with such loops, merely |
| 51 std::map<std::string, std::string>::const_iterator currentEntry = | 55 // limit the chain to some number works fine on practice and seems quiet |
| 56 // elegant. | |
|
Wladimir Palant
2014/11/25 15:46:37
This snarky comment seems completely pointless to
sergei
2014/11/26 10:34:43
removed
| |
| 57 const int maxChainLength = 20; | |
|
Wladimir Palant
2014/11/25 15:46:37
Why increase this value? 10 should be enough for a
sergei
2014/11/26 10:34:43
It was needed for tests.
removed.
| |
| 58 referrerChain.reserve(maxChainLength); | |
|
Wladimir Palant
2014/11/25 15:46:37
I doubt that this makes sense. While we might get
sergei
2014/11/26 10:34:43
I've tested, it's indeed has only one frame in abo
Wladimir Palant
2014/12/09 21:20:03
Any sane allocation approach does, to minimize mem
sergei
2014/12/12 17:07:07
Actually on a lower level usual allocator rounds i
| |
| 59 std::map<std::string, RequestInfo>::const_iterator currentEntry = | |
|
Wladimir Palant
2014/11/25 15:46:37
Given that you use std::map<std::string, RequestIn
sergei
2014/11/26 10:34:43
changed.
| |
| 52 mapping.find(url); | 60 mapping.find(url); |
| 53 for (int i = 0; i < maxChainLength && currentEntry != mapping.end(); i++) | 61 // We need it to build [first, second] when there is only one entry in the |
| 62 // `mapping` member, [{second, {first, AnyTypeOfSecond}}]. | |
| 63 // So, if we exit from the cycle not because of loop limit and the first is | |
| 64 // not empty then add it. | |
|
Wladimir Palant
2014/11/25 15:46:37
I assume that the code this comment refers to will
| |
| 65 std::map<std::string, RequestInfo>::const_iterator prevEntry = mapping.end(); | |
| 66 | |
| 67 // process the current url beyond the loop below because we should add it | |
| 68 // even if `mapping` is empty. | |
| 69 if (currentEntry != mapping.end()) | |
| 54 { | 70 { |
| 55 const std::string& currentUrl = currentEntry->second; | 71 if (currentEntry->second.type == FilterEngine::ContentType::CONTENT_TYPE_SUB DOCUMENT) |
| 56 referrerChain.insert(referrerChain.begin(), currentUrl); | 72 { |
| 57 currentEntry = mapping.find(currentUrl); | 73 referrerChain.push_back(url); |
| 74 } | |
| 75 prevEntry = currentEntry; | |
| 76 currentEntry = mapping.find(currentEntry->second.referrer); | |
| 58 } | 77 } |
| 78 else | |
| 79 { | |
| 80 // If there is no information for the url in `mapping` then assume that | |
| 81 // it's a frame, despite it's not necessary to be a frame. | |
| 82 referrerChain.push_back(url); | |
|
Wladimir Palant
2014/11/25 15:46:37
This special case is unnecessary. An empty referre
sergei
2014/11/26 10:34:43
Not clear which special case is mentioned, could y
Wladimir Palant
2014/12/09 21:20:03
I just realized that this was designed to work dif
sergei
2014/12/12 17:07:07
I also misunderstood it the same way initially and
| |
| 83 } | |
| 84 for (int i = 0; i < maxChainLength && currentEntry != mapping.end(); i++, | |
| 85 currentEntry = mapping.find(currentEntry->second.referrer)) | |
|
Wladimir Palant
2014/11/25 15:46:37
Please put the assignment to currentEntry at the e
sergei
2014/11/26 10:34:43
Not sure whether it's easier to read, because 'for
| |
| 86 { | |
| 87 if (currentEntry->second.type == FilterEngine::ContentType::CONTENT_TYPE_SUB DOCUMENT) | |
| 88 { | |
| 89 referrerChain.push_back(currentEntry->first); | |
|
Wladimir Palant
2014/11/25 15:46:37
This should be .insert(referrerChain.begin(), ...)
sergei
2014/11/26 10:34:43
In general it's inefficient, I don't understand wh
Wladimir Palant
2014/12/09 21:20:03
That's what I meant - for tiny lists like what we
| |
| 90 } | |
| 91 prevEntry = currentEntry; | |
| 92 } | |
| 93 if (prevEntry != mapping.end() && !prevEntry->second.referrer.empty()) | |
| 94 { | |
| 95 referrerChain.push_back(prevEntry->second.referrer); | |
| 96 } | |
| 97 // it should be slightly more efficient to reverse it at the end instead | |
| 98 // of copying the tail of the referrerChain on each adding. | |
|
Wladimir Palant
2014/11/25 15:46:37
It should also be pretty pointless, with the refer
sergei
2014/11/26 10:34:43
The main reason of it was to use push_back and rev
| |
| 99 reverse(referrerChain.begin(), referrerChain.end()); | |
| 59 return referrerChain; | 100 return referrerChain; |
| 60 } | 101 } |
| OLD | NEW |