Given a set of sites, CV-Inspector will automate the crawling, data collection, differential analysis, and labeling of the sites.
- Label 1 = the site was able to circumvent the adblocker
- Label 0 = the site was not successful at circumventing the adblocker or it did not attempt at circumvention
CV-Inspector was developed and used in the paper: CV-Inspector: Towards Automating Detection of Adblock Circumvention.
We refer to the paper for more details.
Visit our CV-Inspector Project page for more information, including datasets that we utilized in the paper.
Visit our CV-Inspector Dataset page for more information.
If you create a publication (including web pages, papers published by a third party, and publicly available presentations) using CV-Inspector, please cite the corresponding paper as follows:
@inproceedings{le2021cvinspector,
title={{CV-Inspector: Towards Automating Detection of Adblock Circumvention}},
author={Le, Hieu and Markopoulou, Athina and Shafiq, Zubair},
booktitle={The Network and Distributed System Security Symposium (NDSS)},
url = {https://dx.doi.org/10.14722/ndss.2021.24055},
doi = {10.14722/ndss.2021.24055},
year={2021}
}
We also encourage you to provide us (athinagroupreleases@gmail.com) with a link to your publication. We use this information in reports to our funding agencies.
For quick use, you can use our AMI that has CV-Inspector set up already using Ubuntu 18.04.3 LTS.
- Request access to the AMI by filling out the form
- You must have an existing AWS account. We will share the AMI directly with your account.
- See the README_AMI.md For the AMI.
If you want to set up your own environment, see the README_selfsetup.md.
- CV-Inspector Adblock Plus Chrome Extension: A custom version of Adblock Plus Chrome extension to annotate the page source
npm
: To build chrome extensionsmongodb
: To save intermediate data collectedchromedriver78
: The ChromeDriver for Selenium (version 78)Python 3.6+
: CV-Inspector is built on top on Python 3.6setup.py
: List of Python packages
Throughout the code and datasets, you may see the following terms:
control
: This commonly means the case for "No Adblocker"variant
: This commonly means the case for "With Adblocker"crawl_group_name
: Some unique identifier that holds together all data collected within one call ofcvinspector_monitor
.
CV-Inspector is licensed under Apache-2.0 License.
- The original DOM Chrome Extension was provided by the authors of Detecting Anti-Adblockers in the Wild. We modify the extension for CV-Inspector.