Dumviri: Detecting Trackers and Mixed Trackers with a Breakage Detector

Proceedings of the Twelfth ACM Workshop on Hot Topics in Networks

Adreveal: Improving transparency into online targeted advertising

B. Liu, A. Sheth, U. Weinsberg, J. Chandrashekar, R. Govindan

Published: 2013

Network and Distributed System Security Symposium (NDSS)

Selling off privacy at auction

C. Castelluccia, L. Olejnik, T. Minh-Dung

Published: 2014

2021 IEEE Symposium on Security and Privacy (SP)

Detecting filter list evasion with event-loop-turn granularity javascript signatures

Q. Chen, P. Snyder, B. Livshits, A. Kapravelos

Published: 2021

Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security

Jack-in-the-box: An empirical study of javascript bundling on the web and its security implications

J. Rack, C.-A. Staicu

Published: 2023

Proc. Priv. Enhancing Technol.

An automated approach for complementing ad blockers’ blacklists

D. Gugelmann, M. Happe, B. Ager, V. Lenders

Published: 2015

arxiv

被引用数 1

Towards Seamless Tracking-Free Web: Improved Detection of Trackers via One-class Learning

Muhammad Ikram, Hassan Jameel Asghar, Mohamed Ali Kaafar, Balachander Krishnamurthy, Anirban Mahanti

Published: 2016.3.21

Numerous tools have been developed to aggressively block the execution of popular JavaScript programs (JS) in Web browsers. Such blocking also affects functionality of webpages and impairs user experience. As a consequence, many privacy preserving tools (PP-Tools) that have been developed to limit online tracking, often executed via JS, may suffer from poor performance and limited uptake. A mechanism that can isolate JS necessary for proper functioning of the website from tracking JS would thus be useful. Through the use of a manually labelled dataset composed of 2,612 JS, we show how current PP-Tools are ineffective in finding the right balance between blocking tracking JS and allowing functional JS. To the best of our knowledge, this is the first study to assess the performance of current web PP-Tools. To improve this balance, we examine the two classes of JS and hypothesize that tracking JS share structural similarities that can be used to differentiate them from functional JS. The rationale of our approach is that web developers often borrow and customize existing pieces of code in order to embed tracking (resp. functional) JS into their webpages. We then propose one-class machine learning classifiers using syntactic and semantic features extracted from JS. When trained only on samples of tracking JS, our classifiers achieve an accuracy of 99%, where the best of the PP-Tools achieved an accuracy of 78%. We further test our classifiers and several popular PP-Tools on a corpus of 4K websites with 135K JS. The output of our best classifier on this data is between 20 to 64% different from the PP-Tools. We manually analyse a sample of the JS for which our classifier is in disagreement with all other PP-Tools, and show that our approach is not only able to enhance user web experience by correctly classifying more functional JS, but also discovers previously unknown tracking services.

モデル性能評価データ収集プライバシーリスク管理

2020 IEEE Symposium on Security and Privacy (SP)

Adgraph: A graph-based approach to ad and tracker blocking

U. Iqbal, P. Snyder, S. Zhu, B. Livshits, Z. Qian, Z. Shafiq

Published: 2020

arXiv preprint arXiv:2107.11309

Webgraph: Capturing advertising and tracking information flows for robust blocking

S. Siby, U. Iqbal, S. Englehardt, Z. Shafiq, C. Troncoso

Published: 2021

arXiv preprint arXiv:2301.10895

Astrack: Automatic detection and removal of web tracking code with minimal functionality loss

I. Castell-Uroz, K. Fukuda, P. Barlet-Ros

Published: 2023

Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security

Sugarcoat: Programmatically generating privacy-preserving, web-compatible resource replacements for content blocking

M. Smith, P. Snyder, B. Livshits, D. Stefan

Published: 2021

Privacy Enhancing Technologies Symposium (PETS)

Blocked or broken? Automatically detecting when privacy interventions break websites

M. Smith, P. Snyder, M. Haller, B. Livshits, D. Stefan, H. Haddadi

Published: 2022

Proceedings of the 32nd USENIX Security Symposium

Defining “broken”: User experiences and remediation tactics when ad-blocking or tracking-protection tools break a website’s user experience

A. Nisenoff, A. Borem, M. Pickering, G. Nakanishi, M. Thumpasery, B. Ur

Published: 2023

32nd USENIX Security Symposium (USENIX Security 23)

PoolParty: Exploiting browser resource pools for web tracking

P. Snyder, S. Karami, A. Edelstein, B. Livshits, H. Haddadi

Published: 2023

easylist/easylist commit d56ebb6

Abstracts of the 2020 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems

Who filters the filters: Understanding the growth, usefulness and efficiency of crowdsourced ad blocking

P. Snyder, A. Vastel, B. Livshits

Published: 2020

Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations

Trafilatura: A Web Scraping Library and Command-Line Tool for Text Discovery and Extraction

A. Barbaresi

Published: 2021

Perceptual ad highlighter

G. Storey, D. Reisman, J. Mayer, A. Narayanan

Published: 2017

brave/brave-browser wiki

Pagegraph

analytics-market

How google analytics collects data

2016 IEEE 17th International conference on information reuse and integration (IRI)

Clustering web pages based on structure and style similarity (application paper)

T. Gowda, C. A. Mattmann

Published: 2016

Proceedings of the 24th International Conference on World Wide Web

Privaricator: Deceiving fingerprinters with little white lies

N. Nikiforakis, W. Joosen, B. Livshits

Published: 2015

Proceedings of the 36th International Conference on Machine Learning

EfficientNet: Rethinking model scaling for convolutional neural networks

Mingxing Tan, Quoc Le

Published: 2019

16th USENIX Symposium on Operating Systems Design and Implementation (OSDI 22)

Jawa: Web archival in the era of JavaScript

A. Goel, J. Zhu, R. Netravali, H. V. Madhyastha

Published: 2022

Xgboost documentation

Implement blocking by post body parameters

All you need to know about cookies

Wayback machine

Tracking tags — docs — twitter developer platform

Adobe analytics 2.0 api reference

youtube tracker · issue #7878 · easylist/easylist

Tracker radar wiki

IEEE Symposium on Security and Privacy

Wtagraph: Web tracking and advertising detection

Z. Yang, W. Pei, M. Chen, C. Yue

Published: 2022

Proc. NDSS

Tranco: A research-oriented top sites ranking hardened against manipulation

V. L. Pochat, T. Van Goethem, S. Tajalizadehkhoob, M. Korczynski, W. Joosen

Published: 2019

google cloud

Pricing — compute engine: Virtual machines (vms)

U. B. of Labor Statistics

average hourly and weekly earnings of employees

Privacy badger

Proceedings of the 2014 Workshop on Artificial Intelligent and Security Workshop

Leveraging machine learning to improve unwanted resource filtering

S. Bhagavatula, C. Dunn, C. Kanich, M. Gupta, B. Ziebart

Published: 2014

The World Wide Web Conference

Cookie synchronization: Everything you always wanted to know but were afraid to ask

P. Papadopoulos, N. Kourtellis, E. Markatos

Published: 2019

Proceedings of the 2016 ACM SIGSAC conference on computer and communications security

Online tracking: A 1-million-site measurement and analysis

S. Englehardt, A. Narayanan

Published: 2016

2017 IEEE International Workshop on Measurement and Networking (M&N)

Towards accurate detection of obfuscated web tracking

H. Le, F. Fallace, P. Barlet-Ros

Published: 2017

Proceedings of the 2016 ACM on International Workshop on Security And Privacy Analytics

Towards automatic identification of javascript-oriented machine-based tracking

A. J. Kaizer, M. Gupta

Published: 2016

European Symposium on Research in Computer Security

A machine learning approach for detecting third-party trackers on the web

Q. Wu, Q. Liu, Y. Zhang, P. Liu, G. Wen

Published: 2016

Proceedings of the 21st ACM Internet Measurement Conference

Trackersift: Untangling mixed tracking and functional web resources

A. H. Amjad, D. Saleem, M. A. Gulzar, Z. Shafiq, F. Zaffar

Published: 2021

arXiv preprint arXiv:2302.01182

Blocking javascript without breaking the web: An empirical investigation

A. H. Amjad, Z. Shafiq, M. A. Gulzar

Published: 2023

Fourteenth symposium on usable privacy and security (SOUPS 2018)

Characterizing the use of browser-based blocking extensions to prevent online tracking

A. Mathur, J. Vitak, A. Narayanan, M. Chetty

Published: 2018

Adblock plus - open adblock plus forums

arXiv preprint arXiv:2202.12872

Autofr: Automated filter rule generation for adblocking

H. Le, S. Elmalaki, A. Markopoulou, Z. Shafiq

Published: 2022

2024 IEEE Symposium on Security and Privacy (SP)

Sinbad: Saliency-informed detection of breakage caused by ad blocking

S. E. H. Chehade, S. Siby, C. Troncoso

Published: 2024

Vips: a vision-based page segmentation algorithm

D. Cai, S. Yu, J.-R. Wen, W.-Y. Ma

Published: 2003

2017 14th Conference on Computer and Robot Vision (CRV)

Towards an improved vision-based web page segmentation algorithm

M. Cormer, R. Mann, K. Moffatt, R. Cohen

Published: 2017

Block brightdata · issue #1580 · adguardteam/adguardsdnsfilter

Zenodo