This project is in early alpha. It is an experimental proof-of-concept and should not be relied upon for personal safety, legal evidence, or any security-critical purpose. Expect bugs, breaking changes, and data loss.

WhatIsTracking.Me

Many have this question, but there are no definitive answers. In the age of no-consent scraping, we surrender all our digital data. What about our offline interactions in daily life?

How It Works

1

Go Outside

Live your normal daily life.

2

Wear a QR Code

Display a unique QR code from our app on a badge, sticker, device, or paper.

3

Get Captured

Someone takes a picture or video of you, knowingly or not.

4

Get Notified

Receive a notification when someone or something interacts with the QR code, even weeks later.

5

See the Map

View where you got captured and interaction data on an interactive map.

You leave your home, go outside, other people or companies take pictures of you without your consent or sometimes even knowledge. This is normalized in our current society.

Previously these pictures were never exposed to anyone, except if the person consciously shared them on social media or in person.

This has changed.

To keep the AI train running and to sell more personalized ads, companies chose to process and automatically upload the pictures and videos you gladly offered to them for a “free” service. In most cases they will aggressively coerce you to sign a hundred-page terms of service, where you surrender your rights. If you don’t agree, you can’t use their service and everyone just agrees as they don’t see a choice.

A prominent example is Meta.

If you see a “cloud processing” pop-up on Facebook and tap Allow, you’re agreeing to Meta’s AI Terms of Service and permitting your “media and facial features” to be analyzed by Meta’s AI. Once enabled, Facebook continuously uploads photos from your camera roll to Meta’s cloud to surface “hidden gems” and suggest creative edits or collages. source

But it does not end with a single company. To make things worse, regulators in several countries consider and plan requirements to certain companies to scan pictures on your phone for illegal content, regardless of their business terms of service. This opened the doors of acceptance for on-device scanning and processing of your personal data.

What Now?

This project aims to help you and us understand where, when, what, and maybe who is digitally tracking you in real life, even if you don’t own smart devices.

We approach this in a technical way, but cannot succeed without collaboration with the community. Therefore we are dependent on people using our project and donating their (anonymized) data as a conscious decision. Help us find answers, or at least get closer. Maybe it will also be useful in a regulatory debate to make sure we retain or gain the privilege of consent in real life, which we lost in digital life.

The Pipeline

1
Client Device

QR Generated

Encodes an encrypted timestamp and unique identifier

2
Real World

Bait Taken

CCTV cameras, bots, scanners, and browsers all trigger the trap

CCTV
Database
AI
Website
3
Tracking Server

Logged & Purged

DNS and HTTP requests stored pseudoanonymously, deleted once read

DNS
+
HTTP
Store
Request
Purge
4
Client Dashboard

Location Matching

Client aligns the decrypted timestamp with local position history

Location
+
Timestamp
=
Unmasking

Technical Breakdown

It starts with the fact that we have deployed several servers that will track any interaction with them. This means any DNS, HTTP, or HTTPS request aimed at these will be logged, analyzed, and processed. The technology used is known from something called Out-Of-Band testing (OOB), which is part of the workflow of security engineers, consultants and researchers. If you want to learn more about this part of the stack we would like to point you to the ProjectDiscovery organization that created the interactsh project.

We built on top of this and extended it with a processing and matching service, a user-facing API that allows you to register user identifiers and subscribe to events that are associated with unique identifier subdomains, a database that keeps the interactions until the user downloads them, and a user-facing client application. We have all of this code open source and you can learn more about technical details in our repository.

You register yourself on app.staging.whatistracking.me without providing any information about yourself. You will get a bunch of public identifiers that will be used to create unique subdomains that allow us to associate interactions with you. Let us assume one of them is totallyrandomidentifier.

Visiting https://totallyrandomidentifier.tracking-collector.com will create at minimum two interaction events. The first one is a DNS query to figure out where this (sub)domain is located and the second one is loading the content served on the webserver on this address via HTTP(S).

You can now create a QR code to this domain and print it, wear it and get notified whenever there is some interaction. But this won’t tell you where and when, just that it happened. To get this deeper insight, you need a more advanced solution.

The app will create a QR code with the link to the above domain but also append an encrypted timestamp that will be updated on an interval basis. This will regenerate a new QR code for each interval. The app also optionally tracks your location (fully local!) and will be able to match these timestamps with your location if some interaction happens even weeks or months after you went outside.

We also created an open-source physical device based on an e-paper toolkit from soldered. This is due to the physical limitations of a smartphone display or a badge/sticker printout. This allows us to display the QR code on a 10” e-paper display that is easier to scan from a distance. The display choice also allows us to wear this device without additional equipment for weeks and months without needing to charge. You can find the full implementation in the /Inkplate subfolder in the repository.

With this location, timestamp encoded in the subdomain, and the metadata from the interaction itself, you will be able to understand when, where, and likely what tracked you. The app has a map feature that shows you interactions mapped to locations to get a more visual picture.

Common Questions

Will this expose all of the trackers?

No. It will only show part of the surface.

Why does this even work?

Because companies are data hungry. They want to scrape any new data that they haven't seen before. They will decode the QR code and scrape the content to train their next-generation AI or ad profiling tool. This can happen instantly or delayed even after weeks or months. Our service will be able to notify you as soon as we are able to track an interaction.

Does participation hurt me in any way?

Yes, you will stand out from the crowd until everyone adopts it. The QR code will make you easier to track with our current technological capabilities.

Why should I donate my data?

The donated data will be used for the benefit of everyone and will allow our collective society to prove that regulations and changes are needed. You are not obligated to donate the interaction data and your anonymized local data.

Why did you create this?

This was born out of curiosity during security testing of AI infrastructure.

How can I collaborate?

Use our app in the wild and donate data or become involved in the community and development of the project. We accept PRs and welcome new members.

What about AI usage?

We do not reject AI and use it in our daily work. The code surrounding this project was also mostly vibe coded and only a few parts are completely handwritten. We still think that manual review and proper understanding is important and try to limit the code and features we maintain to a reasonable degree, where we can still handle the project without any AI usage.

Is it technically possible for you to see my data, even if I don't donate?

We are not able to see your location or timestamps at any time. We are technically able to see the raw interaction data and the metadata you create when interacting with our service. Nevertheless, we chose to not use them without explicit consent and we are open to implement effective technical measures to improve this.

Can I self-host this?

We are not able to see your location or timestamps at any time. We are technically able to see the raw interaction data and the metadata you create when interacting with our service. Nevertheless, we chose to not use them without explicit consent and we are open to implement effective technical measures to improve this.