Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improving added value of Topics #319

Open
remysaissy opened this issue Jul 3, 2024 · 1 comment
Open

Improving added value of Topics #319

remysaissy opened this issue Jul 3, 2024 · 1 comment

Comments

@remysaissy
Copy link
Contributor

Teads believes that the most important thing is to understand how rare a given signal is.
Why? Because topics link browsing history with context in a privacy compliant way.
This characteristic can be used when joining them on the server side in order to build contextual clusters without third party cookies.

To achieve this the current design should evolve and allow adding a weight value besides each observed topic. That weight is the relative weight of a given topic for a browser compared to all browsers using the topic.

Also, all browsers can be considered within two scopes:

  • All browsers using Topics API
  • All browsers having these topics observed for a given Ad-Tech

For this latter case, a TEE service might enable an Ad-Tech to support the cost of that analysis in a privacy compliant way

To take an analogy, this algorithm works like a TF-IDF.
A TF-IDF analyzes the occurrences of words in a document.
In Teads case, we consider that words are topics and documents are browsers.

@jkarlin
Copy link
Collaborator

jkarlin commented Jul 3, 2024

This is largely a duplicate of #42 where we discussed inverse frequency. In the end, we did model inverse frequency and found that it did not correlate terribly well to topic value as according to values provided by a few sellers and buyers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants