Article
Open access
Published: 05 July 2024

tracerDB: a crowdsourced fluorescent tracer database for target engagement analysis

Nature Communications volume 15, Article number: 5646 (2024) Cite this article

773 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Investigating ligand-protein complexes is essential in the areas of chemical biology and drug discovery. However, detailed information on key reagents such as fluorescent tracers and associated data for the development of widely used bioluminescence resonance energy transfer (BRET) assays including NanoBRET, time-resolved Förster resonance energy transfer (TR-FRET) and fluorescence polarization (FP) assays are not easily accessible to the research community. We created tracerDB, a curated database of validated tracers. This resource provides an open access knowledge base and a unified system for tracer and assay validation. The database is freely available at https://www.tracerdb.org/.

Bright and stable luminescent probes for target engagement profiling in live cells

Article 21 October 2021

Multivalent binding kinetics resolved by fluorescence proximity sensing

Article Open access 07 October 2022

FRET as a biomolecular research tool — understanding its potential while avoiding pitfalls

Article 30 August 2019

Introduction

Well-characterized, selective small molecules—“chemical probes”—are essential tools for target validation during drug development and in basic biological research¹. Criteria for small molecule modulators to qualify as chemical probes have been established by chemical biologists and are widely accepted in the community². These include target-related criteria for potency, selectivity, and proof for target engagement in addition to the suitability of the chemical matter itself¹. By creating these quality criteria, chemical probes became important and generally recognized tools aiding the scientific community and accelerating drug discovery. Inspired by this approach, our goal is the standardization of quality criteria within the drug candidate evaluation process. For the evaluation of a ligand-protein interaction, either direct or indirect measurements can be carried out. Direct binding assays such as isothermal titration calorimetry (ITC) and surface plasmon resonance (SPR) are state-of-the-art methods to measure dissociation constants (K_D). Additionally, ITC measurements are both label-free and do not require immobilization. They allow for the determination of the stoichiometry and thermodynamic parameters, whereas SPR enables the determination of the binding rate constants such as k_on and k_off. However, these methods either require large amounts of protein (ITC) or immobilized purified protein (SPR) and therefore are excellent tools for final binding validation but not optimal for larger screening and in-cell campaigns. Indirect biochemical and cellular assays often rely on in-solution (or in-cell) displacement assays using fluorescence-labeled molecules, called tracers (sometimes referred to as fluorescent probes—not to be confused with chemical probes themselves or medical radiotracers)^3,4,5. Tracers are composed of (1) a moiety that binds to the protein of interest (POI), such as small molecules, DNA, RNA, and peptides, (2) a chemical linker, and (3) a reporter label, typically a fluorescent dye^6,7. To avoid interference of the linker with the binding of the molecule to the POI, the choice of the right exit vector, a solvent exposed attachment point of the linker to the molecule, is important (Fig. 1a).

**Fig. 1: Composition of a tracer (T000001)²² and the principle underlying the tracerDB.**

Tracers are used in cellular target engagement assays (in cellulo) such as time-resolved Förster resonance energy transfer (TR-FRET)⁶ or bioluminescence resonance energy transfer (BRET)⁴ assays or biochemical in vitro studies, which can be BRET-based, TR-FRET-based or comprise fluorescence polarization (FP)⁸. In particular, NanoBRET, a method frequently applied in kinase live-cell target engagement assays, critically relies on the use of suitable tracer molecules. This method validates the binding of a small molecule such as an inhibitor to its cognate target in the cell. It is also suitable for assessing cellular selectivity by utilizing a single tracer⁹. Owing to the stringent distance and orientation constraints of the BRET donor, tracers do not have to be specific for the protein of interest. Promiscuous BRET tracers are ideal as they survey multiple targets. Using this principle, we successfully enabled 206 (as of Feb. 2024) validated kinase interactions with tracer K10 (T000008).

Results

Due to the importance of the quantification of protein-ligand interactions, a large number of tracers are reported within the literature. However, scientists face several problems to establish displacement assays for their respective target: (1) finding established tracers in the literature using search engines is difficult, as much of the required information is buried in the Supplementary Methods; (2) reproducibility of the reported assays is often problematic due to insufficient validation of the tracer or unfavorable assay parameters; (3) the availability of the tracer is often unknown. We created a database for fluorescent tracer molecules named tracerDB to address these problems. It has been developed and standardized to provide design and application guidance based on strict performance criteria. For each tracer-based assay, the chemical structure or commercial availability is provided, as well as the assay parameters and a reference. tracerDB allows to search for the protein of interest or the tracer, enabling fast assessment of available assay options for a specific target. Within the first 6 months (as of April 2024), 42 tracers, targeting 318 different proteins in 476 experimentally validated assays were reviewed and uploaded.

Scientists worldwide can submit their tracer data for review and inclusion in the database. The submission of tracer data must contain all necessary information (no physical molecules) required to judge the quality and reproducibility of a tracer-based assay. First, general information about the molecular structure (e.g. simplified molecular-input line-entry system (SMILES) specification, fluorophore characteristics, storage conditions and trivial name) are required for the creation of a tracer page (Fig. 1b). In some special cases, tracer structures cannot be disclosed. In this case, the availability of the tracer must be guaranteed to allow access to all reported assays which would otherwise be granted by the chemical structure of the tracer. Every structure submitted is checked for structural features associated with “pan assay interference compounds” (PAINS)¹⁰ which are reported together with the tracer information. Since the applied filters for the detection of PAINS also detect fluorescent substructures, it is recommended to inspect the highlighted moieties of the tracers that are flagged, by opening the PAINS report within the tracer description panel. All target proteins bound by the tracer have to be listed in UniProt¹¹. Experimental data for the tracer titration and compound displacement are part of the validation process and must be uploaded together with information on a recommended concentration, the Z’ value of the assay, and the assay window observed. Here, the assay window describes the fold-change between signal (tracer bound) and noise (tracer only) at the recommended tracer concentration (https://www.tracerdb.org/about/). The experimental data are available for download by the user. To facilitate the upload and review process, data can be submitted via the submission page (https://www.tracerdb.org/submission). On this page, all information can be added with essential information written in bold. Without providing all necessary information, the submission is not possible. After insertion of all required information, the data is automatically sent to info@tracerdb.org, for final approval and upload, allowing submission without the need of a login by the user. Additionally, tracer IDs can be assigned prior to publication, allowing a direct link to the database (in analogy to PDB). In contrast to an automatically generated data repository, the submission of tracer data is followed by a review process which makes tracerDB a reviewed and curated database. Thus, every entry has been examined for its agreement with the database’s quality criteria, allowing adherence to the highest possible quality control.

The interaction network between tracer molecules and their respective targets can be modeled as a many-to-many relationship where many tracers can bind a single protein and a single tracer can bind many proteins. As a result, the underlying database structure consists of three entity sets: the tracer, the protein, and their interaction (Fig. 1b). To ensure a user-friendly submission of data and standardize the presentation, all molecular representations and calculations are created and executed on the server side. We chose Django¹² as a python-based web framework together with a MySQL database to enable high-frequency read operations.

In addition to the information on the crowdsourced tracers, we have also included general information on tracer molecules and illustrations of different assay systems on the “about” page (https://www.tracerdb.org/about/). Here, we describe the quality control criteria and how to calculate the respective values. In order to further increase the reproducibility of the described assays, each assay is classified according to its parameters into robust, expert and unsuitable assays with exemplary data for clarification (Fig. 2). These assay levels are represented by a traffic light icon for each registered assay. In addition, we have included a methods section describing the different assays used to collect the submitted data (https://www.tracerdb.org/methods/). This section is supported by an illustration and key references.

**Fig. 2: Data input and processing carried out by the webserver.**

Discussion

The development and implementation of tracer-based assays is carried out by countless laboratories around the world. For every assay, proper validation and standardization are crucial to ensure assay quality. To support the reproducibility of established assays across different laboratories, tracerDB helps to standardize assays by providing a curated and constantly growing set of experimentally validated tracers with recommended concentrations. Additionally, the tracerDB “about” page (https://www.tracerdb.org/about/) summarizes the most important quality control information to ensure the generation of high-quality data. Further information on validated exit vectors for the development of other bifunctional compounds or indications of suitable protein fusion termini can be extracted from the database, as well.

tracerDB is therefore a resource for drug-screening scientists as well as the chemical biology community, that gathers detailed, reviewed and high-quality information on tracer-based assays and their applications.

Methods

Architecture of the database

RDkit¹³, a commonly used cheminformatics package for python is employed to render SMILES strings as two-dimensional molecular representations, along with the implemented substructure search to allow for the detection of PAINS elements and their depiction. The average molecular weight and the estimated logP value of the compound- and peptide-based tracers are calculated using RDkit’s implemented methods for molecular descriptors. In order to avoid having to deal with complex SMILES of large peptide tracers, the pyPept package¹⁴ has been incorporated into this project to allow for flexible declaration of custom amino acids, i.e. fluorophore peptide labels. These artificial building blocks are then included into the string representation of the peptides and stored in the database as BILN¹⁵. For the interactive depiction of the three-dimensional structure of protein-based tracers, the NGL viewer was incorporated^16,17. To ensure consistency in the depiction and analysis of experimental data uploaded to the webserver, fitting and plotting are executed on the server side. The experimental titration data is plotted via Matplotlib¹⁸ and the fitting is conducted through SciPy¹⁹ using non-linear least squares optimization. It is assumed that the data from concentration response experiments exhibit a sigmoidal shape. Hence, to fit the data the following logistic equation is employed:

$$f\left(x\right)=\frac{a}{1+{e}^{-b\left(x-{{{\mbox{XC}}}}_{50}\right)}}+c$$

(1)

The response of the measurement is a function of the logarithmic concentration $x$, with the additional parameters $a$, $b$, and, $c$ which are utilized to scale and transform $f$, because the input is not normalized. ${{{\mbox{XC}}}}_{50}$ is the parameter determining the log concentration halfway between the plateaus of the sigmoidal curve. Depending on the experimental context this parameter may be interpreted as ${{{\mbox{EC}}}}_{50}$ or ${{{\mbox{IC}}}}_{50}$. Protein titrations performed during the development of fluorescence polarization assay are commonly plotted as signal in millipolarization units versus the molar concentration. These saturation curves are estimated using the following hyperbolic model:

$$f\left(x\right)=\frac{{B}_{\max }\times x}{{K}_{D}+x}+{cx}+d$$

(2)

where ${B}_{\max }$ denotes the extrapolated maximum specific binding to the protein for high ligand concentrations. ${K}_{D}$ is the equilibrium dissociation constant, which specifies the concentration $x$ required for half-maximum binding at equilibrium. The parameter $c$ accounts for the ratio of nonspecific binding to total binding and $d$ corrects for background signals²⁰.

Protein information is automatically retrieved through the UniProt REST API, enabling the search for alternative protein and gene names. The retrieved XML files are processed using Biopython’s UniProt parser²¹, resulting in standardized and well-annotated protein entries, ultimately leading to more robust search functionality.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All data are available as download within the database.

References

Hartung, I. V., Rudolph, J., Mader, M. M., Mulder, M. P. C. & Workman, P. Expanding chemical probe space: quality criteria for covalent and degrader probes. J. Med Chem. 66, 9297–9312 (2023).
Article CAS PubMed PubMed Central Google Scholar
Muller, S. et al. Target 2035—update on the quest for a probe for every protein. RSC Med. Chem. 13, 13–21 (2022).
Article PubMed Google Scholar
Schwalm, M. P. et al. Tracking the PROTAC degradation pathway in living cells highlights the importance of ternary complex measurement for PROTAC optimization. Cell Chem. Biol. https://doi.org/10.1016/j.chembiol.2023.06.002 (2023).
Robers, M. B. et al. Target engagement and drug residence time can be observed in living cells with BRET. Nat. Commun. 6, 10091 (2015).
Article ADS CAS PubMed Google Scholar
Cho, E. J. & Dalby, K. N. Luminescence energy transfer-based screening and target engagement approaches for chemical biology and drug discovery. SLAS Discov. 26, 984–994 (2021).
Article CAS PubMed Google Scholar
Payne, N. C., Kalyakina, A. S., Singh, K., Tye, M. A. & Mazitschek, R. Bright and stable luminescent probes for target engagement profiling in live cells. Nat. Chem. Biol. 17, 1168–1177 (2021).
Article CAS PubMed Google Scholar
Blazer, L. L. et al. A suite of biochemical assays for screening RNA methyltransferase BCDIN3D. SLAS Discov. 22, 32–39 (2017).
Article CAS PubMed Google Scholar
Schwalm, M. P. et al. Targeting LC3/GABARAP for degrader development and autophagy modulation. Preprint at bioRxiv https://doi.org/10.1101/2023.10.05.560930 (2023).
Robers, M. B. et al. Single tracer-based protocol for broad-spectrum kinase profiling in live cells with NanoBRET. STAR Protoc. 2, 100822 (2021).
Article CAS PubMed PubMed Central Google Scholar
Baell, J. & Walters, M. A. Chemistry: chemical con artists foil drug discovery. Nature 513, 481–483 (2014).
Article ADS CAS PubMed Google Scholar
UniProt, C. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res. 47, D506–D515 (2019).
Article Google Scholar
Django (Version 4.2). https://www.djangoproject.com/ (2023).
Landrum, G. RDKit: open-source cheminformatics. https://www.rdkit.org (2023).
Ochoa, R., Brown, J. B. & Fox, T. pyPept: a python library to generate atomistic 2D and 3D representations of peptides. J. Cheminform. 15, 79 (2023).
Article PubMed PubMed Central Google Scholar
Fox, T. et al. BILN: a human-readable line notation for complex peptides. J. Chem. Inf. Model 62, 3942–3947 (2022).
Article CAS PubMed Google Scholar
Rose, A. S. et al. NGL viewer: web-based molecular graphics for large complexes. Bioinformatics 34, 3755–3758 (2018).
Article CAS PubMed PubMed Central Google Scholar
Rose, A. S. & Hildebrand, P. W. NGL Viewer: a web application for molecular visualization. Nucleic Acids Res. 43, W576–W579 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hunter, J. D. Matplotlib: a 2D graphics environment. Comput. Sci. Eng. 9, 90–95 (2007).
Article Google Scholar
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar
Motulsky, H. & Christopoulos, A. Fitting Models to Biological Data Using Linear and Nonlinear Regression: A Practical Guide to Curve Fitting (Oxford University Press, 2004).
Cock, P. J. et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25, 1422–1423 (2009).
Article CAS PubMed PubMed Central Google Scholar
Schwalm, M. P. et al. A toolbox for the generation of chemical probes for baculovirus IAP repeat containing proteins. Front. Cell Dev. Biol. 10, 886537 (2022).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors are thankful for all current and future tracer submissions from diverse laboratories, especially the extensive submissions of the Arrowsmith and Mazitschek Labs. M.P.S., J.D., S.M. and S.K. are grateful for support by the Structural Genomics Consortium (SGC), a registered charity (no: 1097737) that receives funds from Bayer AG, Boehringer Ingelheim, Bristol Myers Squibb, Genentech, Genome Canada through Ontario Genomics Institute, EU/EFPIA/OICR/McGill/KTH/Diamond Innovative Medicines Initiative 2 Joint Undertaking [EUbOPEN grant 875510], Janssen, Merck KGaA, Pfizer and Takeda, and by the German Cancer Research Center DKTK, and the Frankfurt Cancer Institute (FCI). M.P.S. is funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation), CRC1430 (Project-ID 424228829). J.D.V. and M.B.R. are employees of Promega Corp.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institute of Pharmaceutical Chemistry, Goethe University Frankfurt, Max-von-Laue-Str. 9, 60438, Frankfurt am Main, Germany
Johannes Dopfer, Susanne Müller, Stefan Knapp & Martin P. Schwalm
Structural Genomics Consortium, Goethe University Frankfurt, Buchmann Institute for Life Sciences, Max-von-Laue-Str. 15, 60438, Frankfurt am Main, Germany
Johannes Dopfer, Susanne Müller, Stefan Knapp & Martin P. Schwalm
Promega Corporation, Madison, WI, USA
James D. Vasta & Matthew B. Robers
German Cancer Consortium (DKTK)/German Cancer Research Center (DKFZ), DKTK Site Frankfurt-Mainz, 69120, Heidelberg, Germany
Stefan Knapp & Martin P. Schwalm

Authors

Johannes Dopfer
View author publications
You can also search for this author in PubMed Google Scholar
James D. Vasta
View author publications
You can also search for this author in PubMed Google Scholar
Susanne Müller
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Knapp
View author publications
You can also search for this author in PubMed Google Scholar
Matthew B. Robers
View author publications
You can also search for this author in PubMed Google Scholar
Martin P. Schwalm
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.D.: website development, data processing and manuscript editing. J.D.V., S.M., S.K. and M.B.R.: manuscript editing. M.P.S.: conceptualization, data processing, website editing, manuscript preparation and editing.

Corresponding author

Correspondence to Martin P. Schwalm.

Ethics declarations

Competing interests

J.D.V. and M.B.R. are employees of Promega. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article��s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dopfer, J., Vasta, J.D., Müller, S. et al. tracerDB: a crowdsourced fluorescent tracer database for target engagement analysis. Nat Commun 15, 5646 (2024). https://doi.org/10.1038/s41467-024-49896-5

Download citation

Received: 18 February 2024
Accepted: 20 June 2024
Published: 05 July 2024
DOI: https://doi.org/10.1038/s41467-024-49896-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

tracerDB: a crowdsourced fluorescent tracer database for target engagement analysis

Subjects

Abstract

Similar content being viewed by others

Bright and stable luminescent probes for target engagement profiling in live cells

Multivalent binding kinetics resolved by fluorescence proximity sensing

FRET as a biomolecular research tool — understanding its potential while avoiding pitfalls

Introduction

Results

Discussion

Methods

Architecture of the database

Reporting summary

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Peer Review File

Reporting Summary

Rights and permissions

About this article

Cite this article

Comments

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

Bright and stable luminescent probes for target engagement profiling in live cells

Multivalent binding kinetics resolved by fluorescence proximity sensing

FRET as a biomolecular research tool — understanding its potential while avoiding pitfalls

Introduction

Results

Discussion

Methods

Architecture of the database

Reporting summary

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Peer Review File

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links