This contains information, plans, and announcements relating to the Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) series of blind predictive challenges. This NIH-funded series of challenges tests computational models on predictions of properties related to obstacles faced in a drug discovery setting. Use our sub-pages and links to access more information.

Current challenges

Current and upcoming SAMPL challenges

Learn More


A big picture view of upcoming SAMPL challenges and our long-term roadmap

Learn More


History of the SAMPL challenges and links to prior challenges and data

Learn More

Currently, we are running a SAMPL8 physical properties challenge on logD and pKa with data from GSK (dataset collection details here), with deadlines in Summer 2021. For exact deadlines please refer to the relevant GitHub sites. Stay tuned for further announcements relating to the SAMPL8 phase of challenges. See our roadmap for what we can announce so far.

To receive SAMPL e-mail announcements, please sign up for the SAMPL e-mail list. If interested in a particular challenge, be sure to select the relevant tags (e.g. “SAMPL7”, “SAMPL8”) on signup, and likewise for particular topic areas (pKa, logP etc.)

Our SAMPL7 host-guest challenge virtual workshop is up online. And check our the SAMPL Zenodo community for additional SAMPL-related content, such as talks from the GCC/EuroSAMPL workshop in Fall 2020. The SAMPL7 physical properties special issue is now also progressing through the review process.

Data donation: If you have, or can generate, high quality datasets which might be suitable for a future SAMPL challenge, please contact us, or see our Roadmap page for additional information on types of data which may be of particular interest.

Positions: We are currently hiring a software scientist to assist in creating a branch of SAMPL which involves submission of containerized methods rather than predictions. If you are interested, please apply via this Google form or contact David Mobley.

Acknowledging and citing SAMPL

If you’ve benefitted from our work on the SAMPL series of challenges, please be sure to acknowledge our SAMPL NIH grant in any publications/presentations. This funded host-guest experiments, as well as our work organizing and administrating these challenges. You may acknowledge SAMPL by saying something like, “We appreciate the National Institutes of Health for its support of the SAMPL project via R01GM124270 to David L. Mobley (UC Irvine).”

We also ask you to cite the SAMPL dataset(s) you used. These are versioned on Zenodo, and a link will be available from the SAMPL repository for the challenge in which you participated.

Of course, we also appreciate it if you cite any overview/experimental papers relevant to the particular SAMPL challenge you participated in.