Physics-Informed Machine Learning Enables Rapid Macroscopic pK_a Prediction

Corin C. Wagen

Apr 25, 2025

Accurate prediction of macroscopic pK_a values remains a central challenge in computational chemistry, critical for modeling pH-dependent properties like solubility, membrane permeability, and charge state. Here we introduce Starling, a physics-informed neural network based on the Uni-pKa architecture trained to predict per-microstate free energies and compute macroscopic pK_a values via thermodynamic ensemble modeling. Unlike approaches that treat protonation events in isolation, Starling explicitly resolves protonation and tautomeric microstates, enabling robust handling of complex molecules with multiple ionizable sites. We show that Starling achieves comparable or superior accuracy to leading commercial tools on multiple benchmark datasets, and demonstrate its utility in predicting isoelectric points, logD profiles, and blood–brain-barrier permeability. By maintaining thermodynamic consistency and enabling rapid microstate-ensemble generation, Starling enables accurate physicochemical property prediction with broad relevance to drug discovery and molecular design.

This preprint can also be viewed on ChemRxiv.

Introduction

The acid-dissociation constant (pK_a) is a fundamental descriptor of molecular behavior in solution: it governs membrane permeability, solubility, charge distribution, and available reaction mechanisms. Accurate, rapid prediction of pK_a values is critical to lead-optimization campaigns, physicochemical property modeling, and pH-dependent biophysical simulations.^1,21. Di, L.; Kerns, E. H. Drug-Like Properties: Concepts, Structure Design and Methods from ADME to Toxicity Optimization, 2nd ed.; Academic Press: Boston, 2016.
2. Navo, C. D.; Jiménez-Osés, G. Computer Prediction of pKa Values in Small Molecules and Proteins. ACS Med. Chem. Lett. 2021, 12, 1624–1628.

There are two complementary approaches to in silico pK_a prediction. Microscopic models target single protonation events at specific atoms, allowing local rationalization of substituent effects, while macroscopic models describe the ensemble free-energy landscape spanning all protonation and tautomeric microstates (Figure 1).^3,43. Zheng, J. W.; Leito, I.; Green, W. H. Widespread Misinterpretation of pKa Terminology for Zwitterionic Compounds and Its Consequences. J. Chem. Inf. Model. 2024, 64, 8838–8847.
4. Işık, M.; Rustenburg, A. S.; Rizzi, A.; Gunner, M. R.; Mobley, D. L.; Chodera, J. D. Overview of the SAMPL6 pKa Challenge: Evaluating Small Molecule Microscopic and Macroscopic pKa Predictions. J. Comput.-Aided Mol. Des. 2021, 35, 131–166.
For properties governed by overall speciation at a given pH, macroscopic pK_a prediction is indispensable—but exhaustive enumeration and weighting of all relevant microstates leads to a combinatorial explosion.

Figure 1: Macroscopic and microscopic pK_a prediction strategies compared. Individual circles represent microstates.

Consequently, most open-source and commercial workflows employ the microscopic pK_a paradigm, which breaks down in molecules with multiple, coupled protonation sites. Microscopic pK_a methods often generate bizarre or invalid predictions in cases where multiple microstates are relevant, as highlighted by recent work from Jonathan Zheng, Ivo Leito, and William Green.³3. Zheng, J. W.; Leito, I.; Green, W. H. Widespread Misinterpretation of pKa Terminology for Zwitterionic Compounds and Its Consequences. J. Chem. Inf. Model. 2024, 64, 8838–8847.
In contrast, the recently reported Uni-pKa model from Weiliang Luo and co-workers satisfies thermodynamic constraints by learning per-microstate free-energy predictions that are combined into an overall macroscopic pK_a.⁵5. Luo, W. et al. Bridging Machine Learning and Thermodynamics for Accurate pKa Prediction. JACS Au 2024, 4, 3451–3465.
Zheng and co-workers specifically highlight the thermodynamic validity of this model in their article (emphasis added):³3. Zheng, J. W.; Leito, I.; Green, W. H. Widespread Misinterpretation of pKa Terminology for Zwitterionic Compounds and Its Consequences. J. Chem. Inf. Model. 2024, 64, 8838–8847.

Uni-pKa, published in 2024, also leverages the ChEMBL dataset to pretrain a module that leverages 3D information and computed free energies of individual microstates to calculate the overall macroscopic pK_a. The model accounts for tautomerism, capturing the microscopic pK_a of both the uncharged and zwitterionic tautomers. To our knowledge, this is the only recently released ML model that correctly distinguishes between those microstates.

While the original Uni-pKa paper focused on matching experimental pK_a measurements on benchmark datasets, we hypothesized that the free energies generated by this physics-informed neural network might be gainfully employed in a variety of contexts. Here, we report Starling, a re-trained Uni-pKa model that retains state-of-the-art accuracy while accelerating inference relative to the original report. We integrate Starling into an end-to-end workflow that enumerates all potential microstates, predicts per-microstate free energies and populations, and uses these values to predict downstream properties like logD, isoelectric points, and unbound brain-to-plasma partition coefficient (K_p,uu).

Methods

Model Training

We retrained the Uni-pKa model of Luo and co-workers, introducing several modifications to improve inference speed and reduce training time. During the ChEMBL pretraining phase, we limited 3D conformer generation to three conformers per molecule (compared to ten in the original work), enabling dataset preprocessing to finish within several days. For fine-tuning on the Dwar-iBond dataset and test-time inference, the full ten-conformer set was used, as in the original Uni-pKa workflow. Training was conducted on a single NVIDIA H100 GPU via DigitalOcean.

For benchmarking, Luo and co-workers employed an ensemble of five models.⁵5. Luo, W. et al. Bridging Machine Learning and Thermodynamics for Accurate pKa Prediction. JACS Au 2024, 4, 3451–3465.
In discussions on GitHub, Gengmo Zhou (one of the Uni-pKa authors) noted that "the five models are relatively large and not very convenient to use" and recommended using only a single fine-tuned model for inference.⁶6. Pretrained Models For Inference? GitHub issue comment, https://github.com/dptech-corp/Uni-pKa/issues/3#issuecomment-2604100717 (accessed 2025-04-18).
Following this advice, we employed only a single fine-tuned model for inference. The resulting model retained the same architecture and input format as Uni-pKa and is compatible with the original inference pipeline.

To enable subsequent comparison to the original Uni-pKa work (and disambiguate comments about the model from comments about the Uni-pKa architecture), the retrained Uni-pKa model will hereafter be referred to as "Starling" (Figure 2). Starlings are extremely social birds which often congregate in large flocks and engage in emergent behavior called ``murmuration," similar to how macroscopic pK_a values emerge naturally from large numbers of microstates.

Figure 2: A common starling (Sturnus vulgaris), also known as a European starling in the United States.⁷7. Hsu, F. Common Starling at Half Moon Bay. https://en.wikipedia.org/wiki/Common_starling, 2018.

Microstate Enumeration

We employed a beam-search strategy to enumerate protonation microstates within a formal charge window, by default [ $-2$ , $+2$ ]. The algorithm proceeds as follows:

Initial state. The input molecule is sanitized with RDKit.⁸8. RDKit: Open-source cheminformatics. https://www.rdkit.org. https://www.rdkit.org, 2013–2025.
Its canonical SMILES string is cached as a unique identifier, and its formal charge defines the starting beam.
Template-driven generation. At every iteration we generate new microstates via formal charge edits using an RDKit-based substructure matching protocol.⁸8. RDKit: Open-source cheminformatics. https://www.rdkit.org. https://www.rdkit.org, 2013–2025.
Atom-wise transformations were applied based on SMARTS patterns (adapted from the Uni-pKa data processing pipeline) and sanitized using RDKit's built-in sanitization routines.⁸8. RDKit: Open-source cheminformatics. https://www.rdkit.org. https://www.rdkit.org, 2013–2025.
For each molecule, we applied both protonation and deprotonation transformations, iteratively exploring new states until all reachable microstates in the allowed charge range were visited. Chemically unreasonable structures (e.g. pentavalent carbon) were filtered out using a list of hard-coded substructure patterns (also from the Uni-pKa data processing pipeline).
Scoring and pruning. Newly generated candidates are scored using AIMNet2 as an energy estimator.⁹9. Anstine, D.; Zubatyuk, R.; Isayev, O. AIMNet2: A Neural Network Potential to Meet Your Neutral, Charged, Organic, and Elemental-Organic Needs. ChemRxiv, 2024.
For each formal charge $q$ , we merge the old and new states, then retain the $N$ microstates with the lowest scores.
Convergence and termination. Previously visited microstates are tracked by canonical SMILES, which prevents revisiting identical microstates. The search terminates when (i) no new states are generated, (ii) all beams become stationary, or (iii) a maximum of ten iterations is reached.

Because the beam retains only the most promising microstates, the method scales roughly linearly with molecular size while still recovering all experimentally relevant tautomers and protonation states in our validation set. The trade-off is that exceedingly high-energy or topologically distant microstates may be omitted; we judged these to be irrelevant for p $K_\mathrm{a}$ prediction. A beam width of $N=20$ was found to best balance completeness and runtime; we observed c. 5x increase in overall speed for a model 22-atom molecule (SMILES NC(C(=O)O)C1OC(n2cc(CO)c(O)nc2=O)C(O)C1O, 523 microstates), while benchmark performance (vide infra) was virtually unaffected.

We note that this approach does not account for the entropy of symmetry of individual microstates (e.g. monoprotonated diamines). Future work could apply an entropic correction to the free energy of each microstate based on its symmetry number, by analogy to conventional best practices in quantum chemistry.¹⁰10. Plata, R. E.; Singleton, D. A. A Case Study of the Mechanism of Alcohol-Mediated Morita Baylis–Hillman Reactions. The Importance of Experimental Observations. J. Am. Chem. Soc. 2015, 137, 3811–3826.
However, the low symmetry of most druglike molecules means that this inaccuracy will not impact most practical applications.

Free Energy Prediction

We used the Starling model to predict dimensionless free energies for each microstate. Conformers were generated via ETKDG^11,1211. Riniker, S.; Landrum, G. A. Better Informed Distance Geometry: Using What We Know To Improve Conformation Generation. J. Chem. Inf. Model. 2015, 55, 2562–2574.
12. Wang, S. et al. Improving Conformer Generation for Small Rings and Macrocycles Based on Distance Geometry and Experimental Torsional-Angle Preferences. J. Chem. Inf. Model. 2020, 60, 2044–2058.
and MMFF94¹³13. Halgren, T. A. Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94. J. Comput. Chem. 1996, 17, 490–519.
optimization as implemented in RDKit.⁸8. RDKit: Open-source cheminformatics. https://www.rdkit.org. https://www.rdkit.org, 2013–2025.
For each conformer, a feature representation was constructed using atom and formal charge tokenization, spatial distance matrices, and edge-type encoding, as required by the Uni-pKa architecture.

Predictions for each conformer were aggregated using a log-sum-exponential (LSE) procedure to yield a Boltzmann-averaged microstate energy:

$E_\text{micro} = -\log \sum_i \exp(-E_i)$

where $E_i$ is the predicted energy of conformer $i$ .

Energies were further adjusted by adding a per-charge offset to account for the free energy of solvation of a proton. This offset, approximately $-14.96$ in model units, was inferred from the original Uni-pKa training data and computed as $-6.50 \times \ln(10)$ , where $-6.50$ is the shift applied to the Uni-pKa model to align with experimental pK_a data.

Macroscopic pK_a Calculation

Macroscopic pK_a values were computed by grouping microstates by formal charge and comparing free energies between adjacent charge states. For a charge transition $c \rightarrow c+1$ , the macroscopic pK_a was computed using:

$\text{p}K_\text{a} = \frac{1}{\ln(10)} \left[ \log \sum_{i \in c+1} \exp(-E_i) - \log \sum_{j \in c} \exp(-E_j) \right]$

Microstate Populations and Derived Properties

By default, microstate populations were computed across pH 0–14 in 0.1 pH steps. The relative population $w_i(\text{pH})$ of microstate $i$ was calculated as:

$w_i(\text{pH}) = \frac{\exp(-E_i - q_i \ln(10) \cdot \text{pH})}{Z(\text{pH})}$

where $q_i$ is the charge of microstate $i$ and $Z(\text{pH})$ is the pH-dependent partition function.

logD Prediction: logD(pH) was calculated using weighted averaging of logP values in linear space:

$\log D(\text{pH}) = \log_{10} \left( \sum_i w_i(\text{pH}) \cdot 10^{\log P_i} \right)$

logP values were computed using the Crippen logP function in RDKit for neutral species and set to $-2$ for ionic species.¹⁴14. Wildman, S. A.; Crippen, G. M. Prediction of Physicochemical Parameters by Atomic Contributions. J. Chem. Inf. Comput. Sci. 1999, 39, 868–873.
Proper handling of distribution coefficients for ionic species requires handling contact- and solvent-separated-ion pairs and becomes rather complex;¹⁵15. Chen, C.-S.; Lin, S.-T. Prediction of pH Effect on the Octanol–Water Partition Coefficient of Ionizable Pharmaceuticals. Ind. Eng. Chem. Res. 2016, 55, 9284–9294.
based on previous reports,¹⁶16. Avdeef, A. pH-Metric log P. II: Refinement of partition coefficients and lonization constants of multiprotic substances J. Pharm. Sci. 1993, 82, 183–190.
we found that simply employing a fixed logP of –2 for all non-neutral species gave results in good qualitative agreement with experiment. Since quantitative accuracy in the ionic limit is rarely needed in drug discovery, we expect this approximation to be suitable for routine use.

Isoelectric Point (pI): The average net charge across pH was computed, and the pI was defined as the pH where the net charge crossed zero, using bisection search with a convergence tolerance of $10^{-3}$ . If no sign change in net charge was detected across the pH range, the isoelectric point was omitted and a warning was logged.

Unbound Brain-to-Plasma Ratio Prediction

Following Morgan Lawrenz and co-workers at Schrödinger,¹⁷17. Lawrenz, M. et al. A Computational Physics-based Approach to Predict Unbound Brain-to-Plasma Partition Coefficient, Kp,uu. J. Chem. Inf. Model. 2023, 63, 3786–3798.
we compute the aqueous solvation energy by comparing the electronic energies of conformer ensembles with and without solvent. The reported workflow takes up to 40 hours per molecule: to accelerate our implementation, we employ the AIMNet2 neural network potential from Dylan Anstine and co-workers to conduct geometry optimizations, and correct the optimized conformer energies using the CPCM-X implicit water model.^9,189. Anstine, D.; Zubatyuk, R.; Isayev, O. AIMNet2: A Neural Network Potential to Meet Your Neutral, Charged, Organic, and Elemental-Organic Needs. ChemRxiv, 2024.
18. Stahn, M.; Ehlert, S.; Grimme, S. Extended Conductor-like Polarizable Continuum Solvation Model (CPCM-X) for Semiempirical Methods. J. Phys. Chem. A 2023, 127, 7036–7043.
Conformer ensembles are generated using the ETKDG algorithm as implemented in RDKit.^8,11,128. RDKit: Open-source cheminformatics. https://www.rdkit.org. https://www.rdkit.org, 2013–2025.
11. Riniker, S.; Landrum, G. A. Better Informed Distance Geometry: Using What We Know To Improve Conformation Generation. J. Chem. Inf. Model. 2015, 55, 2562–2574.
12. Wang, S. et al. Improving Conformer Generation for Small Rings and Macrocycles Based on Distance Geometry and Experimental Torsional-Angle Preferences. J. Chem. Inf. Model. 2020, 60, 2044–2058.
Geometry optimizations were run using geomeTRIC 1.0.2.¹⁹19. Wang, L.-P.; Song, C. Geometry optimization made simple with translation and rotation coordinates. J. Chem. Phys. 2016, 144, 214108.

We generalize the Lawrenz state-penalty concept by directly computing the free-energy penalty associated with the molecule not being in a chemically neutral, non-zwitterionic form at physiological pH. Specifically, we evaluate the Boltzmann-weighted population of microstates that are formally neutral and minimize atom-centered charges—preferring, for example, the glycine microstate with a neutral backbone (i.e., protonated carboxylic acid and deprotonated amine) over its zwitterionic counterpart. When no fully neutral form exists, we select the microstate with the fewest formal atom-centered charges. The state penalty is then defined as the free energy required to shift the ionization equilibrium toward this reference state:

$\text{SP} := -RT \cdot \ln \left( \sum_{i \in \text{neutral }} p_i \right)$

where $p_i$ is the population of neutral microstates at pH 7.4. This approach accounts for multiple protonation sites, tautomerism, and protonation-coupled conformational changes, and it avoids the need to heuristically assign a single dominant pK_a or ionizable group.

To compare the predictive efficacy of the Schrödinger-reported solvation energy to our AIMNet2/CPCM-X(water)/Starling solvation energy, we binarized log-transformed K_p,uu values at 0.3 to define a "brain-penetrant" class, trained logistic-regression models using both descriptors, and assessed model performance by 5-fold stratified cross validation and receiver operating characteristic (ROC).

Results and Discussion

pK_a Prediction

Our focus here is on pragmatic physics-informed macroscopic pK_a modeling that can be deployed in a variety of tasks, not in maximizing benchmark accuracy. Towards this end, we reduced the size of the parent Uni-pKa model by employing only a single model and not an ensemble of five models, following the authors' advice.⁶6. Pretrained Models For Inference? GitHub issue comment, https://github.com/dptech-corp/Uni-pKa/issues/3#issuecomment-2604100717 (accessed 2025-04-18).
The resultant Starling model is small enough to easily run inference on CPU-only machines, dramatically simplifying usage and deployment.

To assess the impact of this change on overall pK_a-prediction accuracy, we compared our Starling model to a wide variety of pK_a-prediction models: the original Uni-pK_a model,⁵5. Luo, W. et al. Bridging Machine Learning and Thermodynamics for Accurate pKa Prediction. JACS Au 2024, 4, 3451–3465.
the MolGpka graph-convolutional model from Xiaolin Pan and co-workers,²⁰20. Pan, X. et al. MolGpka: A Web Server for Small Molecule pKa Prediction Using a Graph-Convolutional Neural Network. J. Chem. Inf. Model. 2021, 61, 3159–3165.
ChemAxon's commercial Marvin pKa plugin, Schrödingers commercial Epik Classic and Epik 7 models,^21,2221. Shelley, J. C. et al. Epik: a software program for pK( a ) prediction and protonation state generation for drug-like molecules. J. Comput.-Aided Mol. Des. 2007, 21, 681–691.
22. Johnston, R. C. et al. Epik: pKa and Protonation State Prediction through Machine Learning. J. Chem. Theory Comput. 2023, 19, 2380–2388.
and the QupKake model from Omri Abarbanel and Geoff Hutchison.²³23. Abarbanel, O. D.; Hutchison, G. R. QupKake: Integrating Machine Learning and Quantum Chemistry for Micro-pKa Predictions. J. Chem. Theory Comput. 2024, 20, 6946–6956.
Following the original Uni-pKa publication, we benchmarked Starling on the Czodrowski Novartis dataset and the SAMPL6, SAMPL7, and SAMPL8 pK_a-prediction challenges (Table 1).^4,24–264. Işık, M.; Rustenburg, A. S.; Rizzi, A.; Gunner, M. R.; Mobley, D. L.; Chodera, J. D. Overview of the SAMPL6 pKa Challenge: Evaluating Small Molecule Microscopic and Macroscopic pKa Predictions. J. Comput.-Aided Mol. Des. 2021, 35, 131–166.
24. Machine learning meets pKa. https://github.com/czodrowskilab/Machine-learning-meets-pKa (accessed 2025-04-18).
25. Grosjean, H. et al. SAMPL7 Protein-Ligand Challenge: A community-wide evaluation of computational methods against fragment screening and pose-prediction. J. Comput.-Aided Mol. Des. 2022, 36, 291–311.
26. The SAMPL8 Blind Prediction Challenges for Computational Chemistry. https://github.com/samplchallenges/SAMPL8 (accessed 2025-04-18).

To assess the impact of our beam-search enumeration strategy, we also compared against a naïve breadth-first-search strategy, denoted below as ”Starling-BFS.” This strategy explicitly considers all microstates and thus scales exponentially with the number of ionizable functional groups.

Method	Novartis Base	Novartis Acid	SAMPL6	SAMPL7	SAMPL8
Uni-pKa	0.653	1.061	0.716	0.735	0.878
MolGpka	1.064	1.287	0.773	0.980	1.150
ChemAxon Marvin	1.145	1.144	1.248	0.708	1.511
Epik Classic	1.175	1.531	0.962	1.648	---
Epik 7 (ensemble)	---	---	0.61	---	---
QupKake	---	---	0.44	0.85	1.04
Starling	0.790	1.083	1.118	0.734	1.142
Starling-BFS	0.797	1.109	1.102	0.746	1.141

Table 1: Root-mean-squared error of pK_a predictions by benchmark set. Blank entries denote benchmarks for which data is not available. All results taken from the SI of the Uni-pKa manuscript⁵5. Luo, W. et al. Bridging Machine Learning and Thermodynamics for Accurate pKa Prediction. JACS Au 2024, 4, 3451–3465.
except for Starling and Starling-BFS, which were computed for this manuscript, and Epik 7.²²22. Johnston, R. C. et al. Epik: pKa and Protonation State Prediction through Machine Learning. J. Chem. Theory Comput. 2023, 19, 2380–2388.

As expected, we found that going from an ensemble of five models (Uni-pKa) to a single model (Starling) produced a consistent but modest increase in RMSE across all benchmark sets (Table 1). Nevertheless, Starling itself maintains equal or superior accuracy to existing commercial pK_a-prediction methods, and beam-search microstate enumeration leads to virtually no change in accuracy. Analysis of the root-mean-squared error across the pK_a scale shows that the model achieves maximum accuracy in the medicinally relevant 4–12 window, while high or low pKa values are predicted with less accuracy (Figure 3).

Figure 3: Root-mean-squared error of Starling-BFS across the pK_a scale, as assessed on the finetuning Dwar-iBond set. Error bars denote bootstrap 95% confidence intervals (5000 samples).

At the outset of this project, we envisioned that pruning could lead to further reductions in size while maintaining useful accuracy. We applied L1-unstructured pruning to Starling to yield a further 67% reduction in size. While pruning caused minimal changes in RMSE for some benchmark sets, other benchmark sets saw substantially increased error. The pruned model also showed significant systematic error: low pK_a values were overpredicted, while high pK_a values were underpredicted. This led to poor performance in downstream tasks like isoelectric-point prediction (vide infra)—accordingly, we did not investigate pruning-based strategies further.

Challenging Molecules

Unlike many of the models discussed above, the Uni-pKa architecture used in Starling explicitly considers microstates, allowing it to produce reasonable predictions in more complex cases. Glycine is a particularly troublesome case, as highlighted by Zheng and co-workers—the microscopic pK_a values predicted from the canonical "neutral" microstate are quite different from the macroscopic pK_a values, leading to pathological behavior in many pK_a prediction workflows.³3. Zheng, J. W.; Leito, I.; Green, W. H. Widespread Misinterpretation of pKa Terminology for Zwitterionic Compounds and Its Consequences. J. Chem. Inf. Model. 2024, 64, 8838–8847.
Starling correctly predicts that the zwitterionic microstate dominates at physiological pH (Figure 4), and the predicted pK_a values (2.23 and 9.81) comport with experimental pK_a values (2.34 and 9.60).²⁷27. Gokel, G. W.; Dean, J. A. Dean's handbook of organic chemistry, 2nd ed.; McGraw-Hill: New York, 2004.

Figure 4: Significant microstates of glycine by pH.

Starling can also account for symmetry breaking in molecules containing multiple ionizable sites. Succinic acid is a C\textsubscript2-symmetric dicarboxylic acid which has experimental pK_a values of 4.21 and 5.64.²⁷27. Gokel, G. W.; Dean, J. A. Dean's handbook of organic chemistry, 2nd ed.; McGraw-Hill: New York, 2004.
While MolGpka predicts identical pK_a values of 3.4 and 3.4 for both acids,²⁰20. Pan, X. et al. MolGpka: A Web Server for Small Molecule pKa Prediction Using a Graph-Convolutional Neural Network. J. Chem. Inf. Model. 2021, 61, 3159–3165.
Starling correctly predicts non-symmetric pK_a values of 3.80 and 5.10 (Figure 5). This case study highlights the advantages of considering multiple microstates, as methods that look only at the neutral microstate are fundamentally constrained to predict two identical pK_a values.

Figure 5: Unique microstates of succinic acid by pH. (Note that this approach does not differentiate between the two symmetry-equivalent singly deprotonated microstates.)

In proteolysis-targeting chimeras (PROTACs) and other bifunctional degraders, saturated N-heterocycles like piperazines are frequently found to be effective linkers.²⁸28. Troup, R. I.; Fallan, C.; Baud, M. G. J. Current strategies for the design of PROTAC linkers: a critical review. Explor. Target. Anti-tumor Ther. 2020, 1, 273–312.
While the protonation state of these heterocycles has a massive impact on the physicochemical properties of the resultant molecule, predicting the effect of chemical modifications on these linkers remains challenging, and most microstate-based methods struggle with the aforementioned symmetry considerations. We evaluated Starling on a database of PROTAC-relevant fragments with associated experimental pK_a values that was reported in 2022 by Jenny Desantis and co-workers.²⁹29. Desantis, J. et al. PROTACs bearing piperazine-containing linkers: what effect on their protonation state? RSC Adv. 2022, 12, 21968–21977.
Starling predicts pK_a values of 5.64 and 9.46 for piperazine (Figure 6), closely matching the experimental pK_a values of 5.44 and 9.67. Over the entire set, Starling shows excellent performance (MAE: 0.432, RMSE: 0.668), with the biggest errors originating from the overestimation of low pK_a values (Figure 7).

Figure 6: Unique microstates of piperazine by pH. (Note that this approach does not differentiate between the two symmetry-equivalent monoprotonated microstates.)

Figure 7: Starling-BFS performance on the Desantis PROTAC pK_a dataset.²⁹29. Desantis, J. et al. PROTACs bearing piperazine-containing linkers: what effect on their protonation state? RSC Adv. 2022, 12, 21968–21977.

Some limitations of the Uni-pKa architecture merit mention. While the strategy of predicting per-microstate free energies does, in general, seem more robust than direct prediction of pK_a values, in certain complex cases with many similar ionizable groups the predictions can become confused and incoherent. For instance, ethylenediaminetetraacetic acid (EDTA) has six experimental pK_a values and 21 relevant microstates; in this complex case, the computational predictions generated by Starling are improperly ordered and not particularly accurate (Table 2). While cases like this are currently rare in small-molecule drug discovery, we anticipate that the rise of oligopeptide therapeutics and other beyond-rule-of-five compounds will make accuracy in this regime more important moving forward, and plan to address this shortcoming in future pK_a-prediction work.

Initial Charge	Final Charge	pKa (Exp.)	pKa (Starling)
+2	+1	0.00	2.29
+1	0	1.50	3.55
0	-1	2.00	4.87
-1	-2	2.69	3.57
-2	-3	6.13	5.58
-3	-4	10.37	8.37

Table 2: Comparison of experimental³⁰30. Harris, D. C. Quantitative Chemical Analysis, 8th ed.; Macmillan Learning: New York, 2010.
and Starling-computed pK_a values for EDTA.

Microstate Distribution

As a part of the SAMPL6 pK_a challenge, Ikenna Ndukwe and co-workers at Merck measured which microstates were responsible for the observed (de)protonation events for two compounds, SM07 and SM14.⁴4. Işık, M.; Rustenburg, A. S.; Rizzi, A.; Gunner, M. R.; Mobley, D. L.; Chodera, J. D. Overview of the SAMPL6 pKa Challenge: Evaluating Small Molecule Microscopic and Macroscopic pKa Predictions. J. Comput.-Aided Mol. Des. 2021, 35, 131–166.
We used this experimental data to assess the accuracy of the microstates predicted by Starling. For SM07, we found that the observed protonation event at pH 6.08 was predicted to occur at pH 5.13 by Starling. NMR spectroscopy predicted that this mainly corresponded to protonation at N1 of the 4-aminoquinazoline ring, while Starling predicted that N1 protonation and aniline protonation were equally favorable (Figure 8). For SM14, Starling predicted benzimidazole protonation at pH 5.71 and aniline protonation at pH 2.48, matching the experimental microstates (pK_a of 5.30 and 2.58, respectively) (Figure 8).

Figure 8: ChemDraw visualization of microstate transitions in SM07 and SM14.

SAMPL6 does not quantify the relative abundance of minor microstates, making it difficult to assess the accuracy of the entire microstate ensemble, but these results indicate at a minimum that training a Uni-pKa model on macroscopic pK_a values yields generally reasonable microstate predictions. We note that the original Uni-pKa paper reports low accuracy on tautomer-specific benchmarks, commenting that "the model tends to attach similar free energies to different tautomers and fails to distinguish the tautomers with drastic energy differences."⁵5. Luo, W. et al. Bridging Machine Learning and Thermodynamics for Accurate pKa Prediction. JACS Au 2024, 4, 3451–3465.
These results comport with our observations.

Isoelectric Points

With a list of pH-dependent microstate populations, determining the isoelectric point—the pH at which the average charge across all microstates is zero—is straightforward. We benchmarked the accuracy of Starling's isoelectric-point predictions for the 20 canonical amino acids, for which experimental isoelectric-point measurements are readily available.²⁷27. Gokel, G. W.; Dean, J. A. Dean's handbook of organic chemistry, 2nd ed.; McGraw-Hill: New York, 2004.
The Starling-predicted isoelectric points were somewhat compressed relative to experiment—high isoelectric points were underpredicted, while low isoelectric points were overpredicted (Figure 9).

Figure 9: Experimental isoelectric points of canonical amino acids compared to Starling-predicted values.

This approach leads to good error metrics (MAE: 0.282, RMSE: 0.527) and an excellent Pearson correlation coefficient (0.979). While Starling likely underperforms modality-specific isoelectric-point-prediction algorithms like pIChemiSt,³¹31. Frolov, A. I. et al. pIChemiSt Free Tool for the Calculation of Isoelectric Points of Modified Peptides. J. Chem. Inf. Model. 2023, 63, 187–196.
this approach is quick and requires no isoelectric-point-specific fine-tuning, making it a useful way to estimate the isoelectric points of novel chemical scaffolds.

logD/pH Profile Prediction

As described in the "Methods" section, the pH-dependent microstate populations produced by the global pK_a prediction workflow can be used to predict logD, the distribution coefficient between water and n-octanol at a given pH. This approach gives good quantitative agreement with reported logD/pH studies for pentachlorophenol (Figure 10) and metoprolol (Figure 11).^15,3215. Chen, C.-S.; Lin, S.-T. Prediction of pH Effect on the Octanol–Water Partition Coefficient of Ionizable Pharmaceuticals. Ind. Eng. Chem. Res. 2016, 55, 9284–9294.
32. Markovic, M. et al. Segmental-Dependent Solubility and Permeability as Key Factors Guiding Controlled Release Drug Product Development. Pharmaceutics 2020, 12.
Pentachlorophenol is predicted to be highly lipophilic at low pH values, but to become hydrophilic after deprotonation at higher pH values. In contrast, the amine of metoprolol is protonated at low pH values, making the molecule strongly hydrophilic; only at higher pH is the molecule predicted to populate the octanol phase.

Figure 10: Predicted logD of pentachlorophenol as a function of pH. The dashed line corresponds to the predicted pK_a of 4.24.

Figure 11: Predicted logD of metoprolol as a function of pH. The dashed line corresponds to the predicted pK_a values of 9.29 and 13.72.

These logD predictions rely on the venerable Crippen logP-prediction scheme and, while qualitatively reasonable, are unlikely to be quantitatively accurate.¹⁴14. Wildman, S. A.; Crippen, G. M. Prediction of Physicochemical Parameters by Atomic Contributions. J. Chem. Inf. Comput. Sci. 1999, 39, 868–873.
Nevertheless, this scheme is minimally parameterized and does not require fitting to any experimental logD data, making it well-suited to extrapolation to data-scarce regions of chemical space. We expect that improvements to the underlying logP-prediction method will dramatically increase the accuracy of these predictions.

More abstractly, our approach partitions the task of predicting logD into two smaller subtasks: (1) predicting per-microstate weights at a given pH and (2) predicting logP values for a given microstate. Each of these subtasks can be independently trained and benchmarked to experimental data, making it possible to interpret errors and systematically improve the quality of output predictions. The notorious difficulty of predicting logD may stem from the fact that most logD-prediction algorithms attempt to implicitly learn both subtasks from the same dataset; we suspect that this two-task strategy may prove more tractable.

K_p,uu Prediction

Blood–brain-barrier permeability is a critical factor in developing effective central nervous system (CNS) therapeutics. The most common metric used to quantify blood–brain-barrier penetrance is unbound brain-to-plasma drug partition coefficient, or K_p,uu; in a 2022 study, over 3/4ths of drug-discovery companies surveyed found that the use of K_p,uu had a "game-changing" impact on their preclinical research.³³33. Loryan, I. et al. Unbound Brain-to-Plasma Partition Coefficient, Kp,uu,brain—A Game Changing Parameter for CNS Drug Discovery and Development. Pharm. Res. 2022, 39, 1321–1341.
Unfortunately, experimental measurement of K_p,uu requires time-consuming animal studies and non-trivial amounts of material. A high-throughput computational method capable of accurate K_p,uu prediction could drive improved decision making in preclinical CNS campaigns.

Recently, Morgan Lawrenz and co-workers at Schrödinger found that the energy of solvation predicted by density-functional theory (DFT) was highly predictive of K_p,uu, the unbound brain-to-plasma ratio.¹⁷17. Lawrenz, M. et al. A Computational Physics-based Approach to Predict Unbound Brain-to-Plasma Partition Coefficient, Kp,uu. J. Chem. Inf. Model. 2023, 63, 3786–3798.
Lawrenz and co-workers compute the solvation energy only for neutral species: to correct for cases in which the molecule is protonated at physiological pH and must undergo an uphill deprotonation, Lawrenz and co-workers add a pK_a-dependent "state penalty" corresponding to the energy needed to neutralize a molecule at pH 7.4. Unfortunately, their expression only accounts for cases in which a single site on the molecule is protonated at low pH and deprotonated at high pH, making it poorly suited for complex zwitterionic cases and limiting the method's applicability to high-throughput screening or black-box usage.

We envisioned that our rapid macroscopic pK_a-prediction algorithm might provide a more robust and physically accurate way of estimating the free energy of neutralization. Here, we use Starling-computed microstate weights at pH 7.4 to directly compute the free-energy penalty associated with the molecule not being in a chemically neutral, non-zwitterionic form. We also dramatically accelerate the underlying solvation energy prediction by using the AIMNet2 neural network potential to optimize conformer ensembles and computing the solvation energy using the CPCM-X implicit water model.^9,189. Anstine, D.; Zubatyuk, R.; Isayev, O. AIMNet2: A Neural Network Potential to Meet Your Neutral, Charged, Organic, and Elemental-Organic Needs. ChemRxiv, 2024.
18. Stahn, M.; Ehlert, S.; Grimme, S. Extended Conductor-like Polarizable Continuum Solvation Model (CPCM-X) for Semiempirical Methods. J. Phys. Chem. A 2023, 127, 7036–7043.
Overall, our method is approximately two orders of magnitude faster than the reported speed of the Lawrenz workflow, and will almost certainly be faster if hardware acceleration is employed.

We evaluated our solvation-energy-prediction workflow against a dataset of 123 experimental K_p,uu values and DFT-computed solvation energies reported by Schrödinger. The solvation energies predicted by our AIMNet2/CPCM-X(water)/Starling workflow were in good agreement with the solvation energies computed at the M06-2X/LACVP**/PBF(water) // B3LYP/LACVP* level of theory. (We also note that our method does not employ any experimentally measured data, unlike the method of Lawrenz and co-workers.)

To compare the ability of both solvation-energy schemes to predict experimental K_p,uu data, log-transformed K_p,uu values were binarized at 0.3 to define a "brain-penetrant" class (following Lawrenz and co-workers) and logistic-regression models were trained using both energy values. The model based on the Lawrenz solvation-energy workflow achieved an AUC=0.91 and 85% accuracy at a 0.5 decision threshold, while the Starling-based model yielded AUC=0.85 and 75% accuracy (mainly due to an increased number of false positives). Thus, while the high-level DFT solvation energies do yield improved categorization accuracy, AIMNet2/CPCM-X(water)/Starling-computed solvation energies remain highly predictive of experimental K_p,uu.

Figure 12: Comparison of DFT-based solvation energies from Lawrenz and coworkers and AIMNet2/CPCM-X(water)/Starling-BFS-computed solvation energies.

Figure 13: ROC/AUC analysis of employing AIMNet2/CPCM-X(water)/Starling-BFS-predicted energy of solvation to predict K_p,uu at a threshold of 0.3.

Conclusion

We present Starling, a lightweight, physics-informed neural network for macroscopic pK_a prediction. Starling explicitly resolves microstate ensembles and their populations, enabling the prediction of derived properties including logD, isoelectric point, and neutralization free energy—all from a single forward pass. Furthermore, the underlying free-energy-prediction scheme employed by the Uni-pKa architecture make Starling robust to failure modes common to other pK_a-prediction methods. We expect Starling to be capable of generating immediate practical insights in medicinal chemistry, from estimating logD/pH profiles to guiding the development of CNS-active therapeutics through rapid blood–brain-barrier predictions.

Nevertheless, considerable theoretical improvements to Starling are possible. Because the Uni-pKa architecture explicitly resolves the microstate ensemble, it can naturally accommodate new multiobjective training targets: tautomer equilibrium data, experimental isoelectric points, experimental logD measurements, and more. This flexibility could allow similar microstate-scoring architectures to incorporate larger and more diverse datasets. We anticipate that further efforts in this area can address the limitations of Starling and reach unprecedented levels of accuracy for physicochemical property-prediction tasks.

Acknowledgement

C.C.W. thanks Ari Wagen, Jonathon Vandezande, Elias Mann, and Spencer Schneider for editing drafts of this manuscript. C.C.W also thanks Jonathan Zheng for helpful discussions.

References

Di, L.; Kerns, E. H. Drug-Like Properties: Concepts, Structure Design and Methods from ADME to Toxicity Optimization, 2nd ed.; Academic Press: Boston, 2016.
Navo, C. D.; Jiménez-Osés, G. Computer Prediction of pKa Values in Small Molecules and Proteins. ACS Med. Chem. Lett. 2021, 12, 1624–1628.
Zheng, J. W.; Leito, I.; Green, W. H. Widespread Misinterpretation of pKa Terminology for Zwitterionic Compounds and Its Consequences. J. Chem. Inf. Model. 2024, 64, 8838–8847.
Işık, M.; Rustenburg, A. S.; Rizzi, A.; Gunner, M. R.; Mobley, D. L.; Chodera, J. D. Overview of the SAMPL6 pKa Challenge: Evaluating Small Molecule Microscopic and Macroscopic pKa Predictions. J. Comput.-Aided Mol. Des. 2021, 35, 131–166.
Luo, W. et al. Bridging Machine Learning and Thermodynamics for Accurate pKa Prediction. JACS Au 2024, 4, 3451–3465.
Pretrained Models For Inference? GitHub issue comment, https://github.com/dptech-corp/Uni-pKa/issues/3#issuecomment-2604100717 (accessed 2025-04-18).
Hsu, F. Common Starling at Half Moon Bay. https://en.wikipedia.org/wiki/Common_starling, 2018.
RDKit: Open-source cheminformatics. https://www.rdkit.org. https://www.rdkit.org, 2013–2025.
Anstine, D.; Zubatyuk, R.; Isayev, O. AIMNet2: A Neural Network Potential to Meet Your Neutral, Charged, Organic, and Elemental-Organic Needs. ChemRxiv, 2024.
Plata, R. E.; Singleton, D. A. A Case Study of the Mechanism of Alcohol-Mediated Morita Baylis–Hillman Reactions. The Importance of Experimental Observations. J. Am. Chem. Soc. 2015, 137, 3811–3826.
Riniker, S.; Landrum, G. A. Better Informed Distance Geometry: Using What We Know To Improve Conformation Generation. J. Chem. Inf. Model. 2015, 55, 2562–2574.
Wang, S. et al. Improving Conformer Generation for Small Rings and Macrocycles Based on Distance Geometry and Experimental Torsional-Angle Preferences. J. Chem. Inf. Model. 2020, 60, 2044–2058.
Halgren, T. A. Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94. J. Comput. Chem. 1996, 17, 490–519.
Wildman, S. A.; Crippen, G. M. Prediction of Physicochemical Parameters by Atomic Contributions. J. Chem. Inf. Comput. Sci. 1999, 39, 868–873.
Chen, C.-S.; Lin, S.-T. Prediction of pH Effect on the Octanol–Water Partition Coefficient of Ionizable Pharmaceuticals. Ind. Eng. Chem. Res. 2016, 55, 9284–9294.
Avdeef, A. pH-Metric log P. II: Refinement of partition coefficients and lonization constants of multiprotic substances J. Pharm. Sci. 1993, 82, 183–190.
Lawrenz, M. et al. A Computational Physics-based Approach to Predict Unbound Brain-to-Plasma Partition Coefficient, Kp,uu. J. Chem. Inf. Model. 2023, 63, 3786–3798.
Stahn, M.; Ehlert, S.; Grimme, S. Extended Conductor-like Polarizable Continuum Solvation Model (CPCM-X) for Semiempirical Methods. J. Phys. Chem. A 2023, 127, 7036–7043.
Wang, L.-P.; Song, C. Geometry optimization made simple with translation and rotation coordinates. J. Chem. Phys. 2016, 144, 214108.
Pan, X. et al. MolGpka: A Web Server for Small Molecule pKa Prediction Using a Graph-Convolutional Neural Network. J. Chem. Inf. Model. 2021, 61, 3159–3165.
Shelley, J. C. et al. Epik: a software program for pK( a ) prediction and protonation state generation for drug-like molecules. J. Comput.-Aided Mol. Des. 2007, 21, 681–691.
Johnston, R. C. et al. Epik: pKa and Protonation State Prediction through Machine Learning. J. Chem. Theory Comput. 2023, 19, 2380–2388.
Abarbanel, O. D.; Hutchison, G. R. QupKake: Integrating Machine Learning and Quantum Chemistry for Micro-pKa Predictions. J. Chem. Theory Comput. 2024, 20, 6946–6956.
Machine learning meets pKa. https://github.com/czodrowskilab/Machine-learning-meets-pKa (accessed 2025-04-18).
Grosjean, H. et al. SAMPL7 Protein-Ligand Challenge: A community-wide evaluation of computational methods against fragment screening and pose-prediction. J. Comput.-Aided Mol. Des. 2022, 36, 291–311.
The SAMPL8 Blind Prediction Challenges for Computational Chemistry. https://github.com/samplchallenges/SAMPL8 (accessed 2025-04-18).
Gokel, G. W.; Dean, J. A. Dean's handbook of organic chemistry, 2nd ed.; McGraw-Hill: New York, 2004.
Troup, R. I.; Fallan, C.; Baud, M. G. J. Current strategies for the design of PROTAC linkers: a critical review. Explor. Target. Anti-tumor Ther. 2020, 1, 273–312.
Desantis, J. et al. PROTACs bearing piperazine-containing linkers: what effect on their protonation state? RSC Adv. 2022, 12, 21968–21977.
Harris, D. C. Quantitative Chemical Analysis, 8th ed.; Macmillan Learning: New York, 2010.
Frolov, A. I. et al. pIChemiSt Free Tool for the Calculation of Isoelectric Points of Modified Peptides. J. Chem. Inf. Model. 2023, 63, 187–196.
Markovic, M. et al. Segmental-Dependent Solubility and Permeability as Key Factors Guiding Controlled Release Drug Product Development. Pharmaceutics 2020, 12.
Loryan, I. et al. Unbound Brain-to-Plasma Partition Coefficient, Kp,uu,brain—A Game Changing Parameter for CNS Drug Discovery and Development. Pharm. Res. 2022, 39, 1321–1341.

Physics-Informed Machine Learning Enables Rapid Macroscopic pK_a Prediction

Introduction