摘要:SummaryNew photocatalysts are traditionally identified through trial-and-error methods. Machine learning has shown considerable promise for improving the efficiency of photocatalyst discovery from a large potential pool. Here, we describe a multi-step, target-driven consensus method using a stacking meta-learning algorithm that robustly predicts bandgaps and H2evolution activities of photocatalysts. Trained on small datasets, these models can rapidly screen a large space (>10 million materials) to identify promising, non-toxic compounds as candidate water splitting photocatalysts. Two effective compounds and two controls possessing optimal bandgap values (∼2 eV) but not photoactivity as predicted by the models were synthesized. Their experimentally measured bandgaps and H2evolution activities were consistent with the predictions. Conspicuously, the two compounds with strong photoactivities under UV and visible light are promising visible-light-driven water splitting photocatalysts. This study demonstrates the power of machine learning and the potential of big data to accelerate discovery of next-generation photocatalysts.Graphical abstractDisplay OmittedHighlights•Stacking models predict bandgap and H2evolution activity of oxide photocatalysts•Models predict robustly across a wide range of material structures•Models rapidly identify promising photocatalysts from 10 million materials•Four compounds are synthesized and confirm predicted resultsChemistry; Catalysis; Computational chemistry.