摘要:Marine phytoplankton are responsible for half of the global net primary production and perform multiple other ecological functions and services of the global ocean. These photosynthetic organisms comprise more than 4300 marine species, but their biogeographic patterns and the resulting species diversity are poorly known, mostly owing to severe data limitations. Here, we compile, synthesize, and harmonize marine phytoplankton occurrence records from the two largest biological occurrence archives (Ocean Biogeographic Information System, OBIS; and Global Biodiversity Information Facility, GBIF) and three independent recent data collections. We bring together over 1.36 million phytoplankton occurrence records (1.28 million at the level of species) for a total of 1704 species, spanning the principal groups of the diatoms, dinoflagellates, and haptophytes, as well as several other groups. This data compilation increases the amount of marine phytoplankton records available through the single largest contributing archive (OBIS) by 65 %. Data span all ocean basins, latitudes, and most seasons. Analyzing the oceanic inventory of sampled phytoplankton species richness at the broadest spatial scales possible using a resampling procedure, we find that richness tends to saturate at ∼93 % of all species in our database in the pantropics, at ∼64 % in temperate waters, and at ∼35 % in the cold Northern Hemisphere, while the Southern Hemisphere remains under-explored. We provide metadata on the cruise, research institution, depth, and date for each data record, and we include phytoplankton cell counts for 193 763 records. We strongly recommend consideration of spatiotemporal biases in sampling intensity and varying taxonomic sampling scopes between research cruises or institutions when analyzing the occurrence data spatially. Including such information into predictive tools, such as statistical species distribution models, may serve to project the diversity, niches, and distribution of species in the contemporary and future ocean, opening the door for quantitative macroecological analyses of phytoplankton. PhytoBase can be downloaded from PANGAEA: https://doi.org/10.1594/PANGAEA.904397 (Righetti et al., 2019a).