Content-type: text/html Downes.ca ~ Stephen's Web ~ PD12M

Stephen Downes

Knowledge, Learning, Community

PD12M

Source.Plus, Dec 06, 2024

From Alan Levine comes this link: "At 12.4 million image-caption pairs, PD12M is the largest public domain image-text dataset to date, with sufficient size to train foundation models while minimizing copyright concerns. Through the Source.Plus platform, we also introduce novel, community-driven dataset governance mechanisms that reduce harm and support reproducibility over time." Search could be better, but the images are great.

Today: 3 Total: 559 [Direct link] [Share]


Stephen Downes Stephen Downes, Casselman, Canada
stephen@downes.ca

Copyright 2024
Last Updated: Dec 12, 2024 02:49 a.m.

Canadian Flag Creative Commons License.

Force:yes