kakaobrain/coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

PythonShell
This is stars and forks stats for /kakaobrain/coyo-dataset repository. As of 06 May, 2024 this repository has 974 stars and 33 forks.

🐺 COYO-700M: Image-Text Pair Dataset COYO-700M is a large-scale dataset that contains 747M image-text pairs as well as many other meta-attributes to increase the usability to train various models. Our dataset follows a similar strategy to previous vision-and-language datasets, collecting many informative pairs of alt-text and its associated image in HTML documents. We expect COYO to be used to train popular large-scale foundation models complementary to other similar datasets. More details on the...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
pypa/pipenvPythonOther24.2k01.9k0
charliermarsh/ruffRustTypeScriptPython18.1k05880
Dkazem91/holberton-system_engineering-devopsShellPythonPuppet11502710
philipyoo/holbertonschool-sysadmin_devopsShellPythonPuppet7401400
crystal-linux/isoShell124090
OreosLab/checkinpanelPerlPythonJavaScript1.4k03820
ljvmiranda921/prodigy-pdf-custom-recipePython1920190
Gabattal/Scripts-LeagueOfLegendsPythonBatchfile43040
umd-cmsc330/fall2022OCamlRubyStandard ML290220
veloren/velorenRustFluentGLSL4.5k03110