Dataset Store
Bring your AI project to the next level with high-quality, licensed data
Dataset Store
Bring your AI project to the next level with high-quality, licensed data
Expert Q&A Dataset
- Expert responses across various fields, including law, lifestyle, finance, and health
- Over 2.3 million counseling conversations
Academic Dataset
Processed from KCI journals and academic publications by KSI
Customizable into formats for training AI models
Image & Video Datasets
- Partnership with a global image platform
- Around 500 million items, including people, transportation, animals, food, and more
- Full-body photos of individuals (20 images per person, totaling 160,000)
Multilingual Conversation Dataset
- Conversations in 110 languages between individuals and chatbot agents
- 20 million speakers, over 1TB of multi-turn and single-turn conversations
Multilingual Translation Dataset
- 91 types of translations, including foreign language-to-foreign language and foreign language-to-Korean
Synthetic Dataset
- Unreal Engine simulations for rare cases in molding, manufacturing, and safety
- Data processing for 3D segmentation, cuboid annotation, and more
Coding Test Dataset
- Coding test dataset with problems and solutions paired in various languages, including Python3, Java, and C++
Credit Card Dataset
- Payment card transaction dataset by merchant location
Korean Independent Film Dataset
- Film and Korean/English script dataset
- EMDM(Entertainment Metadata Management) included
Media Dataset
- Text, video, and image data from Korean financial news outlets (can be organized in multimodal format)
- 50 million articles and images across specialized domains like economics and society, with multimodal configuration options available
- Official distributor for a total of 97 media outlets across various domains
* Additional collection and processing of purchase datasets upon request
Make an effort to change the world
The perfect data for your AI
Datumo provides a platform connecting AI companies with crowd workers to solve data challenges. Through research on your projects and crowd workers, we implement a more efficient, cutting-edge crowdsourcing platform.