{"id":12353,"date":"2025-05-19T14:02:45","date_gmt":"2025-05-19T05:02:45","guid":{"rendered":"https:\/\/crowdworks.blog\/build-ai-training-data-to-match-complex-infographics-with-text\/"},"modified":"2025-09-26T11:18:58","modified_gmt":"2025-09-26T02:18:58","slug":"ai-training-data-infographics-text","status":"publish","type":"post","link":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/","title":{"rendered":"Building AI Training Data that Matches Complex Infographics with Text"},"content":{"rendered":"\n<div style=\"height:31px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<figure class=\"wp-block-image size-full\"><img fetchpriority=\"high\" decoding=\"async\" width=\"800\" height=\"540\" data-attachment-id=\"12358\" data-permalink=\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/building-ai-training-data_th\/\" data-orig-file=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675\" data-orig-size=\"1000,675\" data-comments-opened=\"0\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Building AI Training Data_th\" data-image-description=\"&lt;p&gt;Building AI Training Data that Matches Complex Infographics with Text&lt;\/p&gt;\n\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=300%2C203\" data-large-file=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=800%2C540\" tabindex=\"0\" role=\"button\" src=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?resize=800%2C540\" alt=\"Building AI Training Data that Matches Complex Infographics with Text\n\" class=\"wp-image-12358\" srcset=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?w=1000 1000w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?resize=300%2C203 300w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?resize=768%2C518 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" data-recalc-dims=\"1\" \/><\/figure>\n\n\n\n<div style=\"height:32px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Client A is,  <\/strong><\/h3>\n\n\n\n<p>One of Korea\u2019s leading telecommunications companies, driving business innovation based on ICT convergence technologies such as AI, big data, and cloud. Recently, the company has been introducing AI company-wide to enhance operational efficiency.<\/p>\n\n\n\n<div style=\"height:101px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading has-text-align-justify\"><strong>Project Overview<\/strong><\/h3>\n\n\n\n<p>To further improve efficiency, Client A wanted its AI systems to better understand, summarize, and analyze internal documents. As part of this effort, the company aimed to build large-scale AI training datasets. Specifically, through Crowdworks, they sought to construct a dataset of Korean-language infographics\u2014hierarchical flowcharts, diagrams, and other graphic elements\u2014meticulously labeled and matched with text.<\/p>\n\n\n\n<div style=\"height:28px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img decoding=\"async\" width=\"800\" height=\"450\" src=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/example.jpg?resize=800%2C450\" alt=\"This is a schematic image of the organizational structure of the Korean judiciary, with the Supreme Court at the top, followed by the Court Administration Office, the Judicial Training Institute, the Court Officials Education Institute, the Court Library, the High Court, and the Patent Court, and below the High Court are the District Courts, Family Courts, and Administrative Courts, with support and registry offices at the bottom.\" class=\"wp-image-11532\" style=\"width:646px;height:auto\" srcset=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/example.jpg?resize=1024%2C576 1024w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/example.jpg?resize=300%2C169 300w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/example.jpg?resize=768%2C432 768w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/example.jpg?resize=1200%2C675 1200w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/example.jpg?w=1280 1280w\" sizes=\"(max-width: 800px) 100vw, 800px\" data-recalc-dims=\"1\" \/><\/figure><\/div>\n\n\n<p class=\"has-text-align-center has-very-dark-gray-color has-text-color has-link-color has-small-font-size wp-elements-757f3f1d32b9e02f8e3523b6ac97012c\">Example images<\/p>\n\n\n\n<div style=\"height:43px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p>The specific tasks of this project are as follows:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Collect Korean infographic image data using publicly available datasets without licensing issues<\/li>\n\n\n\n<li>Collect images that have hierarchical structures and can express relationships between components<\/li>\n\n\n\n<li>Process and match information about each component and node with information about inter-node relationships<\/li>\n\n\n\n<li>Generate summary captioning that can describe the images<\/li>\n<\/ul>\n\n\n\n<div style=\"height:101px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Why Client A Chose Crowdworks<\/strong><\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>End-to-end data services<\/strong>: From collection to processing, Crowdworks covers the entire workflow.<\/li>\n\n\n\n<li><strong>Quality assurance<\/strong>: A systematic validation system ensures high-quality data.<\/li>\n\n\n\n<li><strong>Expertise<\/strong>: Experienced professionals can design and execute even the most complex projects.<\/li>\n<\/ul>\n\n\n\n<div style=\"height:101px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>This was also our first time undertaking such a project\u2014was it really possible?<\/strong><\/h3>\n\n\n\n<p>Client A expected data quality that fully matched their high standards. Even for Crowdworks, with its years of industry-leading expertise in data labeling and AI dataset construction, this was a challenging project. The reasons were:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Collection of license-cleared data that meets the project&#8217;s specific conditions was required<\/li>\n\n\n\n<li>They wanted high-difficulty data construction work that involves defining and connecting relationships between images (complex infographics) and text<\/li>\n\n\n\n<li>Since Client A was also pursuing this as a new initiative with no similar experience, it was difficult to predict the project process or methods, and there were unclear standards and requirements<\/li>\n<\/ul>\n\n\n\n<div style=\"height:101px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Project Solution Process<\/strong><\/h3>\n\n\n\n<p><strong>1) Designing the Data-Building Plan<\/strong><\/p>\n\n\n\n<p>We analyzed the requirements and sample data provided by Client A. Through this process, we identified areas where generative AI could be applied for automation. By combining our proprietary solutions with various open-source tools, we developed a customized data-building toolkit tailored to the client\u2019s needs. The datasets were then categorized and managed by task difficulty, based on the number of nodes and connecting lines between objects.<\/p>\n\n\n\n<div style=\"height:28px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img decoding=\"async\" width=\"800\" height=\"391\" src=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_1\ub2e8\uacc4_\ucd94\uac00-1024x500.png?resize=800%2C391&#038;ssl=1\" alt=\"A guided infographic explaining the process of the autonomous learning organization, including a step-by-step flow from open call \u2192 Planning and Evaluation Committee review \u2192 Chairman approval \u2192 Inauguration report \u2192 Operation \u2192 Final presentation, with explanations of each step and examples of key activities.\" class=\"wp-image-11468\" srcset=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_1\ub2e8\uacc4_\ucd94\uac00.png?resize=1024%2C500 1024w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_1\ub2e8\uacc4_\ucd94\uac00.png?resize=300%2C147 300w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_1\ub2e8\uacc4_\ucd94\uac00.png?resize=768%2C375 768w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_1\ub2e8\uacc4_\ucd94\uac00.png?resize=1536%2C750 1536w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_1\ub2e8\uacc4_\ucd94\uac00.png?resize=2048%2C1000 2048w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_1\ub2e8\uacc4_\ucd94\uac00.png?resize=1200%2C586 1200w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_1\ub2e8\uacc4_\ucd94\uac00.png?w=1600 1600w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_1\ub2e8\uacc4_\ucd94\uac00.png?w=2400 2400w\" sizes=\"(max-width: 800px) 100vw, 800px\" data-recalc-dims=\"1\" \/><\/figure><\/div>\n\n\n<div style=\"height:5px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p class=\"has-text-align-center has-very-dark-gray-color has-text-color has-link-color has-small-font-size wp-elements-09ba7808ce91b6a016efe3f1497a6891\">Step 1<\/p>\n\n\n\n<div style=\"height:28px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"800\" height=\"390\" src=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_2\ub2e8\uacc4_\ucd94\uac00-1024x499.png?resize=800%2C390&#038;ssl=1\" alt=\"A labeling task screen with multiple colored boxes overlaying an image of the Supreme Court organizational chart, with each organizational unit (e.g., Supreme Court, High Court, Support, etc.) color-coded, and a list of names and settings for each label on the right.\" class=\"wp-image-11471\" srcset=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_2\ub2e8\uacc4_\ucd94\uac00.png?resize=1024%2C499 1024w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_2\ub2e8\uacc4_\ucd94\uac00.png?resize=300%2C146 300w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_2\ub2e8\uacc4_\ucd94\uac00.png?resize=768%2C374 768w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_2\ub2e8\uacc4_\ucd94\uac00.png?resize=1536%2C749 1536w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_2\ub2e8\uacc4_\ucd94\uac00.png?resize=2048%2C998 2048w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_2\ub2e8\uacc4_\ucd94\uac00.png?resize=1200%2C585 1200w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_2\ub2e8\uacc4_\ucd94\uac00.png?w=1600 1600w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_2\ub2e8\uacc4_\ucd94\uac00.png?w=2400 2400w\" sizes=\"(max-width: 800px) 100vw, 800px\" data-recalc-dims=\"1\" \/><\/figure>\n\n\n\n<p class=\"has-text-align-center has-very-dark-gray-color has-text-color has-link-color has-small-font-size wp-elements-19dcb2f9c5c8f1ed3f2b4be4dd922942\">Step 2<\/p>\n\n\n\n<div style=\"height:1px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"800\" height=\"391\" src=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_3\ub2e8\uacc4-1024x501.png?resize=800%2C391&#038;ssl=1\" alt=\"A complex flowchart depicting the entire child abuse response system, with multiple labeled boxes overlaid on top of each other, visually organizing the response actors and processes from identification of a child in crisis to outreach to services to case management, with a list of labels on the right.\" class=\"wp-image-11473\" srcset=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_3\ub2e8\uacc4.png?resize=1024%2C501 1024w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_3\ub2e8\uacc4.png?resize=300%2C147 300w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_3\ub2e8\uacc4.png?resize=768%2C376 768w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_3\ub2e8\uacc4.png?resize=1536%2C751 1536w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_3\ub2e8\uacc4.png?resize=2048%2C1001 2048w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_3\ub2e8\uacc4.png?resize=1200%2C587 1200w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_3\ub2e8\uacc4.png?w=1600 1600w, https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/\uc608\uc2dc_3\ub2e8\uacc4.png?w=2400 2400w\" sizes=\"(max-width: 800px) 100vw, 800px\" data-recalc-dims=\"1\" \/><\/figure>\n\n\n\n<p class=\"has-text-align-center has-very-dark-gray-color has-text-color has-link-color has-small-font-size wp-elements-6afebf8569d51e52b4a1d25ae8b01769\">Step 3<\/p>\n\n\n\n<div style=\"height:27px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p><strong>2) Deploying Qualified Experts<\/strong><\/p>\n\n\n\n<p>We determined that this project required specialists with strong expertise and a deep understanding of data context. To that end, we assigned verified data specialists with knowledge of mathematical and logical structures, algorithmic frameworks, and the ability to create or interpret such structures. In addition, they were required to understand JSON and object-based data models. Before the project began, all workers underwent comprehensive guideline training and pre-tests to minimize risks that could affect quality.<\/p>\n\n\n\n<div style=\"height:27px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p><strong>3) Ongoing Review, Feedback, and Adjustments<\/strong><\/p>\n\n\n\n<p>Throughout the four-month project, we maintained regular communication with Client A, conducting periodic reviews of sample data and flexibly adjusting the workflow to reflect interim feedback. Since this was more than a simple image-labeling task, we implemented an integrated operational strategy that combined multiple tasks such as captioning, object mapping, and text transcription. Project managers collaborated closely with internal data engineers to map out the entire workflow and establish detailed guidelines, ensuring consistency and high quality across all deliverables.<\/p>\n\n\n\n<div style=\"height:74px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>\u201cWe want to work with Crowdworks again!\u201d<\/strong><\/h3>\n\n\n\n<p>At the beginning, Client A had significant concerns: the data structures were complex, and the tasks were entirely new territory, raising questions like, <em>\u201cCan this really work?\u201d<\/em> As the project progressed, however, Crowdworks built trust through fast and accurate communication, thorough requirement analysis, and flexible, tailored proposals. Client A was impressed with the project manager\u2019s direction and operational strategy, and ultimately expressed strong satisfaction with the quality of the data delivered\u2014so much so that they concluded, <em>\u201cWe want to work with Crowdworks again on our next project.\u201d<\/em><\/p>\n\n\n\n<p>As AI technology advances and companies build increasingly diverse services, the need for high-quality, complex datasets continues to grow. If you ever encounter a project that seems almost impossible, Crowdworks is here to help. For us, it\u2019s not just a challenge, but also an opportunity to provide the most efficient and effective solutions together with our clients.<\/p>\n\n\n\n<div style=\"height:76px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-1 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button has-custom-width wp-block-button__width-75 is-style-fill\"><a class=\"wp-block-button__link has-vivid-cyan-blue-to-vivid-purple-gradient-background has-background has-text-align-center wp-element-button\" href=\"https:\/\/www.crowdworks.ai\/en\/company\/contact\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>Contact Crowdworks for Data-Building Inquiries<\/strong><\/a><\/div>\n<\/div>\n\n\n\n<div style=\"height:100px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Client A is, One of Korea\u2019s leading telecommunications companies, driving business innovation based on ICT convergence technologies such as AI, big data, and cloud. Recently, the company has been introducing AI company-wide to enhance operational efficiency. Project Overview To further improve efficiency, Client A wanted its AI systems to better understand, summarize, and analyze internal [&hellip;]<\/p>\n","protected":false},"author":235377076,"featured_media":12358,"comment_status":"closed","ping_status":"open","sticky":false,"template":"elementor_theme","format":"standard","meta":{"om_disable_all_campaigns":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false},"version":2}},"categories":[120654],"tags":[120656,121280,121275,120719,121274,121273,121279,121276,121278,120723,121277],"class_list":["post-12353","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-project","tag-ai-en","tag-ai-data-en","tag-ai-training-data","tag-artificial-intelligence","tag-data-building-examples","tag-data-collection","tag-data-labeling","tag-data-preprocessing","tag-text","tag-training-data","tag-usecase-en"],"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v23.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Building AI Training Data that Matches Complex Infographics with Text - CROWDWORKS Blog<\/title>\n<meta name=\"description\" content=\"Build complex AI training datasets that pair infographics with text. Discover how Crowdworks helped a leading telecom company create high-quality, license-cleared data with expert labeling, custom toolkits, and strict quality assurance.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Building AI Training Data that Matches Complex Infographics with Text - CROWDWORKS Blog\" \/>\n<meta property=\"og:description\" content=\"Build complex AI training datasets that pair infographics with text. Discover how Crowdworks helped a leading telecom company create high-quality, license-cleared data with expert labeling, custom toolkits, and strict quality assurance.\" \/>\n<meta property=\"og:url\" content=\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/\" \/>\n<meta property=\"og:site_name\" content=\"CROWDWORKS Blog\" \/>\n<meta property=\"article:published_time\" content=\"2025-05-19T05:02:45+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-09-26T02:18:58+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675&ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"675\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"hakim44a993f691\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"hakim44a993f691\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#article\",\"isPartOf\":{\"@id\":\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/\"},\"author\":{\"name\":\"hakim44a993f691\",\"@id\":\"https:\/\/crowdworks.blog\/en\/#\/schema\/person\/ba7dcb25cb0fbfc16f21f9bf73637cd4\"},\"headline\":\"Building AI Training Data that Matches Complex Infographics with Text\",\"datePublished\":\"2025-05-19T05:02:45+00:00\",\"dateModified\":\"2025-09-26T02:18:58+00:00\",\"mainEntityOfPage\":{\"@id\":\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/\"},\"wordCount\":710,\"publisher\":{\"@id\":\"https:\/\/crowdworks.blog\/en\/#organization\"},\"image\":{\"@id\":\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675\",\"keywords\":[\"AI\",\"AI Data\",\"AI Training Data\",\"Artificial Intelligence\",\"Data building examples\",\"Data collection\",\"Data labeling\",\"Data Preprocessing\",\"Text\",\"Training Data\",\"usecase\"],\"articleSection\":[\"Use Cases\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/\",\"url\":\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/\",\"name\":\"Building AI Training Data that Matches Complex Infographics with Text - CROWDWORKS Blog\",\"isPartOf\":{\"@id\":\"https:\/\/crowdworks.blog\/en\/#website\"},\"primaryImageOfPage\":{\"@id\":\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#primaryimage\"},\"image\":{\"@id\":\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675\",\"datePublished\":\"2025-05-19T05:02:45+00:00\",\"dateModified\":\"2025-09-26T02:18:58+00:00\",\"description\":\"Build complex AI training datasets that pair infographics with text. Discover how Crowdworks helped a leading telecom company create high-quality, license-cleared data with expert labeling, custom toolkits, and strict quality assurance.\",\"breadcrumb\":{\"@id\":\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#primaryimage\",\"url\":\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675\",\"contentUrl\":\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675\",\"width\":1000,\"height\":675,\"caption\":\"Building AI Training Data that Matches Complex Infographics with Text\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"\ud648\",\"item\":\"https:\/\/crowdworks.blog\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Building AI Training Data that Matches Complex Infographics with Text\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/crowdworks.blog\/en\/#website\",\"url\":\"https:\/\/crowdworks.blog\/en\/\",\"name\":\"CROWDWORKS Blog\",\"description\":\"Trustworthy AI built on your data\",\"publisher\":{\"@id\":\"https:\/\/crowdworks.blog\/en\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/crowdworks.blog\/en\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/crowdworks.blog\/en\/#organization\",\"name\":\"CROWDWORKS Blog\",\"url\":\"https:\/\/crowdworks.blog\/en\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/crowdworks.blog\/en\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2023\/05\/blog_cw_logo.png?fit=350%2C100\",\"contentUrl\":\"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2023\/05\/blog_cw_logo.png?fit=350%2C100\",\"width\":350,\"height\":100,\"caption\":\"CROWDWORKS Blog\"},\"image\":{\"@id\":\"https:\/\/crowdworks.blog\/en\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/crowdworks.blog\/en\/#\/schema\/person\/ba7dcb25cb0fbfc16f21f9bf73637cd4\",\"name\":\"hakim44a993f691\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/crowdworks.blog\/en\/#\/schema\/person\/image\/\",\"url\":\"http:\/\/1.gravatar.com\/avatar\/aa4d3511ef8391d29376f95b1d48a6ce?s=96&d=identicon&r=g\",\"contentUrl\":\"http:\/\/1.gravatar.com\/avatar\/aa4d3511ef8391d29376f95b1d48a6ce?s=96&d=identicon&r=g\",\"caption\":\"hakim44a993f691\"},\"url\":\"http:\/\/crowdworks.blog\/en\/author\/hakim44a993f691\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Building AI Training Data that Matches Complex Infographics with Text - CROWDWORKS Blog","description":"Build complex AI training datasets that pair infographics with text. Discover how Crowdworks helped a leading telecom company create high-quality, license-cleared data with expert labeling, custom toolkits, and strict quality assurance.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/","og_locale":"en_US","og_type":"article","og_title":"Building AI Training Data that Matches Complex Infographics with Text - CROWDWORKS Blog","og_description":"Build complex AI training datasets that pair infographics with text. Discover how Crowdworks helped a leading telecom company create high-quality, license-cleared data with expert labeling, custom toolkits, and strict quality assurance.","og_url":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/","og_site_name":"CROWDWORKS Blog","article_published_time":"2025-05-19T05:02:45+00:00","article_modified_time":"2025-09-26T02:18:58+00:00","og_image":[{"width":1000,"height":675,"url":"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675&ssl=1","type":"image\/jpeg"}],"author":"hakim44a993f691","twitter_card":"summary_large_image","twitter_misc":{"Written by":"hakim44a993f691","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#article","isPartOf":{"@id":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/"},"author":{"name":"hakim44a993f691","@id":"https:\/\/crowdworks.blog\/en\/#\/schema\/person\/ba7dcb25cb0fbfc16f21f9bf73637cd4"},"headline":"Building AI Training Data that Matches Complex Infographics with Text","datePublished":"2025-05-19T05:02:45+00:00","dateModified":"2025-09-26T02:18:58+00:00","mainEntityOfPage":{"@id":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/"},"wordCount":710,"publisher":{"@id":"https:\/\/crowdworks.blog\/en\/#organization"},"image":{"@id":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675","keywords":["AI","AI Data","AI Training Data","Artificial Intelligence","Data building examples","Data collection","Data labeling","Data Preprocessing","Text","Training Data","usecase"],"articleSection":["Use Cases"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/","url":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/","name":"Building AI Training Data that Matches Complex Infographics with Text - CROWDWORKS Blog","isPartOf":{"@id":"https:\/\/crowdworks.blog\/en\/#website"},"primaryImageOfPage":{"@id":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#primaryimage"},"image":{"@id":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675","datePublished":"2025-05-19T05:02:45+00:00","dateModified":"2025-09-26T02:18:58+00:00","description":"Build complex AI training datasets that pair infographics with text. Discover how Crowdworks helped a leading telecom company create high-quality, license-cleared data with expert labeling, custom toolkits, and strict quality assurance.","breadcrumb":{"@id":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#primaryimage","url":"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675","contentUrl":"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675","width":1000,"height":675,"caption":"Building AI Training Data that Matches Complex Infographics with Text"},{"@type":"BreadcrumbList","@id":"http:\/\/crowdworks.blog\/en\/ai-training-data-infographics-text\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"\ud648","item":"https:\/\/crowdworks.blog\/en\/"},{"@type":"ListItem","position":2,"name":"Building AI Training Data that Matches Complex Infographics with Text"}]},{"@type":"WebSite","@id":"https:\/\/crowdworks.blog\/en\/#website","url":"https:\/\/crowdworks.blog\/en\/","name":"CROWDWORKS Blog","description":"Trustworthy AI built on your data","publisher":{"@id":"https:\/\/crowdworks.blog\/en\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/crowdworks.blog\/en\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/crowdworks.blog\/en\/#organization","name":"CROWDWORKS Blog","url":"https:\/\/crowdworks.blog\/en\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/crowdworks.blog\/en\/#\/schema\/logo\/image\/","url":"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2023\/05\/blog_cw_logo.png?fit=350%2C100","contentUrl":"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2023\/05\/blog_cw_logo.png?fit=350%2C100","width":350,"height":100,"caption":"CROWDWORKS Blog"},"image":{"@id":"https:\/\/crowdworks.blog\/en\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/crowdworks.blog\/en\/#\/schema\/person\/ba7dcb25cb0fbfc16f21f9bf73637cd4","name":"hakim44a993f691","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/crowdworks.blog\/en\/#\/schema\/person\/image\/","url":"http:\/\/1.gravatar.com\/avatar\/aa4d3511ef8391d29376f95b1d48a6ce?s=96&d=identicon&r=g","contentUrl":"http:\/\/1.gravatar.com\/avatar\/aa4d3511ef8391d29376f95b1d48a6ce?s=96&d=identicon&r=g","caption":"hakim44a993f691"},"url":"http:\/\/crowdworks.blog\/en\/author\/hakim44a993f691\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/crowdworks.blog\/wp-content\/uploads\/2025\/05\/Building-AI-Training-Data_th.jpg?fit=1000%2C675","jetpack_likes_enabled":true,"jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/pfYPYV-3df","jetpack-related-posts":[],"_links":{"self":[{"href":"http:\/\/crowdworks.blog\/en\/wp-json\/wp\/v2\/posts\/12353"}],"collection":[{"href":"http:\/\/crowdworks.blog\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/crowdworks.blog\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/crowdworks.blog\/en\/wp-json\/wp\/v2\/users\/235377076"}],"replies":[{"embeddable":true,"href":"http:\/\/crowdworks.blog\/en\/wp-json\/wp\/v2\/comments?post=12353"}],"version-history":[{"count":3,"href":"http:\/\/crowdworks.blog\/en\/wp-json\/wp\/v2\/posts\/12353\/revisions"}],"predecessor-version":[{"id":12361,"href":"http:\/\/crowdworks.blog\/en\/wp-json\/wp\/v2\/posts\/12353\/revisions\/12361"}],"wp:featuredmedia":[{"embeddable":true,"href":"http:\/\/crowdworks.blog\/en\/wp-json\/wp\/v2\/media\/12358"}],"wp:attachment":[{"href":"http:\/\/crowdworks.blog\/en\/wp-json\/wp\/v2\/media?parent=12353"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/crowdworks.blog\/en\/wp-json\/wp\/v2\/categories?post=12353"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/crowdworks.blog\/en\/wp-json\/wp\/v2\/tags?post=12353"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}