{"id":8244,"date":"2022-03-22T17:14:22","date_gmt":"2022-03-22T16:14:22","guid":{"rendered":"https:\/\/sourcing-force.com\/?p=8244"},"modified":"2023-05-05T23:09:37","modified_gmt":"2023-05-05T21:09:37","slug":"how-machine-learning-improves-data-cleansing","status":"publish","type":"post","link":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/","title":{"rendered":"How Machine Learning Improves Data Cleansing"},"content":{"rendered":"<p>[et_pb_section fb_built=&#8221;1&#8243; next_background_color=&#8221;#ffffff&#8221; _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; background_color=&#8221;rgba(237,237,237,0.82)&#8221; custom_padding=&#8221;||104px|||&#8221; bottom_divider_style=&#8221;slant&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_row column_structure=&#8221;1_2,1_2&#8243; _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_column type=&#8221;1_2&#8243; _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_post_title comments=&#8221;off&#8221; featured_image=&#8221;off&#8221; _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; title_font=&#8221;Source Sans Pro||||||||&#8221; custom_margin=&#8221;|-50px||||&#8221; global_colors_info=&#8221;{}&#8221;][\/et_pb_post_title][\/et_pb_column][et_pb_column type=&#8221;1_2&#8243; _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_image src=&#8221;https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/Machine-learning-3.png&#8221; alt=&#8221;Machine learning Data Cleansing&#8221; title_text=&#8221;ERP d\u00e9finition Sourcing Force&#8221; _builder_version=&#8221;4.19.4&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;|||79px||&#8221; border_radii=&#8221;on|500px|500px|500px|500px&#8221; global_colors_info=&#8221;{}&#8221;][\/et_pb_image][\/et_pb_column][\/et_pb_row][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; _builder_version=&#8221;4.16&#8243; global_colors_info=&#8221;{}&#8221;][et_pb_row _builder_version=&#8221;4.16&#8243; background_size=&#8221;initial&#8221; background_position=&#8221;top_left&#8221; background_repeat=&#8221;repeat&#8221; min_height=&#8221;104.2px&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.16&#8243; custom_padding=&#8221;|||&#8221; global_colors_info=&#8221;{}&#8221; custom_padding__hover=&#8221;|||&#8221;][et_pb_image src=&#8221;https:\/\/sourcing-force.com\/wp-content\/uploads\/2022\/05\/five-sources-of-value-using-a-contract-management-softwar-min.png&#8221; alt=&#8221;Contract Management&#8221; title_text=&#8221;Five Sources of Value using a Contract Management softwar-min&#8221; url=&#8221;https:\/\/sourcing-force.com\/en\/five-sources-of-value-using-a-contract-management-software\/&#8221; url_new_window=&#8221;on&#8221; _builder_version=&#8221;4.17.3&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][\/et_pb_image][et_pb_text _builder_version=&#8221;4.17.6&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;]<\/p>\n<h2>Machine Learning Improves Data Cleansing<\/h2>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.17.6&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.17.6&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_text _builder_version=&#8221;4.19.4&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;]<\/p>\n<p><em><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">One of the many Holy Grails of <strong>machine learning<\/strong> within the spend analysis domain is\u00a0the ability to disambiguate and classify customer purchases accurately, quickly, and automatically. It&#8217;s a fun problem to try to tackle since it&#8217;s approachable from many different angles.<\/span><\/em><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><br \/>At a very simple level, one could iterate through all item purchases and try to categorize each purchase based on the name of the purchase and the name of the available categories to which you&#8217;re mapping. As an example, a &#8220;spoon&#8221; can be mapped to the following UNSPSC categories which all have the word &#8220;spoon&#8221; in them.<br \/><\/span><\/p>\n<p>&nbsp;<\/p>\n<ul>\n<li><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><strong>41123402 &#8211; Dosing Spoon<\/strong><\/span><\/li>\n<li><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><strong>42181512 &#8211; Typhoid Carrier Examination Spoons<\/strong><\/span><\/li>\n<li><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><strong>42294000 &#8211; Surgical Spatulas and Spoons and Scoops and Related Products<\/strong><\/span><\/li>\n<li><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><strong>42294003 &#8211; Surgical Spoons<\/strong><\/span><\/li>\n<li><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><strong>42294519 &#8211; Ophthalmic Spoons or Curettes<\/strong><\/span><\/li>\n<li><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><strong>52151617 &#8211; Domestic Wooden Spoon<\/strong><\/span><\/li>\n<li><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><strong>52151651 &#8211; Domestic Measuring Spoon<\/strong><\/span><\/li>\n<li><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><strong>52151704 &#8211; Domestic Spoons<\/strong><\/span><\/li>\n<\/ul>\n<p><span style=\"font-size: 12pt; font-family: tahoma, arial, helvetica, sans-serif;\">If an automated system were to use this scheme, which category of &#8220;spoon&#8221; would it select? Hopefully there would be some context in the item description that could provide some hints such as the word &#8220;kitchen&#8221; or perhaps a supplier where you purchase the spoon such as &#8220;Staples&#8221;, but that&#8217;s an additional layer of complexity that one would have to account for (think lots of $$$).<\/span><\/p>\n<p>[\/et_pb_text][et_pb_image src=&#8221;https:\/\/sourcing-force.com\/wp-content\/uploads\/2022\/09\/improves-data-cleansing.jpeg&#8221; alt=&#8221;Improves Data Cleansing &#8221; title_text=&#8221;Improves Data Cleansing&#8221; align=&#8221;center&#8221; _builder_version=&#8221;4.17.6&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][\/et_pb_image][et_pb_text _builder_version=&#8221;4.17.6&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;]<\/p>\n<h2>Using the Machine Learning to classify<\/h2>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.17.6&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.17.6&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_text _builder_version=&#8221;4.19.4&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;]<\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><strong>Sourcing Force<\/strong> is fortunate enough to have been in the business long enough to have developed a significant edge. Quite simply, we&#8217;ve classified a ton of items using custom created classification rules.<\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">When a researcher is toying with <a href=\"https:\/\/sourcing-force.com\/en\/category\/source-to-pay\/spend-analytics\/\">machine learning<\/a> algorithms such as Neural Networks (NN), Naive Bayes Classifiers (NBC), Hidden Markov Models (HMM) for Word Sense Disambiguation, etc., frequently he\/she runs into a huge roadblock in that in order to effectively apply these algorithms, one needs\u00a0<em>training data<\/em> in order teach and tune the algorithms.<\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">Training data for some domains can be purchased while other training data needs to be painfully constructed by the researchers (or probably grad students). It&#8217;s not easy to come by in other words.<\/span><\/p>\n<p><em><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">Our hard working Analysts have to date written hundreds of thousands of distinct classification rules\u00a0that map item descriptions to category codes that we&#8217;ve used to classify items for a lot of companies.<\/span><\/em><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"> These rules allow us to do a great job classifying items for our clients, but they are also an undeniable treasure trove of\u00a0<em>implicit<\/em>\u00a0<em>semantic knowledge\u00a0<\/em>that can be used for algorithm training.<\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">A great example that comes to mind of the &#8220;implicit&#8221; semantics that I refer to above can be seen in the problem of &#8220;how does one classify Tylenol?&#8221; There is no UNSPSC code for Tylenol but there is one for Tylenol&#8217;s chemical name: Acetaminophen. <\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">The code is 51142001.<\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">I fortunately knew that important detail from which I can write a classification rule. Consider this: an algorithm that was trained off these Sourcing Force classification rules just learned that mapping of Tylenol to 51142001 <em>for free<\/em>.<\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"> Once upon a time, I wrote a classification rule for a company which I turned around and used to train an algorithm.<\/span><\/p>\n<p>[\/et_pb_text][et_pb_text _builder_version=&#8221;4.17.6&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;]<\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><b>Now that rule, to a degree, can help me classify forever.<\/b><\/span><\/p>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.17.6&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.17.6&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_text _builder_version=&#8221;4.21.0&#8243; _module_preset=&#8221;default&#8221; hover_enabled=&#8221;0&#8243; global_colors_info=&#8221;{}&#8221; sticky_enabled=&#8221;0&#8243;]<\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">Figuring out how to classify some items can be quite a nasty puzzle sometimes for a human especially when it comes to chemicals, and so having an Analyst figure out a mapping for an obscure item in a sense becomes &#8220;a gift that keeps on giving.&#8221; <\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">As an added benefit, the more obscure the item is, the more accurate algorithmic predictions are going to be.<\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">The reason for that is the context in which certain items appear for strange purchases is going to be rather limited. There won\u2019t be much \u201cnoise\u201d in the data to confuse an <strong>automated system<\/strong>.<\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">One must add one last point in order to come full circle within the machine learning domain of <a href=\"https:\/\/sourcing-force.com\/en\/spend-analysis-to-spend-management\/\">spend analysis<\/a>. Human beings are still masters here. Algorithmic approaches to spend analysis, albeit cool, cannot match the pattern recognition capabilities wired into the human brain &#8211; especially a Sourcing Force Analysts&#8217; brain. <\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">Machine learning approaches so far can only mirror what it is that they&#8217;ve learned and repeat back answers that have the highest probability of being correct within the limited context that they know. The large number of rules that Sourcing Force has to play with, broaden a machine&#8217;s perception of reality and give it a rich context to learn from.<\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">Even though I personally have been the one trying to mature software to do <strong>automatic classification<\/strong>, I must give credit where credit is due.\u00a0<\/span><\/p>\n<p><em><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\">The &#8220;parents&#8221; of our little electronic child are the Sourcing Force Analysts, none of whom were harmed during the training of any algorithms.<\/span><\/em><\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p><span style=\"color: #000000; font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><a href=\"https:\/\/sourcing-force.com\/en\/procure-to-pay-automation\/\"><strong>See how Sourcing Force helps businesses automate their procurement processes<\/strong><\/a><\/span><\/p>\n<p><span style=\"font-family: tahoma, arial, helvetica, sans-serif; font-size: 12pt;\"><a href=\"https:\/\/sourcing-force.com\/en\/procure-to-pay-automation\/\"><img decoding=\"async\" class=\"aligncenter size-large wp-image-8003\" src=\"https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/e-procurement-suite-1024x186.png\" alt=\"machine learning\" width=\"1024\" height=\"186\" srcset=\"https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/e-procurement-suite-1024x186.png 1024w, https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/e-procurement-suite-300x55.png 300w, https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/e-procurement-suite-768x140.png 768w, https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/e-procurement-suite-1080x196.png 1080w, https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/e-procurement-suite.png 1210w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/a><\/span><\/p>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; background_color=&#8221;#f7f7f7&#8243; custom_padding=&#8221;5px|||||&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_row _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_text _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;]<\/p>\n<h3 style=\"text-align: center;\">Our latest articles<\/h3>\n<p style=\"text-align: center;\">\n<h3 style=\"text-align: center;\"><\/h3>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_blog fullwidth=&#8221;off&#8221; posts_number=&#8221;3&#8243; include_categories=&#8221;all&#8221; show_author=&#8221;off&#8221; show_date=&#8221;off&#8221; show_categories=&#8221;off&#8221; show_pagination=&#8221;off&#8221; _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; box_shadow_style=&#8221;preset1&#8243; global_colors_info=&#8221;{}&#8221;][\/et_pb_blog][\/et_pb_column][\/et_pb_row][\/et_pb_section]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Machine Learning Improves Data CleansingOne of the many Holy Grails of machine learning within the spend analysis domain is\u00a0the ability to disambiguate and classify customer purchases accurately, quickly, and automatically. It&#8217;s a fun problem to try to tackle since it&#8217;s approachable from many different angles. At a very simple level, one could iterate through all [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":8502,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_et_pb_use_builder":"on","_et_pb_old_content":"","_et_gb_content_width":"","footnotes":""},"categories":[36],"tags":[],"class_list":["post-8244","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-spend-analytics"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>How Machine Learning Improves Data Cleansing - Sourcing Force<\/title>\n<meta name=\"description\" content=\"One of the many Holy Grails of machine learning within the spend analysis is the ability to disambiguate and classify customer purchases.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How Machine Learning Improves Data Cleansing - Sourcing Force\" \/>\n<meta property=\"og:description\" content=\"One of the many Holy Grails of machine learning within the spend analysis is the ability to disambiguate and classify customer purchases.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/\" \/>\n<meta property=\"og:site_name\" content=\"Sourcing Force\" \/>\n<meta property=\"article:published_time\" content=\"2022-03-22T16:14:22+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-05-05T21:09:37+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/Machine-learning-3.png\" \/>\n\t<meta property=\"og:image:width\" content=\"935\" \/>\n\t<meta property=\"og:image:height\" content=\"412\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Olivier Audino\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@SourcingForce\" \/>\n<meta name=\"twitter:site\" content=\"@SourcingForce\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Olivier Audino\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/\"},\"author\":{\"name\":\"Olivier Audino\",\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/#\\\/schema\\\/person\\\/481e000db1e02e01e9dc0913e2430fbd\"},\"headline\":\"How Machine Learning Improves Data Cleansing\",\"datePublished\":\"2022-03-22T16:14:22+00:00\",\"dateModified\":\"2023-05-05T21:09:37+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/\"},\"wordCount\":1396,\"image\":{\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/sourcing-force.com\\\/wp-content\\\/uploads\\\/2019\\\/04\\\/Machine-learning-3.png\",\"articleSection\":[\"Spend analytics\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/\",\"url\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/\",\"name\":\"How Machine Learning Improves Data Cleansing - Sourcing Force\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/sourcing-force.com\\\/wp-content\\\/uploads\\\/2019\\\/04\\\/Machine-learning-3.png\",\"datePublished\":\"2022-03-22T16:14:22+00:00\",\"dateModified\":\"2023-05-05T21:09:37+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/#\\\/schema\\\/person\\\/481e000db1e02e01e9dc0913e2430fbd\"},\"description\":\"One of the many Holy Grails of machine learning within the spend analysis is the ability to disambiguate and classify customer purchases.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/#primaryimage\",\"url\":\"https:\\\/\\\/sourcing-force.com\\\/wp-content\\\/uploads\\\/2019\\\/04\\\/Machine-learning-3.png\",\"contentUrl\":\"https:\\\/\\\/sourcing-force.com\\\/wp-content\\\/uploads\\\/2019\\\/04\\\/Machine-learning-3.png\",\"width\":935,\"height\":412,\"caption\":\"Machine learning Data Cleansing\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/how-machine-learning-improves-data-cleansing\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Accueil\",\"item\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"How Machine Learning Improves Data Cleansing\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/#website\",\"url\":\"https:\\\/\\\/sourcing-force.com\\\/\",\"name\":\"Sourcing Force\",\"description\":\"Digitalisez l\u2019int\u00e9gralit\u00e9 de votre processus Achat !\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/sourcing-force.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/#\\\/schema\\\/person\\\/481e000db1e02e01e9dc0913e2430fbd\",\"name\":\"Olivier Audino\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/sourcing-force.com\\\/wp-content\\\/uploads\\\/2022\\\/01\\\/olivier-audino-150x150.png\",\"url\":\"https:\\\/\\\/sourcing-force.com\\\/wp-content\\\/uploads\\\/2022\\\/01\\\/olivier-audino-150x150.png\",\"contentUrl\":\"https:\\\/\\\/sourcing-force.com\\\/wp-content\\\/uploads\\\/2022\\\/01\\\/olivier-audino-150x150.png\",\"caption\":\"Olivier Audino\"},\"url\":\"https:\\\/\\\/sourcing-force.com\\\/en\\\/author\\\/olivieraudino\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"How Machine Learning Improves Data Cleansing - Sourcing Force","description":"One of the many Holy Grails of machine learning within the spend analysis is the ability to disambiguate and classify customer purchases.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/","og_locale":"en_US","og_type":"article","og_title":"How Machine Learning Improves Data Cleansing - Sourcing Force","og_description":"One of the many Holy Grails of machine learning within the spend analysis is the ability to disambiguate and classify customer purchases.","og_url":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/","og_site_name":"Sourcing Force","article_published_time":"2022-03-22T16:14:22+00:00","article_modified_time":"2023-05-05T21:09:37+00:00","og_image":[{"width":935,"height":412,"url":"https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/Machine-learning-3.png","type":"image\/png"}],"author":"Olivier Audino","twitter_card":"summary_large_image","twitter_creator":"@SourcingForce","twitter_site":"@SourcingForce","twitter_misc":{"Written by":"Olivier Audino","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/#article","isPartOf":{"@id":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/"},"author":{"name":"Olivier Audino","@id":"https:\/\/sourcing-force.com\/#\/schema\/person\/481e000db1e02e01e9dc0913e2430fbd"},"headline":"How Machine Learning Improves Data Cleansing","datePublished":"2022-03-22T16:14:22+00:00","dateModified":"2023-05-05T21:09:37+00:00","mainEntityOfPage":{"@id":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/"},"wordCount":1396,"image":{"@id":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/#primaryimage"},"thumbnailUrl":"https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/Machine-learning-3.png","articleSection":["Spend analytics"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/","url":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/","name":"How Machine Learning Improves Data Cleansing - Sourcing Force","isPartOf":{"@id":"https:\/\/sourcing-force.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/#primaryimage"},"image":{"@id":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/#primaryimage"},"thumbnailUrl":"https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/Machine-learning-3.png","datePublished":"2022-03-22T16:14:22+00:00","dateModified":"2023-05-05T21:09:37+00:00","author":{"@id":"https:\/\/sourcing-force.com\/#\/schema\/person\/481e000db1e02e01e9dc0913e2430fbd"},"description":"One of the many Holy Grails of machine learning within the spend analysis is the ability to disambiguate and classify customer purchases.","breadcrumb":{"@id":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/#primaryimage","url":"https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/Machine-learning-3.png","contentUrl":"https:\/\/sourcing-force.com\/wp-content\/uploads\/2019\/04\/Machine-learning-3.png","width":935,"height":412,"caption":"Machine learning Data Cleansing"},{"@type":"BreadcrumbList","@id":"https:\/\/sourcing-force.com\/en\/how-machine-learning-improves-data-cleansing\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Accueil","item":"https:\/\/sourcing-force.com\/en\/"},{"@type":"ListItem","position":2,"name":"How Machine Learning Improves Data Cleansing"}]},{"@type":"WebSite","@id":"https:\/\/sourcing-force.com\/#website","url":"https:\/\/sourcing-force.com\/","name":"Sourcing Force","description":"Digitalisez l\u2019int\u00e9gralit\u00e9 de votre processus Achat !","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/sourcing-force.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/sourcing-force.com\/#\/schema\/person\/481e000db1e02e01e9dc0913e2430fbd","name":"Olivier Audino","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/sourcing-force.com\/wp-content\/uploads\/2022\/01\/olivier-audino-150x150.png","url":"https:\/\/sourcing-force.com\/wp-content\/uploads\/2022\/01\/olivier-audino-150x150.png","contentUrl":"https:\/\/sourcing-force.com\/wp-content\/uploads\/2022\/01\/olivier-audino-150x150.png","caption":"Olivier Audino"},"url":"https:\/\/sourcing-force.com\/en\/author\/olivieraudino\/"}]}},"_links":{"self":[{"href":"https:\/\/sourcing-force.com\/en\/wp-json\/wp\/v2\/posts\/8244","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sourcing-force.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sourcing-force.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sourcing-force.com\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/sourcing-force.com\/en\/wp-json\/wp\/v2\/comments?post=8244"}],"version-history":[{"count":4,"href":"https:\/\/sourcing-force.com\/en\/wp-json\/wp\/v2\/posts\/8244\/revisions"}],"predecessor-version":[{"id":24556,"href":"https:\/\/sourcing-force.com\/en\/wp-json\/wp\/v2\/posts\/8244\/revisions\/24556"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/sourcing-force.com\/en\/wp-json\/wp\/v2\/media\/8502"}],"wp:attachment":[{"href":"https:\/\/sourcing-force.com\/en\/wp-json\/wp\/v2\/media?parent=8244"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sourcing-force.com\/en\/wp-json\/wp\/v2\/categories?post=8244"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sourcing-force.com\/en\/wp-json\/wp\/v2\/tags?post=8244"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}