{"id":738,"date":"2022-04-04T08:09:38","date_gmt":"2022-04-04T07:09:38","guid":{"rendered":"https:\/\/metrics.blogg.gu.se\/?p=738"},"modified":"2022-02-25T14:25:48","modified_gmt":"2022-02-25T13:25:48","slug":"testing-of-ml-systems","status":"publish","type":"post","link":"https:\/\/metrics.blogg.gu.se\/?p=738","title":{"rendered":"Testing of ML systems"},"content":{"rendered":"\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"640\" src=\"https:\/\/metrics.blogg.gu.se\/files\/2022\/02\/test-pattern-152459-1024x640.png\" alt=\"\" class=\"wp-image-739\" srcset=\"https:\/\/metrics.blogg.gu.se\/files\/2022\/02\/test-pattern-152459-1024x640.png 1024w, https:\/\/metrics.blogg.gu.se\/files\/2022\/02\/test-pattern-152459-300x188.png 300w, https:\/\/metrics.blogg.gu.se\/files\/2022\/02\/test-pattern-152459-768x480.png 768w, https:\/\/metrics.blogg.gu.se\/files\/2022\/02\/test-pattern-152459-1200x750.png 1200w, https:\/\/metrics.blogg.gu.se\/files\/2022\/02\/test-pattern-152459-1320x825.png 1320w, https:\/\/metrics.blogg.gu.se\/files\/2022\/02\/test-pattern-152459.png 1920w\" sizes=\"(max-width: 709px) 85vw, (max-width: 909px) 67vw, (max-width: 1362px) 62vw, 840px\" \/><figcaption>BIld av <a href=\"https:\/\/pixabay.com\/sv\/users\/openclipart-vectors-30363\/?utm_source=link-attribution&amp;utm_medium=referral&amp;utm_campaign=image&amp;utm_content=152459\">OpenClipart-Vectors<\/a> fr\u00e5n <a href=\"https:\/\/pixabay.com\/sv\/?utm_source=link-attribution&amp;utm_medium=referral&amp;utm_campaign=image&amp;utm_content=152459\">Pixabay<\/a><\/figcaption><\/figure>\n\n\n\n<p><a href=\"https:\/\/link.springer.com\/article\/10.1007\/s10664-021-10073-7\">Smoke testing for machine learning: simple tests to discover severe bugs | SpringerLink<\/a><\/p>\n\n\n\n<p class=\"has-drop-cap\">Machine learning systems are very popular today, at least when it comes to research applications. They are not as popular as one would wished (or liked) in the real applications. One of the reasons is the fact that they are hard to test. We do not know how to check if an algorithm will behave as expected in all similar situations &#8211; well, we do not know which situations are similar for us and for the ML system. <\/p>\n\n\n\n<p>This paper looks at the problem from a different angle. The research question is: <em>RQ: What are simple and generic software tests that are capable of finding bugs and improving the quality of machine learning algorithms?<\/em><\/p>\n\n\n\n<p>The authors developed a set of smoke tests, which they see that all ML algorithms should pass. The paper is quite exhaustive and if you are interested, I recommend to take a look at this table: <\/p>\n\n\n\n<p><a href=\"https:\/\/link.springer.com\/article\/10.1007\/s10664-021-10073-7\/tables\/1\">Table 1 | Smoke testing for machine learning: simple tests to discover severe bugs | SpringerLink<\/a><\/p>\n\n\n\n<p>I love the article. It is simple, to the point and very applied. I&#8217;m going to use that in my tests of ML algorithms in the future. <\/p>\n","protected":false},"excerpt":{"rendered":"<p>Smoke testing for machine learning: simple tests to discover severe bugs | SpringerLink Machine learning systems are very popular today, at least when it comes to research applications. They are not as popular as one would wished (or liked) in the real applications. One of the reasons is the fact that they are hard to &hellip; <a href=\"https:\/\/metrics.blogg.gu.se\/?p=738\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Testing of ML systems&#8221;<\/span><\/a><\/p>\n","protected":false},"author":68,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6,4],"tags":[],"_links":{"self":[{"href":"https:\/\/metrics.blogg.gu.se\/index.php?rest_route=\/wp\/v2\/posts\/738"}],"collection":[{"href":"https:\/\/metrics.blogg.gu.se\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/metrics.blogg.gu.se\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/metrics.blogg.gu.se\/index.php?rest_route=\/wp\/v2\/users\/68"}],"replies":[{"embeddable":true,"href":"https:\/\/metrics.blogg.gu.se\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=738"}],"version-history":[{"count":1,"href":"https:\/\/metrics.blogg.gu.se\/index.php?rest_route=\/wp\/v2\/posts\/738\/revisions"}],"predecessor-version":[{"id":740,"href":"https:\/\/metrics.blogg.gu.se\/index.php?rest_route=\/wp\/v2\/posts\/738\/revisions\/740"}],"wp:attachment":[{"href":"https:\/\/metrics.blogg.gu.se\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=738"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/metrics.blogg.gu.se\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=738"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/metrics.blogg.gu.se\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=738"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}