{"id":355764,"date":"2025-06-04T10:20:57","date_gmt":"2025-06-04T04:50:57","guid":{"rendered":"https:\/\/www.technologyforyou.org\/?p=355764"},"modified":"2025-06-04T10:20:57","modified_gmt":"2025-06-04T04:50:57","slug":"teaching-ai-models-what-they-dont-know","status":"publish","type":"post","link":"https:\/\/www.technologyforyou.org\/teaching-ai-models-what-they-dont-know\/","title":{"rendered":"Teaching AI models what they don\u2019t know"},"content":{"rendered":"<div id=\"block-mit-page-title\">\n<div class=\"block-inner\">\n<figure id=\"attachment_355765\" aria-describedby=\"caption-attachment-355765\" style=\"width: 692px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-355765\" src=\"https:\/\/www.technologyforyou.org\/wp-content\/uploads\/2025\/06\/mitai5-300x200.jpg\" alt=\"\" width=\"692\" height=\"461\" srcset=\"https:\/\/www.technologyforyou.org\/wp-content\/uploads\/2025\/06\/mitai5-300x200.jpg 300w, https:\/\/www.technologyforyou.org\/wp-content\/uploads\/2025\/06\/mitai5.jpg 595w\" sizes=\"auto, (max-width: 692px) 100vw, 692px\" \/><figcaption id=\"caption-attachment-355765\" class=\"wp-caption-text\">Caption:\u201cWe want to enable AI in the highest-stakes applications of every industry,\u201d says Themis AI co-founder Alexander Amini \u201917, SM \u201918, PhD \u201922. Credits: Credit: MIT News; iStock<\/figcaption><\/figure>\n<p><strong><span style=\"color: #222222; font-family: Verdana, BlinkMacSystemFont, -apple-system, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif; font-size: 12pt;\">A team of MIT researchers founded Themis AI to quantify AI model uncertainty and address knowledge gaps.<\/span><\/strong><\/p>\n<\/div>\n<\/div>\n<div id=\"block-mit-content\">\n<div class=\"block-inner\">\n<article>\n<div class=\"news-article--authored-by\">Source:\u00a0 <span class=\"news-article--source\">MIT News<\/span><\/div>\n<div class=\"news-article--publication-date\"><\/div>\n<div class=\"news-article--press-inquiries\" data-nosnippet=\"true\">\n<div class=\"news-article--press-inquiries--header\"><\/div>\n<\/div>\n<div class=\"news-article--full-width-wrapper\">\n<div class=\"news-article--full-width-wrapper-inner\">\n<div class=\"news-article--images-gallery--wrapper\">\n<div class=\"news-article--images-gallery flickity-enabled\" tabindex=\"0\">\n<div class=\"flickity-viewport\">\n<div class=\"flickity-slider\">\n<div class=\"news-article--image-item is-selected\">\n<div class=\"news-article--media--image--file\">Artificial intelligence systems like ChatGPT provide plausible-sounding answers to any question you might ask. But they don\u2019t always reveal the gaps in their knowledge or areas where they\u2019re uncertain. That problem can have huge consequences as AI systems are increasingly used to do things like develop drugs, synthesize information, and drive autonomous cars.<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"news-article--content\">\n<div class=\"news-article--content--body\">\n<div class=\"news-article--content--body--inner\">\n<div class=\"paragraph paragraph--type--content-block-text paragraph--view-mode--default\">\n<p>Now, the MIT spinout Themis AI is helping quantify model uncertainty and correct outputs before they cause bigger problems. The company\u2019s Capsa platform can work with any machine-learning model to detect and correct unreliable outputs in seconds. It works by modifying AI models to enable them to detect patterns in their data processing that indicate ambiguity, incompleteness, or bias.<\/p>\n<p>\u201cThe idea is to take a model, wrap it in Capsa, identify the uncertainties and failure modes of the model, and then enhance the model,\u201d says Themis AI co-founder and MIT Professor Daniela Rus, who is also the director of the MIT Computer Science and Artificial Intelligence Laboratory (CSAIL). \u201cWe\u2019re excited about offering a solution that can improve models and offer guarantees that the model is working correctly.\u201d<\/p>\n<p>Rus founded Themis AI in 2021 with Alexander Amini \u201917, SM \u201918, PhD \u201922 and Elaheh Ahmadi \u201920, MEng \u201921, two former research affiliates in her lab. Since then, they\u2019ve helped telecom companies with network planning and automation, helped oil and gas companies use AI to understand seismic imagery, and published papers on developing more reliable and trustworthy chatbots.<\/p>\n<p>\u201cWe want to enable AI in the highest-stakes applications of every industry,\u201d Amini says. \u201cWe\u2019ve all seen examples of AI hallucinating or making mistakes. As AI is deployed more broadly, those mistakes could lead to devastating consequences. Themis makes it possible that any AI can forecast and predict its own failures, before they happen.\u201d<\/p>\n<p><strong>Helping models know what they don\u2019t know<\/strong><\/p>\n<p>Rus\u2019 lab has been researching model uncertainty for years. In 2018, she received funding from Toyota to study the reliability of a machine learning-based autonomous driving solution.<\/p>\n<p>\u201cThat is a safety-critical context where understanding model reliability is very important,\u201d Rus says.<\/p>\n<p>In separate\u00a0<a href=\"https:\/\/dl.acm.org\/doi\/abs\/10.1145\/3306618.3314243\" target=\"_blank\" rel=\"noopener\">work<\/a>, Rus, Amini, and their collaborators built an algorithm that could detect racial and gender bias in facial recognition systems and automatically reweight the model\u2019s training data, showing it eliminated bias. The algorithm worked by identifying the unrepresentative parts of the underlying training data and generating new, similar data samples to rebalance it.<\/p>\n<p>In 2021, the eventual co-founders showed a\u00a0<a href=\"https:\/\/pubs.acs.org\/doi\/full\/10.1021\/acscentsci.1c00546\" target=\"_blank\" rel=\"noopener\">similar approach<\/a>\u00a0could be used to help pharmaceutical companies use AI models to predict the properties of drug candidates. They founded Themis AI later that year.<\/p>\n<p>\u201cGuiding drug discovery could potentially save a lot of money,\u201d Rus says. \u201cThat was the use case that made us realize how powerful this tool could be.\u201d<\/p>\n<p>Today Themis AI is working with enterprises in a variety of industries, and many of those companies are building large language models. By using Capsa, these models are able to quantify their own uncertainty for each output.<\/p>\n<p>\u201cMany companies are interested in using LLMs that are based on their data, but they\u2019re concerned about reliability,\u201d observes Stewart Jamieson SM \u201920, PhD \u201924, Themis AI&#8217;s head of technology. \u201cWe help LLMs self-report their confidence and uncertainty, which enables more reliable question answering and flagging unreliable outputs.\u201d<\/p>\n<p>Themis AI is also in discussions with semiconductor companies building AI solutions on their chips that can work outside of cloud environments.<\/p>\n<p>\u201cNormally these smaller models that work on phones or embedded systems aren\u2019t very accurate compared to what you could run on a server, but we can get the best of both worlds: low latency, efficient edge computing without sacrificing quality,\u201d Jamieson explains. \u201cWe see a future where edge devices do most of the work, but whenever they\u2019re unsure of their output, they can forward those tasks to a central server.\u201d<\/p>\n<p>Pharmaceutical companies can also use Capsa to improve AI models being used to identify drug candidates and predict their performance in clinical trials.<\/p>\n<p>\u201cThe predictions and outputs of these models are very complex and hard to interpret \u2014 experts spend a lot of time and effort trying to make sense of them,\u201d Amini remarks. \u201cCapsa can give insights right out of the gate to understand if the predictions are backed by evidence in the training set or are just speculation without a lot of grounding. That can accelerate the identification of the strongest predictions, and we think that has a huge potential for societal good.\u201d<\/p>\n<p><strong>Research for impact<\/strong><\/p>\n<p>Themis AI\u2019s team believes the company is well-positioned to improve the cutting edge of constantly evolving AI technology. For instance, the company is exploring Capsa\u2019s ability to improve accuracy in an AI technique known as chain-of-thought reasoning, in which LLMs explain the steps they take to get to an answer.<\/p>\n<p>\u201cWe\u2019ve seen signs Capsa could help guide those reasoning processes to identify the highest-confidence chains of reasoning,\u201d Jamieson says. \u201cWe think that has huge implications in terms of improving the LLM experience, reducing latencies, and reducing computation requirements. It\u2019s an extremely high-impact opportunity for us.\u201d<\/p>\n<p>For Rus, who has co-founded several companies since coming to MIT, Themis AI is an opportunity to ensure her MIT research has impact.<\/p>\n<p>\u201cMy students and I have become increasingly passionate about going the extra step to make our work relevant for the world,&#8221; Rus says. \u201cAI has tremendous potential to transform industries, but AI also raises concerns. What excites me is the opportunity to help develop technical solutions that address these challenges and also build trust and understanding between people and the technologies that are becoming part of their daily lives.\u201d<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/article>\n<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>A team of MIT researchers founded Themis AI to quantify AI model uncertainty and address knowledge gaps. Source:\u00a0 MIT News Artificial intelligence systems like ChatGPT provide plausible-sounding answers to any question you might ask. But they don\u2019t always reveal the gaps in their knowledge or areas where they\u2019re uncertain. That problem can have huge consequences [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":355765,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[4],"tags":[],"class_list":{"0":"post-355764","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-technology"},"_links":{"self":[{"href":"https:\/\/www.technologyforyou.org\/wp-json\/wp\/v2\/posts\/355764","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.technologyforyou.org\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.technologyforyou.org\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.technologyforyou.org\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.technologyforyou.org\/wp-json\/wp\/v2\/comments?post=355764"}],"version-history":[{"count":0,"href":"https:\/\/www.technologyforyou.org\/wp-json\/wp\/v2\/posts\/355764\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.technologyforyou.org\/wp-json\/wp\/v2\/media\/355765"}],"wp:attachment":[{"href":"https:\/\/www.technologyforyou.org\/wp-json\/wp\/v2\/media?parent=355764"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.technologyforyou.org\/wp-json\/wp\/v2\/categories?post=355764"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.technologyforyou.org\/wp-json\/wp\/v2\/tags?post=355764"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}