Roy AbitbolIBM Research Israel, Eyal CohenIBM Research Israel, Muhammad KanaanIBM Research Israel, Bhavna AgrawalIBM Research USA, Yingjie LiIBM Research USA, Anuradha BhamidipatyIBM Research USA, Erez BilgoryIBM Research Israel
{"title":"KModels: Unlocking AI for Business Applications","authors":"Roy AbitbolIBM Research Israel, Eyal CohenIBM Research Israel, Muhammad KanaanIBM Research Israel, Bhavna AgrawalIBM Research USA, Yingjie LiIBM Research USA, Anuradha BhamidipatyIBM Research USA, Erez BilgoryIBM Research Israel","doi":"arxiv-2409.05919","DOIUrl":null,"url":null,"abstract":"As artificial intelligence (AI) continues to rapidly advance, there is a\ngrowing demand to integrate AI capabilities into existing business\napplications. However, a significant gap exists between the rapid progress in\nAI and how slowly AI is being embedded into business environments. Deploying\nwell-performing lab models into production settings, especially in on-premise\nenvironments, often entails specialized expertise and imposes a heavy burden of\nmodel management, creating significant barriers to implementing AI models in\nreal-world applications. KModels leverages proven libraries and platforms (Kubeflow Pipelines, KServe)\nto streamline AI adoption by supporting both AI developers and consumers. It\nallows model developers to focus solely on model development and share models\nas transportable units (Templates), abstracting away complex production\ndeployment concerns. KModels enables AI consumers to eliminate the need for a\ndedicated data scientist, as the templates encapsulate most data science\nconsiderations while providing business-oriented control. This paper presents the architecture of KModels and the key decisions that\nshape it. We outline KModels' main components as well as its interfaces.\nFurthermore, we explain how KModels is highly suited for on-premise deployment\nbut can also be used in cloud environments. The efficacy of KModels is demonstrated through the successful deployment of\nthree AI models within an existing Work Order Management system. These models\noperate in a client's data center and are trained on local data, without data\nscientist intervention. One model improved the accuracy of Failure Code\nspecification for work orders from 46% to 83%, showcasing the substantial\nbenefit of accessible and localized AI solutions.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"29 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.05919","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
As artificial intelligence (AI) continues to rapidly advance, there is a
growing demand to integrate AI capabilities into existing business
applications. However, a significant gap exists between the rapid progress in
AI and how slowly AI is being embedded into business environments. Deploying
well-performing lab models into production settings, especially in on-premise
environments, often entails specialized expertise and imposes a heavy burden of
model management, creating significant barriers to implementing AI models in
real-world applications. KModels leverages proven libraries and platforms (Kubeflow Pipelines, KServe)
to streamline AI adoption by supporting both AI developers and consumers. It
allows model developers to focus solely on model development and share models
as transportable units (Templates), abstracting away complex production
deployment concerns. KModels enables AI consumers to eliminate the need for a
dedicated data scientist, as the templates encapsulate most data science
considerations while providing business-oriented control. This paper presents the architecture of KModels and the key decisions that
shape it. We outline KModels' main components as well as its interfaces.
Furthermore, we explain how KModels is highly suited for on-premise deployment
but can also be used in cloud environments. The efficacy of KModels is demonstrated through the successful deployment of
three AI models within an existing Work Order Management system. These models
operate in a client's data center and are trained on local data, without data
scientist intervention. One model improved the accuracy of Failure Code
specification for work orders from 46% to 83%, showcasing the substantial
benefit of accessible and localized AI solutions.