{"title":"用机器学习预测专利诉讼","authors":"Steffen Juranek , Håkon Otneim","doi":"10.1016/j.irle.2024.106228","DOIUrl":null,"url":null,"abstract":"<div><p>We use machine learning methods to predict which patents end up in court using the population of US patents granted between 2002 and 2005. We show that patent characteristics have significant predictive power, particularly value indicators and patent-owner characteristics. Furthermore, we analyze the predictive performance concerning the number of observations used to train the model, which patent characteristics to use, and which predictive model to choose. We find that extending the set of patent characteristics has the biggest positive impact on predictive performance. The model choice matters as well. More sophisticated machine learning methods provide additional value relative to a simple logistic regression. This result highlights the existence of non-linearities among and interactions across the predictors. Our results provide practical advice to anyone building patent litigation models, e.g., for litigation insurance or patent management more generally.</p></div>","PeriodicalId":47202,"journal":{"name":"International Review of Law and Economics","volume":"80 ","pages":"Article 106228"},"PeriodicalIF":0.9000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0144818824000486/pdfft?md5=4ba76bd8d870447814bf120c9c534c7b&pid=1-s2.0-S0144818824000486-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Predicting patent lawsuits with machine learning\",\"authors\":\"Steffen Juranek , Håkon Otneim\",\"doi\":\"10.1016/j.irle.2024.106228\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>We use machine learning methods to predict which patents end up in court using the population of US patents granted between 2002 and 2005. We show that patent characteristics have significant predictive power, particularly value indicators and patent-owner characteristics. Furthermore, we analyze the predictive performance concerning the number of observations used to train the model, which patent characteristics to use, and which predictive model to choose. We find that extending the set of patent characteristics has the biggest positive impact on predictive performance. The model choice matters as well. More sophisticated machine learning methods provide additional value relative to a simple logistic regression. This result highlights the existence of non-linearities among and interactions across the predictors. Our results provide practical advice to anyone building patent litigation models, e.g., for litigation insurance or patent management more generally.</p></div>\",\"PeriodicalId\":47202,\"journal\":{\"name\":\"International Review of Law and Economics\",\"volume\":\"80 \",\"pages\":\"Article 106228\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2024-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0144818824000486/pdfft?md5=4ba76bd8d870447814bf120c9c534c7b&pid=1-s2.0-S0144818824000486-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Review of Law and Economics\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0144818824000486\",\"RegionNum\":3,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Review of Law and Economics","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0144818824000486","RegionNum":3,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ECONOMICS","Score":null,"Total":0}
We use machine learning methods to predict which patents end up in court using the population of US patents granted between 2002 and 2005. We show that patent characteristics have significant predictive power, particularly value indicators and patent-owner characteristics. Furthermore, we analyze the predictive performance concerning the number of observations used to train the model, which patent characteristics to use, and which predictive model to choose. We find that extending the set of patent characteristics has the biggest positive impact on predictive performance. The model choice matters as well. More sophisticated machine learning methods provide additional value relative to a simple logistic regression. This result highlights the existence of non-linearities among and interactions across the predictors. Our results provide practical advice to anyone building patent litigation models, e.g., for litigation insurance or patent management more generally.
期刊介绍:
The International Review of Law and Economics provides a forum for interdisciplinary research at the interface of law and economics. IRLE is international in scope and audience and particularly welcomes both theoretical and empirical papers on comparative law and economics, globalization and legal harmonization, and the endogenous emergence of legal institutions, in addition to more traditional legal topics.