Wei Ma, Kendall Nowocin, Niraj Marathe, George H. Chen
{"title":"An interpretable produce price forecasting system for small and marginal farmers in India using collaborative filtering and adaptive nearest neighbors","authors":"Wei Ma, Kendall Nowocin, Niraj Marathe, George H. Chen","doi":"10.1145/3287098.3287100","DOIUrl":null,"url":null,"abstract":"Small and marginal farmers, who account for over 80% of India's agricultural population, often sell their harvest at low, unfavorable prices before spoilage. These farmers often lack access to either cold storage or market forecasts. In particular, by having access to cold storage, farmers can store their produce for longer and thus have more flexibility as to when they should sell their harvest by. Meanwhile, by having access to market forecasts, farmers can more easily identify which markets to sell at and when. While affordable cold storage solutions have become more widely available, there has been less work on produce price forecasting. A key challenge is that in many regions of India, predominantly in rural and remote areas, we have either very limited or no produce pricing data available from public online sources. In this paper, we present a produce price forecasting system that pulls data from the Indian Ministry of Agriculture and Farmers Welfare's website Agmarknet, trains a model of prices using over a thousand markets, and displays interpretable price forecasts in a web application viewable from a mobile phone. Due to the pricing data being extremely sparse, our method first imputes missing entries using collaborative filtering to obtain a dense dataset. Using this imputed dense dataset, we then train a decision-tree-based classifier to predict whether the price for a specific produce at a specific market will go up, stay the same, or go down. In terms of interpretability, we display the most relevant historical pricing data that drive each forecasted price trend, where we take advantage of the fact that a wide family of decision-tree-based ensemble learning methods are adaptive nearest neighbor methods. We also show how our approach generalizes to forecasting exact produce prices and constructing heuristic price uncertainty intervals. We validate forecast accuracy on data from Agmarknet and a small field survey of a few markets in Odisha.","PeriodicalId":159525,"journal":{"name":"Proceedings of the Tenth International Conference on Information and Communication Technologies and Development","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Tenth International Conference on Information and Communication Technologies and Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3287098.3287100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
Abstract
Small and marginal farmers, who account for over 80% of India's agricultural population, often sell their harvest at low, unfavorable prices before spoilage. These farmers often lack access to either cold storage or market forecasts. In particular, by having access to cold storage, farmers can store their produce for longer and thus have more flexibility as to when they should sell their harvest by. Meanwhile, by having access to market forecasts, farmers can more easily identify which markets to sell at and when. While affordable cold storage solutions have become more widely available, there has been less work on produce price forecasting. A key challenge is that in many regions of India, predominantly in rural and remote areas, we have either very limited or no produce pricing data available from public online sources. In this paper, we present a produce price forecasting system that pulls data from the Indian Ministry of Agriculture and Farmers Welfare's website Agmarknet, trains a model of prices using over a thousand markets, and displays interpretable price forecasts in a web application viewable from a mobile phone. Due to the pricing data being extremely sparse, our method first imputes missing entries using collaborative filtering to obtain a dense dataset. Using this imputed dense dataset, we then train a decision-tree-based classifier to predict whether the price for a specific produce at a specific market will go up, stay the same, or go down. In terms of interpretability, we display the most relevant historical pricing data that drive each forecasted price trend, where we take advantage of the fact that a wide family of decision-tree-based ensemble learning methods are adaptive nearest neighbor methods. We also show how our approach generalizes to forecasting exact produce prices and constructing heuristic price uncertainty intervals. We validate forecast accuracy on data from Agmarknet and a small field survey of a few markets in Odisha.