{"title":"Radon-Nikodym机器学习并行化研究","authors":"A. Bobyl, Vadim V. Davydov, V. Malyshkin","doi":"10.1145/3440749.3442604","DOIUrl":null,"url":null,"abstract":"A novel Radon–Nikodym approach to machine learning (ML) is developed, implemented numerically, tested on various datasets, and it’s parallelization is discussed. It consists in building a “wavefunction” ψ(x), a linear function on input attributes x then constructing a classificator of the form ⟨ψ2f⟩/⟨ψ2⟩. The solution is obtained from a generalized eigenproblem |f|ψ[i]⟩ = λ[i]|ψ[i]⟩ with left– and right– hand side matrices calculated from the input dataset. A solution to classification problem (predict f on an unseen x) is found without using a norm testing how predicted f differs from the one observed. Possible outcomes and their probabilities are obtained separately using the Lebesgue quadrature technique, this makes the method robust to input data with outliers, spikes, etc. A remarkable feature is the knowledge of the invariant group (what input attributes transforms do not change the solution). Radon–Nikodym approach is typically slower than the other methods used, this is the cost of being a “norm–free”. A parallel implementation is expected to improve this deficiency.","PeriodicalId":344578,"journal":{"name":"Proceedings of the 4th International Conference on Future Networks and Distributed Systems","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"On The Radon–Nikodym Machine Learning Parallelization\",\"authors\":\"A. Bobyl, Vadim V. Davydov, V. Malyshkin\",\"doi\":\"10.1145/3440749.3442604\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A novel Radon–Nikodym approach to machine learning (ML) is developed, implemented numerically, tested on various datasets, and it’s parallelization is discussed. It consists in building a “wavefunction” ψ(x), a linear function on input attributes x then constructing a classificator of the form ⟨ψ2f⟩/⟨ψ2⟩. The solution is obtained from a generalized eigenproblem |f|ψ[i]⟩ = λ[i]|ψ[i]⟩ with left– and right– hand side matrices calculated from the input dataset. A solution to classification problem (predict f on an unseen x) is found without using a norm testing how predicted f differs from the one observed. Possible outcomes and their probabilities are obtained separately using the Lebesgue quadrature technique, this makes the method robust to input data with outliers, spikes, etc. A remarkable feature is the knowledge of the invariant group (what input attributes transforms do not change the solution). Radon–Nikodym approach is typically slower than the other methods used, this is the cost of being a “norm–free”. A parallel implementation is expected to improve this deficiency.\",\"PeriodicalId\":344578,\"journal\":{\"name\":\"Proceedings of the 4th International Conference on Future Networks and Distributed Systems\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th International Conference on Future Networks and Distributed Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3440749.3442604\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Future Networks and Distributed Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3440749.3442604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On The Radon–Nikodym Machine Learning Parallelization
A novel Radon–Nikodym approach to machine learning (ML) is developed, implemented numerically, tested on various datasets, and it’s parallelization is discussed. It consists in building a “wavefunction” ψ(x), a linear function on input attributes x then constructing a classificator of the form ⟨ψ2f⟩/⟨ψ2⟩. The solution is obtained from a generalized eigenproblem |f|ψ[i]⟩ = λ[i]|ψ[i]⟩ with left– and right– hand side matrices calculated from the input dataset. A solution to classification problem (predict f on an unseen x) is found without using a norm testing how predicted f differs from the one observed. Possible outcomes and their probabilities are obtained separately using the Lebesgue quadrature technique, this makes the method robust to input data with outliers, spikes, etc. A remarkable feature is the knowledge of the invariant group (what input attributes transforms do not change the solution). Radon–Nikodym approach is typically slower than the other methods used, this is the cost of being a “norm–free”. A parallel implementation is expected to improve this deficiency.