首页 > 最新文献

arXiv (Cornell University)最新文献

英文 中文
Plane partitions and rowmotion on rectangular and trapezoidal posets 平面分区和在矩形和梯形上的移动
Pub Date : 2023-11-13 DOI: 10.48550/arxiv.2311.07133
Johnson, Joseph, Liu, Ricky Ini
We define a birational map between labelings of a rectangular poset and its associated trapezoidal poset. This map tropicalizes to a bijection between the plane partitions of these posets of fixed height, giving a new bijective proof of a result by Proctor. We also show that this map is equivariant with respect to birational rowmotion, resolving a conjecture of Williams and implying that birational rowmotion on trapezoidal posets has finite order.
我们定义了一个矩形偏序集和它的关联的梯形偏序集的标记之间的双向映射。这张图热带化为这些固定高度的偏置集的平面分区之间的双射,给出了Proctor结果的一个新的双射证明。我们还证明了该映射对于两族运动是等变的,解决了Williams的一个猜想,并暗示了梯形偏集上的两族运动具有有限阶。
{"title":"Plane partitions and rowmotion on rectangular and trapezoidal posets","authors":"Johnson, Joseph, Liu, Ricky Ini","doi":"10.48550/arxiv.2311.07133","DOIUrl":"https://doi.org/10.48550/arxiv.2311.07133","url":null,"abstract":"We define a birational map between labelings of a rectangular poset and its associated trapezoidal poset. This map tropicalizes to a bijection between the plane partitions of these posets of fixed height, giving a new bijective proof of a result by Proctor. We also show that this map is equivariant with respect to birational rowmotion, resolving a conjecture of Williams and implying that birational rowmotion on trapezoidal posets has finite order.","PeriodicalId":496270,"journal":{"name":"arXiv (Cornell University)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136352295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models WaterBench:迈向大型语言模型水印的整体评估
Pub Date : 2023-11-13 DOI: 10.48550/arxiv.2311.07138
Tu, Shangqing, Sun, Yuliang, Bai, Yushi, Yu, Jifan, Hou, Lei, Li, Juanzi
To mitigate the potential misuse of large language models (LLMs), recent research has developed watermarking algorithms, which restrict the generation process to leave an invisible trace for watermark detection. Due to the two-stage nature of the task, most studies evaluate the generation and detection separately, thereby presenting a challenge in unbiased, thorough, and applicable evaluations. In this paper, we introduce WaterBench, the first comprehensive benchmark for LLM watermarks, in which we design three crucial factors: (1) For textbf{benchmarking procedure}, to ensure an apples-to-apples comparison, we first adjust each watermarking method's hyper-parameter to reach the same watermarking strength, then jointly evaluate their generation and detection performance. (2) For textbf{task selection}, we diversify the input and output length to form a five-category taxonomy, covering $9$ tasks. (3) For textbf{evaluation metric}, we adopt the GPT4-Judge for automatically evaluating the decline of instruction-following abilities after watermarking. We evaluate $4$ open-source watermarks on $2$ LLMs under $2$ watermarking strengths and observe the common struggles for current methods on maintaining the generation quality. The code and data are available at url{https://github.com/THU-KEG/WaterBench}.
为了减少对大型语言模型(llm)的潜在滥用,最近的研究开发了水印算法,该算法限制了生成过程,为水印检测留下了不可见的痕迹。由于任务的两阶段性质,大多数研究分别评估生成和检测,从而对公正,彻底和适用的评估提出了挑战。本文介绍了首个LLM水印综合基准测试WaterBench,其中设计了三个关键因素:(1)在textbf{基准测试过程中,首先调整每种水印}方法的超参数,使其达到相同的水印强度,然后共同评估它们的生成和检测性能,以确保两者之间的比较。(2)对于textbf{任务选择},我们将输入和输出长度多样化,形成一个五类分类法,涵盖$9$任务。(3)textbf{评价指标}采用GPT4-Judge自动评价水印后指令跟随能力的下降情况。我们在$2$水印强度下评估了$2$ llm上的$4$开源水印,并观察了当前方法在保持生成质量方面的常见问题。代码和数据可在url{https://github.com/THU-KEG/WaterBench}上获得。
{"title":"WaterBench: Towards Holistic Evaluation of Watermarks for Large Language\u0000 Models","authors":"Tu, Shangqing, Sun, Yuliang, Bai, Yushi, Yu, Jifan, Hou, Lei, Li, Juanzi","doi":"10.48550/arxiv.2311.07138","DOIUrl":"https://doi.org/10.48550/arxiv.2311.07138","url":null,"abstract":"To mitigate the potential misuse of large language models (LLMs), recent research has developed watermarking algorithms, which restrict the generation process to leave an invisible trace for watermark detection. Due to the two-stage nature of the task, most studies evaluate the generation and detection separately, thereby presenting a challenge in unbiased, thorough, and applicable evaluations. In this paper, we introduce WaterBench, the first comprehensive benchmark for LLM watermarks, in which we design three crucial factors: (1) For textbf{benchmarking procedure}, to ensure an apples-to-apples comparison, we first adjust each watermarking method's hyper-parameter to reach the same watermarking strength, then jointly evaluate their generation and detection performance. (2) For textbf{task selection}, we diversify the input and output length to form a five-category taxonomy, covering $9$ tasks. (3) For textbf{evaluation metric}, we adopt the GPT4-Judge for automatically evaluating the decline of instruction-following abilities after watermarking. We evaluate $4$ open-source watermarks on $2$ LLMs under $2$ watermarking strengths and observe the common struggles for current methods on maintaining the generation quality. The code and data are available at url{https://github.com/THU-KEG/WaterBench}.","PeriodicalId":496270,"journal":{"name":"arXiv (Cornell University)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136352299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analyzing and Predicting Low-Listenership Trends in a Large-Scale Mobile Health Program: A Preliminary Investigation 分析和预测大规模移动医疗计划的低听众趋势:初步调查
Pub Date : 2023-11-13 DOI: 10.48550/arxiv.2311.07139
Lalan, Arshika, Verma, Shresth, Sudan, Kumar Madhu, Mahale, Amrita, Hegde, Aparna, Tambe, Milind, Taneja, Aparna
Mobile health programs are becoming an increasingly popular medium for dissemination of health information among beneficiaries in less privileged communities. Kilkari is one of the world's largest mobile health programs which delivers time sensitive audio-messages to pregnant women and new mothers. We have been collaborating with ARMMAN, a non-profit in India which operates the Kilkari program, to identify bottlenecks to improve the efficiency of the program. In particular, we provide an initial analysis of the trajectories of beneficiaries' interaction with the mHealth program and examine elements of the program that can be potentially enhanced to boost its success. We cluster the cohort into different buckets based on listenership so as to analyze listenership patterns for each group that could help boost program success. We also demonstrate preliminary results on using historical data in a time-series prediction to identify beneficiary dropouts and enable NGOs in devising timely interventions to strengthen beneficiary retention.
移动医疗方案正日益成为向贫困社区受益人传播卫生信息的一种流行媒介。Kilkari是世界上最大的移动医疗项目之一,它向孕妇和新妈妈提供时间敏感的音频信息。我们一直在与印度一家运营Kilkari项目的非营利组织ARMMAN合作,以确定瓶颈,提高项目效率。特别是,我们提供了受益者与移动医疗项目互动轨迹的初步分析,并检查了该项目中可以潜在增强以促进其成功的要素。我们根据听众人数将这群人分成不同的群组,以便分析每个群组的听众模式,从而有助于提高节目的成功率。我们还展示了在时间序列预测中使用历史数据来识别受益人辍学并使非政府组织能够制定及时的干预措施以加强受益人保留的初步结果。
{"title":"Analyzing and Predicting Low-Listenership Trends in a Large-Scale Mobile\u0000 Health Program: A Preliminary Investigation","authors":"Lalan, Arshika, Verma, Shresth, Sudan, Kumar Madhu, Mahale, Amrita, Hegde, Aparna, Tambe, Milind, Taneja, Aparna","doi":"10.48550/arxiv.2311.07139","DOIUrl":"https://doi.org/10.48550/arxiv.2311.07139","url":null,"abstract":"Mobile health programs are becoming an increasingly popular medium for dissemination of health information among beneficiaries in less privileged communities. Kilkari is one of the world's largest mobile health programs which delivers time sensitive audio-messages to pregnant women and new mothers. We have been collaborating with ARMMAN, a non-profit in India which operates the Kilkari program, to identify bottlenecks to improve the efficiency of the program. In particular, we provide an initial analysis of the trajectories of beneficiaries' interaction with the mHealth program and examine elements of the program that can be potentially enhanced to boost its success. We cluster the cohort into different buckets based on listenership so as to analyze listenership patterns for each group that could help boost program success. We also demonstrate preliminary results on using historical data in a time-series prediction to identify beneficiary dropouts and enable NGOs in devising timely interventions to strengthen beneficiary retention.","PeriodicalId":496270,"journal":{"name":"arXiv (Cornell University)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136352302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance Analysis of Integrated Data and Energy Transfer Assisted by Fluid Antenna Systems 流体天线系统辅助下集成数据和能量传递的性能分析
Pub Date : 2023-11-13 DOI: 10.48550/arxiv.2311.07134
Lin, Xiao, Yang, Halvin, Zhao, Yizhe, Hu, Jie, Wong, Kai-Kit
Fluid antenna multiple access (FAMA) is capable of exploiting the high spatial diversity of wireless channels to mitigate multi-user interference via flexible port switching, which achieves a better performance than traditional multi-input-multi-output (MIMO) systems. Moreover, integrated data and energy transfer (IDET) is able to provide both the wireless data transfer (WDT) and wireless energy transfer (WET) services towards low-power devices. In this paper, a FAMA assisted IDET system is studied, where $N$ access points (APs) provide dedicated IDET services towards $N$ user equipments (UEs). Each UE is equipped with a single fluid antenna. The performance of WDT and WET , textit{i.e.}, the WDT outage probability, the WET outage probability, the reliable throughput and the average energy harvesting amount, are analysed theoretically by using time switching (TS) between WDT and WET. Numerical results validate our theoretical analysis, which reveals that the number of UEs and TS ratio should be optimized to achieve a trade-off between the WDT and WET performance. Moreover, FAMA assisted IDET achieves a better performance in terms of both WDT and WET than traditional MIMO with the same antenna size.
流体天线多址(FAMA)能够利用无线信道的高空间分集,通过灵活的端口交换来减轻多用户干扰,从而获得比传统多输入多输出(MIMO)系统更好的性能。此外,集成数据和能量传输(IDET)能够为低功耗设备提供无线数据传输(WDT)和无线能量传输(WET)服务。本文研究了FAMA辅助IDET系统,其中$N$接入点(ap)向$N$用户设备(ue)提供专用IDET服务。每个UE都配备了一个流体天线。利用WDT和WET之间的时间切换(TS),从理论上分析了WDT和WET的性能,textit{即}WDT中断概率、WET中断概率、可靠吞吐量和平均能量收集量。数值结果验证了我们的理论分析,表明应优化ue的数量和TS比,以实现WDT和WET性能之间的权衡。此外,FAMA辅助的IDET在WDT和WET两方面的性能都优于相同天线尺寸的传统MIMO。
{"title":"Performance Analysis of Integrated Data and Energy Transfer Assisted by\u0000 Fluid Antenna Systems","authors":"Lin, Xiao, Yang, Halvin, Zhao, Yizhe, Hu, Jie, Wong, Kai-Kit","doi":"10.48550/arxiv.2311.07134","DOIUrl":"https://doi.org/10.48550/arxiv.2311.07134","url":null,"abstract":"Fluid antenna multiple access (FAMA) is capable of exploiting the high spatial diversity of wireless channels to mitigate multi-user interference via flexible port switching, which achieves a better performance than traditional multi-input-multi-output (MIMO) systems. Moreover, integrated data and energy transfer (IDET) is able to provide both the wireless data transfer (WDT) and wireless energy transfer (WET) services towards low-power devices. In this paper, a FAMA assisted IDET system is studied, where $N$ access points (APs) provide dedicated IDET services towards $N$ user equipments (UEs). Each UE is equipped with a single fluid antenna. The performance of WDT and WET , textit{i.e.}, the WDT outage probability, the WET outage probability, the reliable throughput and the average energy harvesting amount, are analysed theoretically by using time switching (TS) between WDT and WET. Numerical results validate our theoretical analysis, which reveals that the number of UEs and TS ratio should be optimized to achieve a trade-off between the WDT and WET performance. Moreover, FAMA assisted IDET achieves a better performance in terms of both WDT and WET than traditional MIMO with the same antenna size.","PeriodicalId":496270,"journal":{"name":"arXiv (Cornell University)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136352303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decision-making under risk: when is utility maximization equivalent to risk minimization? 风险下的决策:效用最大化何时等于风险最小化?
Pub Date : 2023-11-13 DOI: 10.48550/arxiv.2311.07269
Ruscitti, Francesco, Dubey, Ram Sewak, Laguzzi, Giorgio
Motivated by the analysis of a general optimal portfolio selection problem, which encompasses as special cases an optimal consumption and an optimal debt-arrangement problem, we are concerned with the questions of how a personality trait like risk-perception can be formalized and whether the two objectives of utility-maximization and risk-minimization can be both achieved simultaneously. We address these questions by developing an axiomatic foundation of preferences for which utility-maximization is equivalent to minimizing a utility-based shortfall risk measure. Our axiomatization hinges on a novel axiom in decision theory, namely the risk-perception axiom.
受一般最优投资组合选择问题分析的启发,其中包括最优消费和最优债务安排问题作为特殊情况,我们关注的问题是如何将风险感知等人格特征形式化,以及效用最大化和风险最小化这两个目标是否可以同时实现。我们通过开发偏好的公理基础来解决这些问题,其中效用最大化等同于最小化基于效用的短缺风险度量。我们的公理化依赖于决策理论中的一个新公理,即风险感知公理。
{"title":"Decision-making under risk: when is utility maximization equivalent to\u0000 risk minimization?","authors":"Ruscitti, Francesco, Dubey, Ram Sewak, Laguzzi, Giorgio","doi":"10.48550/arxiv.2311.07269","DOIUrl":"https://doi.org/10.48550/arxiv.2311.07269","url":null,"abstract":"Motivated by the analysis of a general optimal portfolio selection problem, which encompasses as special cases an optimal consumption and an optimal debt-arrangement problem, we are concerned with the questions of how a personality trait like risk-perception can be formalized and whether the two objectives of utility-maximization and risk-minimization can be both achieved simultaneously. We address these questions by developing an axiomatic foundation of preferences for which utility-maximization is equivalent to minimizing a utility-based shortfall risk measure. Our axiomatization hinges on a novel axiom in decision theory, namely the risk-perception axiom.","PeriodicalId":496270,"journal":{"name":"arXiv (Cornell University)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136352313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Optimization on Quantum Hardware: Feasibility for a Process Industry Use Case 量子硬件的动态优化:过程工业用例的可行性
Pub Date : 2023-11-13 DOI: 10.48550/arxiv.2311.07310
Nenno, Dennis Michael, Caspari, Adrian
The quest for real-time dynamic optimization solutions in the process industry represents a formidable computational challenge, particularly within the realm of applications like model predictive control where rapid and reliable computations are critical. Conventional methods can struggle to surmount the complexities of such tasks. Quantum computing and quantum annealing emerge as avant-garde contenders to transcend conventional computational constraints. We convert a dynamic optimization problem, characterized by a system of differential equations, into a Quadratic Unconstrained Binary Optimization problem, enabling quantum computational approaches. The empirical findings synthesized from classical methods, simulated annealing, quantum annealing via D-Wave's quantum annealer, and hybrid solver methodologies, illuminate the intricate landscape of computational prowess essential for tackling complex and high-dimensional dynamic optimization problems. Our findings suggest that while quantum annealing is a maturing technology that currently does not outperform state-of-the-art classical solvers, continuous improvements could eventually aid in increasing efficiency within the chemical process industry.
在过程工业中,对实时动态优化解决方案的追求是一个巨大的计算挑战,特别是在模型预测控制等应用领域,快速可靠的计算至关重要。传统方法很难克服这类任务的复杂性。量子计算和量子退火成为超越传统计算约束的先锋竞争者。我们将一个以微分方程系统为特征的动态优化问题转换为二次无约束二进制优化问题,使量子计算方法成为可能。从经典方法、模拟退火、通过D-Wave的量子退火器进行的量子退火和混合求解器方法合成的经验发现,阐明了解决复杂和高维动态优化问题所必需的计算能力的复杂景观。我们的研究结果表明,虽然量子退火是一项成熟的技术,目前还没有超越最先进的经典求解器,但持续的改进最终可能有助于提高化学过程工业的效率。
{"title":"Dynamic Optimization on Quantum Hardware: Feasibility for a Process\u0000 Industry Use Case","authors":"Nenno, Dennis Michael, Caspari, Adrian","doi":"10.48550/arxiv.2311.07310","DOIUrl":"https://doi.org/10.48550/arxiv.2311.07310","url":null,"abstract":"The quest for real-time dynamic optimization solutions in the process industry represents a formidable computational challenge, particularly within the realm of applications like model predictive control where rapid and reliable computations are critical. Conventional methods can struggle to surmount the complexities of such tasks. Quantum computing and quantum annealing emerge as avant-garde contenders to transcend conventional computational constraints. We convert a dynamic optimization problem, characterized by a system of differential equations, into a Quadratic Unconstrained Binary Optimization problem, enabling quantum computational approaches. The empirical findings synthesized from classical methods, simulated annealing, quantum annealing via D-Wave's quantum annealer, and hybrid solver methodologies, illuminate the intricate landscape of computational prowess essential for tackling complex and high-dimensional dynamic optimization problems. Our findings suggest that while quantum annealing is a maturing technology that currently does not outperform state-of-the-art classical solvers, continuous improvements could eventually aid in increasing efficiency within the chemical process industry.","PeriodicalId":496270,"journal":{"name":"arXiv (Cornell University)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136352320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian Approach to Strong Lens Finding in the Era of Wide-area Surveys 广域观测时代强透镜发现的贝叶斯方法
Pub Date : 2023-11-13 DOI: 10.48550/arxiv.2311.07455
Holloway, Philip, Marshall, Philip J., Verma, Aprajita, More, Anupreeta, Cañameras, Raoul, Jaelani, Anton T., Ishida, Yuichiro, Wong, Kenneth C.
The arrival of the Vera C. Rubin Observatory's Legacy Survey of Space and Time (LSST), Euclid-Wide and Roman wide area sensitive surveys will herald a new era in strong lens science in which the number of strong lenses known is expected to rise from $mathcal{O}(10^3)$ to $mathcal{O}(10^5)$. However, current lens-finding methods still require time-consuming follow-up visual inspection by strong-lens experts to remove false positives which is only set to increase with these surveys. In this work we demonstrate a range of methods to produce calibrated probabilities to help determine the veracity of any given lens candidate. To do this we use the classifications from citizen science and multiple neural networks for galaxies selected from the Hyper Suprime-Cam (HSC) survey. Our methodology is not restricted to particular classifier types and could be applied to any strong lens classifier which produces quantitative scores. Using these calibrated probabilities, we generate an ensemble classifier, combining citizen science and neural network lens finders. We find such an ensemble can provide improved classification over the individual classifiers. We find a false positive rate of $10^{-3}$ can be achieved with a completeness of $46%$, compared to $34%$ for the best individual classifier. Given the large number of galaxy-galaxy strong lenses anticipated in LSST, such improvement would still produce significant numbers of false positives, in which case using calibrated probabilities will be essential for population analysis of large populations of lenses.
Vera C. Rubin天文台的时空遗产巡天(LSST),欧几里得宽和罗马广域敏感巡天的到来,将宣告强透镜科学的新时代,其中已知的强透镜数量预计将从$mathcal{O}(10^3)$增加到$mathcal{O}(10^5)$。然而,目前的透镜寻找方法仍然需要由强透镜专家进行耗时的后续目视检查,以消除假阳性,而假阳性只会随着这些调查而增加。在这项工作中,我们展示了一系列方法来产生校准概率,以帮助确定任何给定候选透镜的准确性。为了做到这一点,我们使用了来自公民科学和多个神经网络的分类,这些分类是从超级超级相机(HSC)调查中选择的星系。我们的方法不局限于特定的分类器类型,可以应用于任何产生定量分数的强透镜分类器。使用这些校准的概率,我们生成了一个集成分类器,结合了公民科学和神经网络寻镜器。我们发现这样的集成可以提供比单个分类器更好的分类。我们发现假阳性率为$10^{-3}$,完备性为$46%$,而最佳单个分类器的完备性为$34%$。考虑到LSST中预期的大量星系-星系强透镜,这种改进仍然会产生大量的假阳性,在这种情况下,使用校准的概率对于大量透镜的种群分析是必不可少的。
{"title":"A Bayesian Approach to Strong Lens Finding in the Era of Wide-area\u0000 Surveys","authors":"Holloway, Philip, Marshall, Philip J., Verma, Aprajita, More, Anupreeta, Cañameras, Raoul, Jaelani, Anton T., Ishida, Yuichiro, Wong, Kenneth C.","doi":"10.48550/arxiv.2311.07455","DOIUrl":"https://doi.org/10.48550/arxiv.2311.07455","url":null,"abstract":"The arrival of the Vera C. Rubin Observatory's Legacy Survey of Space and Time (LSST), Euclid-Wide and Roman wide area sensitive surveys will herald a new era in strong lens science in which the number of strong lenses known is expected to rise from $mathcal{O}(10^3)$ to $mathcal{O}(10^5)$. However, current lens-finding methods still require time-consuming follow-up visual inspection by strong-lens experts to remove false positives which is only set to increase with these surveys. In this work we demonstrate a range of methods to produce calibrated probabilities to help determine the veracity of any given lens candidate. To do this we use the classifications from citizen science and multiple neural networks for galaxies selected from the Hyper Suprime-Cam (HSC) survey. Our methodology is not restricted to particular classifier types and could be applied to any strong lens classifier which produces quantitative scores. Using these calibrated probabilities, we generate an ensemble classifier, combining citizen science and neural network lens finders. We find such an ensemble can provide improved classification over the individual classifiers. We find a false positive rate of $10^{-3}$ can be achieved with a completeness of $46%$, compared to $34%$ for the best individual classifier. Given the large number of galaxy-galaxy strong lenses anticipated in LSST, such improvement would still produce significant numbers of false positives, in which case using calibrated probabilities will be essential for population analysis of large populations of lenses.","PeriodicalId":496270,"journal":{"name":"arXiv (Cornell University)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136352341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Translational symmetry broken magnetization plateau of the S=1 antiferromagnetic Heisenberg chain with competing anisotropies 具有竞争各向异性的S=1反铁磁海森堡链的平移对称性破缺磁化平台
Pub Date : 2023-11-13 DOI: 10.48550/arxiv.2311.07123
Sakai, Tôru, Okamoto, Kiyomi, Okunishi, Kouichi, Hashimoto, Masaru, Houda, Tomoki, Furuchi, Rito, Nakano, Hiroki
We investigate the S=1 antiferromagnetic quantum spin chain with the exchange and single-ion anisotropies in a magnetic field, using the numerical exact diagonalization of finite-size clusters, the level spectroscopy analysis, and the density matrix renormalization group (DMRG) method. It is found that a translational symmetry broken magnetization plateau possibly appears at the half of the saturation magnetization, when the anisotropies compete with each other. The level spectroscopy analysis gives the phase diagram at half the saturation magnetization. The DMRG calculation presents the magnetization curves for some typical parameters and clarifies the spin structure in the plateau phase.
利用有限大小团簇的数值精确对角化、能级光谱分析和密度矩阵重整化群(DMRG)方法,研究了具有交换和单离子各向异性的S=1反铁磁量子自旋链。当各向异性相互竞争时,在饱和磁化强度的一半可能出现平移对称性破缺的磁化平台。能级光谱分析给出了半饱和磁化时的相图。DMRG计算给出了一些典型参数的磁化曲线,阐明了平台相的自旋结构。
{"title":"Translational symmetry broken magnetization plateau of the S=1\u0000 antiferromagnetic Heisenberg chain with competing anisotropies","authors":"Sakai, Tôru, Okamoto, Kiyomi, Okunishi, Kouichi, Hashimoto, Masaru, Houda, Tomoki, Furuchi, Rito, Nakano, Hiroki","doi":"10.48550/arxiv.2311.07123","DOIUrl":"https://doi.org/10.48550/arxiv.2311.07123","url":null,"abstract":"We investigate the S=1 antiferromagnetic quantum spin chain with the exchange and single-ion anisotropies in a magnetic field, using the numerical exact diagonalization of finite-size clusters, the level spectroscopy analysis, and the density matrix renormalization group (DMRG) method. It is found that a translational symmetry broken magnetization plateau possibly appears at the half of the saturation magnetization, when the anisotropies compete with each other. The level spectroscopy analysis gives the phase diagram at half the saturation magnetization. The DMRG calculation presents the magnetization curves for some typical parameters and clarifies the spin structure in the plateau phase.","PeriodicalId":496270,"journal":{"name":"arXiv (Cornell University)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136352459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reconfiguration of basis pairs in regular matroids 正则拟阵中基对的重构
Pub Date : 2023-11-13 DOI: 10.48550/arxiv.2311.07130
Bérczi, Kristóf, Mátravölgyi, Bence, Schwarcz, Tamás
In recent years, combinatorial reconfiguration problems have attracted great attention due to their connection to various topics such as optimization, counting, enumeration, or sampling. One of the most intriguing open questions concerns the exchange distance of two matroid basis sequences, a problem that appears in several areas of computer science and mathematics. In 1980, White proposed a conjecture for the characterization of two basis sequences being reachable from each other by symmetric exchanges, which received a significant interest also in algebra due to its connection to toric ideals and Gr"obner bases. In this work, we verify White's conjecture for basis sequences of length two in regular matroids, a problem that was formulated as a separate question by Farber, Richter, and Shan and Andres, Hochst"attler, and Merkel. Most of previous work on White's conjecture has not considered the question from an algorithmic perspective. We study the problem from an optimization point of view: our proof implies a polynomial algorithm for determining a sequence of symmetric exchanges that transforms a basis pair into another, thus providing the first polynomial upper bound on the exchange distance of basis pairs in regular matroids. As a byproduct, we verify a conjecture of Gabow from 1976 on the serial symmetric exchange property of matroids for the regular case.
近年来,组合重构问题由于与优化、计数、枚举或抽样等各种主题的联系而引起了人们的广泛关注。最有趣的开放问题之一涉及两个矩阵基序列的交换距离,这个问题出现在计算机科学和数学的几个领域。1980年,White提出了一个关于两个基序列可通过对称交换相互到达的表征的猜想,由于它与环理想和Gr obner基的联系,在代数中也引起了极大的兴趣。在这项工作中,我们验证了White关于正则拟阵中长度为2的基序列的猜想,这个问题被Farber、Richter、Shan、Andres、Hochst attler和Merkel作为一个单独的问题公式化。怀特猜想之前的大部分工作都没有从算法的角度考虑这个问题。我们从最优化的角度研究了这个问题:我们的证明包含了一个多项式算法,用于确定将一个基对转化为另一个基对的对称交换序列,从而提供了正则拟阵中基对交换距离的第一个多项式上界。作为一个副产品,我们验证了1976年Gabow关于正则情况下矩阵的序列对称交换性质的一个猜想。
{"title":"Reconfiguration of basis pairs in regular matroids","authors":"Bérczi, Kristóf, Mátravölgyi, Bence, Schwarcz, Tamás","doi":"10.48550/arxiv.2311.07130","DOIUrl":"https://doi.org/10.48550/arxiv.2311.07130","url":null,"abstract":"In recent years, combinatorial reconfiguration problems have attracted great attention due to their connection to various topics such as optimization, counting, enumeration, or sampling. One of the most intriguing open questions concerns the exchange distance of two matroid basis sequences, a problem that appears in several areas of computer science and mathematics. In 1980, White proposed a conjecture for the characterization of two basis sequences being reachable from each other by symmetric exchanges, which received a significant interest also in algebra due to its connection to toric ideals and Gr\"obner bases. In this work, we verify White's conjecture for basis sequences of length two in regular matroids, a problem that was formulated as a separate question by Farber, Richter, and Shan and Andres, Hochst\"attler, and Merkel. Most of previous work on White's conjecture has not considered the question from an algorithmic perspective. We study the problem from an optimization point of view: our proof implies a polynomial algorithm for determining a sequence of symmetric exchanges that transforms a basis pair into another, thus providing the first polynomial upper bound on the exchange distance of basis pairs in regular matroids. As a byproduct, we verify a conjecture of Gabow from 1976 on the serial symmetric exchange property of matroids for the regular case.","PeriodicalId":496270,"journal":{"name":"arXiv (Cornell University)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136352464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions Gen-Z:基于语境化标签描述的生成零射击文本分类
Pub Date : 2023-11-13 DOI: 10.48550/arxiv.2311.07115
Kumar, Sachin, Park, Chan Young, Tsvetkov, Yulia
Language model (LM) prompting--a popular paradigm for solving NLP tasks--has been shown to be susceptible to miscalibration and brittleness to slight prompt variations, caused by its discriminative prompting approach, i.e., predicting the label given the input. To address these issues, we propose Gen-Z--a generative prompting framework for zero-shot text classification. GEN-Z is generative, as it measures the LM likelihood of input text, conditioned on natural language descriptions of labels. The framework is multivariate, as label descriptions allow us to seamlessly integrate additional contextual information about the labels to improve task performance. On various standard classification benchmarks, with six open-source LM families, we show that zero-shot classification with simple contextualization of the data source of the evaluation set consistently outperforms both zero-shot and few-shot baselines while improving robustness to prompt variations. Further, our approach enables personalizing classification in a zero-shot manner by incorporating author, subject, or reader information in the label descriptions.
语言模型(LM)提示——解决NLP任务的一种流行范例——已被证明容易受到错误校准和轻微提示变化的脆弱性,这是由其判别提示方法引起的,即预测给定输入的标签。为了解决这些问题,我们提出了Gen-Z——一个用于零样本文本分类的生成提示框架。GEN-Z是生成的,因为它测量输入文本的LM可能性,以标签的自然语言描述为条件。这个框架是多元的,因为标签描述允许我们无缝地集成关于标签的其他上下文信息,以提高任务性能。在六个开源LM家族的各种标准分类基准测试中,我们表明,使用评估集数据源的简单上下文化的零射击分类始终优于零射击和少射击基线,同时提高鲁棒性以提示变化。此外,我们的方法通过在标签描述中合并作者、主题或读者信息,以零枪击的方式实现个性化分类。
{"title":"Gen-Z: Generative Zero-Shot Text Classification with Contextualized\u0000 Label Descriptions","authors":"Kumar, Sachin, Park, Chan Young, Tsvetkov, Yulia","doi":"10.48550/arxiv.2311.07115","DOIUrl":"https://doi.org/10.48550/arxiv.2311.07115","url":null,"abstract":"Language model (LM) prompting--a popular paradigm for solving NLP tasks--has been shown to be susceptible to miscalibration and brittleness to slight prompt variations, caused by its discriminative prompting approach, i.e., predicting the label given the input. To address these issues, we propose Gen-Z--a generative prompting framework for zero-shot text classification. GEN-Z is generative, as it measures the LM likelihood of input text, conditioned on natural language descriptions of labels. The framework is multivariate, as label descriptions allow us to seamlessly integrate additional contextual information about the labels to improve task performance. On various standard classification benchmarks, with six open-source LM families, we show that zero-shot classification with simple contextualization of the data source of the evaluation set consistently outperforms both zero-shot and few-shot baselines while improving robustness to prompt variations. Further, our approach enables personalizing classification in a zero-shot manner by incorporating author, subject, or reader information in the label descriptions.","PeriodicalId":496270,"journal":{"name":"arXiv (Cornell University)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136352472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv (Cornell University)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1