首页 > 最新文献

IEEE journal on selected areas in information theory最新文献

英文 中文
Shadow Area and Degrees of Freedom for Free-Space Communication 自由空间通信的阴影面积和自由度
IF 2.2 Pub Date : 2025-08-19 DOI: 10.1109/JSAIT.2025.3600363
Mats Gustafsson
The number of degrees of freedom (NDoF) in a communication channel fundamentally limits the number of independent spatial modes available for transmitting and receiving information. Although the NDoF can be computed numerically for specific configurations using singular value decomposition (SVD) of the channel operator, this approach provides limited physical insight. In this paper, we introduce a simple analytical estimate for the NDoF between arbitrarily shaped transmitter and receiver regions in free space. In the electrically large limit, where the NDoF is high, it is well approximated by the mutual shadow area, measured in units of wavelength squared. This area corresponds to the projected overlap of the regions, integrated over all lines of sight, and captures their effective spatial coupling. The proposed estimate generalizes and unifies several previously established results, including those based on Weyl’s law, shadow area, and the paraxial approximation. We analyze several example configurations to illustrate the accuracy of the estimate and validate it through comparisons with numerical SVD computations of the propagation channel. The results provide both practical tools and physical insight for the design and analysis of high-capacity communication and sensing systems.
通信信道中的自由度(NDoF)从根本上限制了可用于发送和接收信息的独立空间模式的数量。尽管可以使用信道算子的奇异值分解(SVD)对特定配置进行数值计算,但这种方法提供的物理洞察力有限。本文介绍了在自由空间中任意形状的发射端和接收端之间的nof的一种简单的解析估计方法。在较大的电极限下,NDoF较高的地方,可以用相互阴影面积很好地近似,以波长平方为单位测量。该区域对应于区域的投影重叠,整合在所有视线上,并捕获它们的有效空间耦合。提出的估计推广和统一了几个先前建立的结果,包括基于Weyl定律、阴影面积和近轴近似的结果。我们分析了几个示例配置来说明估计的准确性,并通过与传播信道的SVD数值计算的比较来验证它。研究结果为高容量通信和传感系统的设计和分析提供了实用工具和物理见解。
{"title":"Shadow Area and Degrees of Freedom for Free-Space Communication","authors":"Mats Gustafsson","doi":"10.1109/JSAIT.2025.3600363","DOIUrl":"https://doi.org/10.1109/JSAIT.2025.3600363","url":null,"abstract":"The number of degrees of freedom (NDoF) in a communication channel fundamentally limits the number of independent spatial modes available for transmitting and receiving information. Although the NDoF can be computed numerically for specific configurations using singular value decomposition (SVD) of the channel operator, this approach provides limited physical insight. In this paper, we introduce a simple analytical estimate for the NDoF between arbitrarily shaped transmitter and receiver regions in free space. In the electrically large limit, where the NDoF is high, it is well approximated by the mutual shadow area, measured in units of wavelength squared. This area corresponds to the projected overlap of the regions, integrated over all lines of sight, and captures their effective spatial coupling. The proposed estimate generalizes and unifies several previously established results, including those based on Weyl’s law, shadow area, and the paraxial approximation. We analyze several example configurations to illustrate the accuracy of the estimate and validate it through comparisons with numerical SVD computations of the propagation channel. The results provide both practical tools and physical insight for the design and analysis of high-capacity communication and sensing systems.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"6 ","pages":"325-337"},"PeriodicalIF":2.2,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145110296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Achievable Rates and Error Probability Bounds of Frequency-Based Channels of Unlimited Input Resolution 无限输入分辨率下基于频率信道的可达率和误差概率界限
IF 2.2 Pub Date : 2025-08-18 DOI: 10.1109/JSAIT.2025.3599794
Ran Tamir;Nir Weinberger
We consider a molecular channel, in which messages are encoded to the frequency of objects in a pool, and whose output during reading time is a noisy version of the input frequencies, as obtained by sampling with replacement from the pool. Motivated by recent DNA storage techniques, we focus on the regime in which the input resolution is unlimited. We propose two error probability bounds for this channel; the first bound is based on random coding analysis of the error probability of the maximum likelihood decoder and the second bound is derived by code expurgation techniques. We deduce an achievable bound on the capacity of this channel, and compare it to both the achievable bounds under limited input resolution, as well as to a converse bound.
我们考虑一个分子通道,其中消息被编码为池中对象的频率,其在读取时间的输出是输入频率的带噪声版本,通过从池中替换采样获得。受最近的DNA存储技术的启发,我们专注于输入分辨率无限的机制。我们提出了该信道的两个误差概率界限;第一个边界是基于随机编码分析最大似然解码器的错误概率,第二个边界是通过编码删减技术推导出来的。我们推导出该信道容量的可实现界,并将其与有限输入分辨率下的可实现界以及反向界进行比较。
{"title":"Achievable Rates and Error Probability Bounds of Frequency-Based Channels of Unlimited Input Resolution","authors":"Ran Tamir;Nir Weinberger","doi":"10.1109/JSAIT.2025.3599794","DOIUrl":"https://doi.org/10.1109/JSAIT.2025.3599794","url":null,"abstract":"We consider a molecular channel, in which messages are encoded to the frequency of objects in a pool, and whose output during reading time is a noisy version of the input frequencies, as obtained by sampling with replacement from the pool. Motivated by recent DNA storage techniques, we focus on the regime in which the input resolution is unlimited. We propose two error probability bounds for this channel; the first bound is based on random coding analysis of the error probability of the maximum likelihood decoder and the second bound is derived by code expurgation techniques. We deduce an achievable bound on the capacity of this channel, and compare it to both the achievable bounds under limited input resolution, as well as to a converse bound.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"6 ","pages":"283-295"},"PeriodicalIF":2.2,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145061913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Achievable Rates of Nanopore-Based DNA Storage 基于纳米孔的DNA存储的可实现速率
IF 2.2 Pub Date : 2025-08-14 DOI: 10.1109/JSAIT.2025.3598756
Brendon McBain;Emanuele Viterbo
This paper studies achievable rates of nanopore-based DNA storage when nanopore signals are decoded using a tractable channel model that does not rely on a basecalling algorithm. Specifically, the noisy nanopore channel (NNC) with the Scrappie pore model generates average output levels via i.i.d. geometric sample duplications corrupted by i.i.d. Gaussian noise (NNC-Scrappie). Simplified message passing algorithms are derived for efficient soft decoding of nanopore signals using NNC-Scrappie. Previously, evaluation of this channel model was limited by the lack of DNA storage datasets with nanopore signals included. This is solved by deriving an achievable rate based on the dynamic time-warping (DTW) algorithm that can be applied to genomic sequencing datasets subject to constraints that make the resulting rate applicable to DNA storage. Using a publicly-available dataset from Oxford Nanopore Technologies (ONT), it is demonstrated that coding over multiple DNA strands of 100 bases in length and decoding with the NNC-Scrappie decoder can achieve rates of at least $0.64-1.18$ bits per base, depending on the channel quality of the nanopore that is chosen in the sequencing device per channel-use, and 0.96 bits per base on average assuming uniformly chosen nanopores. These rates are pessimistic since they only apply to single reads and do not include calibration of the pore model to specific nanopores.
本文研究了使用不依赖于基调用算法的可处理通道模型解码纳米孔信号时基于纳米孔的DNA存储的可实现速率。具体来说,具有Scrappie孔模型的噪声纳米孔通道(NNC)通过i.i.d高斯噪声(NNC-Scrappie)破坏的几何样本重复产生平均输出水平。推导了简化的消息传递算法,利用NNC-Scrappie对纳米孔信号进行有效的软解码。以前,由于缺乏包含纳米孔信号的DNA存储数据集,对该通道模型的评估受到限制。这是通过基于动态时间规整(DTW)算法推导可实现的速率来解决的,该算法可应用于受约束的基因组测序数据集,使结果速率适用于DNA存储。使用来自牛津纳米孔技术公司(ONT)的公开数据集,证明了对长度为100个碱基的多条DNA链进行编码,并使用NNC-Scrappie解码器进行解码,可以实现每个碱基至少0.64-1.18美元比特的速率,这取决于每个通道使用的测序设备中选择的纳米孔的通道质量,假设均匀选择的纳米孔,平均每个碱基0.96比特。这些比率是悲观的,因为它们只适用于单次读取,不包括对特定纳米孔的孔模型的校准。
{"title":"Achievable Rates of Nanopore-Based DNA Storage","authors":"Brendon McBain;Emanuele Viterbo","doi":"10.1109/JSAIT.2025.3598756","DOIUrl":"https://doi.org/10.1109/JSAIT.2025.3598756","url":null,"abstract":"This paper studies achievable rates of nanopore-based DNA storage when nanopore signals are decoded using a tractable channel model that does not rely on a basecalling algorithm. Specifically, the noisy nanopore channel (NNC) with the Scrappie pore model generates average output levels via i.i.d. geometric sample duplications corrupted by i.i.d. Gaussian noise (NNC-Scrappie). Simplified message passing algorithms are derived for efficient soft decoding of nanopore signals using NNC-Scrappie. Previously, evaluation of this channel model was limited by the lack of DNA storage datasets with nanopore signals included. This is solved by deriving an achievable rate based on the dynamic time-warping (DTW) algorithm that can be applied to genomic sequencing datasets subject to constraints that make the resulting rate applicable to DNA storage. Using a publicly-available dataset from Oxford Nanopore Technologies (ONT), it is demonstrated that coding over multiple DNA strands of 100 bases in length and decoding with the NNC-Scrappie decoder can achieve rates of at least <inline-formula> <tex-math>$0.64-1.18$ </tex-math></inline-formula> bits per base, depending on the channel quality of the nanopore that is chosen in the sequencing device per channel-use, and 0.96 bits per base on average assuming uniformly chosen nanopores. These rates are pessimistic since they only apply to single reads and do not include calibration of the pore model to specific nanopores.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"6 ","pages":"261-269"},"PeriodicalIF":2.2,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144926889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On Achievable Rates Over Noisy Nanopore Channels 噪声纳米孔通道上可实现的速率
IF 2.2 Pub Date : 2025-08-14 DOI: 10.1109/JSAIT.2025.3598773
V. Arvind Rameshwar;Nir Weinberger
In this paper, we consider a recent channel model of a nanopore sequencer proposed by McBain, Viterbo, and Saunderson (2024), termed the noisy nanopore channel (NNC). In essence, an NNC is a duplication channel with structured, Markov inputs, that is corrupted by memoryless noise. We first discuss a (tight) lower bound on the capacity of the NNC in the absence of random noise. Next, we present lower and upper bounds on the channel capacity of general noisy nanopore channels. We then consider two interesting regimes of operation of an NNC: first, where the memory of the input process is large and the random noise introduces erasures, and second, where the rate of measurements of the electric current (also called the sampling rate) is high. For these regimes, we show that it is possible to achieve information rates close to the noise-free capacity, using low-complexity encoding and decoding schemes. In particular, our decoder for the regime of high sampling rates makes use of a change-point detection procedure – a subroutine of immediate relevance for practitioners.
在本文中,我们考虑了McBain, Viterbo和Saunderson(2024)提出的纳米孔测序器的通道模型,称为噪声纳米孔通道(NNC)。本质上,NNC是一个具有结构化马尔可夫输入的重复信道,它被无记忆噪声破坏。我们首先讨论了在没有随机噪声的情况下NNC容量的(紧)下界。其次,我们给出了一般噪声纳米孔通道容量的下界和上界。然后,我们考虑了NNC的两种有趣的操作机制:首先,输入过程的内存很大,随机噪声会引入擦除,其次,电流的测量率(也称为采样率)很高。对于这些体制,我们表明有可能实现接近无噪声容量的信息率,使用低复杂度的编码和解码方案。特别是,我们的高采样率制度的解码器利用了一个变化点检测程序-一个直接相关的从业者子程序。
{"title":"On Achievable Rates Over Noisy Nanopore Channels","authors":"V. Arvind Rameshwar;Nir Weinberger","doi":"10.1109/JSAIT.2025.3598773","DOIUrl":"https://doi.org/10.1109/JSAIT.2025.3598773","url":null,"abstract":"In this paper, we consider a recent channel model of a nanopore sequencer proposed by McBain, Viterbo, and Saunderson (2024), termed the noisy nanopore channel (NNC). In essence, an NNC is a duplication channel with structured, Markov inputs, that is corrupted by memoryless noise. We first discuss a (tight) lower bound on the capacity of the NNC in the absence of random noise. Next, we present lower and upper bounds on the channel capacity of general noisy nanopore channels. We then consider two interesting regimes of operation of an NNC: first, where the memory of the input process is large and the random noise introduces erasures, and second, where the rate of measurements of the electric current (also called the sampling rate) is high. For these regimes, we show that it is possible to achieve information rates close to the noise-free capacity, using low-complexity encoding and decoding schemes. In particular, our decoder for the regime of high sampling rates makes use of a change-point detection procedure – a subroutine of immediate relevance for practitioners.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"6 ","pages":"270-282"},"PeriodicalIF":2.2,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144990072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sequence Reconstruction for the Single-Deletion Single-Substitution Channel 单缺失单替换通道的序列重建
IF 2.2 Pub Date : 2025-08-08 DOI: 10.1109/JSAIT.2025.3597013
Wentu Song;Kui Cai;Tony Q. S. Quek
The central problem in sequence reconstruction is to find the minimum number of distinct channel outputs required to uniquely reconstruct the transmitted sequence. According to Levenshtein’s work in 2001, this number is determined by the size of the maximum intersection between the error balls of any two distinct input sequences of the channel. In this work, we study the sequence reconstruction problem for the q-ary single-deletion single-substitution channel for any fixed integer $qgeq 2$ . First, we prove that if two q-ary sequences of length n have a Hamming distance $dgeq 2$ , then the intersection size of their error balls is upper bounded by $2qn-3q-2-delta _{q,2}$ , where $delta _{i,j}$ is the Kronecker delta, and this bound is achievable. Next, we prove that if two q-ary sequences have a Hamming distance $dgeq 3$ and a Levenshtein distance $d_{text {L}}geq 2$ , then the intersection size of their error balls is upper bounded by $3q+11$ , and we show that the gap between this bound and the tight bound is at most 2.
序列重建的核心问题是找到唯一重建传输序列所需的不同信道输出的最小数量。根据Levenshtein在2001年的工作,这个数字是由任意两个不同的信道输入序列的误差球之间的最大交集的大小决定的。在这项工作中,我们研究了任意固定整数$qgeq 2$的q元单删除单替换通道的序列重建问题。首先,我们证明了如果两个长度为n的q-ary序列有一个汉明距离$dgeq 2$,那么它们的误差球的相交大小的上界为$2qn-3q-2-delta _{q,2}$,其中$delta _{i,j}$为Kronecker delta,并且这个上界是可以实现的。接下来,我们证明了如果两个q-ary序列具有Hamming距离$dgeq 3$和Levenshtein距离$d_{text {L}}geq 2$,那么它们的误差球相交大小的上界为$3q+11$,并证明了该界与紧界之间的差距不超过2。
{"title":"Sequence Reconstruction for the Single-Deletion Single-Substitution Channel","authors":"Wentu Song;Kui Cai;Tony Q. S. Quek","doi":"10.1109/JSAIT.2025.3597013","DOIUrl":"https://doi.org/10.1109/JSAIT.2025.3597013","url":null,"abstract":"The central problem in sequence reconstruction is to find the minimum number of distinct channel outputs required to uniquely reconstruct the transmitted sequence. According to Levenshtein’s work in 2001, this number is determined by the size of the maximum intersection between the error balls of any two distinct input sequences of the channel. In this work, we study the sequence reconstruction problem for the q-ary single-deletion single-substitution channel for any fixed integer <inline-formula> <tex-math>$qgeq 2$ </tex-math></inline-formula>. First, we prove that if two q-ary sequences of length n have a Hamming distance <inline-formula> <tex-math>$dgeq 2$ </tex-math></inline-formula>, then the intersection size of their error balls is upper bounded by <inline-formula> <tex-math>$2qn-3q-2-delta _{q,2}$ </tex-math></inline-formula>, where <inline-formula> <tex-math>$delta _{i,j}$ </tex-math></inline-formula> is the Kronecker delta, and this bound is achievable. Next, we prove that if two q-ary sequences have a Hamming distance <inline-formula> <tex-math>$dgeq 3$ </tex-math></inline-formula> and a Levenshtein distance <inline-formula> <tex-math>$d_{text {L}}geq 2$ </tex-math></inline-formula>, then the intersection size of their error balls is upper bounded by <inline-formula> <tex-math>$3q+11$ </tex-math></inline-formula>, and we show that the gap between this bound and the tight bound is at most 2.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"6 ","pages":"232-247"},"PeriodicalIF":2.2,"publicationDate":"2025-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144914375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Survey of Sequence Reconstruction Problems and Their Applications in DNA-Based Storage 序列重建问题及其在dna存储中的应用综述
IF 2.2 Pub Date : 2025-08-04 DOI: 10.1109/JSAIT.2025.3595457
Yaoyu Yang
In DNA sequencing, we often need to infer an unknown sequence from a collection of its corrupted copies. Each copy cannot faithfully tell the truth due to DNA fragmentation, point mutations, and measurement errors. The theoretical guarantee of unique reconstruction is thus of concern. This motivated the study of sequence reconstruction problems three decades ago. Recently, synthetic DNA has been regarded as an ultra-dense data storage medium. Sequence reconstruction is a crucial step in achieving reliable and efficient data readout. In this survey, we summarize mainly two types of problems, reconstruction from subsequences or substrings, in both combinatorial and probabilistic settings. Meanwhile, we discuss codes and algorithms that may assist with the future development of DNA-based data storage systems.
在DNA测序中,我们经常需要从其损坏副本的集合中推断出未知序列。由于DNA断裂、点突变和测量误差,每个副本不能忠实地告诉真相。因此,独特重建的理论保证值得关注。这激发了三十年前序列重建问题的研究。近年来,合成DNA被认为是一种超高密度的数据存储介质。序列重构是实现可靠、高效数据读取的关键步骤。在这个调查中,我们主要总结了两种类型的问题,从子序列或子字符串重建,在组合和概率设置。同时,我们讨论了可能有助于未来基于dna的数据存储系统发展的代码和算法。
{"title":"Survey of Sequence Reconstruction Problems and Their Applications in DNA-Based Storage","authors":"Yaoyu Yang","doi":"10.1109/JSAIT.2025.3595457","DOIUrl":"https://doi.org/10.1109/JSAIT.2025.3595457","url":null,"abstract":"In DNA sequencing, we often need to infer an unknown sequence from a collection of its corrupted copies. Each copy cannot faithfully tell the truth due to DNA fragmentation, point mutations, and measurement errors. The theoretical guarantee of unique reconstruction is thus of concern. This motivated the study of sequence reconstruction problems three decades ago. Recently, synthetic DNA has been regarded as an ultra-dense data storage medium. Sequence reconstruction is a crucial step in achieving reliable and efficient data readout. In this survey, we summarize mainly two types of problems, reconstruction from subsequences or substrings, in both combinatorial and probabilistic settings. Meanwhile, we discuss codes and algorithms that may assist with the future development of DNA-based data storage systems.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"6 ","pages":"352-366"},"PeriodicalIF":2.2,"publicationDate":"2025-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145141744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Input Optimization in the Composite DNA Storage Channel 复合DNA存储通道的输入优化
IF 2.2 Pub Date : 2025-08-01 DOI: 10.1109/JSAIT.2025.3595005
Adir Kobovich;Nir Weinberger
Recent advancements in DNA storage show that composite DNA letters can significantly enhance storage capacity. We model this process as a multinomial channel and propose an optimization algorithm to determine its capacity-achieving input distribution (CAID) for an arbitrary number of output reads. Our empirical results match a scaling law that determines that the support size grows exponentially with capacity. In addition, we introduce a limited-support optimization algorithm that optimizes the input distribution under a restricted support size, making it more feasible for real-world DNA storage systems. We also extend our model to account for noise and study its effect on capacity and input design.
DNA存储的最新进展表明,复合DNA字母可以显著提高存储容量。我们将这一过程建模为多项式通道,并提出了一种优化算法来确定任意数量的输出读取的容量实现输入分布(CAID)。我们的经验结果匹配的缩放定律,决定了支持的大小与容量呈指数增长。此外,我们引入了一种有限支持优化算法,该算法在有限支持大小下优化输入分布,使其更适用于现实世界的DNA存储系统。我们还扩展了我们的模型,以考虑噪声并研究其对容量和输入设计的影响。
{"title":"Input Optimization in the Composite DNA Storage Channel","authors":"Adir Kobovich;Nir Weinberger","doi":"10.1109/JSAIT.2025.3595005","DOIUrl":"https://doi.org/10.1109/JSAIT.2025.3595005","url":null,"abstract":"Recent advancements in DNA storage show that composite DNA letters can significantly enhance storage capacity. We model this process as a multinomial channel and propose an optimization algorithm to determine its capacity-achieving input distribution (CAID) for an arbitrary number of output reads. Our empirical results match a scaling law that determines that the support size grows exponentially with capacity. In addition, we introduce a limited-support optimization algorithm that optimizes the input distribution under a restricted support size, making it more feasible for real-world DNA storage systems. We also extend our model to account for noise and study its effect on capacity and input design.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"6 ","pages":"248-260"},"PeriodicalIF":2.2,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144998191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asymptotically Good Generalized Quantum Tanner Codes 渐近良好广义量子Tanner码
IF 2.2 Pub Date : 2025-07-30 DOI: 10.1109/JSAIT.2025.3594310
Olai Å. Mostad;Eirik Rosnes;Hsuan-Yin Lin
In this work, we present a generalization of the recently proposed quantum Tanner codes by Leverrier and Zémor, which contains a construction of asymptotically good quantum low-density parity-check codes. Quantum Tanner codes have so far been constructed equivalently from groups, Cayley graphs, or square complexes constructed from groups. We show how to enlarge this to graphs with labeled local views and a family of square complexes, which is the largest possible in a certain sense. We show that the proposed generalization contains a family of asymptotically good quantum codes that are based on non-Cayley Schreier graphs, i.e., a new family of (generalized) quantum Tanner codes is provided. Moreover, we evaluate the performance of the generalized codes and compare with those based on Cayley graphs both in terms of minimum distance and logical error rate on the depolarizing channel, demonstrating that the proposed generalized codes based on Schreier graphs outperform those based on Cayley graphs.
在这项工作中,我们推广了最近由Leverrier和zacimmore提出的量子Tanner码,其中包含一个渐近良好量子低密度奇偶校验码的构造。到目前为止,量子坦纳码是由群、凯利图或由群构成的平方复合体等量构造的。我们展示了如何将其扩展到带有标记的局部视图和一组正方形复合体的图,这在某种意义上是最大的可能。我们证明了所提出的推广包含了一组基于非cayley Schreier图的渐近良好量子码,即提供了一组新的(广义)量子Tanner码。此外,我们还评估了基于Schreier图的广义码的性能,并将其与基于Cayley图的广义码在去极化信道上的最小距离和逻辑错误率进行了比较,证明了基于Schreier图的广义码优于基于Cayley图的广义码。
{"title":"Asymptotically Good Generalized Quantum Tanner Codes","authors":"Olai Å. Mostad;Eirik Rosnes;Hsuan-Yin Lin","doi":"10.1109/JSAIT.2025.3594310","DOIUrl":"https://doi.org/10.1109/JSAIT.2025.3594310","url":null,"abstract":"In this work, we present a generalization of the recently proposed quantum Tanner codes by Leverrier and Zémor, which contains a construction of asymptotically good quantum low-density parity-check codes. Quantum Tanner codes have so far been constructed equivalently from groups, Cayley graphs, or square complexes constructed from groups. We show how to enlarge this to graphs with labeled local views and a family of square complexes, which is the largest possible in a certain sense. We show that the proposed generalization contains a family of asymptotically good quantum codes that are based on non-Cayley Schreier graphs, i.e., a <italic>new</i> family of (generalized) quantum Tanner codes is provided. Moreover, we evaluate the performance of the generalized codes and compare with those based on Cayley graphs both in terms of minimum distance and logical error rate on the depolarizing channel, demonstrating that the proposed generalized codes based on Schreier graphs outperform those based on Cayley graphs.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"6 ","pages":"367-382"},"PeriodicalIF":2.2,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145210027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ramp Secret Sharing for Composite DNA 合成DNA的斜坡秘密共享
IF 2.2 Pub Date : 2025-07-28 DOI: 10.1109/JSAIT.2025.3593447
Wenkai Zhang;Zhiying Wang
Emerging DNA storage technologies use composite DNA letters, where information is represented by a probability vector, leading to higher information density and lower synthesis costs. However, it faces the problem of information leakage in sharing the DNA vessels among untrusted vendors. This paper introduces an asymptotic ramp secret sharing scheme (ARSSS) for secret information storage using composite DNA letters. This innovative scheme, inspired by secret sharing methods over finite fields and enhanced with a modified matrix-vector multiplication operation for probability vectors, achieves asymptotic information-theoretic data security for a large alphabet size. Moreover, this scheme reduces the number of reading operations for DNA samples compared to traditional schemes, and therefore lowers the complexity and the cost of DNA-based secret sharing. We further explore the construction of the scheme, starting with a proof of the existence of a suitable generator, followed by practical examples. Finally, we demonstrate efficient constructions to support large information sizes, which utilize multiple vessels for each secret share rather than a single vessel.
新兴的DNA存储技术使用复合DNA字母,其中信息由概率向量表示,导致更高的信息密度和更低的合成成本。然而,它在不可信的供应商之间共享DNA容器时面临着信息泄露的问题。介绍了一种利用复合DNA字母存储秘密信息的渐近斜坡秘密共享方案(ARSSS)。该创新方案受有限域上的秘密共享方法的启发,并通过改进的概率向量矩阵向量乘法运算进行增强,实现了大字母表大小的渐近信息论数据安全。此外,与传统方案相比,该方案减少了DNA样本的读取操作,从而降低了基于DNA的秘密共享的复杂性和成本。我们进一步探讨了该方案的构造,首先证明了一个合适的生成器的存在性,然后是实际的例子。最后,我们展示了支持大信息大小的有效结构,它为每个秘密共享使用多个容器而不是单个容器。
{"title":"Ramp Secret Sharing for Composite DNA","authors":"Wenkai Zhang;Zhiying Wang","doi":"10.1109/JSAIT.2025.3593447","DOIUrl":"https://doi.org/10.1109/JSAIT.2025.3593447","url":null,"abstract":"Emerging DNA storage technologies use composite DNA letters, where information is represented by a probability vector, leading to higher information density and lower synthesis costs. However, it faces the problem of information leakage in sharing the DNA vessels among untrusted vendors. This paper introduces an asymptotic ramp secret sharing scheme (ARSSS) for secret information storage using composite DNA letters. This innovative scheme, inspired by secret sharing methods over finite fields and enhanced with a modified matrix-vector multiplication operation for probability vectors, achieves asymptotic information-theoretic data security for a large alphabet size. Moreover, this scheme reduces the number of reading operations for DNA samples compared to traditional schemes, and therefore lowers the complexity and the cost of DNA-based secret sharing. We further explore the construction of the scheme, starting with a proof of the existence of a suitable generator, followed by practical examples. Finally, we demonstrate efficient constructions to support large information sizes, which utilize multiple vessels for each secret share rather than a single vessel.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"6 ","pages":"217-231"},"PeriodicalIF":2.2,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144887811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Error Exponents for DNA Storage Codes With a Variable Number of Reads 具有可变读取数的DNA存储代码的错误指数
IF 2.2 Pub Date : 2025-07-21 DOI: 10.1109/JSAIT.2025.3590758
Yan Hao Ling;Nir Weinberger;Jonathan Scarlett
In this paper, we study error exponents for an index-based concatenated coding based class of DNA storage codes in which the number of reads performed can be variable. That is, the decoder can sequentially perform reads and choose whether to output the final decision or take more reads, and we are interested in minimizing the average number of reads performed rather than a fixed pre-specified value. We show that this flexibility leads to a considerable reduction in the error probability compared to a fixed number of reads, not only in terms of constants in the error exponent but also in the scaling laws. This is shown via an achievability result for a suitably-designed protocol, and in certain parameter regimes we additionally establish a matching converse that holds for all protocols within a broader index-based concatenated coding based class.
在本文中,我们研究了一类基于索引的串联编码的DNA存储编码的错误指数,其中执行的读取次数可以是可变的。也就是说,解码器可以顺序地执行读取并选择是输出最终决定还是进行更多读取,我们感兴趣的是最小化执行的平均读取次数,而不是固定的预先指定值。我们表明,与固定次数的读取相比,这种灵活性不仅在误差指数常数方面,而且在标度定律方面,导致错误概率大大降低。这是通过适当设计的协议的可实现性结果来显示的,并且在某些参数制度中,我们还建立了一个匹配的逆,该逆适用于更广泛的基于索引的连接编码类中的所有协议。
{"title":"Error Exponents for DNA Storage Codes With a Variable Number of Reads","authors":"Yan Hao Ling;Nir Weinberger;Jonathan Scarlett","doi":"10.1109/JSAIT.2025.3590758","DOIUrl":"https://doi.org/10.1109/JSAIT.2025.3590758","url":null,"abstract":"In this paper, we study error exponents for an index-based concatenated coding based class of DNA storage codes in which the number of reads performed can be variable. That is, the decoder can sequentially perform reads and choose whether to output the final decision or take more reads, and we are interested in minimizing the average number of reads performed rather than a fixed pre-specified value. We show that this flexibility leads to a considerable reduction in the error probability compared to a fixed number of reads, not only in terms of constants in the error exponent but also in the scaling laws. This is shown via an achievability result for a suitably-designed protocol, and in certain parameter regimes we additionally establish a matching converse that holds for all protocols within a broader index-based concatenated coding based class.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"6 ","pages":"205-216"},"PeriodicalIF":2.2,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144814143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE journal on selected areas in information theory
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1