面向高精度3D NAND存算一体芯片的多晶硅晶界势垒工艺优化与验证

郑好; 刘慧雯; 许克志; 张宝通; 杨远程; 夏志良; 霍宗亮

doi:10.7498/aps.75.20260199

摘要

随着人工智能与边缘计算的快速发展，存算一体（Computing-In-Memory，简记为CIM）架构被认为是缓解冯·诺依曼瓶颈的重要技术路径1–3。基于3D NAND的CIM方案兼具高存储密度与工艺成熟度，但在执行矩阵-向量乘法（Matrix-Vector Multiplication，简记为MVM）等模拟计算任务时，串电流分布展宽会导致累加电流偏差，进而引起计算精度下降4–6问题。其中3D NAND顶部选择（Top Select Gate，简记为TSG）晶体管沟道的多晶硅晶界更是会直接影响到串电流的分布。因此本文采用计算机辅助设计软件 (Technology Computer-Aided Design, 简记为TCAD)建立TSG Deck器件模型，分析TSG多晶硅沟道晶界陷阱诱发的势垒对开态电流波动的影响规律。在此基础上，提出一种通过多晶硅前驱体组合调控实现等效氢钝化窗口优化的工艺方案，并对不同工艺分组条件下的开态电流分布进行单片晶圆尺度的统计评估。结果表明，最优工艺可使位线端电流分布的归一化标准差较优化前减小50%，并在CIM系统级仿真中使GPT-2 124M模型的INT8推理中的MVM计算误差相对基准工艺降低14.7%至66.8%。综上，本文工作为面向高精度3D NAND CIM芯片的工艺优化方案提供了可实现的设计依据。

关键词:

Abstract

With the rapid development of artificial intelligence and edge computing, the Computing-In-Memory (CIM) architecture is considered a crucial technical path to alleviate the von Neumann bottleneck. While 3D NAND-based CIM schemes offer distinct advantages in high storage density and process maturity, their execution of analog computing tasks, such as Matrix-Vector Multiplication (MVM), suffers from calculation accuracy degradation. This is primarily caused by the broadening of the string current distribution, which leads to accumulated current deviations. In particular, the polysilicon grain boundaries (GBs) within the Top Select Gate (TSG) channel of 3D NAND strings play a decisive role in determining this current distribution.
To address this challenge, this study utilizes Technology Computer-Aided Design (TCAD) to construct a mature TSG Deck device model, analyzing the influence mechanism of potential barriers induced by GB traps on on-state current fluctuations. Simulation results demonstrate that acceptor-like traps at grain boundaries induce local potential barriers, and the variance of these barriers is the dominant physical source of on-state current instability. Guided by these physical insights, a novel process optimization strategy is proposed to modulate the equivalent hydrogen passivation window by combining polysilicon precursors with distinct nucleation and hydrogen-content characteristics (denoted as NS, MS, and DS). Specifically, innovatively inserting a 9-nm DS precursor interlayer between the NS nucleation layer and the MS bulk-fill layer creates a low-defect buffer zone, achieving in-situ hydrogen passivation of deep-level traps without compromising interface smoothness.
Wafer-scale statistical analysis of on-state current distributions across different process splits confirms that the optimal precursor combination reduces the normalized standard deviation of the bit-line current by 50% compared to the baseline process. Furthermore, to evaluate the system-level impact, the measured current distributions were fitted to a skew-t distribution and injected as multiplication noise into a custom CIM simulation framework. System-level simulations of INT8 quantization inference for the GPT-2 124M model indicate that the optimized device characteristics significantly reduce MVM calculation errors by 14.7% to 66.8%, depending on the weight matrix dimensions. In conclusion, this work bridges device-level process optimization with system-level performance, providing a highly manufacturable design basis for high-precision 3D NAND CIM chips.

Keywords:

3D NAND, Computing-In-Memory, Top Select Gate (TSG), Polysilicon Grain Boundary, Current Distribution /
/
/
/

搜索

面向高精度3D NAND存算一体芯片的多晶硅晶界势垒工艺优化与验证

Process Optimization and Validation of the Polysilicon Grain-Boundary Barrier for High-Accuracy 3D NAND Compute-in-Memory Chips

摘要

Abstract

施引文献

目录