arXiv学术速递笔记11.30_代码007(未授权)

本文介绍: 尽管在提高神经网络的鲁棒性方面是有效的，但对抗性训练遭受了自然准确性降级问题，即，天然样品的准确性已大大降低。在这项研究中，我们揭示了自然精度退化是高度相关的自然样本拓扑结构的表示空间的定量和定性实验的破坏。基于这一观察结果，我们提出了拓扑保留对抗训练（TRAIN），通过在对抗训练期间仅在自然样本上训练的标准模型中保留自然样本的拓扑结构来缓解这个问题。作为一种额外的正则化，我们的方法可以很容易地以即插即用的方式与各种流行的对抗训练算法相结合，利用双方的优势。

标题： 保持拓扑的对抗性训练
链接： https://ar x i v.org/a bs/2311.17607
作者： Xi a o y ue Mi,Fan Tang,Ye peng Weng,Danding Wang,Juan Ca o,Sheng Tang,Peng Li,Yang Liu
摘要： 尽管在提高神经网络的鲁棒性方面是有效的，但对抗性训练遭受了自然准确性降级问题，即，天然样品的准确性已大大降低。在这项研究中，我们揭示了自然精度退化是高度相关的自然样本拓扑结构的表示空间的定量和定性实验的破坏。基于这一观察结果，我们提出了拓扑保留对抗训练（TRAIN），通过在对抗训练期间仅在自然样本上训练的标准模型中保留自然样本的拓扑结构来缓解这个问题。作为一种额外的正则化，我们的方法可以很容易地以即插即用的方式与各种流行的对抗训练算法相结合，利用双方的优势。在CIFAR-10、CIFAR-100和Tiny ImageNe t上进行的大量实验表明，我们提出的方法在大多数情况下都能在各种强基线上实现一致且显著的改进。具体来说，在没有额外数据的情况下，我们提出的方法在自然准确度上提高了8.78%，在鲁棒准确度上提高了4.50%。
摘要： Despi te t he effectivenes s in im pro ving the ro bustnes s of neural networks, ad versa rial training ha s suffe red fro m the natural ac cu rac y de gradation problem, i.e., ac cur ac y on natural s amples ha s reduce d sig nifi c antly. In thi s stu dy, we re veal that natural ac cur acy de gradation is highly related to the d isr uption of the natural s ample topology in the representation space by quantitative and qualitative ex periments. Based on thi s ob servation, we propose Topology-pReserving Adversarial traINing (TRAIN) to alleviate the problem by pre serving the topology structure of natural samples fro m a standard model trained only on natural samples during adversarial training. As an additional reg ul arization, our method can easily be combined with var ious pop ul ar advers arial training algorith ms in a plug-and–play manner, taking ad vantage of bot h sides. Extensive ex periments on CIFAR-10, CIFAR-100, and Tiny ImageNet show that our proposed method ac hieves consi stent and sig nifi c ant improv ements over var ious strong base lines in most cases. Specifi cally, wi thout additional data, our pro posed method ac hieves up to 8.78% improv ement in natural ac curacy and 4.50% improv ement in ro bust ac curacy.

标题： 群组稀疏和可解释的对抗性攻击
链接： https://arxiv.org/abs/2311.17434
作者： Shpresim Sadi ku,Moritz Wagner,Seb astian Pokutta
摘要： 稀疏对抗攻击通过最小的像素扰动来欺骗深度神经网络（DNN），通常由

ℓ

ell_0

显示所有内容

声明：本站所有文章，如无特殊说明或标注，均为本站原创发布。任何个人或组织，在未征得本站同意时，禁止复制、盗用、采集、发布本站内容到任何网站、书籍等各类媒体平台。如若本站内容侵犯了原著者的合法权益，可联系我们进行处理。

对抗性拓扑训练

文章 目录

一、AI安全

（对抗性训练）Top o log y-Pre serv in g Ad v e rsa r i al Tr a ining

（对抗性攻击）Gro up–wise Sparse and Explainable Adversarial Attacks

（针对 扩散 模型的攻击）MMA-Diffusion: MultiModal Attack on Diffusion Mod els

参考 链接

发表回复取消回复

一、AI安全

（对抗性训练）Topology-Preserving Adversarial Training

（对抗性攻击）Group–wise Sparse and Explainable Adversarial Attacks

（针对扩散模型的攻击）MMA-Diffusion: MultiModal Attack on Diffusion Models

相关文章

发表回复 取消回复

（对抗性训练）Top o log y-Pre serv in g Ad v e rsa r i al Tr a ining

（对抗性攻击）Gro up–wise Sparse and Explainable Adversarial Attacks

（针对扩散模型的攻击）MMA-Diffusion: MultiModal Attack on Diffusion Mod els

发表回复取消回复