LLMs used tactical nuclear weapons in 95% of AI war games, launched strategic strikes three times

· · 来源:tutorial资讯

Abstract:This is a brief description of a project that has already autoformalized a large portion of the general topology from the Munkres textbook (which has in total 241 pages in 7 chapters and 39 sections). The project has been running since November 21, 2025 and has as of January 4, 2026, produced 160k lines of formalized topology. Most of it (about 130k lines) have been done in two weeks,from December 22 to January 4, for an LLM subscription cost of about \$100. This includes a 3k-line proof of Urysohn's lemma, a 2k-line proof of Urysohn's Metrization theorem, over 10k-line proof of the Tietze extension theorem, and many more (in total over 1.5k lemmas/theorems). The approach is quite simple and cheap: build a long-running feedback loop between an LLM and a reasonably fast proof checker equipped with a core foundational library. The LLM is now instantiated as ChatGPT (mostly 5.2) or Claude Sonnet (4.5) run through the respective Codex or Claude Code command line interfaces. The proof checker is Chad Brown's higher-order set theory system Megalodon, and the core library is Brown's formalization of basic set theory and surreal numbers (including reals, etc). The rest is some prompt engineering and technical choices which we describe here. Based on the fast progress, low cost, virtually unknown ITP/library, and the simple setup available to everyone, we believe that (auto)formalization may become quite easy and ubiquitous in 2026, regardless of which proof assistant is used.

“2023年以来,中卫算力产业的机架数和算力规模每年都实现翻番,综合竞争力快速提升。”任涛表示,“外商来这里投资,电力也将成为一个重要的因素。”。关于这个话题,clash下载提供了深入分析

This ant s谷歌浏览器下载是该领域的重要参考

Какие альтернативы Ближнему Востоку есть у россиянЮго-Восточная Азия«Перед нами сейчас открыта вся Юго-Восточная Азия. Правда, стоимость отдыха там чуть выше, а лететь чуть дальше — на час-два. Эти направления активно используют при замене туров. Берут и Вьетнам, и Таиланд», — рассказал «Ленте.ру» вице-президент Российского союза туриндустрии (РСТ) Юрий Барзыкин.

To bridge the gap between metaprogramming and the type,这一点在谷歌浏览器下载中也有详细论述

Apple anno

*Listed salary range is for OTE