An intelligent, monolithic conversational assistant. Built for fast inference, unique personality replication, and seamless daily chatting.
From our legacy systems to our bleeding-edge GPT 3, we are committed to keeping all generations of WillMe alive and accessible.
Gen 3 • RWKV Architecture
Our launched monolithic flagship. Moving away from GRU, GPT 3.1 adopts the highly efficient RWKV architecture to ensure long-range style consistency. General world knowledge, masterfully tuned for chat.
Gen 2 • GRU Architecture
The stable Gen 2 model. GRU 2 stays online for legacy access and comparison, while GPT 3.1 is now the public flagship for most chats.
Available indefinitely alongside Gen 3.
Gen 1 • Original GRU
The model that started it all. Kept purely for legacy access, backwards compatibility, and historical fallback.
Available indefinitely alongside Gen 3.
We are actively exploring the limits of personality replication and efficiency. Our biggest live experiment is the transition to the RWKV architecture for WillMe GPT 3.1.
By training a lightweight 163m parameter model on roughly 10 billion tokens of English and Swedish text, followed by heavily tuning it on exactly 53,000 real conversations with William, we are testing how well an O(1) inference model can maintain a natural human rhythm, slang, and specific persona lock-in over long contexts without the heavy compute costs of traditional transformers.
May 28, 2026
The next generation is in development. GPT 3.2 builds on 3.1 with 11B training tokens and a new advanced patching fine-tuning stage for more accurate personality replication. No release date yet. See the model page →
May 14, 2026
WillMe GPT 3.1 is now public at /chat/. It is free to use, runs on Intel Arc hardware, and lives alongside GRU 1 and GRU 2 for comparison.
Apr 28, 2026
Launched the model comparison page at /compare. The chat page became fully mobile-compatible, and WillMe AI became a division of Infinity Intelligence under Infinity Productions.
Mar 2026
Moved to a clean light theme with indigo accents. Each model now has its own overview page at /models/gru1 and /models/gru2.
Oct 2025
~40M parameter model with 150k tokens of general pre-training before fine-tuning. First WillMe model capable of coherent back-and-forth conversation.
Aug – Sep 2025
15M parameters trained exclusively on 2,700 real conversations. Incoherent, chaotic, and somehow very William. Still live.