There’s a paper that landed on Hugging Face this week referred to as Memex(RL): Scaling Lengthy-Horizon LLM Brokers through Listed Expertise Reminiscence, out of Accenture’s Centre for Superior AI. It diagnoses one thing each staff delivery brokers in manufacturing has hit, and most are too embarrassed to speak about: brokers collapse on lengthy duties, not as a result of the mannequin is dumb, however as a result of the working context fills up.
The Memex framing is the correct lens right here. Current approaches truncate the trajectory or roll it into operating summaries. Each are lossy. The agent loses entry to the precise device output, the precise API response, and the precise row of information it pulled 5 steps in the past. By step 12, the agent is reasoning on a blurry compression of its personal previous. It then comes to a decision, will get it mistaken, and the ops staff blames the mannequin.
The mechanism Memex proposes is nearer to how people truly work. Hold a compact abstract plus a secure index within the working context. Retailer the full-fidelity artifact in an exterior database underneath that index. Let the agent resolve when to dereference and pull the unique again. The reinforcement studying piece, MemexRL, trains the agent on what to summarise, what to archive, easy methods to index it, and when to retrieve. That decomposition is the actual contribution. Reminiscence in brokers will not be one downside, it’s 4, and most manufacturing stacks are nonetheless fixing zero of them.
Additionally Learn:Â Generalist or specialist? Constructing future-proof abilities within the age of AI
This issues far past analysis benchmarks. In provide chain operations, the canonical process at a freight audit firm is: learn a service bill, match it towards a contract, towards a tariff sheet, towards historic patterns for a similar lane, then flag the overcharge. A single audit can contact 15 paperwork and ten device calls. With out listed reminiscence, by the point the agent will get to the dispute drafting step, it can’t bear in mind which clause within the contract justified the flag. So it both fabricates or provides up. Each are unacceptable when the output goes to a CFO.
The broader sign right here, for anybody constructing B2B AI in 2026, is that the moat is not the mannequin. Frontier capabilities are commoditising quick. The moat is the reminiscence structure, the info plumbing, and the retrieval self-discipline across the agent. Groups that deal with context as one thing to develop will maintain paying inference prices to push extra tokens on the downside. Groups that deal with context as one thing to control, with specific indices, will ship brokers that really end the duty.
The subsequent take a look at is whether or not reminiscence turns into a primitive in agent frameworks the way in which state is in databases. Proper now everyone seems to be rolling their very own. That won’t final one other 12 months.
—
Editor’s notice: e27 goals to foster thought management by publishing views from the neighborhood. You too can share your perspective by submitting an article, video, podcast, or infographic.
The views expressed on this article are these of the writer and don’t essentially mirror the official coverage or place of e27.
Be part of us on WhatsApp, Instagram, Fb, X, and LinkedIn to remain related.
The put up The moat is not the mannequin, it’s the reminiscence appeared first on e27.










