Interview

Engram emerges from stealth with continual learning AI that cuts inference costs by training models to know your world

Jun 29, 2026 with Jack Morris

Key Points

Engram emerges from stealth with a continual learning system that retrains AI models on customer-specific context daily, cutting enterprise inference costs by 96% on repetitive workflows.
Microsoft, Notion, and Harvey are design partners, chosen because their context-heavy repetitive operations compound cost savings fastest.
Morris frames near-term savings as a wedge toward the longer ambition of models that develop genuine organizational understanding over time and generalize across tasks.

Summary

Jack Morris is co-founder and head of research at Engram, which emerged from eight months of stealth last week with a continual learning AI system and backing from an undisclosed group of venture investors.

The core idea is that frontier labs compete to make one model smarter every month, but Morris argues that's the wrong axis for most enterprise use cases. What enterprises actually need is a model that knows their world better. Engram's system continuously retrains itself on a customer's specific context, effectively rewiring the model's knowledge daily rather than relying on generic capabilities.

“The model doesn't need to get smarter every month — it needs to know you better. We're working on a whole different stack: new ways of training, new ways of running the models that train themselves to know your world better and adjust to the things that you say. Our early enterprise partners that we've been working with are Microsoft, Notion, and Harvey. They're nice because they have these massive workspaces of context and they're early adopters of AI — these are the places where we can reduce costs the fastest because the workflows really are just that repetitive.”
— Jack Morris

The cost argument

The near-term commercial thesis isn't about smarter AI — it's about cheaper AI. Enterprise agents running repetitive workflows typically re-read large batches of context files from scratch on every run. Morris says Engram's approach cuts that dramatically: where an agent might read 100 files to generate a daily summary, a continually trained model might need only four. That reduction in inference cost is how Engram is finding early traction.

Design partners named at launch are Microsoft, Notion, and Harvey. Morris singles them out precisely because their workflows are highly repetitive and their workspaces context-heavy, which is where the cost savings compound fastest.

The longer bet

Morris frames the inference cost play as a near-term wedge. The longer ambition is a model that develops a genuinely deep understanding of a user or organization over time — the way a long-tenured colleague would — and can generalize across tasks rather than just retrieve known context faster. Whether Engram can sustain a technical edge as the frontier labs iterate on their own context-handling is an open question the transcript doesn't address.

Read full transcript →

Every deal, every interview. 5 minutes.

TBPN Digest delivers summaries of the latest fundraises, interviews and tech news from TBPN, every weekday.

You might also like...

Notion launches AI-native email client with inbox context from your workspace

Apr 15, 2025

GitHub COO Kyle Daigle on Copilot scale, platform openness, and why enterprise code fine-tuning rarely moves the needle

Oct 28, 2025

Operative: Lovable meets Retool for internal APIs — web app code generation for enterprise internal tooling

Jun 11, 2025