2026-04-06T19:07:56+07:00

Gralkor

April 6, 2026April 9, 2026/Eli Mydlarz/Leave a comment

I needed a better memory plugin for OpenClaw, so I made one – Gralkor (https://lnkd.in/gQyn2HTA)

I don’t mean better than the default, I mean better than the top OpenClaw memory plugins.

I started with the best open source, temporally-aware memory available – Graphiti (https://lnkd.in/gpRn5SXC). I’ve worked with many graph and vector memory systems and Graphiti still amazes me. Graphiti’s strengths are perfect for a long-running personal agent – I really appreciate Zep sharing it with us.

On top of Graphiti, I’ve put a lot of myself and the latest research into Gralkor.

I was quite surprised at how other memory plugins work. Typically they just capture individual question and answer pairs – not much to extract context from! What about ideas that come together slowly over the course of a whole conversation?

Instead, I learned heaps about OpenClaw’s hooks and figured out how to ingest whole episodes that make sense as tasks and conversations. More context, richer extraction, deeper understanding.

Did you know that most memory plugins for OpenClaw only remember dialog? When your agent tells you it did this or that last week, it doesn’t remember doing it – it remembers saying it did. Ask how and it will extrapolate confidently and the error compounds in memory. Your agents mostly don’t remember what they thought either, including how they solved their last problem – I sure couldn’t work under those conditions!

Instead, I built a distillation process to ingest thoughts and actions in context with dialog, tuning for the highest fidelity possible without crowding the graph with tool call parameters.

Gralkor provides a simple platform to experiment with memory consolidation and learning. You’ve got cron, just add Thinker CLI and Gralkor to start your quest for recursive self-improvement. We can learn together – ask me for my reflection cron! This is showing up in research a lot now as ERL.

Finally, custom ontologies! You can define your own entities and relationships, using a configuration scheme designed for accurate classification.

You could focus on standard domain language, or structure your agents memory around your model of the world. This is another one starting to come up in research.

So, enjoy Gralkor (https://lnkd.in/g79xCK2V). Star it, let me know what you think, tell your friends – all those nice things. Great trees need strong roots.

Trunk Sync and Seance

March 25, 2026April 9, 2026/Eli Mydlarz/Leave a comment

Trunk Sync has a new “seance” feature.

Are you worried about inheriting AI-generated code you don’t understand? No problem, you can always talk to the guy who just wrote it.

Resurrect the long-dead coding agent responsible at exactly the moment in code and context when they changed that line. Learn how the code works and why it works that way straight from the programmer, rather than through post-hoc analysis (guessing).

Seance is a feature of Trunk Sync, which I use for extreme continuous integration with my coding agents. It was the challenge of not being able to personally defend main – normally my last line of defence – that drove me to create Seance.

Typical example at https://lnkd.in/gqMEeBE4 – wanting to know why a Docker image was changed.

In your project folder:

			
ppm i @susu-eng/trunk-sync
trunk-sync install

Please remember, I am just sharing my own experiments. I only hope it’s interesting for you.

Trunk Sync: Maximum continuous integration for coding agents. Agents work in parallel on local worktrees, across remote machines – any mix, all with agentic conflict resolution. No resolving conflicts by hand, or discovering that an agent never pushed its work.

Seance: Talk to dead coding agents. Point at any line of code and rewind the codebase and session back to the exact moment it was written. Ask the agent what it was thinking. Understand generated code on demand and stop worrying about keeping up with every change your agents make.

Academic: When you can run multiple Claude Code agents on the same codebase from anywhere and without breaking each other’s work, your comprehension becomes the bottleneck. People are framing this as “cognitive debt”, and here we are exploring the far right of this debate – extreme post-hoc understanding. Don’t worry about cognitive debt at all – just build as fast as you can and make it easier to catch up selectively. I’m not endorsing – just experimenting and learning like you.

Caveats: There’s a flag for pushing Claude transcripts in case the session doing the work was on another machine or needs to be accessed after Claude cleans it up. A better version (please feel free to PR) would push transcripts to a server so they can be accessed securely outside of Git.

There’s another command for summoning the developer who instructed the agent to write the code, but that one is occult – best kept as an easter egg 😂

Thinker CLI

March 21, 2026April 9, 2026/Eli Mydlarz/Leave a comment

I’m sharing Thinker CLI.

You’ve seen me talk about how valuable CLIs are in agent-land already:
– Self-documenting
– Model domain objects and lifecycles
– Model workflows
– Provide fast feedback
– Teach agents incrementally (rather than requiring full usage baked into a skill)
– Run by any shell-using agent
Give an agent a good CLI and it can do _the thing_ even if it doesn’t know how, because _how_ is baked into the CLI.

Thinker CLI brings all these benefits _and it’s super simple_.

Thinker lets anybody define (and share!) a guided, multi-step thought process for your agent in a JSON config file. Agents follows user directions (or automation) to use Thinker with the config file, then Thinker walks them through the multi-turn process in the config file call by call using structured inputs, structured outputs, interpolation into templates, and strict validation. This way work is presented to the agent clearly, incrementally, and validated at each step. The agent can “think through” complicated work, programmed in advance.

I’ve been using this approach – human-guided CoT sequences with structured inputs and outputs – to great effect in my projects for years now. With good design, it _way_ outperforms the generalised reasoning processes built into current models. I’m really happy I can share it in such a simple way.

Used in an agent, you can define steps for searching in memory, saving back into memory, researching online, producing complex artefacts: Thinker CLI allows you to compose any of your agents functionality in linear sequences using natural language.

Links:
– If you want to read more: https://lnkd.in/g3khXusD
– If you want to tell your agent to install: https://lnkd.in/g-SzcWiU
– Example of a coding agent running it: https://lnkd.in/gyDxBNGv (I normally use Thinker with OpenClaw, but this was easier to get logs of. You see how any agent can use it)

Defensive programming and coding agents

March 12, 2026April 9, 2026/Eli Mydlarz/Leave a comment

Codex and Claude are way too defensive. I think this is a good time to talk about defensive programming.

Say I believe some scenario is impossible, and should it be true there will be an error – a console error, request failure, something noisy – but life will go on.

This is actually good. I am probably not wrong, so there is no reason to complicate my code. If I am wrong, great! Through failing fast (and good observability) I will discover my wrongness and we will all be better off for it. The effects of being wrong in software are cumulative and sometimes fatal, so we want to uncover wrongness early.

Building a good understanding of how data actually flows through your system is important. You should not just guess. You also should not defend against everything by default. You should actually check, be confident that you know, and be eager to discover that you are wrong.

The worst thing is taking a wild guess at how some unexpected edge case should be handled, when you really have no idea why that would have happened, or what the downstream implications of your handling will be. It is routine for coding agents to mishandle, downgrade to warning, or just completely swallow errors that reveal critical misunderstandings and concordant design problems in your (their?) software.

Coding agents love defensive programming. There could be many reasons for this, but two come to mind:
– They just don’t want it to crash, like the early JS mentality.
– They don’t want to “miss an edge case”, perhaps reflective of a lot of training data produced by people who didn’t want to “miss an edge case”.

When you vibe code (not agentically engineer, or whatever we’re calling it) and everything looks amazing, how much of the implementation is just failing quietly because of defensive programming? Perhaps it helps explain the early-euphoria-hard-crash we saw many vibe coders go through.

Instruct your coding agents to fail fast and loud.

The easiest optimisation is going faster

February 10, 2026April 9, 2026/Eli Mydlarz/Leave a comment

I’ve been baking some favoured ways of working into my coding agent as project DX, OpenCode config, skills and rules for a little while. I can already work much faster this way than I could by myself, or with Antigravity. Reliability is high, output is good, and I’m working at a level of abstraction that I really enjoy. My mentality therein is something like ADDD https://lnkd.in/gsAizijA (thanks Obie Fernandez for the link).

I also have fine-grained control over planning and behaviour using test-trees as contracts. The codebase is well controlled, tested, and documented, benefitting from good adherence to my preferred practices.

Once people get to some stage they are happy with, I see they often optimise by running Ralph loops overnight. Maybe it’s not for me. I don’t want to work asynchronously just because the agent is too slow or unreliable, I want the agent to be faster. I want to go as fast as I can think at my current level of abstraction, which I am really enjoying.

Parallelisation also doesn’t sound great – all the old points about cycle time over throughput still apply. I don’t want to architect for easier parallel dev (that’s a compromise we’ve made too much in the past already), or have to integrate a bunch of work trees, or increase my mental load with things that are lower priority anyway, or start new work before learning from the last piece of work.

For now I can think way faster than GPT 5.2 codex high can work (on this project, with my workflow), so I’m on the hunt for more tokens per second. I’m not the first – I’m waiting for faster coding plans to drop and reportedly they will sell out in minutes. When I can continue working this way at a few thousand tokens per second, it will be an absolute delight.

Once you’ve got it working, the easiest optimisation is probably going faster.

Teaching coding agents is like systems thinking

February 10, 2026April 9, 2026/Eli Mydlarz/Leave a comment

Working with coding agents, organically teaching your colleagues has been replaced by systems thinking.

Coding models don’t learn (yet), but you can build the lessons you want to teach into the model’s developer experience, context engineering, workflow orchestration, instructions, and so on. All of that stuff makes up your “agent” and most of it is radically improveable.

I see people are flooding projects with markdown files (sure, I have some too 😂), but it feels like a local optima to me:
– There are better ways for coding agents to learn
– There are more impactful ways to improve your agent’s own DX
– There are even better ways to document your software

I’m happy we are speed-running the journey to good software engineering practices (it gets faster every time!), but that also means rehashing some of the old debates – debates that were already settled in my circles.

Tailwind’s build time SaaS trouble is a leading indicator

February 9, 2026April 9, 2026/Eli Mydlarz/Leave a comment

What’s happening to Tailwind is a leading indicator of the challenges that will face all SaaS companies (https://lnkd.in/gdBk4vfR)

Coding agents aren’t super helpful with operations, so SaaS businesses that deliver before runtime – like a developer-facing library – are more vulnerable to replacement today.

Coding agents will get better at operability and actual operations, the rest of the business will become as AI-aware as developers – even runtime SaaS margins will become indefensible.

Research Journal – The Agents are Coming

November 26, 2024November 26, 2024/Eli Mydlarz/Leave a comment

When I started my AI research, I was focused on code generation. After a while, I could see that even the most challenging approach – building a general software engineering agent – is a solved problem, theoretically.

Start with where we are today. Replit Agent is my favourite so far. The way I see it, Replit Agent and friends already have an advantage over me in speed and cost on some time horizon, and that time horizon will get longer.

Consider the improvements we’re seeing in related areas. Tools like Cody and Aider show how understanding codebases as graphs makes LLMs radically more helpful.

Agents are also getting better at interacting with their environments. Anthropic just launched the Model Context Protocol – a framework for connectivity to new data sources – and the ChatGPT app can already see your editor and terminal.

The models themselves are improving, notably including new planning-oriented models, improvements in reasoning, improvements in efficiency, scaling of output quality with test time, faster inference… Each of these is super helpful for a general software engineering agent, and they are all happening at the same time.

Memory layers are also maturing. Letta and Zep are both amazing.

Orchestration is getting better. There are so many good new tools – I’m trying LangGraph next. That’s going to enable more complex workflows (TDD, outside-in) and – crucially – cognitive architectures.

Then there is a bunch of effort stuff. Putting together templates and examples for agents to train on and work with, building software with abstractions that play to the strengths of the agents building it, rather than being designed for humans. Remember, even though it seems like we can do it this way, this is not the easiest way for AI at all – we will make it much easier.

I suspect a lot of engineers would prefer not to think about it, but software engineering will be generally solved, with multiple approaches.