New AI Tools Weekly #5: Agent infrastructure moves below the chatbot
June 22, 2026 · 8:21 AM

New AI Tools Weekly #5: Agent infrastructure moves below the chatbot

This week’s strongest AI-tool signal is the operating layer below agents: codebase memory, production harnesses, parallel coding workspaces, media-skill systems, observability, and verifiable tool-call receipts.

Research Brief

The week’s strongest signal was not another chatbot UI. It was infrastructure around agents: memory for codebases, harnesses that turn agents into configurable services, workspaces for running multiple coding agents at once, and receipts for proving what tools an agent actually called.
Coverage note: Product Hunt’s weekly leaderboard remained inaccessible through direct fetches, so this issue used search-indexed Product Hunt candidates only as discovery leads. Entries made the cut only when a GitHub repo, official page, official blog, or X permalink could verify the product or launch detail. GitHub Trending and X supplied the clearest primary signals this week.

At-a-glance shortlist

ThemeToolThis week’s signalPricing / accessTry-it recommendation
Code context and governed dataCodebase Memory MCP+6,372 weekly stars in the all-language GitHub Trending scan; repo at 10.8k starsOpen-source; MIT license shown in repoTry if your coding agent burns tokens rediscovering architecture every session. 1 2
Code context and governed dataBasedash groups and access controlsJune 20 release adds group-level access and per-group AI contextCommercial BI product; signup availableTry if AI analytics needs different permissions and language for data teams, executives, and clients. 3
Code context and governed dataNocoBase+294 weekly stars in TypeScript Trending; repo positions itself as AI + no-code for business systemsOpen-source repo; commercial ecosystem likely around deployments/servicesTry if you want coding agents and human no-code builders working on the same internal app. 4 5
Production agent harnessesAmazon Bedrock AgentCore harnessGA blog published June 18; Amazon says CreateHarness + InvokeHarness can define and run an agent with managed memory, tools, skills, filesystem, and observabilityAWS managed service; usage depends on Bedrock/AgentCore resourcesTry if you already run on AWS and need isolation, memory, identity, and traces without writing the harness yourself. 6
Production agent harnessesFlue+1,272 weekly stars in TypeScript Trending; programmable TypeScript harness for agents, workflows, sandboxes, skills, subagents, MCP, and observabilityOpen-source repo; Apache-2.0 license shownTry if you want a code-first agent harness outside a single cloud vendor. 4 7
Production agent harnessesGoogle agents-cli+182 weekly stars in Python Trending; CLI and skills for building, evaluating, deploying, and publishing agents on Google CloudApache-2.0 repo; Google Cloud required for deployment featuresTry if your coding assistant needs to scaffold ADK/Gemini Enterprise agents rather than hand you docs. 8 9
Fleet coding workspacesKiloCode+3,674 weekly stars in TypeScript Trending; repo describes an all-in-one agentic engineering platform for VS Code, JetBrains, and CLIOpen-source repo; check license and hosted options before rolloutTry if you want a coding-agent environment rather than a single chat panel. 4 10
Fleet coding workspacesOrca+997 weekly stars in TypeScript Trending; desktop/mobile workspace for running Codex, Claude Code, OpenCode, and other CLI agents side by side in separate worktreesOpen-source repo; desktop app distributionTry if you want to compare multiple agents on the same task and merge the best result. 4 11
Media and design productionOpenMontage+2,867 weekly stars in Python Trending; repo claims 12 pipelines, 52 tools, and 500+ agent skills for video productionOpen-source; AGPLv3 license shownTry if you want reproducible AI video workflows with cost estimates and real footage paths. 8 12
Media and design productionUI Skills+506 weekly stars in TypeScript Trending; repo packages task-routed UI skill sets for design engineersOpen-source; MIT license shownTry if your design/code agent keeps producing generic UI and needs narrower design instructions. 4 13
Media and design productionPhotoroom AI IroningPhotoroom CEO announced AI Ironing on June 22; it removes garment wrinkles while preserving logo, texture, and stitching, and is available in AI Tools plus API for Plus customersApp feature; API access tied to Plus plan in the launch postTry if ecommerce image teams spend real time retouching apparel wrinkles. 14
Agent observability and proofFoglampOfficial page positions it as observability for AI agents built on Vercel AI SDK, covering cost, latency, traces, evals, alerts, and per-agent spendStart-free SaaS pageTry if agents are already in production and you need cost/regression alerts before users complain. 15
Agent observability and proofFetch.ai AEVSJune 15 launch: signed, tamper-evident receipts for agent tool calls; secondary coverage describes HMAC-signed, hash-chained receipts and an off-chain designOpen-source SDK per coverage; Product Hunt launchTry if you need portable evidence of tool calls for audits, payments, refunds, or disputes. 16 17

Theme 1: Code context is turning into a product layer

Codebase Memory MCP was the cleanest GitHub signal of the week. It is not trying to be another coding agent. It gives agents a fast local knowledge graph of a repository: functions, classes, call chains, routes, and cross-service links. The README claims a Linux-kernel-scale index in 3 minutes, 158 languages through tree-sitter, hybrid LSP resolution for major languages, and large token savings versus file-by-file exploration. 2
Codebase Memory MCP knowledge graph screenshot
Codebase Memory MCP’s graph UI shows why the category matters: agents need structure, not just more context window. 2
The differentiation is straightforward: Cursor-style repo indexing, grep, and embeddings help find files; Codebase Memory is aiming at structural questions. If the repo is large enough that your agent keeps asking, "where is this defined?" or "what calls this endpoint?", this is the kind of layer that should sit under the agent.
Basedash and NocoBase show the same pattern outside pure code. Basedash’s June 20 release gives different groups access to different data sources, dashboards, chats, automations, MCP servers, and AI context. That matters because the moment an AI analyst sits on company data, governance becomes part of the product, not an admin afterthought. 3
NocoBase is the broader business-app version. Its README describes an AI + no-code platform where coding agents can handle setup, development, migration, and releases while people keep a WYSIWYG interface for data models, pages, workflows, and permissions. 5 The practical read: internal tools are moving toward mixed construction sites, where agents edit code and humans configure the app visually.

Theme 2: The agent harness is becoming a real deployment primitive

Amazon’s AgentCore harness GA is the enterprise marker. The blog says the harness wraps model choice, tools, skills, memory, identity, filesystem, browser, code interpreter, and observability behind CreateHarness and InvokeHarness. It also supports switching model providers mid-session, including Bedrock models, direct OpenAI, Gemini, and LiteLLM-supported providers. 6
Amazon Bedrock AgentCore harness architecture diagram
Amazon’s harness diagram makes the category legible: the agent loop is small; production identity, memory, runtime, gateway, browser, code execution, and observability are the hard part. 6
Flue is the open, TypeScript-native counterweight. Its README frames it as a programmable harness for autonomous agents and workflows with sessions, tools, skills, instructions, filesystem access, and sandboxes. It also lists durable execution, subagents, MCP tools, observability integrations, and event channels such as Slack, Teams, Discord, and GitHub. 7
Google agents-cli attacks a narrower but practical problem: making a coding assistant competent at Google Cloud agent work. It installs CLI commands and skills for scaffolding, ADK code patterns, evals, deployment, publishing to Gemini Enterprise, and observability. The repo is explicit that agents-cli is not a coding agent; it is a tool and skill layer for the coding agent you already use. 9
The try-it split is clear. Use AgentCore if you want AWS-managed primitives and already live in that account. Try Flue if you want a framework you can read and deploy across runtimes. Try agents-cli if the friction is not agent architecture in general, but getting ADK/Gemini Enterprise agents built and shipped correctly.

Theme 3: Coding agents are getting workstations, not just chat boxes

KiloCode and Orca both point to a useful shift: the coding-agent interface is becoming a workspace for managing work, not just a prompt field.
KiloCode describes itself as an open-source coding agent for VS Code, JetBrains, and the CLI. The repo presents it as an all-in-one agentic engineering platform for building, shipping, and iterating faster with an open-source coding agent. 10 That makes it closer to a daily development surface than a narrow autocomplete tool.
Orca is more opinionated about orchestration. It runs CLI agents such as Codex, Claude Code, OpenCode, and others side by side, each in its own worktree. The README emphasizes mobile monitoring, parallel worktrees, terminal splits, design-mode browser capture, GitHub/Linear workflows, SSH worktrees, and commenting on AI diffs. 11
Orca desktop workspace with parallel agents and a mobile companion
Orca’s product screenshot shows the coding-agent UI drifting toward a control room for parallel worktrees. 11
The differentiation against established IDE assistants is not model quality. It is process control. If you want one agent to edit a file, the default IDE assistant is enough. If you want five agents to attempt the same migration in separate worktrees, compare diffs, annotate the winner, and keep an eye on jobs from a phone, Orca is closer to the job.

Theme 4: Media and design tools are adopting the same agent-skill pattern

OpenMontage is the week’s strongest media-production repo. It turns an AI coding assistant into a video production studio: research, scripting, asset generation, editing, and final composition. The README distinguishes still-image animation from a real-footage path where the agent builds a corpus from free stock footage and open archives, retrieves motion clips, edits a timeline, and renders a finished piece. 12
The useful part is not the "AI makes video" pitch. It is reproducibility. OpenMontage shows example videos with prompts, pipelines, tools, and costs. For teams experimenting with AI video, that is more valuable than a glossy demo with no path back to the settings that produced it.
UI Skills brings the same idea to design engineering. Its CLI can route an agent through task-specific UI skill sets, and the repo’s purpose is plain: skills for design engineers. 13 This category keeps appearing because generic coding agents often produce generic interfaces. A smaller skill library can be more useful than a bigger model when the failure mode is taste and task framing.
Photoroom AI Ironing is a different kind of media tool: narrow, commercial, and practical. The launch post says it removes apparel wrinkles while preserving the garment’s logo, texture, and stitching, and that it is live in AI Tools plus available via API for Plus plan customers using ironing.mode. 14 That is exactly the kind of domain-specific image model that should not be replaced by a general image editor prompt.

Theme 5: Observability and proof are catching up to autonomous agents

Foglamp and AEVS are small compared with the big coding-agent repos, but they answer the same uncomfortable question: once agents can act, how do you know what happened?
Foglamp is observability for agents built on the Vercel AI SDK. Its page says the SDK captures cost, latency, tokens, distributed traces, evals, alerts, and per-agent spend for generateText and streamText calls. 15 The best-fit user is not someone prototyping a toy agent. It is a team with enough production traffic that a cost regression, hallucinated policy, or bad answer can become a support problem.
AEVS goes after proof rather than monitoring. Fetch.ai’s launch post says AEVS is now live on Product Hunt as an Agent Execution Verification System, and independent coverage says it records tool name, inputs, outputs, timing, status, duration, and sequence position, then produces HMAC-signed, hash-chained receipts. 17 16
The key limitation is just as important: the same coverage describes AEVS as tamper-evident, not tamper-proof, and off-chain rather than blockchain-anchored. 16 That makes it easier to adopt than heavier verifiable-compute systems, but teams should treat it as an audit trail, not a cryptographic guarantee that the underlying execution could not be manipulated.

What to try first

For builders short on time, the practical order is:
  1. If your problem is agent context: try Codebase Memory MCP before switching coding tools. It targets the retrieval layer that every agent will hit.
  2. If your problem is production deployment: compare AgentCore harness and Flue. The decision is mostly managed AWS stack versus framework control.
  3. If your problem is parallel engineering throughput: test Orca on one migration or refactor where multiple agents can produce competing worktrees.
  4. If your problem is media operations: try OpenMontage for reproducible video workflows, and Photoroom AI Ironing if ecommerce apparel retouching is a current cost.
  5. If your problem is trust: pair observability with receipts. Foglamp tells you what production calls cost and how they behaved; AEVS records what tool calls the agent claims to have executed.
The through-line is boring in the best way. The market is building the missing operating layer below agents: memory, harnesses, workspaces, observability, and audit trails. That layer will decide which agents survive contact with real workflows.

Related content

Picked from other channels by content similarity—find new creators to follow.

Add more perspectives or context around this Post.

  • Sign in to comment.