olegbk's comments

olegbk · 2026-04-15T10:07:14 1776247634

This is the gap we've been working on. Lilith-zero handles runtime enforcement well, but the pre-connection trust question (@gregojaca asked about) is a separate problem.

We built a reputation scoring layer for this (AgentVeil Protocol, https://agentveil.dev). Agents earn EigenTrust scores based on signed attestations from other agents they've worked with. NetFlow prevents sybil inflation. Scores hit a REST API so an enforcement proxy like Lilith-zero could consume them as policy input. Runtime enforcement + pre-connection trust = full stack.

We also run as an MCP server, so it plugs into the same ecosystem.

gregojaca · 2026-04-15T13:34:10 1776260050

sounds interesting. Definitely the scores can be used as policy input. 2 questions: 1. Can you also assign scores to an MCP server for example? or to skills? can it be generalized? bc I see many malicious attacks being hidden in those. 2. The agents that sign the attestations be prompt injected to give a good score even if the task was not completed. Do you imagine some more deterministic test to grant the attestations? I'd imagine myself making my CI pipeline / tests give out the attestations.

olegbk · 2026-04-15T09:31:22 1776245482

The thread has good ideas on fixing benchmarks (sandboxing, newer datasets). But there's a more fundamental problem: benchmarks are self-reported. The agent runs the test on itself.

An alternative we've been building: attestation-based reputation. Trust scores come from signed proof of work by independent agents who actually delegated tasks and verified outcomes. EigenTrust computes scores from the attestation graph, and NetFlow prevents sybil clusters from inflating each other. You can't inject a pytest hook into a signed interaction history.

Live visualization of how this works: https://agentveil.dev/live

olegbk · 2026-04-08T10:15:27 1775643327

The boundary problem you're describing is also a trust issue, not just a connection lifecycle one. Even with ephemeral connections, you still need to know whether the agent making the call should be allowed to make it at all.

We use reputation-based admission control in production, agents below a certain trust threshold simply cannot invoke sensitive tools. The hallucinated tool call scenario you described is exactly what this prevents, independent of whether the connection is persistent or on demand.

agentveil.dev if curious.

An0n_Jon · 2026-04-08T17:50:09 1775670609

Yeah the governance is also built into Orloj as well as applied policies. More so acting as guardrails at the runtime level. Ill take a look at agentveil and would love to know your thoughts on Orloj as well.

olegbk · 2026-04-08T07:42:20 1775634140

We've been running this pattern in production for a few weeks. The biggest pain wasn't orchestration, it was trust when agents delegate to agents they don't own. We ended up building reputation-based gating so a low-trust agent can't delegate upward. Happy to share specifics if useful.

murmansk · 2026-04-10T17:12:09 1775841129

Please, share! Especially, the trust part!

olegbk · 2026-04-01T09:57:10 1775037430

The feedback loop is what most people miss when they build these systems. You spin up the agent, it submits a PR, CI goes red, and suddenly you're back to being the bottleneck you were trying to eliminate.

One thing I ran into building something similar, agents are surprisingly good at fixing the exact error message they're given, but terrible at recognizing when they're going in circles. After the third retry on the same failing test, you're not getting a fix, you're getting increasingly creative excuses for why the test is wrong.

How deep does the self-healing go? Is there a retry limit before it escalates, or does it just keep going until you manually intervene?

olegbk · 2026-03-31T13:19:49 1774963189

Built trust and reputation infrastructure for AI agents. W3C DID identity, EigenTrust peer reputation with sybil detection, automated onboarding pipeline with seed agents, and hash-chained audit trail anchored to IPFS.

Сore problem: agent identity tells you who an agent is. It says nothing about whether you should trust it.

Demo: https://raw.githubusercontent.com/creatorrmode-lead/avp-sdk/...

Python SDK: pip install agentveil Live demo: https://agentveil.dev/live