RELAI

RELAI · 2025-08-08T16:03:48.299Z

How good (or bad) is GPT-5 — and does it matter for you?

Technology, Information and Internet

Self-Improving AI Agents

Discover all 8 employees

About us

RELAI powers self-improving AI agents. Using its lifelong optimization engine for agents, RELAI automatically turns failures, feedback, and real-world outcomes into structured lessons that continuously improve agent behavior over time. The result is an agent that gets better without losing what already works.

Website: https://relai.ai
External link for RELAI
Industry: Technology, Information and Internet
Company size: 2-10 employees
Type: Privately Held
Founded: 2024

Employees at RELAI

See all employees

Updates

RELAI

1,229 followers
5mo
Report this post
Working on AI agents and attending NeurIPS? Let's chat!
Soheil Feizi
5mo

Excited to be at NeurIPS 2025 in San Diego this year! If you’re around and want to talk AI agents, reliability, or open-source AI, ping me and let’s find a time! I’m also looking forward to giving a talk at the “Celebration of All Things Open Source” event at NeurIPS https://lnkd.in/en_vcjVS Our group will also be presenting three papers at the conference: • Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text https://lnkd.in/eKARgG7e • Localizing Knowledge in Diffusion Transformers https://lnkd.in/e4K5DTVc • A Technical Report on “Erasing the Invisible”: The 2024 NeurIPS Competition on Stress Testing Image Watermarks https://lnkd.in/eXvvW5rt Looking forward to meeting old & new friends!
Like Comment Share
RELAI

1,229 followers
6mo
Report this post
A notebook showing how to optimize AI agents using the user feedback:

Soheil Feizi
6mo

How to use expert feedback to optimize AI agents? In many real-world applications, there is no clear ground truth label for what a “good” agent response is. Often, all we have is user feedback and preferences (“this is wrong”, “missing context”, “too verbose”, etc.). This feedback is an extremely valuable supervision signal, but turning it into effective optimization of agent behavior is not straightforward: Stochasticity & replay To learn from feedback, we often need to “replay” the original sample or trace. But agentic systems (with tools, RAG, branching, etc.) are stochastic, so re-running the same input may not reproduce the same trajectory or output. Linking feedback to replays Even if we can approximate the original run, evaluating a new or re-played trace against the old feedback is non-trivial. The feedback is textual, often high-level and contextual, not a simple scalar reward. Optimizing config and structure Finally, we want to optimize both the agent configuration (prompts, hyperparameters, tools, thresholds) and the agent graph/structure (which nodes, in what order, with what routing). Jointly optimizing these under noisy, text-based feedback is a challenging learning and search problem. In this notebook, using an agentic RAG example, we show how to operationalize this: 📝 Convert user feedback on agentic runs into an annotation benchmark on RELAI 🎯 Use the Maestro agent optimizer to consume that benchmark and automatically improve both the config and the graph of the agent 🔁 Close the loop from user preference → benchmark → optimization → better agent in a reproducible, data-driven way 🔗 Notebook: https://lnkd.in/eWXRxHEz Powered by RELAI (relai.ai)

Google Colab colab.research.google.com

Like Comment Share
RELAI

1,229 followers
6mo
Report this post
One notebook to: "build -> simulate -> evaluate -> optimize" your agentic RAG! 🔗 Notebook: https://lnkd.in/eNmjFyiG
Soheil Feizi
6mo

🚀 Sharing a Colab notebook with a complete learning loop for reliable agentic RAG: 🧱 Build agentic RAG on top of your own data 🎭 Simulate with persona-based runs to stress-test your agent 🧑⚖️ Evaluate quality automatically with Critico (LLM-as-a-judge) 🎛️ Optimize configs & structure with Maestro for better performance All in a single notebook. Works with any model — just drop in your API key. 🔗 Notebook: https://lnkd.in/eu-KeShv Powered by RELAI (relai.ai)
Like Comment Share
RELAI

1,229 followers
9mo
Report this post
How good (or bad) is GPT-5 — and does it matter for you?
Soheil Feizi
9mo

How good (or bad) is GPT-5 — and does it matter for you? I’ve been seeing a lot of posts lately debating the quality of GPT-5’s responses. I tried a few of the examples people mentioned. Here’s one from my own experiment (screenshot attached): I asked GPT-5 to solve a simple arithmetic problem. It got it wrong. Then, when I allowed it to use a calculator tool, it got it right. So what does this tell us? I think many people make a mistake in framing LLMs as some kind of “AGI in a box.” That’s not how they’re meant to work and honestly, it’s not helpful to think of them that way. LLMs aren’t replacements for the thousands of sophisticated tools we already rely on in engineering, science, or other technical fields. They are language models: brilliant at pattern recognition, reasoning with context, and increasingly good at knowing when they don’t know something… and then using the right “tool” to get it right. That’s why I believe the real future is AI agents as systems where the LLM is a crucial component, but the system’s strength comes from orchestrating other agents, models, and tools via APIs, MCPs, or even a bit of Python. My favorite analogy: LLMs are like bricks in a building. You can have bricks of different shapes, sizes, and strengths but to make an amazing building, you still need tools, engineering, and expertise to assemble them into something greater. The magic isn’t just in the brick. It’s in the architecture.
Like Comment Share

RELAI

Technology, Information and Internet

Self-Improving AI Agents

About us

Employees at RELAI

Soheil Feizi

Mohammad Amin Roohi

Ali Bavafa

Priyatham Kattakinda

Updates

Join now to see what you are missing

Similar pages

Relai

Synquery

Department of Computer Science, UMD

VLM Run

Inkeep

Simple AI

Liquid AI

Sonia Health

Almond FinTech

Avoca

Browse jobs

Machine Learning Engineer jobs

Engineer jobs

Analyst jobs

Key Account Manager jobs

Marketing Specialist jobs

Marketing Manager jobs

Account Manager jobs

Scientist jobs

Digital Marketing Manager jobs

Power System Engineer jobs

Data Analyst jobs

Control Engineer jobs

Director of Analytics jobs

Director jobs

Adjunct Professor jobs

Senior Data Analyst jobs

Professor jobs

Chief Executive Officer jobs

Intern jobs

Data Science Specialist jobs