Synthetic datasets are an increasingly important part of developing with large language models.
September 5, 2024
Read ItWhile static guards are great at filtering out predefined content like NSFW language, they struggle when faced with sophisticated attacks like jailbreak attempts, prompt injection, and more.
August 8, 2024
Read ItTracing is a powerful observability technique that offers developers an effective way to better see what goes on inside their LLM applications.
August 8, 2024
Read ItLLM-as-a-Judge provides a solid proxy for AI SQL generation performance, especially as a quick check on results.
August 1, 2024
Read ItThis blog explores the different ways you can instrument your LLM application, comparing manual and automatic instrumentation techniques, and looking into the unique benefits that OpenTelemetry (OTEL) brings to the table.
July 25, 2024
Read ItPeople are trusting LLMs to parse some pretty complex data - can they even do simple time series analysis?
May 5, 2024
Read ItMost RAG research focuses on retrieval, but what can you do to get the generative part right?
April 4, 2024
Read ItNot only are LLMs bad at math, they often fail at basic numerical reasoning.
March 3, 2024
Read ItDespite developers using context length as a model qualifier, LLMs often lose information buried in long prompts.
February 2, 2024
Read ItPrompt engineering from the ground up. No, we are not LinkedIn lunatics selling you a course.
December 12, 2023
Read ItThere are a lot of independent variables in RAG systems, and thoroughly testing across configurations is the only way to figure out what setup might work best for you.
November 11, 2023
Read ItConfused by serial Tweeter RAG takes? This guide breaks down the basics.
August 8, 2023
Read It