Radiant AI
Posts
Radiant Update | March 10

Radiant Update | March 10

Product updates from Radiant AI: Metadata-based and model-based routing, new tagging capabilities, collaboration with SkyPilot

Nitish Kulkarni, Jakob Frick & Ben Cappellacci
March 09, 2024

Our product team has been busy this week and we’re excited to share some of the updates we’ve shipped. We thank you for your continued support and collaboration, since it directly helps us continue to ship great products for our customers.

Product Updates

Metadata-based routing

You can now set routing rules based on the metadata of a request, including:

user or group id
auth header information
and more

Using a model-based classifier to route

You can now use a LLM-based classifier to route model requests. Any generative model available in Radiant can be used to make a routing decision. Some examples of how this can be used include:

Using a small (and cheaper) locally-hosted LLM to route certain requests to a model based on cost or security considerations

Using an LLM to route to specific models fine-tuned for a certain task

Redesigned Provider Interface

We introduced a new provider interface to make it easier to navigate across providers, view models, monitor usage and set budget on a per provider or model level.

The new provider interface lets administrators easily see which endpoints are consuming the most tokens in a specific model.

Refined Endpoint tagging workflow

This week we made tagging endpoints and workflows more powerful and intuitive.

The tagging capability allows administrators to characterize endpoints and filter models, logs and usage using any number of parameters.
The refined workflow streamlines the processing of adding tags to an endpoints, creating and renaming tags.

Collaboration with SkyPilot

This week we announced a collaboration with SkyPilot. SkyPilot is a rapidly growing open source project for training and serving Generative AI models. SBy combining a SkyPilot deployed inference server with Radiant’s control plane enterprises can achieve significant performance scale, and cost reduction while ensuring that all model usage comes with unified data controls, model permissions and audit logging. Read more on our blog.

New GenAI model integrations

This week we added support for Claude 3 by Anthropic, which has turned out to be one of the first serious challenger to GPT-4 when it comes to capabilities in the enterprise context.
Added support for Groq, a compute platform with highly optimized hardware for rapid LLM execution

About Radiant

Radiant is the Enterprise AI platform to take you from idea to production deployment. From security and governance to scaling and resiliency, we make it simple to make AI a critical part of your business.

Try out a demo here, sign up here to get your own instance, or reach out to our founders directly at [email protected].

We’re also hiring. If you know of someone great that is interested in helping every company build AI into their products and operations, we'd love an introduction.