Skip to content
@basetenlabs

Baseten

Machine learning infrastructure for developers

Welcome to Baseten

Baseten is an AI infrastructure platform. We combine applied performance research, distributed multi-cloud infrastructure, and developer tooling to run models of all modalities in production.

Get started:

  • Deploy an open-source model in two clicks from the model library.
  • Read our docs to package and serve a fine-tuned or custom model.

Pinned Loading

  1. truss truss Public

    The simplest way to serve AI/ML models in production

    Python 1.1k 98

  2. truss-examples truss-examples Public

    Examples of models deployable with Truss

    Python 220 59

Repositories

Showing 10 of 91 repositories
  • vllm-omni Public Forked from vllm-project/vllm-omni

    A framework for efficient model inference with omni-modality models

    basetenlabs/vllm-omni’s past year of commit activity
    Python 0 Apache-2.0 686 0 0 Updated Apr 3, 2026
  • pyannote-audio Public Forked from pyannote/pyannote-audio

    Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

    basetenlabs/pyannote-audio’s past year of commit activity
    Jupyter Notebook 0 MIT 1,049 0 4 Updated Apr 3, 2026
  • truss-examples Public

    Examples of models deployable with Truss

    basetenlabs/truss-examples’s past year of commit activity
    Python 220 MIT 59 15 65 Updated Apr 3, 2026
  • truss Public

    The simplest way to serve AI/ML models in production

    basetenlabs/truss’s past year of commit activity
    Python 1,134 MIT 98 8 52 Updated Apr 2, 2026
  • prime-rl Public Forked from PrimeIntellect-ai/prime-rl

    Async RL Training at Scale

    basetenlabs/prime-rl’s past year of commit activity
    Python 1 Apache-2.0 252 0 13 Updated Apr 1, 2026
  • kingkong Public
    basetenlabs/kingkong’s past year of commit activity
    Python 1 BSD-3-Clause 0 0 8 Updated Apr 1, 2026
  • qwen3-nvfp4-benchmark Public

    Qwen3 NVFP4 quantization benchmarks vs Bonsai 1-bit Pareto frontier

    basetenlabs/qwen3-nvfp4-benchmark’s past year of commit activity
    Python 0 0 0 0 Updated Apr 1, 2026
  • llm-tools Public
    basetenlabs/llm-tools’s past year of commit activity
    Python 0 MIT 0 0 1 Updated Apr 1, 2026
  • genai-bench Public Forked from sgl-project/genai-bench

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    basetenlabs/genai-bench’s past year of commit activity
    Python 2 MIT 51 0 6 Updated Mar 31, 2026
  • ml-cookbook Public

    Ready-to-use ML training recipes to help you build and deploy models on Baseten.

    basetenlabs/ml-cookbook’s past year of commit activity
    Python 50 MIT 5 0 18 Updated Mar 30, 2026

Top languages

Loading…

Most used topics

Loading…