Skip to content
View kobe0938's full-sized avatar
🫨
🫨
  • Stanford University
  • Santa Clara
  • 18:55 (UTC -07:00)

Organizations

@mlfoundations @RetroCode-Org

Block or report kobe0938

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kobe0938/README.md

Hi, I'm Kobe πŸ‘‹

πŸš€ Currently Maintaining/Contributing


πŸ› οΈ Previous Projects

Agents & Evaluation

  • Harbor β€” Agent evaluation framework and RL environment toolkit. Contributor.
  • lmcache-agent-trace β€” Agent application, benchmark, and workload traces for LLM serving research.
  • claude-code-tracing β€” Tracing tooling for Claude Code agent runs. [blog]

LLM Inference & Serving Infra

Others

  • Continuum β€” Multi-turn LLM agent scheduling with KV-cache time-to-live for efficient serving. Contributor. [paper]
  • VidGen β€” Diffusion + autoregressive models for interactive video/game generation (Diffusive AI).
  • LAG β€” Research experiments.
  • citation-verifier β€” Verifying citations produced by LLM agents (TypeScript).

Pinned Loading

  1. LMCache/LMCache LMCache/LMCache Public

    Supercharge Your LLM with the Fastest KV Cache Layer

    Python 8k 1.1k

  2. vllm-project/production-stack vllm-project/production-stack Public

    vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

    Python 2.3k 391

  3. LMCache/lmcache-agent-trace LMCache/lmcache-agent-trace Public

    Agent application/benchmark/workload traces should be placed here.

    Python 10 5

  4. Inference-Engine-Arena/inference-engine-arena Inference-Engine-Arena/inference-engine-arena Public archive

    Postman & Chatbot Arena for inference benchmarking.

    Python 14