vllm-project/tpu-inference

TPU inference for vLLM, with unified JAX and PyTorch support.

What it does

This project helps companies run AI models faster and more cheaply by enabling them to use Google's specialized AI chips (called TPUs) alongside the more common Nvidia graphics cards, all through a single unified system called vLLM. Spotify is already using it in production to serve AI-powered features to users, switching between chip types to balance cost and performance.

Why it matters

Relying on a single chip supplier (like Nvidia) for AI infrastructure is increasingly expensive and risky, so tools that let teams flexibly switch between hardware options give companies real negotiating power and cost control. This kind of 'hardware optionality' is quickly becoming a strategic advantage, as it allows product teams to scale AI features without being locked into one vendor's pricing or supply chain.

51Hot

Gaining traction — heating up

Stars

296

Forks

169

Contributors

Language

Python

Related projects

GoogleCloudPlatform/generative-ai

62Hot

This is Google's official collection of tutorials, code examples, and ready-to-run notebooks showing builders how to create AI-powered applications using Google's Gemini models on its cloud platform. It covers everything from basic AI conversations to complex multi-step AI agents that can reason and take actions autonomously.

// why it matters With over 15,000 stars and nearly 300 contributors, this repository signals where serious enterprise AI development is heading — Google's cloud ecosystem is positioning itself as a primary destination for teams building production AI products. For founders and PMs evaluating AI infrastructure, this gives a clear picture of Google's capabilities and provides a fast track to building on the same models powering consumer Google products.

Jupyter Notebook16.7k stars4.2k forks292 contrib

AI / Machine Learning GitHub ↗

NousResearch/hermes-agent

60Hot

Hermes Agent is an AI assistant that gets smarter the more you use it — it remembers past conversations, learns new skills from experience, and builds a profile of who you are over time, all without being tied to any single AI provider or device. It runs in the cloud and connects to messaging apps like Telegram, Slack, and WhatsApp, so you can interact with it anywhere while it handles complex tasks in the background.

// why it matters As AI assistants become a core part of how teams and products operate, the ability to avoid vendor lock-in while building a continuously improving, memory-rich agent is a significant competitive advantage — this is the kind of infrastructure layer that could sit underneath entire products or workflows. With nearly 9,000 stars and over 100 contributors, it signals strong developer demand for agents that persist, learn, and work autonomously rather than resetting with every session.

Python110.8k stars16.1k forks458 contrib

AI / Machine Learning GitHub ↗

ROCm/aiter

60Hot

AITER is AMD's open-source library of high-performance building blocks that make AI models run faster on AMD hardware, supporting everything from basic AI operations to complex training and multi-GPU coordination. Think of it as a toolbox that lets AI software teams tap into AMD's chip capabilities without having to write low-level hardware code themselves.

// why it matters As AI infrastructure costs soar, builders are actively exploring alternatives to Nvidia's dominant GPU ecosystem, and AMD is positioning AITER as the key compatibility layer that makes switching or diversifying hardware more practical. For founders and PMs building AI products, this means AMD GPUs become a more credible option for cost reduction or supply chain diversification — especially relevant as demand for AI compute continues to outpace supply.

Python412 stars289 forks200 contrib

AI / Machine Learning GitHub ↗

mvanhorn/last30days-skill

57Hot

Last30Days is a plug-in skill for the Claude AI coding assistant that automatically researches any topic across Reddit, X, YouTube, Hacker News, Polymarket, and Bluesky, then produces a cited summary of what people are actually talking about right now. Think of it as a one-command briefing tool that scans the social web for the past 30 days and distills the signal into a readable report, saved automatically to your computer.

// why it matters As AI tools and markets shift weekly, founders and product teams who can quickly validate what's gaining traction — before it becomes mainstream knowledge — have a real edge in prioritization and positioning. The 15,000+ stars suggest strong demand for ambient, automated trend intelligence baked directly into developer workflows rather than requiring separate research tools.

Python23.4k stars1.9k forks16 contrib

AI / Machine Learning GitHub ↗

// SUBSCRIBE

The repos that moved this week, why they matter, and what to watch next. One email. No noise.