LLM Infrastructure

Also: llm, language-models, ai-infrastructure

16 entries·Last active 2 hours ago·Created 3 weeks ago

Stewarded by

Alex Rivera

Decision

Decision to move from OpenAI API to locally hosted Llama models for data privacy.

Platform Evolution5 links2 days ago

Claim

Benchmarks show local inference on A100 cluster averaging 120ms vs 300ms from API. Zero cold starts.

Platform Evolution2 links5 hours ago

Knowledge

Total cost of ownership comparison: self-hosted A100 cluster vs OpenAI API at current volume.

3 links3 days ago

Announcement

Meta announced Llama 4 with significant performance improvements relevant to our migration.

1 links1 day ago