What is this?
This mirrors the Hacker News front page. Click on tags to filter / exclude. You can combine multiple tags.
Get these tags inside Hacker News itself with these open-source browser extensions for Chrome and Firefox
histre is a power tool for your knowledge. This is a demo of its auto-tagging feature. Learn about its other features here
Your Biggest Customer Might Be Your Biggest Bottleneck
• densumesh.dev
Light Sleep: Waking VMs in 200ms with eBPF and snapshots
• www.koyeb.com
How many HTTP requests/second can a single machine handle? (2024)
• binaryigor.com
Memory is slow, Disk is fast – Part 1
• www.bitflux.ai
The Bitter Lesson Is Misunderstood
• obviouslywrong.substack.com
Speed-coding for the 6502 – a simple example
• www.colino.net
LiteLLM (YC W23) is hiring a back end engineer
• www.ycombinator.com
1 point
•
1 week ago
•
No comments yet
DeepConf: Scaling LLM reasoning with confidence, not just compute
• arxiviq.substack.com
Building A16Z's Personal AI Workstation
• a16z.com
How Delphi achieved sub 100ms retrieval with Pinecone
• venturebeat.com
How to Scale Your Model: How to Think About GPUs
• jax-ml.github.io
How to Think About GPUs
• jax-ml.github.io
Why we still build with Ruby
• www.getlago.com
The trap of tech that's great in the small but not in the large
• surfingcomplexity.blog
Do things that don't scale, and then don't scale
• derwiki.medium.com
Do Things That Don't Scale (2013)
• paulgraham.com
Reverse Proxy Deep Dive: Why Load Balancing at Scale Is Hard
• startwithawhy.com
Scale: Natively compile CUDA applications for AMD GPUs
• docs.scale-lang.com
Booting 5000 Erlangs on Ampere One 192-core
• underjord.io
OpenFreeMap survived 100k requests per second
• blog.hyperknot.com
All About Transformer Inference
• jax-ml.github.io
Ask HN: How can ChatGPT serve 700M users when I can't run one GPT-4 locally?
• news.ycombinator.com
Actual LLM agents are coming
• pleias.fr
Litestar is worth a look
• www.b-list.org
How to Scale Proteomics
• www.asimov.press
Why reliability is hard at scale: learnings from infrastructure outages
• newsletter.pragmaticengineer.com
EXT4 Shows Wild Gains with Better Block Allocation Scalability in Linux 6.17
• www.phoronix.com