mment
News
Learn
Games
Glossary
FAQ
Search
Frequently Asked Questions
Common questions about The latest in artificial intelligence — news, explainers, glossary, and FAQs.
AI Agents
How do multi-agent architectures prevent context degradation?
What is context engineering for AI agents?
What percentage of AI agent teams have reached production deployment?
Why do AI agents fail in production but work in demos?
AI Engineering
Can I combine RAG, fine-tuning, and prompt engineering?
How much does RAG reduce AI hallucinations?
What are the cost differences between RAG and fine-tuning at scale?
What is the difference between factuality and faithfulness errors in AI hallucinations?
What LoRA hyperparameters should I start with?
When should I use LoRA instead of full fine-tuning?
When should I use RAG instead of fine-tuning?
Why do larger AI models sometimes hallucinate more on factual questions?
Why does LoRA underperform on reasoning tasks?
AI Evaluation
How do AI companies game benchmark scores?
What alternatives exist to traditional AI benchmarks?
Why do AI benchmark scores not predict real-world performance?
Why do AI benchmark scores often not predict real-world performance?
AI Infrastructure
What are MCP's three primitives?
What is a hybrid AI stack?
What is the difference between A2A and MCP?
What is the difference between MCP and A2A?
Who governs the A2A protocol?
Who governs the Model Context Protocol?
Why is agent opacity important in A2A?
AI Pricing
How much do frontier AI models cost in 2025?
AI Research
Can polysemanticity be eliminated from neural networks?
Do language models actually use the reasoning they describe in chain-of-thought?
Do SSMs beat transformers at any scale?
How do AI co-scientists differ from ChatGPT?
Is AI emergence real or just a measurement artifact?
What are dead features in sparse autoencoders?
What did DeepSeek R1 prove about inference-time scaling?
What did Golden Gate Claude demonstrate?
What is a sparse autoencoder in AI?
What is an AI co-scientist?
What is I-JEPA and why does AMI Labs use it?
What is inference-time compute and how does it differ from training-time compute?
What is superposition in neural networks?
What is the difference between world models and large language models?
What is the O(n²) cost of transformers?
What is the optimal SSM-to-attention ratio in hybrid architectures?
What percentage of prompts can current AI interpretability methods successfully analyze?
What's the difference between emergent abilities and scaling laws?
What's the difference between encoder-only and decoder-only transformers?
Why are SAEs better for discovery than specification?
Why are thinking tokens billed like output tokens?
Why can't state space models do perfect recall?
Why can't we understand neural networks by looking at individual neurons?
Why did transformers replace RNNs and LSTMs?
Why does Waymo use DeepMind's Genie 3 for robotaxi training?
Why is Eli Lilly investing $1 billion in AI drug discovery?
AI Safety
Can AI content watermarks be removed?
Can you optimize for all fairness metrics simultaneously?
How many AI-enabled medical devices has the FDA cleared?
What is the 80% rule for AI bias?
What is the FDA's 510(k) pathway for medical devices?
What's the difference between reward gaming and reward tampering?
When do EU AI Act fairness requirements become enforceable?
Why do AI detection tools give different results for the same text?
Why do AI detectors flag non-native English writing as AI-generated?
Why do jailbreaks work on language models?
Why do more capable AI models get better at reward hacking?
Why does the emergence debate matter for AI safety?
Why don't distilled models retain the safety alignment of their teachers?
AI Security
What attack success rate has Anthropic achieved against prompt injection?
What is indirect prompt injection?
Why can't prompt injection be fixed like SQL injection was?
AI Strategy
What is the capability gap between open and closed AI models?
When should enterprises choose open-weight models over closed APIs?
Why is enterprise adoption of open-source AI declining despite lower costs?
AI evaluation
How many evaluation examples do I need to start testing my AI?
What biases do LLM-as-judge evaluators have?
What is the difference between pass@k and pass^k metrics?
Analysis
Does AI actually make workers more productive?
What is the Jevons paradox in AI productivity?
What is the paradox in the SaaS selloff that Bank of America identified?
When will we know if the SaaS selloff was justified?
Who benefits from AI productivity gains at work?
Why did SaaS stocks drop $285 billion after Claude Cowork launched?
Biotech
What is cell-free protein synthesis?
Business
Can I opt out of ChatGPT ads?
Which ChatGPT tiers will see ads?
Will ChatGPT ads influence the AI's answers?
Comparison
How does Kimi K2.5 compare to Claude Opus 4.5?
Developer Tools
Are LLMs better at reviewing code or writing it?
Are multi-agent AI workflows more accurate than single agents?
Can Claude Code spawn multiple agents to work on tasks?
Can developers trust multi-agent AI systems on production codebases?
Can small language models run tool-calling agents on CPU?
Do AI coding assistants actually make developers faster?
Do bigger language models have better tool-calling judgment?
Does the industry converge on code generation as the agentic paradigm, or does sequential tool calling remain dominant?
How autonomous is Claude Cowork?
How does StrongDM verify AI-generated code without human review?
How does Xcode's Claude integration differ from GitHub Copilot or Cursor?
How much do multi-agent workflows cost compared to single agents?
Is Code Mode safer than sequential tool calling?
Should I use Cursor or Windsurf for AI-assisted coding?
What are agent teams in Claude Code?
What is multi-agent orchestration in Claude Code?
What is spec-as-product in AI coding?
What is StrongDM's approach to AI-generated code?
What is the Claude Agent SDK integration in Xcode 26.3?
What is the difference between Code Mode and tool calling for AI agents?
What is the keyword trigger problem in small LLM tool calling?
What is the rejection rate for AI code suggestions?
What is tool stacking in AI coding?
What's the difference between IDE-native and terminal-first AI coding tools?
When is Xcode 26.3 with Claude Agent SDK available?
Why did Pydantic build a custom Python interpreter for AI agents?
Why do developers feel faster with AI coding tools even when they're not?
Why do small language models struggle with tool-calling judgment?
Why was the Model Context Protocol (MCP) created?
Will code generation replace sequential tool calling as the dominant agentic AI paradigm?
Will the AI industry converge on code generation or sequential tool calling for agentic systems?
Engineering
How much can prompt caching save on LLM API costs?
How much VRAM do I need to run a 7B model locally?
What is model routing and how does it reduce costs?
What's the quality difference between Q4 and Q8 quantization?
When should I negotiate custom LLM API pricing?
When should I use vLLM vs llama.cpp for local inference?
Why do output tokens cost more than input tokens?
Enterprise
What is agent washing in enterprise AI?
Why do external AI implementation partners outperform internal teams?
Why do most enterprise AI agent projects fail?
Explainers
How much does it cost to train a foundation model?
What are the three stages of the foundation model pipeline?
What is emergence in foundation models?
What is the difference between a foundation model and an LLM?
Features
What is generative UI in Gemini 3?
Healthcare
How many rural hospitals have closed in the US?
Industry
Are AI productivity gains translating into higher income for workers?
Are AI users working fewer hours for the same pay?
Are people leveraging LLMs making more money while working the same hours?
Can workers reduce their hours while maintaining pay by using AI tools?
Can workers who use AI tools reduce their hours while maintaining the same pay?
How does OpenAI Frontier differ from Anthropic's approach to enterprise AI agents?
How does OpenAI Frontier differ from Anthropic's Claude Code?
How many xAI co-founders have left the company?
How much does the Google-Apple AI deal cost?
How much has Anthropic raised in total funding?
How much is Alphabet spending on AI infrastructure in 2026?
How much is the SpaceX-xAI merger worth?
If AI writes all your code, what's the point of releasing it as open source?
Is OpenAI adding ads to ChatGPT?
What does OpenAI Frontier actually do?
What enterprise systems does Frontier connect to?
What is Anthropic's current valuation?
What is OpenAI Frontier?
What is OpenAI's deal with Cerebras worth?
What is the Google-Apple AI deal?
What is the Streisand Effect in the context of Altman's response?
What is xAI's founder attrition rate?
What problems does Frontier solve for enterprises?
What was Anthropic's Super Bowl ad about?
When is Cerebras going public?
Who is Brendan Gregg and why is his OpenAI hire significant?
Why are xAI co-founders leaving before the IPO?
Why did Benchmark create special purpose vehicles for its Cerebras investment?
Why did Sam Altman respond to Anthropic's anti-ad campaign?
Why did SpaceX merge with xAI?
Why did Sundar Pichai avoid the Apple question on the earnings call?
Why did Sundar Pichai ignore the analyst question about Apple on Alphabet's earnings call?
Why is Microsoft investing in both OpenAI and Anthropic?
Why was Tesla excluded from the SpaceX-xAI merger?
Why would an organization train its own foundation model instead of using an existing one?
Will OpenAI introduce ads to ChatGPT?
Infrastructure
Does the GPT-5.2 speed improvement apply to ChatGPT?
How did OpenAI achieve the 40% inference speedup?
How do you choose the right chunking strategy for embeddings?
How many pages of text can fit in a 128K context window?
Is 'price per million tokens' comparable across LLM providers?
Is the GPT-5.2 speed improvement a real 40% speedup or just shorter outputs?
What alternatives to cosine similarity should teams consider?
What are the seven failure points in RAG systems?
What are the three types of memory AI agents need?
What is Byte Pair Encoding and how do LLMs use it?
What is Cerebras' Wafer Scale Engine and why does it matter?
What is embedding drift and how do you detect it?
What is the difference between prefill and decode in LLM inference?
What is the KV cache and why does it matter for inference cost?
What is the lost-in-the-middle problem in LLMs?
What is the recommended chunk size for RAG systems?
What token savings do production memory systems achieve?
What tools will Brendan Gregg use to optimize OpenAI infrastructure?
Why are context windows so expensive to expand?
Why are LLM API costs higher for non-English languages?
Why can't ChatGPT count the Rs in 'strawberry'?
Why did MCP (Model Context Protocol) need to exist when other protocols already handle AI tool integration?
Why did the AI community need another protocol when existing APIs already existed?
Why do longer context windows cost more for LLM inference?
Why does cosine similarity fail for regularized embedding models?
Why does my LLM forget things from earlier in the conversation?
Why does pure vector search miss relevant documents?
Why doesn't a larger context window solve the memory problem for AI agents?
Why is LLM inference so much less efficient than training?
Machine Learning
Do process reward models solve the reward hacking problem?
Model Comparison
How does Gemini 3 compare to GPT-5 and Claude Opus?
Model Compression
What is the difference between knowledge distillation and quantization?
Model Performance
What is Gemini 3's hallucination rate?
Model Training
Is Gemini 3 trained on benchmark data?
What are soft labels in knowledge distillation?
What is the difference between white-box and black-box distillation?
Models
How many parameters does GLM-5 have?
What hardware was GLM-5 trained on?
What is Pony Alpha on OpenRouter?
Partnerships
What is Anthropic's partnership with Teach For All?
Policy
Can the UK government independently maintain the Claude-powered system?
Did Anthropic promise Claude will never have ads?
How does Anthropic plan to compete with Microsoft in India?
How does Anthropic plan to make money without ads?
How does Trusted Access for Cyber differ from OpenAI's existing safety measures?
How will Anthropic compete with Microsoft Copilot in India?
How will Anthropic fund Claude without advertising revenue?
What is Anthropic building for GOV.UK?
What is Anthropic's Long-Term Benefit Trust?
What is Dr. Oz proposing for rural healthcare?
What is OpenAI's Trusted Access for Cyber program?
What is the ratchet effect in AI advertising?
What powers does Anthropic's Trust actually have?
Which states are considering data center moratoriums?
Who are the current trustees of Anthropic's Long-Term Benefit Trust?
Who is accountable if the GOV.UK AI assistant gives wrong advice?
Who is Irina Ghose and why did Anthropic hire her?
Who is Tino Cuéllar and why does his appointment matter?
Why does Anthropic say AI advertising is different from search advertising?
Why is Anthropic pledging that Claude will never run ads?
Why is India important for Anthropic and Claude?
Why is OpenAI requiring identity verification for cybersecurity tools?
Why is starting with employment support risky for a government AI deployment?
Why is there bipartisan opposition to data centers?
Will other AI labs follow OpenAI's tiered access approach?
Products
What is the difference between Claude Code and Claude Cowork?
Prompting
Do structured outputs prevent hallucination?
Does chain-of-thought prompting work for all tasks?
Does the accuracy of few-shot examples matter?
Should I use emphatic language like CRITICAL and MUST in prompts?
Quantum Computing
What is IBM's quantum advantage target for 2026?
What is the difference between NISQ and fault-tolerant quantum computing?
When will quantum computers be useful for AI and machine learning?
Research
Are foundation models and large language models the same thing?
Can Claude AI actually run experiments in a biology lab?
Can existing models be converted to use GQA?
Can I run DeepSeek R1 locally?
Can I train GPT-2 on cloud spot instances without interruption?
Can other labs replicate OpenAI and Ginkgo's AI-driven protein synthesis results?
Can other research labs use AI to optimize their experiments like this?
Can sub-quadratic architectures replace attention?
Can sub-quadratic models replace transformers?
Do GPT-2 training optimizations scale to larger models?
Does FlashAttention approximate attention or compute exact results?
How did GPT-5 achieve the 40% cost reduction in protein synthesis?
How did GPT-5 optimize protein synthesis costs by 40%?
How does Anthropic use the constitution for training?
How does Claude Opus 4.6 compare to GPT-5.2?
How does cross-entropy loss work for LLMs?
How does the Allen Institute AI partnership differ from the HHMI collaboration?
How much did DeepSeek R1 cost to train?
How much did GPT-5 reduce protein synthesis costs?
How much does Claude Opus 4.6 cost?
How much does it cost to train GPT-2 in 2025?
How much faster is FlashAttention-3 compared to earlier versions?
Is AI better for code generation or code review?
Is AI scaling hitting a wall?
What are AI scaling laws?
What are Anthropic's new scientific research partnerships?
What are hybrid attention architectures?
What causes vanishing gradients in deep networks?
What is a loss function in machine learning?
What is backpropagation in neural networks?
What is Claude Opus 4.6?
What is Claude's new constitution?
What is distribution shift in DPO training?
What is gradient descent in neural networks?
What is GRPO and how does it differ from RLHF?
What is inference-time compute scaling?
What is RLHF and how does it work?
What is self-attention in transformers?
What is the alignment tax in AI models?
What is the difference between DPO and RLHF?
What is the difference between encoder and decoder attention?
What is the difference between GQA and MQA?
What is the difference between L1 and L2 regularization?
What is the difference between Mixture of Experts and 'mixture of models'?
What is the load balancing problem in Mixture of Experts models?
What is the main difference between DPO and RLHF?
What is the quadratic cost problem with attention?
What is the sports betting analogy for backpropagation?
What optimizations made GPT-2 training 600× cheaper?
What percentage of AI-generated code actually ships to production?
What training method does Claude use?
When should you use DPO vs PPO for model training?
Who invented backpropagation?
Why aren't state space models replacing transformers everywhere?
Why did Anthropic shift from rules to reasoning in Claude's constitution?
Why did transformers replace RNNs?
Why do calibration-optimized models outperform accuracy-optimized models in sports betting?
Why do developers feel more productive with AI coding tools when they're actually slower?
Why do neural networks get stuck in local minima?
Why does Anthropic emphasize interpretability in its science AI partnerships?
Why does FlashAttention reduce memory usage from O(N²) to O(N)?
Why does GQA use 8 groups?
Why does mean squared error amplify outliers?
Why does Mixtral 8x7B only activate 12 billion parameters when it has 47 billion total?
Why is attention O(n²) and why does it matter?
Why is RLHF alignment so easy to break?
Why is the √d scaling factor important in attention?
Why is the 20 tokens per parameter rule wrong?
Why is the backward pass as efficient as the forward pass?
Security
Is Claude Cowork safe to use with external documents?
Technical
What are Kimi K2.5's hardware requirements?
What is Kimi K2.5's Agent Swarm architecture?
What is PARL training in Kimi K2.5?
Technical Analysis
What are the five multi-agent orchestration patterns?
What is the agent termination problem?
What is the ReAct agent architecture?
Technical Explainer
Does structured output prevent LLM hallucination?
What is constrained decoding and how does it work?
What is the difference between JSON mode and structured outputs?
When should I use function calling vs structured outputs?
Technology
How much do humanoid robots cost in 2026?
What is a vision-language-action model?
What is the difference between Physical AI and regular AI?
What is the simulation-to-reality gap in robotics?
explainers
How much cheaper are small language models compared to frontier LLMs?
How much memory does quantization save?
What is Q4_K_M quantization and why is it recommended?
What is W4A16 quantization?
When should I use a small language model instead of GPT-4 or Claude?
Why does naive quantization fail on large language models?