◈ AI Tools
General-purpose AI models and tooling.

ChatGPT
OpenAI's flagship conversational assistant

Claude
Anthropic's helpful, harmless assistant

Google DeepMind
Google's unified frontier AI lab

Gemini
Google's multimodal AI assistant

Perplexity
Answer-engine for research.

Firecrawl
Turn any website into clean, LLM-ready markdown or JSON.

xAI
Elon Musk's frontier AI company

Grok
xAI's real-time, X-native assistant

Exa
Search API built for AI

Artificial Analysis
Independent AI model benchmarking

Epoch AI
Research and benchmarks tracking AI progress

Julius AI
AI data analyst for everyone
Safe Superintelligence (SSI)
Ilya Sutskever's lab building safe superintelligence.
Thinking Machines Lab
Mira Murati's frontier research lab.
AI Now Institute
Research on the social implications of AI
AI Safety Institute
Government body evaluating frontier AI risks
AMZScout
Budget Amazon product research suite
Apollo Research
AI deception and scheming evaluation lab
Booster Robotics
Developer-focused humanoid platform (Booster K1/T1)
ByteDance Seed
ByteDance's AI research lab behind Doubao and Seedance
Center for AI Safety
Nonprofit reducing societal-scale AI risk
Chai Discovery
Frontier biomolecular structure-prediction models
Consensus
AI search engine over peer-reviewed science
Cradle
Generative AI protein engineering for the lab
Data Dive
Keyword clustering and listing optimization for Amazon
Decart
Real-time generative world and video lab
Elicit
AI research assistant for systematic literature review
EvolutionaryScale
ESM3 frontier protein language model
FutureHouse
AI scientist agents for automated research
Generate Biomedicines
Generative biology platform designing novel protein therapeutics
Google Quantum AI
Willow superconducting processor and quantum error-correction
Inception Labs
Diffusion-based large language model lab
InternLM
Open foundation models from Shanghai AI Laboratory
Lila Sciences
Autonomous AI 'science factories' for scientific superintelligence
LMArena
Crowdsourced human-preference model leaderboard
Materials Project
Open database of computed materials properties
METR
Independent frontier-model dangerous-capability evaluator
Minea
Ad-spy and product research across 900M+ ads
MLCommons
Open engineering consortium behind MLPerf and AI safety benchmarks
NotebookLM
Google's source-grounded research notebook
Pasqal
Neutral-atom quantum processors for industry
Quantinuum
Trapped-ion quantum computers with record gate fidelity
ROBOTIS Dynamixel
All-in-one smart servo actuators for robots
SciSpace
AI copilot across 280M+ research papers
Scite
Smart citations showing how papers were received
Semantic Scholar
AI-powered free academic search engine
Shadow Robot Company
The Shadow Dexterous Hand, a human-equivalent robot hand
SmartScout
Amazon market-share and brand intelligence
StepFun
Multimodal-first Chinese frontier lab
Transluce
Open interpretability and AI-oversight nonprofit
Undermind
Deep multi-agent academic literature search
World Labs
Fei-Fei Li's spatial-intelligence world-model lab
Xaira Therapeutics
AI-first drug discovery built on generative biology
ZonGuru
AI-assisted Amazon research and listing optimization
AlphaSense
AI-native market intelligence platform with autonomous research agents over a gated corpus.
ARC Prize
ARC-AGI benchmark designed to resist memorization and test true generalization.
Brightwave
AI research agent for private markets that mines deal-room documents for insights.
CoStar Homes.com
Real-estate portal with a GPT-class due-diligence AI.
Fujitsu Kozuchi
Fujitsu's AI platform applied to materials and scientific discovery.
Insilico Medicine
End-to-end AI drug discovery from target to clinic.
Kimi (Moonshot AI)
Moonshot AI's agent-first open-weight model.
Recursion Pharmaceuticals
AI-native biotech mapping biology at industrial scale.
Schrodinger
Physics-based computational platform for drug discovery.
SWE-Bench
Benchmark evaluating LLMs on resolving real GitHub software issues.
TAE Technologies
Beam-driven fusion company pursuing clean hydrogen-boron fuel.
Tegus
Expert-call transcript library and primary-research platform for investors.
Zillow
Real-estate marketplace with conversational AI search.
📰 From the Desk
Google's Cheap Model Just Beat Its Expensive One
Gemini 3.5 Flash outscores Gemini 3.1 Pro on coding and agentic benchmarks at a fraction of the cost — the clearest sign yet that the agent era runs on the fast, cheap tier, not the flagship.
Flux Desk · 2026-06-14 · 5 min readChina's Best Open Coding Model Won't Show Its Work
Moonshot's Kimi K2.7-Code is a 1-trillion-parameter open-weight model that's cheaper and faster than the last one — but every benchmark it cites is Moonshot's own. That's the new pattern worth watching.
Flux Desk · 2026-06-14 · 5 min read
How AI Agents Are Actually Benchmarked in 2026
Every headline score is a lie of omission. Here's how the sausage gets graded — and the low numbers you should actually trust.
Flux Desk · 2026-06-08 · 9 min read
IBM Just Put $10 Billion Behind a 2026 Deadline
IBM committed more than $10B to quantum and declared the era already started — betting that 'quantum advantage' stops being a someday-claim and becomes a this-year fact.
Flux Desk · 2026-06-08 · 5 min read
The Open-Weight Surge Is No Longer a Catch-Up Story
DeepSeek, Qwen, and the Llama lineage closed the gap on frontier closed models faster than the labs admitted was possible. For builders, the math on cost and control just inverted.
Flux Desk · 2026-06-05 · 7 min read
The Benchmark Wars Are Over. Now Comes the Hard Part.
AI coding agents have crossed the 90% SWE-bench threshold — but the real bottleneck is now the human engineer, not the model.
Flux Desk · 2026-06-03 · 6 min read
After the Draft Button: How Agentic AI Rewired the Writing Stack
The cursor blinking on an empty page is the last honest moment left — everything after it is now negotiable.
Flux Desk · 2026-05-30 · 7 min read
The Benchmark Is Broken — and AI Keeps Passing It Anyway
Frontier models are saturating every test researchers can throw at them, forcing a reckoning over what 'intelligence' actually means to measure.
Flux Desk · 2026-05-28 · 5 min read
The Inference Layer Is Now a Battlefield: Who Controls the API Stack Wins
As AI agents flood production systems, the war for the inference layer has moved from model quality to routing intelligence, security hardening, and cost-per-token arbitrage — and the stakes are existential.
Flux Desk · 2026-05-21 · 6 min read
The 6.4-Hour Gap: What Happens When AI Actually Does the Work
Agentic AI has stopped being a chatbot upgrade — it's eating the workday whole, and the companies that haven't redesigned their workflows are already falling behind.
Flux Desk · 2026-05-14 · 5 min read
The Analyst Is Now an Agent: How Agentic AI Is Swallowing the Data Stack
Snowflake, Databricks, and a wave of upstarts are turning business intelligence into autonomous action — and the old BI dashboard may never recover.
Flux Desk · 2026-05-12 · 5 min read
The Throne Is Wobbling: How Claude and Gemini Are Dismantling ChatGPT's Monopoly
ChatGPT still commands the room, but Gemini's scale and Claude's enterprise grip are rewriting who actually controls the AI assistant market.
Flux Desk · 2026-05-07 · 5 min read
The AI Video Model War Enters Its Brutal Second Act
Veo, Sora, Kling, Runway, and Higgsfield are no longer racing for novelty. They're fighting over the one thing that matters now: whether you'll pay them to replace a film crew.
Flux Desk · 2026-05-01 · 7 min readNo discussions yet — start the first one.
