AI Tools — AI tools, news & discussion

CodingVibe Coding

GitHub Copilot

AI pair programmer in your editor

AI Tools🔥

Cursor

The AI-first code editor

Agents & Jarvis🔥

Lovable

Build full-stack apps by chatting

Agents & Jarvis🔥

Bolt.new

Prompt, run, edit, and deploy full-stack apps

Autonomous AgentsCoding

OpenAI Codex

OpenAI's agentic coding agent — terminal, IDE, and cloud.

Agents & Jarvis🔥

v0

Generate UI and apps from prompts

Data & AnalyticsResearch

Julius AI

AI data analyst for everyone

SaaS BoilerplatesVibe Coding

ShipFast

NextJS boilerplate to ship startups fast

Memory / RAGOpen Source

Letta (MemGPT)

The agent-memory framework from the MemGPT team.

Agent FrameworksCoding

Braintrust

Eval and observability platform for AI & agents.

Startups & VCCoding

Cognition

Maker of the autonomous AI software engineer Devin

Tech & Culture

ChallengersCoding

Reflection AI

Frontier open-intelligence lab from ex-DeepMind researchers

CybersecurityDev Tools

Snyk

Developer-first application security

Tech & Culture

CodingProductivity

Tabnine

Privacy-first AI code completion for enterprises

Trae

ByteDance's free AI-native IDE

CodingAgent Frameworks

Claude Code

Anthropic's terminal coding agent for repo-wide engineering work.

CodingDev Tools

CodeRabbit

AI-powered automated code review for pull requests.

Autonomous AgentsCoding

Devin

Cognition's autonomous AI software engineer with a cloud agent.

Chinese LabsCoding

GLM

Zhipu AI's frontier model trained on Huawei Ascend.

Chinese Labsagents

Kimi (Moonshot AI)

Moonshot AI's agent-first open-weight model.

SaaS BoilerplatesCoding

MakerKit

SaaS boilerplate with multi-provider AI components on mature auth and billing.

Chinese LabsOpen Source

Qwen (Alibaba)

Alibaba's multimodal, multilingual open-weight model family.

Vibe CodingNo-Code Agents

Replit

Browser IDE with an AI Agent that builds and deploys apps.

Dev ToolsCoding

Sentry

Application monitoring and error tracking, with agent monitoring support.

Tech & Culture

Codingcode-review

Sourcegraph Cody

Codebase-aware AI assistant for code and AI-PR review.

SaaS BoilerplatesCoding

Supastarter

Full-stack SaaS starter kit with pre-built AI components.

Benchmarks & SafetyCoding

SWE-Bench

Benchmark evaluating LLMs on resolving real GitHub software issues.

Vibe Codingvercel

v0 by Vercel

Generative UI and app scaffolding from prompts.

Devin Desktop

The agentic IDE formerly known as Windsurf (now Cognition's Devin Desktop).

Flux Desk · 2026-06-14 · 5 min read

📰 From the Desk

Feature · Frontier Labs

Google's Cheap Model Just Beat Its Expensive One

Gemini 3.5 Flash outscores Gemini 3.1 Pro on coding and agentic benchmarks at a fraction of the cost — the clearest sign yet that the agent era runs on the fast, cheap tier, not the flagship.

Feature · Frontier Labs

China's Best Open Coding Model Won't Show Its Work

Moonshot's Kimi K2.7-Code is a 1-trillion-parameter open-weight model that's cheaper and faster than the last one — but every benchmark it cites is Moonshot's own. That's the new pattern worth watching.

Flux Desk · 2026-06-14 · 5 min read

Feature · Agents & Jarvis

How AI Agents Are Actually Benchmarked in 2026

Every headline score is a lie of omission. Here's how the sausage gets graded — and the low numbers you should actually trust.

Flux Desk · 2026-06-08 · 9 min read

Feature · Science

IBM Just Put $10 Billion Behind a 2026 Deadline

IBM committed more than $10B to quantum and declared the era already started — betting that 'quantum advantage' stops being a someday-claim and becomes a this-year fact.

Flux Desk · 2026-06-08 · 5 min read

Flux Desk · 2026-06-05 · 7 min read

The Open-Weight Surge Is No Longer a Catch-Up Story

DeepSeek, Qwen, and the Llama lineage closed the gap on frontier closed models faster than the labs admitted was possible. For builders, the math on cost and control just inverted.

Flux Desk · 2026-06-03 · 6 min read

The Benchmark Wars Are Over. Now Comes the Hard Part.

AI coding agents have crossed the 90% SWE-bench threshold — but the real bottleneck is now the human engineer, not the model.

Flux Desk · 2026-05-30 · 7 min read

After the Draft Button: How Agentic AI Rewired the Writing Stack

The cursor blinking on an empty page is the last honest moment left — everything after it is now negotiable.

Flux Desk · 2026-05-28 · 5 min read

The Benchmark Is Broken — and AI Keeps Passing It Anyway

Frontier models are saturating every test researchers can throw at them, forcing a reckoning over what 'intelligence' actually means to measure.

Flux Desk · 2026-05-21 · 6 min read

The Inference Layer Is Now a Battlefield: Who Controls the API Stack Wins

As AI agents flood production systems, the war for the inference layer has moved from model quality to routing intelligence, security hardening, and cost-per-token arbitrage — and the stakes are existential.

Flux Desk · 2026-05-14 · 5 min read

The 6.4-Hour Gap: What Happens When AI Actually Does the Work

Agentic AI has stopped being a chatbot upgrade — it's eating the workday whole, and the companies that haven't redesigned their workflows are already falling behind.

Flux Desk · 2026-05-12 · 5 min read

The Analyst Is Now an Agent: How Agentic AI Is Swallowing the Data Stack

Snowflake, Databricks, and a wave of upstarts are turning business intelligence into autonomous action — and the old BI dashboard may never recover.

Flux Desk · 2026-05-07 · 5 min read

The Throne Is Wobbling: How Claude and Gemini Are Dismantling ChatGPT's Monopoly

ChatGPT still commands the room, but Gemini's scale and Claude's enterprise grip are rewriting who actually controls the AI assistant market.