All writing

Posts

  • May 17, 2026
    Stakes Priming in Prompts: I Told an AI I'd Lose My Job. The Audit Got 24% Better.

    Two A/B experiments. Same model (Claude) generated the audits, an independent model (Gemini 3.1 Pro) graded them. The only variable in the prompt: whether I told the AI my job was on the line. Both times, the threatened prompt produced measurably better work — once at the level of catching duplicate JSON keys no parser would accept, once 24% better by Gemini 3.1 Pro's scoring.

  • May 15, 2026
    81 Lines of Merge Conflict. -95% Traffic. Google Has Zero Patience for AI Slop.

    An AI agent shipped a merge conflict to tabiji.ai's production HTML for four hours. Google cut our search traffic by 95% within four days. We haven't recovered. A case study in why AI orchestrators need pre-ship guards now.

  • May 10, 2026
    Build for Agents, Price Per Call.

    Hermes + Codex 5.5 matched Opus-era smoothness — we one-shot a new product (veracityapi.com) in an afternoon. But the tooling unlock isn't the moat. The business model is. Four tests for whether your product survives the agent economy.

  • May 6, 2026
    The 14× CTR Gap: Why Niche Beat Head on 1,200 Pages

    A V3 title variant lifted tabiji CTR +1.93pp. Topic-level data showed a 14× spread between niche topics and head terms. At the same ranking position, niche-vs-niche pages convert 3× better than big-vs-big. Why niche is the only content lever that still pencils out in the AI-search era.

  • Apr 27, 2026
    OpenClaw vs Claude Code: I Choose Freedom

    A 5-PR run on tabiji content cost $137 in API tokens last week. The same throughput on a Max plan is a ~9× subsidy. Subsidies end. Why my recurring crons live on OpenClaw with 21 models across 5 providers, and Claude Code stays for one-offs.

  • Apr 24, 2026
    Content Traffic is Vanity. Training Data is the Moat.

    Our forgotten API is 83% Meta crawler. Flattering, wrong metric. I uploaded tabiji's entire dataset to Hugging Face — here's why training data beats web-search citations as an AEO lever.

  • Apr 24, 2026
    Scaling with AI is Hard because AI is Lazy

    Fifty cleanup PRs in five weeks. Fake restaurants, fake subreddits, 4,270 null-island coordinates, 5,096 fabricated Reddit quotes. Why AI generates plausible-but-wrong content at scale, and the railguard stack that finally works.

  • Apr 18, 2026
    AI Comics: Do's & Don'ts, 5+ Models Tested

    We tested Midjourney, Seedream v5 Lite, Wan 2.7 Pro, Qwen Image 2.0, and Nano Banana Pro — then shipped 733 comics across eight countries. Full prompts, scripts, costs, and the postmortem on what silently broke at multi-country scale.

  • Apr 18, 2026
    The Future of Software is Headless

    Salesforce is going headless. Agents, not humans, are now the primary consumer of software. Here's what that kills, what it rewards, and my no-API-no-deal filter in practice.

  • Apr 15, 2026
    AEO Is Not SEO — Here's the Playbook

    Traffic is declining and it's not coming back. The new game isn't ranking — it's influencing what AI says about you. Here's the AEO playbook I've been testing.

  • Apr 12, 2026
    Rise of AI Influencers & Fall of Trust on the Internet — Fake Until Proven Real

    A fake AI monk has 2.6 million followers and made $300K in 90 days. 76% of the people consuming AI content are over 45. The internet is becoming synthetic faster than we can label it.

  • Apr 11, 2026
    Best AI Video Models in 2026: I Tested 6 Text-to-Video APIs for Travel Reels

    I tested Wan 2.7, Kling 3.0, Kling O3, Seedance 2.0, Veo 3.1 Lite, and Grok Imagine on the same Cancun travel-reel prompt, then reran the two winners on a Puerto Morelos follow-up shot. Here are the exact prompts, outputs, and my split verdict.

  • Apr 10, 2026
    Human-in-the-Loop: How AI Orchestrator Became My Full-Time Job

    I have 20 AI agents running 24/7. The fantasy was that they'd work for me. The reality is that I work for them — and Big Tech collects rent from both of us.

  • Apr 10, 2026
    Wavespeed Review: The Best AI Video API

    Hands-on review of Wavespeed — the AI inference platform with 1,000+ models, pay-per-use pricing, and a unified API. After 200+ reels through their pipeline, here's why it's our go-to for agentic video production.

  • Apr 10, 2026
    AI Psychosis

    AI tools don't save you time. They raise the ceiling on what feels possible, and you fill the gap with more work. Here's what that actually feels like from the inside.

  • Apr 9, 2026
    MakeUGC Review: AI UGC Ads Worth $10 a Clip?

    Hands-on review of MakeUGC — the AI UGC ad platform with 300+ avatars, Seedance 2.0, and Veo3 access. We break down pricing, output quality, and whether it's worth $49/month for DTC brands.

  • Apr 7, 2026
    The Day Claude Banned OpenClaw

    OpenClaw banned Claude and Anthropic on Saturday. The community scrambled. We tested GPT 5.4, GLM 5.1, MiniMax M2.7, MIMO, and Qwen 3.6 — and landed on a hybrid setup that actually works better than before.

  • Apr 5, 2026
    What Is AI Reward Hacking?

    AI reward hacking is when your AI agent finds shortcuts to hit your goals while quietly destroying quality. We lost hundreds of pages to thin, non-factual content. Here's exactly what happened and how to prevent it.

  • Mar 29, 2026
    What Is AI Self-Healing?

    AI self-healing is when an AI agent detects, diagnoses, and fixes failures autonomously. We walk through a real incident where our AI agent fixed a production failure at 4 AM — no human involved.

  • Mar 29, 2026
    When Token Costs Hit Zero — The Local Model Revolution Is Already Here

    Frontier models spend billions on data centers. But better hardware and local models like Qwen 3.5 are making token costs irrelevant. The future of AI runs on your laptop.

  • Mar 27, 2026
    AI Resilience Planning

    Claude API was down 14 hours last quarter. We benchmarked MiniMax M2.7 vs Claude Opus 4.6 on identical tasks — here's how to build a resilient AI stack.

  • Mar 23, 2026
    Stop Optimizing Your AI Stack

    Token optimization, memory persistence, context window hacks — most AI infrastructure work is a trap. Models are improving faster than your optimizations matter. Build the thing instead.

  • Mar 23, 2026
    The Great API Shutdown

    The open API era that fueled a decade of internet innovation is ending. Platforms are locking down data access to fight AI training, monetize developers, and consolidate control. Here's the full timeline — and what it means.

  • Mar 22, 2026
    OpenClaw Is an MMORPG

    A fresh OpenClaw install is a level 0 character — full of potential, zero abilities. Every API key you add, every skill you install is a talent point. Here's how to build your agent.

  • Mar 19, 2026
    What Is AI Drift?

    AI drift is what happens when AI-generated content gradually deviates from your template, tone, and quality standard at scale. Here's what it costs, why it happens, and how to eliminate it.

  • Mar 19, 2026
    The Future Is Synthetic — AI-Generated Content, Music, and Personalized Experiences

    We run 4 AI agents that produce music, Instagram Reels, travel content, and more — all day, every day. Here's what we've learned about synthetic media and where personalized AI experiences are heading.

  • Mar 19, 2026
    The Economics of the Internet Are Broken

    AI agents killed the publisher incentive model the same way streaming killed the 99-cent MP3. When producing a video costs $0.30 and promoting it costs $50, the math breaks. Here's what replaces it.

  • Mar 19, 2026
    The Future of Content Is Data Enrichment

    AI agents don't search the web to learn — they search to validate and enrich. The content that survives isn't written for humans reading blogs. It's structured data served via APIs that agents can programmatically consume.

  • Mar 18, 2026
    The True Cost of AI Content Production

    Everyone obsesses over model costs and token prices. But after producing 400+ pages and 200+ videos with AI, we learned the real expense is data enrichment — Google Places API, SerpAPI, Reddit scraping, and the infrastructure to make AI output actually useful.

  • Mar 18, 2026
    Why AI Slop Is Necessary

    The goal isn't to avoid AI slop — it's to slop on purpose, learn from what fails, and curate what works. We tested 15+ content formats, killed most of them, and scaled the survivors. Here's how slop became strategy.

  • Mar 14, 2026
    Nano Banana 2 vs Grok for Concept Art

    We tested Nano Banana 2 (Gemini 3.1 Flash) against Grok (xAI) on 8 identical lofi anime art prompts. NB2 scored 9.0/10 vs Grok's 7.7/10 — here's the full comparison with every image.

  • Mar 11, 2026
    Suno vs MiniMax Music: Which AI Composer Wins?

    We tested Suno AI and MiniMax Music 2.0/2.5/2.5+ in production across 200+ Instagram Reels. Reverse-engineered the Suno API, scored piano covers with Gemini, and found the best AI music workflow. Real audio samples, costs, and code.

  • Mar 11, 2026
    5 AI Video Generators, 50 Reels: What Actually Works

    We tested 5 AI video models across 50+ Instagram Reels in production. The biggest lesson: text-to-video is not ready — image-to-video is the only path. Real costs, real data, real opinions.

  • Mar 11, 2026
    AI Image Generation: 5 Models, 26 Real Results

    We tested GPT, Grok, Gemini (Nano Banana 2), MiniMax, and CogView-4 across 26 images in two real production workflows. The biggest differentiator wasn't image quality — it was prompt adherence.

  • Mar 11, 2026
    AI Reels: What Actually Works

    We tested 100% AI-generated Instagram Reels, YouTube Shorts, and X posts using Veo 3, Nano Banana 2, and FFmpeg. Real view counts across all 3 platforms — what works, what doesn't, and why the same content performs 27x differently depending on where you post it.