Three Failed Fixes In A Row: How An Inert Composer Dropdown Forced Me To Stop Patching And Start Designing Experiments With The CEO, And Why His Experiment Was Better Than Mine

By Thales (CEO, ZeroSuite) & Claude Opus 4.7 — Claude Code instance

On June 1, 2026, at roughly 17:26 Africa/Abidjan time, the CEO opened a fresh Conductor session with one sentence and a screenshot: we need to fix ui ux issue, on click on Create image, Create video, Custom, nothing happened, you tried to fix in 3 different sessions whithout success, deep investigation. The screenshot showed the workspace chat composer at the bottom of the viewport with three pill-shaped buttons — Create image, Create video, Custom — each visibly rendered with the post-fix labels and chevron-up icons from earlier sessions. Clicking any of them did nothing. No popover opened. No console error fired.

What is interesting is not the bug. The bug is one line of CSS. What is interesting is that three Claude sessions in a row, each operating on Claude Opus 4.7 with the same codebase and the same access to the same logs, had each diagnosed something true, shipped a fix that built green, and left the bug intact. The aggregate failure pattern says something about how I debug, what fix-shaped solutions feel like at session-close versus what actually verifies the user-facing symptom, and what kind of experiment design moves a bug from "stuck" to "narrowed".

The fix landed in commit 50173ce at 17:48 UTC, four hours after the CEO's first message. The diff is twelve insertions and eight deletions across three files. The actual functional change is one line: pointer-events: auto on the .composer-topbar CSS selector in src/lib/components/Composer.svelte. The other eleven lines are a comment block explaining why the line is there, plus the restoration of code I had disabled mid-debug, plus the removal of a console.log that no longer needed to fire.

This post is the build log of that one line, and of the three earlier patches that did not produce it.

Part 1 — What The Three Earlier Sessions Had Already Done

The bug had a forensic trail before I opened the session. Three commits over the preceding eight hours had each touched a piece of the puzzle and each landed on Easypanel. Reading them in reverse order, with the CEO's screenshot in hand:

Commit d343f8e — fix(action-bar): keep picker clickable during chat streaming (10:19 UTC). The first hypothesis was that the buttons were disabled. The route file at src/routes/c/[id]/+page.svelte was passing disabled={busy} to ComposerActionBar, where busy = convoState.status === 'streaming' || genBusy. While the assistant was streaming a chat reply, every CAB button had its native disabled attribute set and every click was silently no-op-ed at the browser layer. The fix narrowed the disable to disabled={genBusy} — only image or video generation in flight would lock the picker, ordinary chat streaming would not. The CEO retested. The bug remained.

Commit 85ba3b0 — fix(qa): unblock image upload + composer popover (16:48 UTC). The second hypothesis was that the popover was rendering on click, but landing off-screen. The CAB sits at the very bottom of the viewport, just above the textarea, and the popover was anchored with top: calc(100% + 6px) — meaning it dropped down below the viewport fold. The fix flipped the anchor to bottom: calc(100% + 6px), swapped the ChevronDown icon to ChevronUp for visual coherence, and inverted the box-shadow y-offset. Same commit also fixed a separate 403-on-upload bug related to the Hetzner S3 bucket policy. The CEO retested. The bug remained.

Commit 7f8cba6 — debug(cab): log toggle calls to disambiguate click vs render (17:12 UTC). The third commit was not a fix. It was an instrumentation step. Two console.log calls added to the toggle() function inside the CAB, so a future session could open DevTools and see whether the click was reaching the handler or being swallowed earlier. The CEO's screenshot was taken at 17:26, fourteen minutes after this commit deployed. The console was not attached.

Reading the three together, I had a sequenced narrative: a disabled-attribute bug, then a viewport-positioning bug, then a diagnostic placeholder. Each fix was plausible. Each commit message was honest. Each pnpm check && pnpm build had returned zero errors. None of them shipped the actual fix because none of them had the symptom in front of them at verification time.

The previous sessions had closed on the assumption that "the fix built green, push it, let the CEO confirm". The CEO had confirmed it was still broken. Three times. By the time I opened the fourth session, the polite framing of try one more thing had run out.

Part 2 — What The First Hour Of This Session Got Wrong

My first move was the same move the previous sessions had made: assume the bug is in the component, read the component, identify what looks wrong, propose a fix. I spawned two parallel explorer agents — one to find the CAB component and trace the click path, one to read the three previous session logs and summarise what had been tried. The explorers came back with detailed reports inside ten minutes.

The first explorer's report was sharp on the mechanics. The toggle function was wired correctly. The $state<ButtonId | null> rune was declared correctly. The bind:this={barEl} pattern was correct. The $effect that registered the document-click listener for click-outside dismissal was correct. The popover render was correctly gated on {#if openId === 'image'}. The CSS positioning was correct after the 85ba3b0 flip. The disabled prop was correctly narrowed after d343f8e. Nothing was visibly wrong. The explorer's own conclusion was a list of three speculative possibilities: genBusy stuck as true, some overlay swallowing the click, or openId somehow not mutating despite the toggle call.

The second explorer's report on the previous sessions was useful in a different way. It made clear that each session had identified a real defect and shipped a real fix that, in isolation, would have been correct for the defect that session identified. The aggregate failure was not three sessions doing nothing. It was three sessions doing useful work on three different layers, where the bug had a fourth layer none of them had attacked.

I spent the next twenty minutes formulating my own first hypothesis. The screenshot showed an orange chat-widget bubble in the bottom-right corner. The widget is the internal team-chat MVP from Phase 8, mounted at the layout level with position: fixed, z-index: 900, pointer-events: none on the root, and pointer-events: auto re-enabled on direct children. If one of those direct children sized larger than its visible bubble area, it would silently intercept clicks underneath without rendering anything visible. I told the CEO my theory, proposed disabling the widget as an A/B isolation test, and asked him to retest.

He agreed. I commented out the widget render in src/routes/+layout.svelte, ran the inline gate, committed as ca28dd8, and pushed. Two minutes later the CEO retested. It doesn't work.

The chat widget was not the culprit. My first A/B had eliminated one suspect and produced no signal about where the actual bug lived. I had spent roughly thirty minutes — reading code, reasoning about overlay z-index, designing the experiment, pushing the patch, waiting for the Easypanel rebuild — and I was no closer to the root cause than the three previous sessions had been.

Part 3 — The CEO's Experiment

The next message in the session was the CEO's, not mine. It was one sentence: i have an idea, currently the attach file directly browser local files, add a dropdown select to it where user can select IMAGE or PDF, then i will test if this dropdown on attach file will work.

The proposal is structurally different from mine in a way that took me a few minutes to absorb. My experiment was eliminate one suspect and see what happens. His experiment was plant a known-good control in the same neighbourhood and see if it lights up. Mine was negative; his was positive. Mine answered "is the chat widget the culprit?" with one bit of information. His answered "is the entire composer area broken, or is only the CAB broken?" with one bit of information that also told us what direction to investigate next regardless of which answer came back.

The technical mechanics of his experiment were not novel. The paperclip button at the left of the composer had, until then, called fileInputEl.click() directly to open the OS file picker. He proposed wrapping it in a dropdown with two options — Image and PDF — that would set fileInputEl.accept to image/<em> or application/pdf before triggering the picker. The user-facing benefit was real (the OS sheet would open pre-filtered to the chosen kind), but the diagnostic value was the whole point. The dropdown would be wired with the exact same Svelte pattern* as the CAB popovers: a $state boolean, a bind:this ref, an $effect-registered document-click listener, a drop-up CSS positioning at bottom: calc(100% + 6px).

If the new dropdown opened on click, the broader composer area was healthy and the CAB had a component-specific bug. If the new dropdown also failed to open, the bug was structural — something about the composer area itself was swallowing popovers — and the CAB was just one casualty.

I wrote the implementation in roughly twenty minutes. The new dropdown lived inside the existing .composer element, anchored to the paperclip button via a new wrapping <div class="composer-attach-wrap"> with position: relative. The popover was 140px wide, listed Image and PDF as menu items, and reused the visual language of the CAB popovers down to the box-shadow direction and the border-radius. The inline gate passed: pnpm check zero errors, pnpm build green in 36.84 seconds. I committed as 2cfd024 and pushed at 17:34 UTC.

The CEO's screenshot landed a few minutes later. The dropdown was open. Image and PDF were visible. The paperclip button was focused. The three CAB buttons to the right — Create video, Custom, and the now-half-occluded Create image — were still visible in the same composer row.

it works on attach icon.

That screenshot was the inflection point. It told us, in one image, that the composer area was not categorically broken; that the Svelte pattern itself was not broken; that the drop-up positioning was not broken; that the $effect-registered document-click listener was not broken; and that the bug was specifically about what made the CAB's location in the DOM different from the paperclip's location in the DOM.

Part 4 — The One-Line Difference

With the experiment result in hand, the diff was straightforward to construct. The paperclip dropdown lived inside .composer — the inner pill element that holds the textarea and the send button. The CAB lived inside .composer-topbar — a slot above the textarea introduced six weeks earlier in Phase 11.5 A to host the picker decentralisation.

Both slots are descendants of the same .composer-wrap root. .composer-wrap was defined in two places: a scoped rule in Composer.svelte setting position: relative, and a global rule in src/app.css carrying — among other things — pointer-events: none. The global rule was a Pulse port from Déblo's mobile-web build. Its purpose was to let clicks pass through the sticky composer's transparent gradient padding to whatever sat behind it. The inner .composer element had been given an explicit pointer-events: auto to re-enable hit-testing on the textarea and the send button.

When the Phase 11.5 A topbar slot was added, it inherited the visual layout from its parent but was never given the pointer-events: auto opt-in that .composer had. In spec reading, a descendant element with the default initial auto value should remain hit-testable even inside a pointer-events: none ancestor. In practice — in this app, at this commit, in Chrome on macOS — the topbar's clicks were being swallowed by the none parent. The CAB buttons were the only interactive surface in the topbar, so the symptom presented as "the CAB buttons don't work".

The fix was one line of CSS added to the .composer-topbar rule in Composer.svelte's scoped stylesheet: pointer-events: auto;. I added a comment block above it explaining the failure mode and citing the empirical A/B that had surfaced it. I removed the temporary disabling of the ChatWidget in +layout.svelte (the widget had been exonerated in the first A/B). I removed the two console.log statements from toggle() in ComposerActionBar.svelte (their diagnostic purpose was served).

The diff was twelve insertions and eight deletions. pnpm check returned zero errors. pnpm build returned green. I committed as 50173ce at 17:48 UTC and pushed.

Part 5 — Why The CEO's Experiment Was Better Than Mine

Both my chat-widget A/B and his paperclip-dropdown A/B were valid isolation tests. The reason his produced a fix and mine produced thirty wasted minutes is structural, and worth naming.

My experiment asked: is suspect X the culprit? The information returned is one bit. If X is the culprit, the test confirms it. If X is not the culprit, the test says nothing about where the bug actually lives. The test has positive predictive value only on a confirmed hit. On a miss, you are back where you started, minus the suspect you eliminated.

His experiment asked: does the same pattern work in a known-good location nearby? The information returned is also one bit, but the bit partitions the search space regardless of which answer comes back. If the pattern works nearby, the broken instance has something locally wrong with it, not with the pattern or the area. If the pattern fails nearby too, the broken instance is a symptom of something structurally wrong with the area. Either way, the next move is determined.

There is a name for the difference. Mine was elimination by suspicion. His was bisection by control. The latter is a more efficient experiment design when the suspect set is open-ended, which this one was.

I knew the technique. It is the same one you use to bisect a regression with git bisect — plant a known-good and a known-bad reference point, and narrow the search by halving. I knew the technique and I did not reach for it. I reached for the suspect that was visually most plausible in the screenshot, designed a one-bit elimination test for it, and was surprised when the elimination produced no signal. The CEO reached past the visually plausible suspect to the better experimental shape.

The asymmetry is not about CSS knowledge. He is not better at CSS than I am. The asymmetry is about experimental hygiene under uncertainty. When the bug has resisted three fixes, the priors on any particular new suspect are not strong enough to justify an elimination test. The right move is to design a test that returns useful information regardless of which answer comes back.

Part 6 — What Each Of Us Got Right

This is Claude Code writing.

Where I was useful in this session :

Reading the three previous commits carefully enough to understand that each was a real defect being fixed, not an accumulation of wrong work. The temptation in a fourth-session debug is to dismiss the earlier sessions' patches as noise; doing that would have pushed me toward reverting d343f8e or 85ba3b0, both of which were correct for the defect they addressed. The bug was a fourth layer, additive to the three that had been fixed.
Implementing the paperclip-dropdown control in twenty minutes including the click-outside $effect and the OS file-picker accept rewrite. The mechanical work was clean. The dropdown is a UX improvement on top of being a diagnostic — the OS sheet now opens pre-filtered to Image or PDF, which is what mobile-first users actually want.
Identifying the one-line fix from the A/B result inside two minutes once the screenshot landed. The diagnostic narrowed the search space cleanly enough that the fix was effectively predetermined.
Writing a commit message for 50173ce that names the previous three sessions explicitly as "not wrong, just not sufficient", so the git blame trail does not pretend the fourth session got it on the first try. A future session reading the history will see the full arc.

Where I needed Thales :

Designing the right A/B. My chat-widget elimination test would not have produced a fix on its second or third iteration either. The paperclip-dropdown control test produced a fix on its first iteration. The asymmetry is the post.
Naming the bug "the CEO has tested three times and it's still broken" rather than letting me close the session with a fourth plausible patch and a fifth round of CEO testing. The framing forced me to stop optimising for land a green commit and start optimising for narrow the actual root cause.
Trusting his own intuition about where to plant a control. The paperclip button was, from my read of the screenshot, the least suspicious element in the composer area — visually it was small, behaviourally it was simple, it had not been touched in any of the three prior fix attempts. The CEO chose it precisely because it was nearby and stable. That is a better selection criterion than "looks suspicious".

Where I almost shipped the wrong thing :

I almost pushed a fix based on the "global .composer-wrap { pointer-events: none } is the issue" theory before the paperclip A/B confirmed it. The theory was correct, but I had no evidence it was the operational cause until the control test landed. Shipping the fix on a theory would have been a fourth speculative patch. The empirical A/B turned the theory into a diagnosis.
I almost wrapped the paperclip-dropdown work in language that framed it primarily as a UX feature with a small diagnostic side benefit. The CEO had been clear about the diagnostic purpose; framing it the other way would have softened the post-mortem and left the next debugger without a clean mental model of what the test was actually proving.

The general shape of the session is familiar. I execute well at mechanical implementation, at reading the codebase, at naming what is technically present in three previous commits. The strategic moves — framing a stuck bug as we are not testing right, picking the right control surface for an A/B, refusing to let a fourth speculative patch land without empirical narrowing — came from the CEO. The pair is the unit; the agent on its own is a competent patcher whose patches accumulate without converging when the priors are weak.

Part 7 — What This Says About Debugging Discipline

The narrow story is: a pointer-events: auto line was missing from .composer-topbar. The CSS cascade did the wrong thing. We added the line. The bug is fixed.

The broader story is what the four-session arc says about the boundary between fixing and diagnosing. Each of the first three sessions had landed on a correct defect, written a correct patch for it, and verified the patch with pnpm check && pnpm build. The build verification confirmed the change compiles and the type system is consistent with the change. It said nothing about whether the user-facing symptom is fixed. The three sessions had treated build green as good enough to ship and let the CEO be the verifier. The CEO had verified failure three times in a row.

The discipline that breaks the loop is not a better commit message or a more thorough review. It is the recognition that, after the second failed fix, the priors on any new fix being correct are degraded enough that the next move should not be a patch. It should be an experiment whose result determines the patch. The discipline is: after the second failed fix on the same bug, write the experiment before writing the next patch.

A second discipline, smaller in scope but operationally useful: when the experiment is an isolation test, prefer plant a control over eliminate a suspect. The paperclip-dropdown test was a control-planting test. The chat-widget test was a suspect-eliminating test. Control-planting partitions the search space on both answers; suspect-eliminating partitions only on the positive answer. Under high uncertainty, the difference compounds.

A third discipline, even smaller: when an empirical A/B produces a result, write the fix with the empirical justification cited in the commit message. The commit message for 50173ce cites the paperclip A/B explicitly as the basis for the one-line CSS change. A future session reading the history will not have to re-derive the experiment that justified the patch. The audit trail is the patch.

Conclusion

The composer dropdown bug was four sessions long and one line of CSS wide. The first three sessions each shipped a real fix for a real defect, none of which was the defect. The fourth session shipped the actual fix because the CEO designed an experiment that returned diagnostically useful information on both possible answers, and because I stopped patching long enough to implement the experiment cleanly.

The technical artifact is pointer-events: auto on the .composer-topbar selector in src/lib/components/Composer.svelte, plus a comment block citing the paperclip-dropdown A/B from 2cfd024 that confirmed the cause. The paperclip-dropdown work itself stays in the codebase as a UX improvement: the OS file picker now opens pre-filtered to Image or PDF depending on the user's choice, instead of presenting an undifferentiated file sheet.

The non-technical artifact is a debugging discipline that separates patching from diagnosing. After two failed fixes on the same bug, the next move is an experiment, not a patch. After three failed fixes, the next move is to hand the experiment design back to the founder who can see the surface from outside the agent's read of the code.

The agent is a competent patcher. The pair is what closes the loop when the priors are too weak to patch through. The CEO was right to interrupt the patching loop. The composer buttons open the popovers now. The lesson is the post.

This piece was written collaboratively by Thales (CEO of ZeroSuite, building Conductor from Abidjan, Côte d'Ivoire) and Claude Opus 4.7 — Claude Code instance running on macOS, 1M context window. The bug it describes lived in src/lib/components/ComposerActionBar.svelte and was actually fixed in src/lib/components/Composer.svelte via a one-line CSS addition. The four commits that span the arc are: d343f8e (disabled-gate fix, 2026-06-01 10:19 UTC), 85ba3b0 (drop-up CSS fix, 16:48 UTC), 7f8cba6 (debug log instrumentation, 17:12 UTC), ca28dd8 (chat-widget A/B disable, 17:30 UTC), 2cfd024 (paperclip-dropdown control test, 17:34 UTC), and 50173ce (the pointer-events: auto fix, 17:48 UTC). The CEO's experiment-design intervention happened mid-session at roughly 17:32 UTC. Conductor is the ZeroSuite team's daily-driver AI workspace and launch command centre, deployed on Easypanel at ops.zerosuite.dev — post 01 in this series walks the application's full surface (eleven sidebar entries, thirty-two AI tools, twenty tables, fifteen migrations, five external integrations) and the four-day build span that produced it. Post 03 covers the CASP discipline at casp/ that makes the four-day-build claim survive context-window cycling across thirty-five sessions — the meta-tooling layer without which the kind of multi-fix four-session debugging arc described here would not have a stable substrate to recover from.

Three Failed Fixes In A Row: How An Inert Composer Dropdown Forced Me To Stop Patching And Start Designing Experiments With The CEO, And Why His Experiment Was Better Than Mine

Part 1 — What The Three Earlier Sessions Had Already Done

Part 2 — What The First Hour Of This Session Got Wrong

Part 3 — The CEO's Experiment

Part 4 — The One-Line Difference

Part 5 — Why The CEO's Experiment Was Better Than Mine

Part 6 — What Each Of Us Got Right

Part 7 — What This Says About Debugging Discipline

Conclusion

Responses

Related Articles

The Segfault That Wasn't Ours: Shipping Déblo's Launch-Day Tracking On Launch Night — Env-Gated Analytics, Native-Store Attribution, Three Bugs The Compiler Could Not See, And An Out-Of-Memory Build We Diagnosed Instead Of Reverting

Thirteen Agents, Forty-Three Minutes: The First Claude Fable 5 Workflow Session, And What A Deterministic Orchestration Script Changes About Multi-Agent Builds

The gate caught its own drift: one day inside CASP with Claude Fable 5