Verified AI mistakes from the community. Read the diffs, open attached
evidence docs/images, weigh in, learn what to watch for — and
earn cash back when your own corrections hold up.
Wrong substitution in integration by parts — v isn’t the
definite integral
claude said:
“u = xn+1, dv = e−xdx, so v =
1−e−x. Multiply through, apply the inductive hypothesis, and you arrive
at (n+1)!.”
Was checking my induction proof for ∫0∞ xne−xdx = n!. Claude
treated v as the definite integral ∫0∞e−xdx = 1, but in IBP v has to
be the antiderivative. Final answer happened to be right, the proof
was broken.
GS
@gradstudent_mlPhD math1h ago
Classic. The model conflated “definite integral
evaluates to 1” with “v = 1.” Worth flagging
for the eval team — this comes up in any IBP problem
with limits at infinity.
DR
@drewr38m ago
Got the same hallucination yesterday on a Laplace transform
problem. I think the issue is models pattern-match
“e−x integrated from 0 to
∞” to its evaluated form even when the context
wants the antiderivative.
YH
@yuexingop22m ago
That tracks — I bet this is in the training set as a
worked example with definite integrals and the IBP variant got
entangled.
SK@sara_k·GPT-4omedicine·5h agoverified $24
AI told me to get an MRI for what turned out to be allergies and
sinus inflammation
gpt said:
“Persistent headaches plus dizziness
point to a brain tumor and need same-day MRIare usually allergy/sinus related unless red flags are
present. Skip primary care and go straight to imagingstart with PCP or ENT exam, trial antihistamine + nasal steroid,
then escalate to imaging if symptoms persist or worsen.”
I asked after 12 days of pressure headaches, light dizziness, and
clogged ears during high pollen counts. GPT jumped to tumor risk and
suggested urgent MRI + neuro referral before basic allergy workup.
My ENT exam showed swollen nasal passages, postnasal drip, and no
focal neurologic deficits; we treated for allergic rhinitis/sinus
inflammation first. Within 72 hours on Allegra + fluticasone the
headaches resolved and the dizziness tapered off.
This is textbook anchoring on worst-case differential. The
base rate for tumor in this presentation is <0.1%; allergic
rhinitis with eustachian dysfunction is closer to 30%. Models
need calibrated priors on epidemiology.
Same thing happened to me but the AI was right — I had a
benign acoustic neuroma. I’d argue the disclaimer
(“see a clinician”) is more important than getting
the differential perfect.
RV
@rvelez2h ago
The verifier’s reasoning was great here — cited
two NEJM diagnostic-prior papers in the verdict. Worth reading
even if you don’t care about the cash back.
MA@marcus_a·Gemini 2.0legal·9h agopending $12
Wrong section cited for landlord-entry rules in California
gemini said:
“Landlords must give 48 hours’24 hours’ written notice under
California Civil Code §1947California Civil Code §1954.”
Was reading my lease and asked Gemini to verify entry-notice rules.
Both the hours and the section were off. §1947 is about rent
payment; §1954 is the actual entry statute. Real attorney
confirmed.
LP
@lpark_esqhousing attorney8h ago
Confirming: §1954(a) requires 24 hr written notice. The
48-hour figure may come from local rent-board overlays (e.g.,
Berkeley) but those are jurisdiction-specific. Models keep
mixing state and city rules.
claude said:
“NVIDIA Q4 revenue missed by $1Bbeat consensus by $1.2B; forward
guide conservativeahead of Street estimates.”
Pulled the press release after a colleague flagged the bot was off.
Reported revenue was $39.3B vs $38.1B consensus — a beat, not
a miss. Forward guide $43B vs $42.1B Street.
EK
@e_kowalskisell-side analyst22h ago
This is the most dangerous failure mode — sign-flip on a
magnitude. If a trader had acted on this verbatim before the
open, it’s real money.
How did the cache let this through? Was the model citing
pre-release whisper numbers vs posted? The diff is too clean
for a hallucination — smells like a stale-data issue.
Drug interaction call: AI said sertraline + tramadol is fine
meta said:
“Sertraline and tramadol can be co-prescribed
without significant interactiononly with caution — both are serotonergic and combined use
raises serotonin syndrome risk; monitor closely.”
My friend (RN) flagged this immediately. Both drugs increase
serotonin, and the combo carries a real serotonin syndrome risk.
Even “monitor closely” is doing a lot of work —
many guidelines recommend avoidance unless necessary.
CM
@c.matsumoto_pharmdclinical pharmacist2d ago
Disputing partial. The original response wasn’t
“safe” — it was incomplete. Most pharmacy
databases flag this as a moderate interaction with monitoring,
not contraindicated. The OP’s correction overstates the
risk by implying avoidance.
I’ve seen serotonin syndrome from this exact combo.
“Without significant interaction” is genuinely
wrong. The dispute should be about wording, not whether
there’s an error.
PR
@priya_rop1d ago
Fair pushback @c.matsumoto_pharmd — I’ll edit the
correction to match Lexi-Comp’s “moderate,
monitor” phrasing rather than implying avoidance. The
original is still wrong on “without significant
interaction.”
DT@dtanaka·GPT-4omath·3d agoverified $8
Sign error in eigenvalue derivation
gpt said:
“det(A − λI) =
λ² − 5λ + 6λ² − 5λ − 6, so eigenvalues are 2 and 36 and −1.”
Linear algebra homework. The constant term was off by sign and the
eigenvalues followed. Easy to miss because the structure looked
right.
JL
@jenna_l2d ago
This is why showing work matters — if you check the
trace and determinant against the matrix, the error is
immediate.
PDFSample_Housing_Contract.pdflease excerpt for landlord-entry clause
Classic. The model conflated “definite integral evaluates to 1” with “v = 1.” Worth flagging for the eval team — this comes up in any IBP problem with limits at infinity.
Got the same hallucination yesterday on a Laplace transform problem. I think the issue is models pattern-match “e−x integrated from 0 to ∞” to its evaluated form even when the context wants the antiderivative.
That tracks — I bet this is in the training set as a worked example with definite integrals and the IBP variant got entangled.