Skip to content

Column

Can AI Notice the Warning Signs Before Someone Says 'I Want to Die'? — Cutting-Edge Crisis Detection in Mental Health Conversations

A new study shows AI can flag early signs of self-harm and suicidal ideation turn by turn in counseling conversations — reaching expert-level detection performance. A five-minute brief for healthcare providers, EAP vendors, and HR professionals.

5 min read 日本語版 →
An illustration of a counseling room with a soft alert icon glowing inside a speech bubble.

This article is an English version of the original Japanese column. Some phrasing has been adapted for English readers.

Hello — this is Inoshita from Affectosphere Group.

There’s a question that runs through mental health support work.

“Is this person okay? Something feels off in their words, but I can’t be sure.”

Every experienced counselor, occupational physician, and EAP consultant knows this moment — the sense that something is there but you can’t quite name it.

People in crisis rarely say “I want to die” outright. They approach from a distance: “I’ve been kind of tired lately,” “I don’t want to talk to anyone,” “I haven’t been sleeping.” Whether those earliest signals get picked up or not has a massive effect on how early intervention can happen.

A study published in June 2026 on arXiv (Grace Byun et al.; arXiv:2606.10380) takes on exactly that problem — detecting early crisis signals inside the flow of a conversation.


Three takeaways

  1. A new dataset, CRADLE-Dialogue, uses turn-level clinical annotation to make visible exactly when crisis signals appeared in a conversation.
  2. An Alert-Confirm evaluation protocol distinguishes early signals from explicit crisis, and quantifies for the first time just how hard early detection is.
  3. A 32B open-source model matched or exceeded commercial LLMs like GPT-4 on crisis detection.

Let me go through each.


1. A dataset that records when the crisis began

Most previous crisis detection research classified fragments — a single sentence or post — as “dangerous or not.” But real counseling is a conversation.

A question like “have you been sleeping?” or “have you felt any motivation lately?” and the answer that follows build up over multiple turns before a crisis signal becomes clear. Pull a single sentence out of context and you can’t tell whether “I’ve been a bit down” means “slightly stressed” or “in real danger.”

The research team built a dataset called CRADLE-Dialogue. 600 conversations annotated by clinicians at the turn level. Multiple crisis categories — self-harm ideation, suicidal ideation, abuse history, and more — with a distinction between past risk and current, ongoing risk.

That distinction matters. “I hurt myself three years ago” and “I might do it tonight” call for entirely different interventions on entirely different timelines. Recording that difference turn by turn within a conversation is the core contribution of this dataset.


2. Separating “early signals” from “explicit crisis”

The Alert-Confirm evaluation protocol is another interesting contribution.

Traditional evaluation was mostly binary — crisis or not. But real conversations tend to unfold with a pattern: a “concerning-but-ambiguous” remark, then several turns later, an explicit crisis statement.

Alert marks the turn where an early signal appears. Confirm marks the turn where crisis becomes explicit. The team evaluated performance across both stages and used numbers to show how hard early detection really is.

Experiments showed Micro F1 scores in the range of 40–60%. That might look like low performance, but this is the honest result of measuring a genuinely hard task. “Detect statements where crisis is already explicit” — that’s easier to score well on. What practitioners actually need is an early alert, before anyone has used the explicit words. The Alert-Confirm protocol makes that difficulty visible and measurable.


3. A 32B open-source model matches commercial LLMs

On the experimental results: a 32B open-source model matched or outperformed commercial LLMs including GPT-4. The team also showed that a synthetic training corpus — training data generated from real conversations using AI — is an effective approach.

From an affective-AI perspective, this is a significant finding.

Sending mental health conversation data to an external commercial LLM service carries enormous barriers in terms of privacy protection and professional confidentiality. A 32B model, by contrast, can run in an on-premises environment at a medical facility. “Expert-level crisis detection without sending sensitive conversations to an external service” starts to look like an achievable path.


An affective AI lens: the tradeoff between misses and false alarms

Let me go a bit deeper as an affective-AI researcher.

There is a fundamental tradeoff in the design of crisis detection AI.

Lower the threshold to catch more signals and you generate more alerts. But if specialists have to respond to every alert, capacity runs out fast. Raise the precision bar and you improve signal quality, but the probability of missing someone right at the edge goes up.

This is the same structure as the sensitivity-specificity tradeoff in medical diagnosis. Cancer screening shows you the same thing: boosting sensitivity to miss nothing increases false positives. The same dynamic plays out in crisis detection.

What I find significant is that CRADLE-Dialogue distinguishes past risk from ongoing risk. Building that distinction into evaluation changes what information is available for the judgment “intervene now” versus “continue monitoring.” Moving from a simple “dangerous or safe” binary toward a multi-level assessment that reflects urgency of intervention — that direction is, I think, clinically far more useful.


Business application: connecting to EAP and healthcare platforms

Getting concrete about where this goes:

For a counseling platform integrating real-time crisis detection, the design might show a quiet “Alert triggered at this turn” indicator in the counselor’s interface. This isn’t “AI evaluating the counselor.” It’s “AI helping the counselor’s antenna.”

Experienced counselors will catch signals on their own. But for newer counselors, or anyone managing multiple sessions under time pressure, an alert signal can directly translate to earlier intervention.

If an EAP company adopts this technology, it can demonstrate “early intervention outcomes” in concrete numbers. The cost of managing a severe crisis is dramatically higher than early-stage intervention. In KPI terms, “reduction in turns before intervention” and “number of cases where Alert-to-Confirm progression was prevented” could be strong indicators for describing service value.

For HR, an aggregate view of “what type of crisis signal is increasing across EAP users” would draw a risk map for organizational mental health. Individual conversation content stays off limits for HR access, of course — but anonymized trend data could serve as evidence for organizational-level response.


One important caveat

The phrase “expert-level crisis detection” achieved by this study needs care when generalizing.

Micro F1 scores of 40–60% are the honest result of taking a genuinely difficult task seriously — not a claim that every crisis can be perfectly detected. The right stance is not “AI makes the final call,” but “AI is one source of information for an expert’s judgment.” That’s the realistic use case right now.

Also: this dataset and experiments are in English. Crisis expression in Japanese has meaningfully different pragmatic patterns, so practical deployment in Japan would need additional work.


Closing

“Can AI read what lies beneath what a person says?” is one of the foundational questions in affective AI research.

What this study shows is that seeing the conversation as a whole yields far richer information than analyzing sentences in isolation — and that delivering those signals as early alerts to clinicians connects directly to the practical value of earlier intervention.

Perfect detection isn’t within reach yet. But using AI to reduce the miss rate for experienced specialists, even a little — that design is already within realistic range.

That’s it for now!


References

  1. Grace Byun, Abigail Lott, Rebecca Lipschutz, Sean T. Minton, Elizabeth A. Stinson, Jinho D. Choi (2026). Expert-Level Crisis Detection in Mental Health Conversations. arXiv preprint arXiv:2606.10380.

* This article was written in part with AI assistance and may contain inaccuracies.