Expectation Maximization Algorithm

10h

Research Shows Where Persona Prompting Works And When It Backfires

Research shows that persona prompting "reliably" damages accuracy for some types of tasks but works well in other categories.

Nature

Computer science articles from across Nature Portfolio

Large language models appear aligned, yet harmful pretraining knowledge persists as latent patterns. Here, the authors prove current alignment creates only local safety regions, leaving global ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Research Shows Where Persona Prompting Works And When It Backfires

Computer science articles from across Nature Portfolio

Trending now