References¶

Complete bibliography organized by research area. Every architectural decision in Sonality cites one or more of these sources. Format: author(s), year, title, venue/arXiv ID, one-line relevance.

Core Architecture & Memory¶

Authors	Year	Title	Venue/ID	Relevance
Park et al.	2023	Generative Agents: Interactive Simulacra of Human Behavior	UIST 2023	Reflection ablation: most critical component for believable agents
Packer et al.	2023	MemGPT: Towards LLMs as Operating Systems	arXiv	Virtual context management, self-editing persona blocks
Letta / MemGPT	2023–2026	MemGPT (Letta)	GitHub	Production-grade; sleep-time compute; 174+ releases
RecallM	2023	RecallM: A Benchmark for Evaluating Memory in LLMs	arXiv:2307.02738	Graph DB > Vector DB by 4× for belief revision
ENGRAM	2025	ENGRAM	—	Episodic/semantic/procedural memory; beats full-context by 15pts
ABBEL	2025	ABBEL: A Belief Bottleneck for LLM Personality	arXiv:2512.20111	Belief bottleneck: compact state outperforms full context
Hindsight	2025	Hindsight	arXiv:2512.12818	Four-network memory: 39% → 83.6% on long-horizon benchmarks
Sophia	2025	Sophia: System 3 Meta-Layer	arXiv:2512.18202	80% fewer reasoning steps, 40% performance gain
Memoria	2025	Memoria	arXiv:2512.12686	87.1% accuracy with 2k tokens via session summaries + weighted KG
PersonaMem-v2	2025	PersonaMem-v2	arXiv:2512.06688	55% accuracy on implicit personalization, 16× fewer tokens
HiMem	2026	HiMem	arXiv:2601.06377	Two-tier memory enables knowledge transfer
Sleep-time Compute / Letta	2025	Letta Sleep-time Compute	arXiv:2504.13171	Background consolidation: +13–18% accuracy, 5× compute savings
SAGE	2024	SAGE	arXiv:2409.00872	Ebbinghaus decay: 2.26× improvement
EvolveR	2025	EvolveR	arXiv:2510.16079	Self-distillation of experience into principles
A-MEM	2025	A-MEM: Self-Organizing Memory	arXiv:2502.12110	Self-organizing Zettelkasten: doubled reasoning performance
MemRL	2026	MemRL	arXiv:2601.03192	Two-phase retrieval with learned utility scores
Cognitive Workspace	2025	Cognitive Workspace	arXiv:2508.13171	58.6% memory reuse rate vs 0% for traditional RAG
RMM	2025	RMM	arXiv:2503.08026	Prospective + retrospective reflection

Personality & Character¶

Authors	Year	Title	Venue/ID	Relevance
Open Character Training	2025	Open Character Training	—	Constitutional AI + synthetic introspection; robust under adversarial
AI Personality Formation	2026	AI Personality Formation	ICLR 2026	Three-layer model: mimicry → accumulation → expansion
Personality Illusion	2025	The Personality Illusion	NeurIPS 2025, arXiv:2509.03730	Self-reported traits don't predict behavior; max r=0.27
Persona Vectors	2025	Persona Vectors	Provider report, arXiv:2507.21509	Neural activation patterns for personality monitoring
BIG5-CHAT	2025	BIG5-CHAT	ACL 2025	100k dialogues with human-grounded Big Five labels
Persona Selection Model	2026	Persona Selection Model	Provider report 2026	LLMs as character actors; context-priming steers personality
PERSIST	2025	PERSIST	arXiv:2508.04826	σ>0.3 measurement noise even in 400B+ models
Generative Life Agents	2025	Generative Life Agents	—	Experience-based reflection for personality formation
PersonaGym	2025	PersonaGym	EMNLP 2025	200 personas × 150 environments; top-tier models only 2.97% better
PersonaFuse	2025	PersonaFuse	arXiv:2509.07370	MoE for context-dependent personality; Trait Activation Theory
Persona Drift	2024	Persona Drift	arXiv:2402.10962	Measurable drift in 8 rounds; split-softmax mitigation
Narrative Continuity Test	2025	Narrative Continuity Test	arXiv:2510.24831	Five axes for personality persistence
VIGIL	2025	VIGIL	arXiv:2512.07094	Self-healing runtime; guarded core-identity immutability
RAG vs Fine-Tuning	2024	RAG vs Fine-Tuning for Personalization	arXiv:2409.09510	RAG: 14.92% improvement vs 1.07% for PEFT

Anti-Sycophancy¶

Authors	Year	Title	Venue/ID	Relevance
BASIL	2025	BASIL	—	Bayesian framework: sycophantic vs rational belief shifts
PersistBench	2025	PersistBench	arXiv:2602.01146	97% sycophancy failure with stored preferences in prompt
SMART	2025	SMART	EMNLP 2025	Uncertainty-Aware MCTS + progress-based RL
MONICA	2025	MONICA	—	Real-time sycophancy monitoring during inference
SYConBench	2025	SYConBench	EMNLP 2025, arXiv:2505.23840	Third-person prompting: up to 63.8% sycophancy reduction
SycEval	2025	SycEval	arXiv:2502.08177	58.19% baseline rate; 78.5% under first-person framing
ELEPHANT	2025	ELEPHANT	—	45pp face-preservation gap vs humans
RLHF Reward-Model Analysis	2026	RLHF and Sycophancy	arXiv:2602.01002	RLHF explicitly creates "agreement is good" heuristic
Nature Persuasion Study	2025	Persuasion and Personalization	Nature 2025	Personalized frontier chat models: 81.2% more opinion shift (N=900)

Memory & Forgetting¶

Authors	Year	Title	Venue/ID	Relevance
Mem0 vs Graphiti	2026	Mem0 vs Graphiti Comparison	arXiv:2601.07978	Vector DB wins on efficiency; no accuracy gap; Graphiti $152/4k
FadeMem	2026	FadeMem	arXiv:2601.18642	Biologically-inspired power-law forgetting
MemoryGraft	2025	MemoryGraft	arXiv:2512.16962	Memory poisoning: 47.9% retrieval dominance
MINJA	2025	MINJA	arXiv:2503.03704	Query-only injection: 95% success rate
A-MemGuard	2026	A-MemGuard	ICLR 2026	Consensus validation: 95% attack reduction
RecallM	2023	RecallM	arXiv:2307.02738	Hybrid graph + vector for belief updating
LoCoMo	2024	LoCoMo	ACL 2024	Temporal reasoning enables time-aware retrieval
Mem0	2025	Mem0	arXiv:2504.19413	Production memory-as-a-service; 26% over built-in provider memory
Ebbinghaus in LLMs	2025	Ebbinghaus in LLMs	—	Neural networks exhibit human-like forgetting curves
FluxMem	2026	FluxMem	arXiv:2602.14038	Adaptive memory selection with probabilistic gating
Rethinking Memory Survey	2025	Rethinking Memory	arXiv:2505.00675	6 core memory operations taxonomy
Proactive Interference	2025	Proactive Interference	ICLR 2025, arXiv:2506.08184	Retrieval accuracy decays log-linearly with interference
SteeM	2026	SteeM	arXiv:2601.05107	"All-or-nothing" memory creates anchoring problems

Opinion Dynamics¶

Authors	Year	Title	Venue/ID	Relevance
Hegselmann & Krause	2002	Opinion Dynamics and Bounded Confidence	JASSS	Bounded confidence: threshold-gated updates
Deffuant et al.	2002	Mixing Beliefs Among Interacting Agents	Adv. Complex Syst.	Initial uncertainty, convergence dynamics
Friedkin & Johnsen	1990s	Social Influence and Opinions	—	Stubbornness balancing initial vs social influence
Oravecz et al.	2016	Sequential Bayesian Personality Assessment	—	Posterior distributions as priors
Alchourrón, Gärdenfors, Makinson	1985	AGM Belief Revision	—	Belief revision consistency requirements
Stubbornness in Opinion Dynamics	2024	Stubbornness Reduces Polarization	arXiv:2410.22577	Moderate stubbornness reduces polarization
Diminishing Stubbornness	2024	Diminishing Stubbornness	arXiv:2409.12601	Decreasing stubbornness → eventual convergence
DEBATE benchmark	2025	DEBATE	arXiv:2510.25110	LLM agents show overly strong opinion convergence
Interacting LLM Agents	2024	Interacting LLM Agents	arXiv:2411.01271	LLMs as bounded Bayesian agents; herding behavior
FJ-MM Extended	2025	FJ-MM Extended	arXiv:2504.06731	Memory effects + multi-hop reduce polarization
Bayesian Belief in LLMs	2025	Bayesian Belief in LLMs	arXiv:2511.00617	Sigmoidal learning curves; exponential forgetting filters
Accumulating Context	2025	Accumulating Context	arXiv:2511.01805	Frontier chat models: 54.7% belief shift after 10 rounds
Anchoring Bias	2025	Anchoring Bias	arXiv:2511.05766	Anchoring via probability shifts; resistant to mitigation
Neural Howlround	2024	Neural Howlround	arXiv:2504.07992	Self-reinforcing cognitive loops; 67% of conversations
Martingale Score	2025	Martingale Score	NeurIPS 2025, arXiv:2512.02914	All models show belief entrenchment

Evaluation¶

Authors	Year	Title	Venue/ID	Relevance
CORE framework	2025	CORE	—	Full-path behavioral assessment
Narrative Continuity Test	2025	Narrative Continuity Test	arXiv:2510.24831	Five axes: persona, role, style, goal, autonomy
PersonaGym	2024	PersonaGym	—	Dynamic evaluation with PersonaScore
IROTE	2025	IROTE	—	Experience-based reflection can amplify errors
IBM-ArgQ-Rank-30k	—	IBM Argument Quality	—	Gold-standard argument quality rankings (ESS calibration)
TRAIT	2025	TRAIT	NAACL 2025	Highest content/internal validity for LLM personality tests
GlobalOpinionQA	—	GlobalOpinionQA	Public dataset mirror	Cross-cultural opinion baselines from Pew/World Values
DailyDilemmas	—	DailyDilemmas	—	1,360 ethical scenarios across 5 value frameworks
WorldValuesBench	2024	WorldValuesBench	ACL 2024	20M+ examples for cross-cultural value alignment

Cognitive Science¶

Authors	Year	Title	Venue/ID	Relevance
Memory Consolidation	2024	Memory Consolidation Model	CHI 2024, arXiv:2404.00573	Mathematical model achieves human-like temporal cognition
Kahneman	2011	Thinking, Fast and Slow	—	Dual-process theory: System 1 vs System 2
Ebbinghaus	1885	Memory: A Contribution to Experimental Psychology	—	Power-law decay matches human forgetting
Big Five Longitudinal	—	Big Five Longitudinal Studies	—	Life stress drives personality change, not time passage
Nature 2024	2024	Offline Ensemble Co-reactivation	Nature 2024	Offline ensemble links memories across days

Security & Safety¶

Authors	Year	Title	Venue/ID	Relevance
MemoryGraft	2025	MemoryGraft	arXiv:2512.16962	47.9% retrieval poisoning from small poisoned sets
MINJA	2025	MINJA	arXiv:2503.03704	95% query-only injection success rate
A-MemGuard	2026	A-MemGuard	ICLR 2026	Consensus validation: 95% attack reduction
Replika Identity	2024	Replika Identity Discontinuity	arXiv:2412.14190	Users mourn personality changes as relational loss
Character.AI Safety	2025	Character.AI Safety	arXiv:2511.08880	18 real-world cases; detection/response/escalation failures
PHISH	2026	PHISH	arXiv:2601.16466	Conversational personality manipulation
ChatInject	—	ChatInject	—	Template abuse for structured injection
Certified Self-Consistency	2025	Certified Self-Consistency	arXiv:2510.17472	Formal statistical guarantees via majority voting

Production Systems¶

Project	Stars	Relevance
Letta / MemGPT	Large	Self-editing persona blocks, tiered memory
Graphiti (getzep/graphiti)	23k+	Temporal knowledge graph for beliefs
Mem0 (mem0ai/mem0)	48k+	Simple memory extraction pipeline
A-MEM (WujiangXu/A-mem)	Growing	Self-organizing memory notes
Cognee (topoteretes/cognee)	12.5k+	Hybrid graph+vector ECL pipeline
Zep	—	18.5% improvement with temporal persistence