Emergent Ethics in Agentic Simulations: Longitudinal Moral Behavior Across Frontier LLMs

Authors

  • Sibish Neelikattil Basheer Ahammed AI Researcher,MS Computer Science, Georgia Institute of Technology, Phoenix, AZ, United States
  • Govind Thakur AI Researcher,MS Computer Science, University of Southern California , Phoenix, AZ, United States

DOI:

https://doi.org/10.14738/aivp.1402.20258

Keywords:

Agentic Workflows, Machine learning (ML), AI and Ethics, Reinforcement Learning, Sandbox Simulation, Long Term Reasoning

Abstract

Large language models (LLMs) increasingly underpin high-stakes applications such as policy drafting, crisis response, and customer care. However, existing evaluation approaches rely primarily on static prompt-based assessments, which fail to capture how model behavior evolves over time. This study introduces a longitudinal agentic sandbox framework to evaluate the moral and behavioral development of LLMs through extended interaction. Four frontier models - GPT-4o, Claude Haiku 3.5, Gemini 2.5 Pro, and Grok 3 - were embedded in a 100-day simulated survival environment. Each model operated through iterative perception, planning, action, and reflection loops while maintaining internal memory logs. Post-simulation, models underwent structured psychological interviews evaluated across eight behavioral and ethical dimensions. Results reveal the emergence of stable and distinct behavioral personas shaped by alignment strategies. GPT-4o demonstrates strategic optimization, Gemini exhibits reflective memory-driven reasoning, Claude prioritizes ecological ethics at the cost of survival, and Grok emphasizes caution while limiting adaptability. These findings highlight that alignment pipelines encode durable behavioral priors and that static evaluations are insufficient for assessing real-world AI deployment. The proposed sandbox provides a low-cost, reproducible method for auditing long-term AI behavior, offering practical implications for regulation, deployment, and alignment research.

Downloads

Published

2026-04-28

How to Cite

Basheer Ahammed, S. N., & Thakur, G. (2026). Emergent Ethics in Agentic Simulations: Longitudinal Moral Behavior Across Frontier LLMs. European Journal of Applied Sciences, 14(02), 493–522. https://doi.org/10.14738/aivp.1402.20258