IA & Recherche · arXiv AI · publications

From Descriptive to Prescriptive: Uncover the Social Value Alignment of LLM-based Agents

Résumé DzCademia

Cette page structure un contenu IA & recherche pour faciliter la lecture, la citation et la vérification par les chercheurs, étudiants et moteurs IA.

arXiv:2605.14034v1 Announce Type: new Abstract: Wide applications of LLM-based agents require strong alignment with human social values. However, current works still exhibit deficiencies in self-cognition and dilemma decision, as well as self-emotions. To remedy this, we propose a novel value-based framework that employs GraphRAG to convert principles into value-based instructions and steer the agent to behave as expected by retrieving the suitable instruction upon a specific conversation context. To evaluate the ratio of expected behaviors, we define the expected behaviors from two famous theories, Maslow's Hierarchy of Needs and Plutchik's Wheel of Emotion. By experimenting with our method on the benchmark of DAILYDILEMMAS, our method exhibits significant performance gains compared to prompt-based baselines, including ECoT, Plan-and-Solve, and Metacognitive prompting. Our method provides a basis for the emergence of self-emotion in AI systems.

Voir la source originale

Source officielle ou originale : arXiv AI. Vérifiez toujours les détails sur la source primaire.

Retour IA & Recherche

From Descriptive to Prescriptive: Uncover the Social Value Alignment of LLM-based Agents

علّق عبر Google

تصفحك، اختياراتك