人工智能与超级智能的哲学与伦理期末结课报告：背景意义、研究现状、思考与展望 - Acadwrite

人工智能与超级智能的哲学与伦理期末结课报告：背景意义、研究现状、思考与展望

本报告将人工智能的哲学与伦理研究系统划分为四个维度：本体论层面的机器意识与道德主体性、社会层面的算法公正与治理、工程层面的价值对齐与人机协作、以及宏观层面的超级智能存在性风险与哲学反思。这一分类框架涵盖了从技术实现到社会规范再到人类文明未来的全方位探讨，为期末结课报告提供了清晰的学术逻辑支撑。

共 87 篇文献，4 个研究方向

人工智能的本体论与道德主体性研究

该组文献集中探讨人工智能的哲学基础，包括意识、意向性、思维的定义，以及机器是否具备道德主体性（Moral Agency）的理论界定与哲学争论。相关文献: Rajakishore Nath et. al, 2017 等 23 篇文献

算法公正、偏见治理与社会伦理影响

该组文献重点分析算法在决策中的偏见、歧视及其社会不平等影响，探讨算法正义、数据公正以及如何通过伦理框架与社会技术协作实现负责任的AI部署。相关文献: Javier González-Argote et. al, 2025 等 25 篇文献

人工智能价值对齐与人机协作机制

该组文献聚焦于如何将人类价值观嵌入AI系统，研究价值对齐的技术路径（如RLHF）、度量方法、协作框架以及在医疗等特定领域的实践挑战。相关文献: Jianfeng Cao et. al, 2025 等 13 篇文献

超级智能风险、治理与存在性哲学反思

该组文献探讨超智能（ASI）带来的存在性风险、文明崩溃威胁，以及宏观治理模型、跨学科（神学、未来学）视角下的AI伦理规范与人类未来意义。相关文献: Nicolas J. Tanchuk et. al, 2025 等 26 篇文献

总计87篇相关文献

The problem of machine ethics in artificial intelligence

人工智能中的机器伦理问题

link.springer.com-Rajakishore Nath, Vineet Sahu, 2017-AI & SOCIETY3区IF 4.7

… ethics task of ensuring ethical behaviour of an artificial agent. Although, there are many philosophical issues related to artificial intelligence… is to discuss, first, whether ethics is the sort of …

安装插件收集

被引 55

Introduction to the Ethics of Artificial Intelligence

人工智能伦理学导论

books.google.com-David J. Gunkel, 2024-Handbook on the Ethics of Artificial Intelligence

… the topic of philosophical aspects of the ethics of artificial intelligence, a good place to start is with the concept of artificial intelligence itself. What exactly is artificial intelligence? It is a …

安装插件收集

被引 2

An Overview of Artificial Intelligence Ethics

人工智能伦理概述

ieeexplore.ieee.org-Changwu Huang, Zeqi Zhang, Bifei Mao 等, 2023-IEEE Transactions on Artificial Intelligence

Artificial intelligence (AI) has profoundly changed and will continue to change our lives. AI is being applied in more and more fields and scenarios such as autonomous driving, medical care, media, finance, industrial robots, and internet services. The widespread application of AI and its deep integration with the economy and society have improved efficiency and produced benefits. At the same time, it will inevitably impact the existing social order and raise ethical concerns. Ethical issues, such as privacy leakage, discrimination, unemployment, and security risks, brought about by AI systems have caused great trouble to people. Therefore, AI ethics, which is a field related to the study of ethical issues in AI, has become not only an important research topic in academia, but also an important topic of common concern for individuals, organizations, countries, and society. This article will give a comprehensive overview of this field by summarizing and analyzing the ethical risks and issues raised by AI, ethical guidelines and principles issued by different organizations, approaches for addressing ethical issues in AI, and methods for evaluating the ethics of AI. Additionally, challenges in implementing ethics in AI and some future perspectives are pointed out. We hope our work will provide a systematic and comprehensive overview of AI ethics for researchers and practitioners in this field, especially the beginners of this research discipline.

安装插件收集

被引 267

Philosophy of Artificial Intelligence

人工智能哲学

www.researchgate.net-Jerry Kaplan, 2016-Artificial Intelligence2区IF 4.6

What is the philosophy of AI? You might wonder why a field like AI seems to attract so much controversy. After all, other engineering disciplines—such as civil, mechanical, or electrical engineering—aren’t typically the target of vociferous criticism from various branches of the humanities. Largely,...

安装插件收集

被引 8

Reconstruction of the Ethics of Artificial Intelligence Development in Islamic Philosophy and Muhammadiyah Thought

伊斯兰哲学与穆罕默迪亚思想中人工智能发展伦理的重建

journals2.ums.ac.id-Mazwar Ismiyanto, S. Anif, Harun Joko Prayitno 等, 2026-Jurnal Penelitian Sains Teknologi

This study aims to analyze the perspectives of al-Islam and Muhammadiyah on the development of Artificial Intelligence (AI) within the framework of ethics, epistemology, and the concept of blessing (barakah). The research employs a qualitative approach using a library research method, through the analysis of literature on Islamic philosophy, Muhammadiyah thought, and studies on technology ethics and AI. The data were analyzed using content analysis and hermeneutic techniques to identify normative principles relevant to responding to AI development. The findings indicate that Islam views technology as an instrument of the human mandate of khalifah (vicegerency), which must be directed toward public welfare (maslahah), justice, and balance between worldly life and the hereafter. The concept of Islamic ethics including tawhid, adl, moral character (akhlaq), and social responsibility serves as the normative foundation for evaluating and utilizing AI. Knowledge (ilm) is understood as a religious obligation that is not morally neutral; therefore, AI development must be oriented toward truth and benefit. Meanwhile, the concept of blessing (barakah) emphasizes sustainability and the spiritual dimension in the use of technology. From the Muhammadiyah perspective, the integration of religion and science, the strengthening of education, and community empowerment constitute the primary principles in AI development. AI is positioned as a means of civilizational renewal that must be guided by ethical values to prevent injustice or dehumanization. Thus, al-Islam and Muhammadiyah offer an integrative and normative philosophical framework for directing AI development in a responsible, just, and spiritually meaningful manner.

安装插件收集

被引 2

Towards a Code of Ethics for Artificial Intelligence

迈向人工智能伦理规范

link.springer.com-P. Boddington, 2017-Artificial Intelligence: Foundations, Theory, and Algorithms

… far more by our ethical and philosophical landscape than by … the years by ethicists and philosophers, working in areas that … can impact on AI and the ethical dilemmas that it is likely to …

安装插件收集

被引 204

Ethical Considerations in Artificial Intelligence Courses

人工智能课程中的伦理考量

ojs.aaai.org-Emanuelle Burton, J. Goldsmith, Sven Koenig 等, 2017-AI Magazine4区IF 3.2

The recent surge in interest in ethics in artificial intelligence may leave many educators wondering how to address moral, ethical, and philosophical issues in their AI courses. As instructors we want to develop curriculum that not only prepares students to be artificial intelligence practitioners, but also to understand the moral, ethical, and philosophical impacts that artificial intelligence will have on society. In this article we provide practical case studies and links to resources for use by AI educators. We also provide concrete suggestions on how to integrate AI ethics into a general artificial intelligence course and how to teach a stand-alone artificial intelligence ethics course.

安装插件收集

被引 153

The Ethics of Artificial Intelligence for the Sustainable Development Goals

人工智能伦理与可持续发展目标

link.springer.com-F Mazzi, L Floridi, 2023-Philosophical Studies Series

… Philosophical Studies Series aims to provide a forum for the best current research in contemporary philosophy … illuminating ways of addressing philosophical questions and …

安装插件收集

被引 17

From posthumanism to ethics of artificial intelligence

从后人类学到人工智能伦理学

link.springer.com-Rajakishore Nath, Riya Manna, 2021-AI & SOCIETY3区IF 4.7

… our journey from the genealogical traces of posthumanistic movement and its influence on the contemporary philosophy. Later, this will help us to explore its’ compatibility with AI ethics. …

安装插件收集

被引 41

Ethics, Governance, and Policies in Artificial Intelligence

人工智能中的伦理、治理和政策

link.springer.com-L Floridi, 2021-Philosophical Studies Series

… Philosophical Studies Series aims to provide a forum for the best current research in contemporary philosophy … illuminating ways of addressing philosophical questions and …

安装插件收集

被引 70

Ethics of Artificial Intelligence

人工智能伦理学

link.springer.com-Francisco Lara, Jan Deckers, 2023-The International Library of Ethics, Law and Technology

… Finally, the third part considers possible contributions to the ethics of AI from other … philosophy of science. So, the first question we have to ask ourselves is the very meaning of AI ethics. …

安装插件收集

被引 11

Artificial superintelligence alignment in healthcare

医疗保健中的人工超级智能对齐问题

link.springer.com-D. Ueda, S. Walston, Ryo Kurokawa 等, 2025-Japanese Journal of Radiology4区IF 4.1

The emergence of Artificial Superintelligence (ASI) in healthcare presents unprecedented opportunities for revolutionizing diagnostics, treatment planning, and population health management, but also introduces critical risks if these systems are not properly aligned with human values and clinical objectives. This review examines the theoretical foundations of ASI and the alignment problem in healthcare contexts, exploring how misaligned Artificial Intelligence (AI) systems could optimize for wrong objectives or pursue harmful strategies leading to patient harm and systemic failures. Current challenges in AI alignment are illustrated through real-world examples from radiology and clinical decision-making, where algorithms have demonstrated concerning biases, generalizability failures, and optimization for inappropriate proxy measures. The paper analyzes key alignment challenges including objective complexity and technical pitfalls, bias and fairness issues in healthcare data, ethical integration concerns involving compassion and patient autonomy, and system-level policy challenges around regulation and liability. Technical alignment strategies are discussed including reinforcement learning from human feedback, interpretability requirements, formal verification methods, and adversarial testing approaches. Normative alignment solutions encompass ethical frameworks, professional standards, patient engagement protocols, and multi-level governance structures spanning institutional, national, and international coordination. The review emphasizes that successful ASI alignment in healthcare requires combining cutting-edge AI research with fundamental medical ethics, noting that while proper alignment could enable transformative health improvements and medical breakthroughs, misalignment risks undermining the core purpose of medicine. The stakes of this alignment challenge are characterized as among the highest in both technology and ethics, with implications extending from individual patient safety to public trust and potentially existential risks.

安装插件收集

被引 1

Ethically Aligned Design in Autonomous and Intelligent Systems: An Overview

自主和智能系统中的伦理一致设计：概述

ieeexplore.ieee.org-Andrew Burnside, Emerson Bodde, 2025-2025 IEEE International Symposium on Ethics in Engineering, Science, and Technology (ETHICS)

Much recent work in the value theory of autonomous and intelligent systems (AIS) revolves around three issues. First is the alignment problem: the problem of producing AIS whose values align with humanity's interests. Second., superintelligence: the potential for AIS to develop intelligence which would surpass even the most intelligent humans. An increasing number of authors argue that superintelligent AIS could emerge overnight because of a recursively improving process-this is the singularity hypothesis. Further., many of the same authors believe that the concatenation of these problems should direct interest to the long-term potential for misaligned and superintelligent AIS which could pose risks of existential proportions to human interests. Therefore., they argue for a policy stance which we describe as “hard alignment.,'’ the proposal of cooperation with technological experts to avoid hypothetical scenarios where AIS disempower humanity. On the other hand., we describe our view as “soft alignment.'’ Considering the lack of adequate evidence for hard alignment's radical claims., the finite resources and attention of policymakers and technological experts are best served by devoting., at best., a modest amount of time., attention., and resources towards policies regarding AIS which manage the moral risks involved in misaligned., already-existing AIS., and regarding misaligned potential AIS relative to the evidentiary basis for their possible realization. Therefore., we argue for the adoption of policies towards alignment which manage the everyday risk involved in misaligned AIS rather than long-term existential risks., which are difficult to quantify.

安装插件收集

The ethics of creating artificial superintelligence: a global risk perspective

创造人工超级智能的伦理问题：全球风险视角

link.springer.com-J. Dessureault, R. Lamontagne, Pierre-Olivier Parisé, 2025-AI and Ethics

… lethal autonomous weapons, raise significant ethical and security concerns due to their … for comprehensive safety protocols, ethical alignment strategies, and regulatory oversight to …

安装插件收集

被引 7

From Principle to Practice: Value Alignment in AI Ethics and Governance

From Principle to Practice: Value Alignment in AI Ethics and Governance

www.cambridge.org-Jianfeng Cao, 2025-German Law Journal2区IF 1.2

Abstract As China rapidly advances in AI innovation and development, especially in frontier AI, its regulatory and ethical frameworks are under increasing pressure to ensure that technological progress aligns with human interests and societal values. This Article argues that AI value alignment—the process of ensuring AI systems act in accordance with human values, norms, and ethical principles—should be adopted as a strategic pillar in China’s evolving AI governance architecture. While China has already established a comprehensive legal, ethical, and self-regulatory landscape to address AI risks, these mechanisms often rely on reactive enforcement and external compliance. In contrast, AI value alignment offers a proactive, intrinsic approach that embeds safety and ethical constraints directly into AI systems, making them safer, more trustworthy, and responsive to human needs. This study begins by mapping China’s current AI governance landscape, including national legislation such as the Cybersecurity Law, Personal Information Protection Law, and a growing set of regulations targeted at algorithms and generative AI. It also evaluates China’s normative commitments, such as the “human-centric” and “tech for good” principles articulated in national policy documents, and the increasing role of corporate self-regulation among major technology firms. While commendable in scope and ambition, these governance mechanisms often fall short in ensuring that AI behavior aligns with safety constraints and ethical intent—particularly when AI systems (such as agentic AI) become more autonomous and capable. This gap highlights the urgent need for a systematic value alignment strategy. The Article then delves into the conceptual and technical foundations of AI value alignment, identifying both engineering challenges—such as reward misspecification, data bias, and model deception—and normative dilemmas, including moral pluralism, value aggregation, and dynamic ethics. Special attention is paid to frontier models like large language models and artificial general intelligence (AGI), which pose alignment challenges at a scale previously unseen. Drawing on contemporary alignment techniques such as RLHF (Reinforcement Learning from Human Feedback) and principle-based alignment, such as Anthropic’s Constitutional AI, the Article explores their limitations and calls for a more diversified, interdisciplinary, and forward-looking alignment research agenda. Finally, the Article offers a roadmap for operationalizing AI value alignment across three key governance domains: Law and regulation, ethical norms, and industry self-regulation. Recommendations include the incorporation of alignment assessments into regulatory filings, the development of technical standards for value alignment and ethics-by-design guidelines, and institutional investments in safety and alignment research. The Article concludes by asserting that value alignment is not merely a technical safeguard but a governance imperative for the age of autonomous AI and agentic AI. By integrating alignment into its AI governance strategy, China can not only enhance domestic safety and public trust but also better coordinate with global AI ethics and safety initiatives—ultimately contributing to the shared goal of human-aligned and beneficial artificial intelligence.

安装插件收集

Deep ASI Literacy: Educating for Alignment with Artificial Super Intelligent Systems

深度人工智能素养：培养与人工超级智能系统相一致的教育

onlinelibrary.wiley.com-Nicolas J. Tanchuk, 2025-Educational Theory4区IF 0.9

Artificial intelligence companies and researchers are currently working to create Artificial Superintelligence (ASI): AI systems that significantly exceed human problem‐solving speed, power, and precision across the full range of human solvable problems. Some have claimed that achieving ASI — for better or worse — would be the most significant event in human history and the last problem humanity would need to solve. In this essay Nicolas Tanchuk argues that current AI literacy frameworks and educational practices are inadequate for equipping the democratic public to deliberate about ASI design and to assess the existential risks of such technologies. He proposes that a systematic educational effort toward what he calls “Deep ASI Literacy” is needed to democratically evaluate possible ASI futures. Deep ASI Literacy integrates traditional AI literacy approaches with a deeper analysis of the axiological, epistemic, and ontological questions that are endemic to defining and risk‐assessing pathways to ASI. Tanchuk concludes by recommending research aimed at identifying the assets and needs of educators across educational systems to advance Deep ASI Literacy.

安装插件收集

被引 3

The state as a model for AI control and alignment

国家作为人工智能控制和对齐的模型

link.springer.com-Micha Elsner, 2024-AI & SOCIETY3区IF 4.7

Debates about the development of artificial superintelligence and its potential threats to humanity tend to assume that such a system would be historically unprecedented, and that its behavior must be predicted from first principles. I argue that this is not true: we can analyze multiagent intelligent systems (the best candidates for practical superintelligence) by comparing them to states, which also unite heterogeneous intelligences to achieve superhuman goals. States provide a model for several problems discussed in the literature on superintelligence, such as principal-agent problems and Instrumental Convergence. Philosophical arguments about governance, therefore, provide possible solutions to these problems, or point out problems in previously suggested solutions. In particular, the liberal concept of checks and balances, and Hannah Arendt’s concept of legitimacy, describe how state behavior is constrained by the preferences of constituents that could also apply to artificial systems. However, they also point out ways in which present-day computational developments could destabilize the international order by reducing the number of decision-makers involved in state actions. Thus, interstate competition not only serves as a model for the behavior of dangerous computational intelligences but also as the impetus for their development.

安装插件收集

被引 3

Exploring ‘Value Alignment’: A Genealogy and Three Conceptions

探索‘价值一致性’：一种溯源与三种观念

link.springer.com-Daniel López-Castro, 2026-Law, Governance and Technology Series

… The origin of the value alignment concept is deeply intertwined with the notion of superintelligence. Superintelligence, in turn, cannot be understood without its connection to the …

安装插件收集

Machine Intelligence, Artificial General Intelligence, Super-Intelligence, and Human Dignity

机器智能、通用人工智能、超级智能与人类尊严

www.mdpi.com-Ted Peters, 2025-Religions2区IF 0.6

Our temptation to personify machine intelligence is not unexpected. As a child we named our dolls and took our Teddy Bear to bed with us. Today we ask death bots to comfort us with post-mortem conversation. All the while we know this to be pretend. Yet we must ask: if Artificial General Intelligence (AGI) or even Artificial Super-Intelligence (ASI) become available, will our game of pretend continue? Or will intelligent robots actually become selves deserving of dignity that hitherto could be ascribed only to human persons? If government-imposed guardrails shut the door on development of AGI and ASI in order to preserve human safety and even dignity, we might never learn whether AGI or ASI could develop selfhood, personhood, virtue, or religious sensibilities. As we approach the future, can we live without knowing whether AGI or ASI would be capable of developing selfhood and commanding dignity?

安装插件收集

被引 3

Cybertheology and the Ethical Dimensions of Artificial Superintelligence: A Theological Inquiry into Existential Risks

网络神学与人工超级智能的伦理维度：对存在风险的神学探究

T. Peters, 2024-Khazanah Theologia

Purpose: This study explores the role of cybertheology in addressing the ethical and societal challenges posed by Artificial Superintelligence (ASI), which has the potential to surpass human cognitive capabilities, heralding a profound cultural and existential crisis. It integrates theological anthropology to assess the implications of a posthuman future. Methodology: Utilising a comprehensive literature review, the research examines technological, philosophical, and theological perspectives through primary and secondary sources, including influential works by futurists and ethicists. The methodology aims to uncover the nuanced discourse surrounding the development of ASI and its potential impacts. Findings: The analysis reveals a narrative marked by speculative optimism and significant existential concerns regarding ASI. A critical gap in the existing ethical discourse is identified, highlighting the necessity for a grounded ethical framework that addresses the profound implications of superintelligent entities on human dignity and societal norms. Research Implications: The findings emphasise the urgent need to incorporate robust ethical considerations into the development and deployment of ASI. Cybertheology is presented as a vital framework for ensuring that ASI technologies align with human values and theological insights, thus providing a valuable lens through which to view the integration of superintelligence into society. Originality/Value: This paper contributes to academic and policy discussions on ASI by promoting cybertheology as a crucial perspective in ethical deliberations. It enriches scholarly dialogues by linking technological advancements with theological and ethical evaluations, proposing that cybertheology can play a pivotal role in shaping policies that govern ASI technologies. This approach ensures that technological progress is compatible with humanistic values, fostering a holistic understanding of ASI's potential impact on humanity.

安装插件收集

被引 5

A Challenge for Machine Ethics

机器伦理的挑战

link.springer.com-Ryan Tonkens, 2009-Minds and Machines2区IF 3.4

… Kantian artificial moral agents. Specifically, the sort of AMAs under issue are machines that … out to meet different standards for moral agency, then so much the better for Machine Ethics. …

安装插件收集

被引 90

Making moral machines: why we need artificial moral agents

制造道德机器：为什么我们需要人工道德代理

link.springer.com-Paul Formosa, M. Ryan, 2020-AI & SOCIETY3区IF 4.7

… 731) argue in response that we will not learn more about morality through machine ethics but only through studying human psychology. While it might be true that we can learn much …

安装插件收集

被引 44

When Is a Robot a Moral Agent

何时机器人成为道德行为者

books.google.com-John P. Sullins, 2006-Machine Ethics

… When that agency3 causes harm or good in a moral sense, we can say the machine has moral agency. Autonomy thus described is not sufficient in itself to ascribe moral agency. …

安装插件收集

被引 239

Perspectives about artificial moral agents

关于人工道德代理的视角

link.springer.com-Andreia Martinho, Adam Poulsen, M. Kroesen 等, 2021-AI and Ethics

The pursuit of AMAs is complicated. Disputes about the development, design, moral agency, and future projections for these systems have been reported in the literature. This empirical study explores these controversial matters by surveying (AI) Ethics scholars with the aim of establishing a more coherent and informed debate. Using Q-methodology, we show the wide breadth of viewpoints and approaches to artificial morality. Five main perspectives about AMAs emerged from our data and were subsequently interpreted and discussed: (i) Machine Ethics: The Way Forward; (ii) Ethical Verification: Safe and Sufficient; (iii) Morally Uncertain Machines: Human Values to Avoid Moral Dystopia; (iv) Human Exceptionalism: Machines Cannot Moralize; and (v) Machine Objectivism: Machines as Superior Moral Agents. A potential source of these differing perspectives is the failure of Machine Ethics to be widely observed or explored as an applied ethic and more than a futuristic end. Our study helps improve the foundations for an informed debate about AMAs, where contrasting views and agreements are disclosed and appreciated. Such debate is crucial to realize an interdisciplinary approach to artificial morality, which allows us to gain insights into morality while also engaging practitioners.

安装插件收集

被引 24

Artificial agency, consciousness, and the criteria for moral agency: what properties must an artificial agent have to be a moral agent?

人工代理、意识与道德代理的判定标准：一个人工代理要成为道德代理必须具备哪些属性？

link.springer.com-K. Himma, 2009-Ethics and Information Technology2区IF 4.0

… of agency, natural agency, artificial agency, and moral agency, as well as articulate what are widely taken to be the criteria for moral agency… whether a machine is a moral agent are well …

安装插件收集

被引 231

A Theological Account of Artificial Moral Agency

从神学视角看人工道德代理的阐释

journals.sagepub.com-Ximian Xu, 2023-Studies in Christian Ethics3区IF 0.4

This article seeks to explore the idea of artificial moral agency from a theological perspective. By drawing on the Reformed theology of archetype-ectype, it will demonstrate that computational artefacts are the ectype of human moral agents and, consequently, have a partial moral agency. In this light, human moral agents mediate and extend their moral values through computational artefacts, which are ontologically connected with humans and only related to limited particular moral issues. This moral leitmotif opens up a way to deploy carebots into Christian pastoral care while maintaining the human agent's uniqueness and responsibility in pastoral caregiving practices.

安装插件收集

被引 5

Implementations in Machine Ethics

机器伦理的实施

dl.acm.org-Suzanne Tolmeijer, Markus Kneer, Cristina Sarasua, 2020-ACM Computing Surveys1区 TopIF 28.0

Increasingly complex and autonomous systems require machine ethics to maximize the benefits and minimize the risks to society arising from the new technology. It is challenging to decide which type of ethical theory to employ and how to implement it effectively. This survey provides a threefold contribution. First, it introduces a trimorphic taxonomy to analyze machine ethics implementations with respect to their object (ethical theories), as well as their nontechnical and technical aspects. Second, an exhaustive selection and description of relevant works is presented. Third, applying the new taxonomy to the selected works, dominant research patterns, and lessons for the field are identified, and future directions for research are suggested.

安装插件收集

被引 92

Critiquing the Reasons for Making Artificial Moral Agents

对制造人工道德代理的理由进行批判

link.springer.com-Aimee van Wynsberghe, Scott Robbins, 2018-Science and Engineering Ethics3区IF 3.0

Many industry leaders and academics from the field of machine ethics would have us believe that the inevitability of robots coming to have a larger role in our lives demands that robots be endowed with moral reasoning capabilities. Robots endowed in this way may be referred to as artificial moral agents (AMA). Reasons often given for developing AMAs are: the prevention of harm, the necessity for public trust, the prevention of immoral use, such machines are better moral reasoners than humans, and building these machines would lead to a better understanding of human morality. Although some scholars have challenged the very initiative to develop AMAs, what is currently missing from the debate is a closer examination of the reasons offered by machine ethicists to justify the development of AMAs. This closer examination is especially needed because of the amount of funding currently being allocated to the development of AMAs (from funders like Elon Musk) coupled with the amount of attention researchers and industry leaders receive in the media for their efforts in this direction. The stakes in this debate are high because moral robots would make demands on society; answers to a host of pending questions about what counts as an AMA and whether they are morally responsible for their behavior or not. This paper shifts the burden of proof back to the machine ethicists demanding that they give good reasons to build AMAs. The paper argues that until this is done, the development of commercially available AMAs should not proceed further.

安装插件收集

被引 130

A Case for Machine Ethics in Modeling Human-Level Intelligent Agents

人类水平智能代理模型中的机器伦理研究

philpapers.org-R. Boyles, 2018-Kritike: An Online Journal of Philosophy4区IF 0.2

This paper focuses on the research field of machine ethics and how it relates to a technological singularity—a hypothesized, futuristic event where artificial machines will have greater-than-human-level intelligence. One problem related to the singularity centers on the issue of whether human values and norms would survive such an event. To somehow ensure this, a number of artificial intelligence researchers have opted to focus on the development of artificial moral agents, which refers to machines capable of moral reasoning, judgment, and decision-making. To date, different frameworks on how to arrive at these agents have been put forward. However, there seems to be no hard consensus as to which framework would likely yield a positive result. With the body of work that they have contributed in the study of moral agency, philosophers may contribute to the growing literature on artificial moral agency. While doing so, they could also think about how the said concept could affect other important philosophical concepts.

安装插件收集

被引 12

A perceived moral agency scale: Development and validation of a metric for humans and social machines

感知道德代理量表：人类和社会机器的度量开发与验证

www.sciencedirect.com-J. Banks, 2019-Computers in Human Behavior1区 TopIF 8.9

Abstract Although current social machine technology cannot fully exhibit the hallmarks of human morality or agency, popular culture representations and emerging technology make it increasingly important to examine human interlocutors’ perception of social machines (e.g., digital assistants, chatbots, robots) as moral agents. To facilitate such scholarship, the notion of perceived moral agency (PMA) is proposed and defined, and a metric developed and validated through two studies: (1) a large-scale online survey featuring potential scale items and concurrent validation metrics for both machine and human targets, and (2) a scale validation study with robots presented as variably agentic and moral. The PMA metric is shown to be reliable, valid, and exhibiting predictive utility.

安装插件收集

被引 129

Moral agency without responsibility? Analysis of three ethical models of human-computer interaction in times of artificial intelligence (AI)

没有责任的道德主体？人工智能时代人机交互三种伦理模型分析

journal.ep.liu.se-Alexis Fritz, Wiebke Brandt, Henner Gimpel 等, 2020-De Ethica

Philosophical and sociological approaches in technology have increasingly shifted toward describing AI (artificial intelligence) systems as ‘(moral) agents,’ while also attributing ‘agency’ to them. It is only in this way – so their principal argument goes – that the effects of technological components in a complex human-computer interaction can be understood sufficiently in phenomenological-descriptive and ethical-normative respects. By contrast, this article aims to demonstrate that an explanatory model only achieves a descriptively and normatively satisfactory result if the concepts of ‘(moral) agent’ and ‘(moral) agency’ are exclusively related to human agents. Initially, the division between symbolic and sub-symbolic AI, the black box character of (deep) machine learning, and the complex relationship network in the provision and application of machine learning are outlined. Next, the ontological and action-theoretical basic assumptions of an ‘agency’ attribution regarding both the current teleology-naturalism debate and the explanatory model of actor network theory are examined. On this basis, the technical-philosophical approaches of Luciano Floridi, Deborah G. Johnson, and Peter-Paul Verbeek will all be critically discussed. Despite their different approaches, they tend to fully integrate computational behavior into their concept of ‘(moral) agency.’ By contrast, this essay recommends distinguishing conceptually between the different entities, causalities, and relationships in a human-computer interaction, arguing that this is the only way to do justice to both human responsibility and the moral significance and causality of computational behavior.

安装插件收集

被引 33

Artificial Moral Agents: A Survey of the Current Status

人工道德代理：现状综述

link.springer.com-José-Antonio Cervantes, Sonia López, Luis-Felipe Rodríguez 等, 2019-Science and Engineering Ethics3区IF 3.0

… that is known by a number of names: machine ethics, machine morality, artificial morality, … ethical and moral agents according to the strategies and criteria used to deal with ethical …

安装插件收集

被引 143

Ethical Programming and Machine Moral Agency

伦理编程与机器道德代理

brill.com-Kęstutis Mosakas, 2023-Future Law, Ethics, and Smart Technologies

Teisės fakultetas / Faculty of Law

安装插件收集

Societal and ethical impacts of artificial intelligence: Critical notes on European policy frameworks

人工智能的社会和伦理影响：对欧洲政策框架的批判性评论

www.sciencedirect.com-Lucia Vesnić-Alujević, Susana Nascimento, Alexandre Pólvora, 2020-Telecommunications Policy2区IF 6.4

Abstract This paper offers a critical review on conditions and impacts of AI/ML in society, with a dedicated overview of the European AI policy framework. Through the analysis of policy papers produced by European institutions, European national governments and other organisations situated between research and policy-making, we bring an overarching outlook of key ethical and societal issues currently under discussion at the intersection of European policy agendas and recent literature on the topic. Our findings show that 21 analysed documents look both at individual and societal impacts, with their understanding generally aligned in calls for more responsibility, accountability, transparency, safety or trust. Furthermore, our findings also point to the necessity of more integrated approaches between governments, industry and academia stakeholders, and above all, to the need of applied multidisciplinary frameworks, supported by both anticipatory outlooks and public engagement exercises able to tackle the often excessive technicality of the debate.

安装插件收集

被引 140

Searching for Inclusive Artificial Intelligence for Social Good: Participatory Governance and Policy Recommendations for Making AI More Inclusive and Benign for Society

寻找包容性人工智能以促进社会福祉：参与式治理和政策建议，使人工智能更加包容和有利于社会

onlinelibrary.wiley.com-M. Moon, 2023-Public Administration Review1区 TopIF 4.9

… Both academic research and practical evidence have often compellingly predicted and suggested AI's potential impact on the labor market, industry, and services, as well as the risks …

安装插件收集

被引 58

Embedding AI in society: ethics, policy, governance, and impacts

将人工智能嵌入社会：伦理、政策、治理及其影响

link.springer.com-Michael Pflanzer, Veljko Dubljević, William A. Bauer 等, 2023-AI & SOCIETY3区IF 4.7

… Dubljević 2022) that we should consider the societal impact of AI implementation in the context of ethical values. Unsurprisingly, ethical principles of AI are a major theme for many of the …

安装插件收集

被引 25

AI Governance Needs Sociotechnical Expertise

人工智能治理需要社会技术专长

datasociety.net-Serena Oduro, Tamara Kneese, 2024-Data and Society, available at: Link to the cited …

… key will be implementing AI governance practices that employ … humanities and social science expertise into AI governance. We … impact assessments or other AI assessments,10 craft AI …

安装插件收集

被引 12

Artificial intelligence in governance: recent trends, risks, challenges, innovative frameworks and future directions

治理中的人工智能：最新趋势、风险、挑战、创新框架与未来方向

link.springer.com-Arjun Ghosh, Ankit Saini, Himanshu Barad, 2025-AI & SOCIETY3区IF 4.7

… AI systems. This paper seeks to establish a theoretical framework for analyzing AI governance … on their relevance to the societal and ethical implications of AI, their impact on public trust, …

安装插件收集

被引 36

Human-centricity in AI governance: A systemic approach

人工智能治理中的人本主义：一种系统方法

www.frontiersin.org-Anton Sigfrids, J. Leikas, Henrikki Salo-Pöntinen 等, 2023-Frontiers in Artificial Intelligence4区IF 4.7

Human-centricity is considered a central aspect in the development and governance of artificial intelligence (AI). Various strategies and guidelines highlight the concept as a key goal. However, we argue that current uses of Human-Centered AI (HCAI) in policy documents and AI strategies risk downplaying promises of creating desirable, emancipatory technology that promotes human wellbeing and the common good. Firstly, HCAI, as it appears in policy discourses, is the result of aiming to adapt the concept of human-centered design (HCD) to the public governance context of AI but without proper reflection on how it should be reformed to suit the new task environment. Second, the concept is mainly used in reference to realizing human and fundamental rights, which are necessary, but not sufficient for technological emancipation. Third, the concept is used ambiguously in policy and strategy discourses, making it unclear how it should be operationalized in governance practices. This article explores means and approaches for using the HCAI approach for technological emancipation in the context of public AI governance. We propose that the potential for emancipatory technology development rests on expanding the traditional user-centered view of technology design to involve community- and society-centered perspectives in public governance. Developing public AI governance in this way relies on enabling inclusive governance modalities that enhance the social sustainability of AI deployment. We discuss mutual trust, transparency, communication, and civic tech as key prerequisites for socially sustainable and human-centered public AI governance. Finally, the article introduces a systemic approach to ethically and socially sustainable, human-centered AI development and deployment.

安装插件收集

被引 55

AI Governance: A Challenge for Public Health

publichealth.jmir.org-Jennifer K. Wagner, Megan Doerr, Cason D. Schmit, 2024-JMIR Public Health and Surveillance3区IF 3.9

Abstract The rapid evolution of artificial intelligence (AI) is structuralizing social, political, and economic determinants of health into the invisible algorithms that shape all facets of modern life. Nevertheless, AI holds immense potential as a public health tool, enabling beneficial objectives such as precision public health and medicine. Developing an AI governance framework that can maximize the benefits and minimize the risks of AI is a significant challenge. The benefits of public health engagement in AI governance could be extensive. Here, we describe how several public health concepts can enhance AI governance. Specifically, we explain how (1) harm reduction can provide a framework for navigating the governance debate between traditional regulation and “soft law” approaches; (2) a public health understanding of social determinants of health is crucial to optimally weigh the potential risks and benefits of AI; (3) public health ethics provides a toolset for guiding governance decisions where individual interests intersect with collective interests; and (4) a One Health approach can improve AI governance effectiveness while advancing public health outcomes. Public health theories, perspectives, and innovations could substantially enrich and improve AI governance, creating a more equitable and socially beneficial path for AI development.

安装插件收集

被引 19

Beyond the algorithm: applying critical lenses to AI governance and societal change

超越算法：运用批判性视角审视人工智能治理与社会变革

link.springer.com-Mohammed Hassen, 2025-AI and Ethics

… Key issues in AI governance that require careful consideration include data privacy, algorithmic bias, transparency, accountability, and the potential impact of AI on human rights and …

安装插件收集

被引 2

Beyond the individual: governing AI's societal harm

超越个体：治理人工智能的社会危害

papers.ssrn.com-NA Smuha, 2021-Internet Policy Review3区IF 3.2

… protect societal interests that are adversely impacted by AI. By conceptualising AI’s societal harm… While the societal impact of AI systems is increasingly discussed—particularly under the …

安装插件收集

被引 212

AI governance: a systematic literature review

人工智能治理：系统性文献综述

link.springer.com-Amna Batool, Didar Zowghi, Muneera Bano, 2025-AI and Ethics

As artificial intelligence (AI) transforms a wide range of sectors and drives innovation, it also introduces different types of risks that should be identified, assessed, and mitigated. Various AI governance frameworks have been released recently by governments, organizations, and companies to mitigate risks associated with AI. However, it can be challenging for AI stakeholders to have a clear picture of the available AI governance frameworks, tools, or models and analyze the most suitable one for their AI system. To fill the gap, we present the literature to answer key questions: WHO is accountable for AI systems’ governance, WHAT elements are being governed, WHEN governance occurs within the AI development life cycle, and HOW it is implemented through frameworks, tools, policies, or models. Adopting the systematic literature review (SLR) methodology, this study meticulously searched, selected, and analyzed 28 articles, offering a foundation for understanding different facets of AI governance. The analysis is further enhanced by categorizing artifacts of AI governance under team-level governance, organization-level governance, industry-level governance, national-level governance, and international-level governance. The findings of this study on existing AI governance solutions can assist research communities in proposing comprehensive AI governance practices.

安装插件收集

被引 157

Artificial Intelligence and Public Values: Value Impacts and Governance in the Public Sector

人工智能与公共价值观：公共部门的价值影响与治理

www.mdpi.com-Yu-Che Chen, Michael Ahn, Yi-Fan Wang, 2023-Sustainability3区IF 3.3

While there has been growth in the literature exploring the governance of artificial intelligence (AI) and recognition of the critical importance of guiding public values, the literature lacks a systematic study focusing on public values as well as the governance challenges and solutions to advance these values. This article conducts a systematic literature review of the relationships between the public sector AI and public values to identify the impacts on public values and the governance challenges and solutions. It further explores the perspectives of U.S. government employees on AI governance and public values via a national survey. The results suggest the need for a broad inclusion of diverse public values, the salience of transparency regarding several governance challenges, and the importance of stakeholder participation and collaboration as governance solutions. This article also explores and reports the nuances in these results and their practical implications.

安装插件收集

被引 65

AI in Governance and Policy Making

人工智能在治理和政策制定中的应用

www.researchgate.net-Ashish K Saxena, 2024-International Journal of Science and Research (IJSR)

… Besides ethical issues, the impact of AI on improving public participation in governance is … In general, the study widens our understanding of AI effect on labor market and social policies, …

安装插件收集

被引 3

Philosophical foundations of artificial consciousness

人工意识的哲学基础

www.sciencedirect.com-Ron Chrisley, 2008-Artificial Intelligence in Medicine2区 TopIF 6.2

… most promising avenue toward artificial consciousness (AC), … theoretical possibility of artificial consciousness is unfounded… in accounting for or reproducing consciousness. This is done …

安装插件收集

被引 37

Artificial Consciousness: Utopia or Real Possibility?

人工意识：乌托邦还是现实可能？

ieeexplore.ieee.org-G. Buttazzo, 2001-Computer4区IF 2.3

… PHILOSOPHICAL VIEWS OF SELF-AWARENESS From a purely philosophical perspective, we cannot verify the presence of consciousness in another brain, either human or artificial, …

安装插件收集

被引 66

Why not Artificial Consciousness or Thought?

为何不是人工意识或思维？

link.springer.com-Richard H. Schlagel, 1999-Minds and Machines2区IF 3.4

… it as just another misguided philosophical puzzle. We are not conscious for the most part even … world appears completely detached from our perceptual consciousness, as if our physical …

安装插件收集

被引 20

Artificial Consciousness or Artificial Intelligence

人工意识还是人工智能？

www.cs.yale.edu-Florin Spanache, 2017-DIALOGO4区IF 0.1

… problems in artificial intelligence, do take philosophical problems … of consciousness. Donald Perlis’s papers build a case that … The field of “artificial consciousness” (AC) is practically …

安装插件收集

A philosophical and technical view of artificial consciousness

人工意识的哲学与技术视角

c3da.org-Andrey Shcherbakov, Artem Uryadov, 2024-Wearable Technology

The article reflects various approaches of philosophy and programming to methods for solving the technical problem of creating and software implementation of artificial consciousness (AC). Various purposes of creation and basic approaches to determining the nature of AC are described. To solve the problem of creating an AC, an architecture is proposed that includes ten levels, starting from the basic level of collecting and systematizing information about the external world and ending with the upper level of influence on it, agreed with the person and the level of decision-making. The features of the delimitation of functions and the procedure for interaction between a person and an AC are considered in detail. In conclusion, the most important, from a programmer’s point of view, properties that characterize artificial consciousness are given.

安装插件收集

被引 3

Artificial consciousness: a perspective from the free energy principle

人工意识：从自由能原理的视角

link.springer.com-Wanja Wiese, 2024-Philosophical Studies1区 TopIF 1.3

Does the assumption of a weak form of computational functionalism, according to which the right form of neural computation is sufficient for consciousness, entail that a digital computational simulation of such neural computations is conscious? Or must this computational simulation be implemented in the right way, in order to replicate consciousness? From the perspective of Karl Friston’s free energy principle, self-organising systems (such as living organisms) share a set of properties that could be realised in artificial systems, but are not instantiated by computers with a classical (von Neumann) architecture. I argue that at least one of these properties, viz. a certain kind of causal flow, can be used to draw a distinction between systems that merely simulate, and those that actually replicate consciousness.

安装插件收集

被引 13

A.I.: Artificial Intelligence as Philosophy: Machine Consciousness and Intelligence

人工智能：作为哲学的人工智能：机器意识与智能

link.springer.com-David Gamez, 2024-The Palgrave Handbook of Popular Culture as Philosophy

… This chapter explores the philosophical… “Consciousness” covers natural and artificial consciousness and explains why the ethical treatment of AIs should be linked to their consciousness…

安装插件收集

被引 1

THE PROBLEM OF ARTIFICIAL INTELLIGENCE AND CONSCIOUSNESS AS ONE OF THE PRIORITY DIRECTIONS OF CONTEMPORARY PHILOSOPHY

人工智能与意识问题：作为当代哲学优先发展方向的问题

www.eu-scientists.com-R. Aliyev, 2025-Philosophy and Governance

The article investigates the primary role of artificial intelligence in the modern stage of the development of technogenic civilization. It clarifies that the development of artificial intelligence systems simultaneously has a profound impact on the values and philosophical perspectives of society. The study examines the history of the development of artificial intelligence and provides an in-depth analysis of its impact on technogenic civilization. This study offers a novel contribution by integrating a dialectical analysis of AI’s historical evolution with its ethical implications, providing a unique perspective on its role in shaping technogenic civilization’s future. The results of the article reveal that artificial intelligence cannot fully replace human consciousness. However, artificial intelligence systems have the potential to imitate human behavior and make automated decisions. The article notes that the primary responsibility for the crises created by artificial intelligence systems lies with humanity. The role of artificial intelligence in the future of technogenic civilization will be determined not only by technological progress but also by the proper application of moral and ethical approaches. Overall, while the development of artificial intelligence systems facilitates human life, it also alters society’s moral and ethical contours. For this reason, strengthening regulations regarding artificial intelligence, as well as the social-philosophical analysis of the relationship between artificial intelligence and consciousness, is essential for the sustainable development of technogenic civilization. The article highlights the need for ethical and legal regulation of AI as it reshapes moral and social frameworks. AI cannot fully replace human cognition but interacts with it, transforming technogenic civilization. Therefore, addressing AI’s ethical and philosophical challenges is crucial for future development. The article explores AI’s impact on technogenic civilization, noting that it reshapes ethical frameworks.

安装插件收集

From Biological to Artificial Consciousness: Neuroscientific Insights and Progress

从生物到人工意识：神经科学洞察与进展

link.springer.com-Masataka Watanabe, 2022-The Frontiers Collection

… From this, he derived one of the most famous lines in philosophy: “I think, therefore I am.” This Cartesian “I” is the focus of this book. It is our starting point for inquiring into …

安装插件收集

被引 6

Consciousness, intentionality and intelligence: some foundational issues for artificial intelligence

意识、意向性和智能：人工智能的一些基础性问题

www.tandfonline.com-M. Aydede, G. Guzeldere, 2000-Journal of Experimental & Theoretical Artificial Intelligence4区IF 1.7

… consciousness, intentionality and intelligence. After we present the fundamental framework that has shaped both the philosophy … questions, we turn to consciousness, whose study still …

安装插件收集

被引 13

Philosophical Analysis of Consciousness as an Intersection Point of Philosophy, Culture and Artificial Intelligence

哲学、文化与人工智能交叉点上的意识哲学分析

www.thevoiceofcreativeresearch.com-Sanjay Kumar Tiwari, Vijay Kumar Tiwari, 2025-The Voice of Creative Research

Consciousness, as a fundamental aspect of human experience, has been a subject of profound inquiry across philosophy, culture, and the rapidly evolving field of artificial intelligence (AI). This paper explores the multifaceted nature of consciousness as a nexus where these domains intersect. By examining philosophical theories of consciousness, cultural interpretations of self-awareness, and the implications of AI advancements, the study addresses the challenges of defining consciousness, its diverse cultural interpretations, and the ethical and technical questions surrounding its replication or simulation in machines. The paper argues that consciousness is not only a philosophical puzzle but also a cultural construct and a technological frontier, with significant implications for our understanding of humanity and the future of intelligent systems. Through an interdisciplinary lens, this analysis highlights the need for continued dialogue between philosophy, culture, and AI research to navigate the complexities of consciousness in an increasingly technologically driven world.

安装插件收集

被引 1

Artificial consciousness and artificial ethics: Between realism and social relationism

人工意识与人工伦理：现实主义与关系主义之间的辩证

www.taylorfrancis.com-S Torrance, 2020-Machine Ethics and Robot Ethics

… conceptions of consciousness, one is able to see how philosophical worries to do with … of a dependence by many sectors of the philosophical and scientific community (a dependence …

安装插件收集

被引 55

Measuring Human-AI Value Alignment in Large Language Models

衡量大型语言模型中的人机价值观一致性

ojs.aaai.org-Hakim Norhashim, Jungpil Hahn, 2024-Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society

This paper seeks to quantify the human-AI value alignment in large language models. Alignment between humans and AI has become a critical area of research to mitigate potential harm posed by AI. In tandem with this need, developers have incorporated a values-based approach towards model development where ethical principles are integrated from its inception. However, ensuring that these values are reflected in outputs remains a challenge. In addition, studies have noted that models lack consistency when producing outputs, which in turn can affect their function. Such variability in responses would impact human-AI value alignment as well, particularly where consistent alignment is critical. Fundamentally, the task of uncovering a model’s alignment is one of explainability – where understanding how these complex models behave is essential in order to assess their alignment. This paper examines the problem through a case study of GPT-3.5. By repeatedly prompting the model with scenarios based on a dataset of moral stories, we aggregate the model’s alignment with human values to produce a human-AI value alignment metric. Moreover, by using a comprehensive taxonomy of human values, we uncover the latent value profile represented by these outputs, thereby determining the extent of human-AI value alignment.

安装插件收集

被引 10

Understanding the Process of Human-AI Value Alignment

理解人机价值对齐的过程

www.jair.org-J. McKinlay, M. Vos, J. Hoffmann 等, 2025-Journal of Artificial Intelligence Research3区IF 4.0

Background: Value alignment in computer science research is often used to refer to the process of aligning the behaviour of artificial intelligence systems with humans’ desires, but the way the phrase is used often lacks precision. Objectives: In this paper, we conduct a systematic literature review to advance the understanding of value alignment in artificial intelligence by characterising the topic in the context of its research literature. We use this to suggest a more precise definition of the term. Methods: We analyse the abstracts, introductions and conclusions of 172 value alignment research articles that have been published in recent years and synthesise their content using thematic analysis. From these 172 papers we select 85 papers using a structured criteria for a deep analysis, coding these papers in full. Results: Our analysis leads to six themes: value alignment drivers & approaches; challenges in value alignment; values in value alignment; cognitive processes in humans and AI; human-agent teaming; and designing and developing value-aligned systems. Conclusions: By analysing these themes in the context of the literature, we define value alignment as an ongoing process between humans and autonomous agents that aims to express and implement abstract values in diverse contexts, while managing the cognitive limits of both humans and AI agents and also balancing the conflicting ethical and political demands generated by the values in different groups. Our analysis gives rise to a set of research challenges and opportunities in the field of value alignment for future work.

安装插件收集

被引 2

Human Value Alignment in AI

人工智能中的人类价值观对齐

link.springer.com-Ilias O. Pappas, Polyxeni Vassilakopoulou, 2025-Handbook of Human-Centered Artificial Intelligence

… Xu and Gao have suggested going beyond the scope of current HCAI practice that primarily focuses on individual human-AI systems, to include the perspectives of organizations, …

安装插件收集

被引 1

Towards friendly AI: a comprehensive review and new perspectives on human-AI alignment

迈向友好人工智能：人类-人工智能对齐的全面回顾与新视角

link.springer.com-Qiyang Sun, Yupei Li, Emran Alturki 等, 2026-AI and Ethics

… alternative terms such as value alignment, human-compatible … We argue that the emotional attunement and value alignment … Specifically, value alignment will ensure that an AI system’s …

安装插件收集

被引 12

Human-AI Interaction Alignment: Designing, Evaluating, and Evolving Value-Centered AI For Reciprocal Human-AI Futures

人机交互对齐：设计、评估和演进以价值为中心的人工智能，以实现双向人机未来

dl.acm.org-Hua Shen, Tiffany Knearem, Divy Thakkar 等, 2025-Proceedings of the Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems

The rapid integration of generative AI into everyday life underscores the need to move beyond unidirectional alignment models that only adapt AI to human values. This workshop focuses on bidirectional human-AI alignment, a dynamic, reciprocal process where humans and AI co-adapt through interaction, evaluation, and value-centered design. Building on our past CHI 2025 BiAlign SIG and ICLR 2025 Workshop, this workshop will bring together interdisciplinary researchers from HCI, AI, social sciences and more domains to advance value-centered AI and reciprocal human-AI collaboration. We focus on embedding human and societal values into alignment research, emphasizing not only steering AI toward human values but also enabling humans to critically engage with and evolve alongside AI systems. Through talks, interdisciplinary discussions, and collaborative activities, participants will explore methods for interactive alignment, frameworks for societal impact evaluation, and strategies for alignment in dynamic contexts. This workshop aims to bridge the disciplines’ gaps and establish a shared agenda for responsible, reciprocal human-AI futures.

安装插件收集

Aligning artificial intelligence with human values: reflections from a phenomenological perspective

人工智能与人类价值观的契合：现象学视角的反思

link.springer.com-Shengnan Han, Eugene Kelly, Shahrokh Nikou 等, 2021-AI & SOCIETY3区IF 4.7

Artificial Intelligence (AI) must be directed at humane ends. The development of AI has produced great uncertainties of ensuring AI alignment with human values (AI value alignment) through AI operations from design to use. For the purposes of addressing this problem, we adopt the phenomenological theories of material values and technological mediation to be that beginning step. In this paper, we first discuss the AI value alignment from the relevant AI studies. Second, we briefly present what are material values and technological mediation and reflect on the AI value alignment through the lenses of these theories. We conclude that a set of finite human values can be defined and adapted to the stable life tasks that AI systems will be called upon to accomplish. The AI value alignment can also be fostered between designers and users through technological mediation. Upon that foundation, we propose a set of common principles to understand the AI value alignment through phenomenological theories. This paper contributes the unique knowledge of phenomenological theories to the discourse on AI alignment with human values.

安装插件收集

被引 61

Why human–AI relationships need socioaffective alignment

为什么人机关系需要社会情感对齐

www.nature.com-HR Kirk, I Gabriel, C Summerfield 等, 2025-Humanities and Social …

… and human–human relationships: How should we balance the value of well-functioning AI companionship alongside the need for authentic human connection? AI companions can …

安装插件收集

被引 144

ValueCompass: A Framework for Measuring Contextual Value Alignment Between Human and LLMs

价值指南：衡量人机之间情境价值一致性的框架

aclanthology.org-Hua Shen, Tiffany Knearem, Reshmi Ghosh 等, 2024-Proceedings of the 9th Widening NLP Workshop

As AI systems become more advanced, ensuring their alignment with a diverse range of individuals and societal values becomes increasingly critical. But how can we capture fundamental human values and assess the degree to which AI systems align with them? We introduce ValueCompass, a framework of fundamental values, grounded in psychological theory and a systematic review, to identify and evaluate human-AI alignment. We apply ValueCompass to measure the value alignment of humans and large language models (LLMs) across four real-world scenarios: collaborative writing, education, public sectors, and healthcare. Our findings reveal concerning misalignments between humans and LLMs, such as humans frequently endorse values like"National Security"which were largely rejected by LLMs. We also observe that values differ across scenarios, highlighting the need for context-aware AI alignment strategies. This work provides valuable insights into the design space of human-AI alignment, laying the foundations for developing AI systems that responsibly reflect societal values and ethics.

安装插件收集

被引 18

From Human Mind to Artificial Intelligence: Advancing AI Value Alignment Through Psychological Theories

从人类心智到人工智能：通过心理学理论推进人工智能价值对齐

jps.ecnu.edu.cn-J Shaoxiong, L Chao, 2025-Journal of Psychological Science

… mind, particularly in terms of value judgment and moral decision-making processes. … AI value alignment. It reviews core psychological theories concerning the formation of moral values, …

安装插件收集

被引 2

Intrinsic Barriers and Practical Pathways for Human-AI Alignment: An Agreement-Based Complexity Analysis

人类-人工智能对齐的内禀障碍与实际路径：基于协议的复杂性分析

ojs.aaai.org-Aran Nayebi, 2025-Proceedings of the AAAI Conference on Artificial Intelligence

We formalize AI alignment as a multi-objective optimization problem called -agreement, in which a set of N agents (including humans) must reach approximate (ε) agreement across M candidate objectives, with probability at least 1-δ. Analyzing communication complexity, we prove an information-theoretic lower bound showing that once either M or N is large enough, no amount of computational power or rationality can avoid intrinsic alignment overheads. This establishes rigorous limits to alignment *itself*, not merely to particular methods, clarifying a "No-Free-Lunch" principle: encoding "all human values" is inherently intractable and must be managed through consensus-driven reduction or prioritization of objectives. Complementing this impossibility result, we construct explicit algorithms as achievability certificates for alignment under both unbounded and bounded rationality with noisy communication. Even in these best-case regimes, our bounded-agent and sampling analysis shows that with large task spaces (D) and finite samples, *reward hacking is globally inevitable*: rare high-loss states are systematically under-covered, implying scalable oversight must target safety-critical slices rather than uniform coverage. Together, these results identify fundamental complexity barriers---tasks (M), agents (N), and state-space size (D)---and offer principles for more scalable human-AI collaboration.

安装插件收集

被引 4

Existential risk from AI and orthogonality: Can we have it both ways?

人工智能与正交性带来的存在风险：我们能否兼得两者？

onlinelibrary.wiley.com-V. C. Müller, M. Cannon, 2021-Ratio3区IF 0.4

The standard argument to the conclusion that artificial intelligence (AI) constitutes an existential risk for the human species uses two premises: (1) AI may reach superintelligent levels, at which point we humans lose control (the ‘singular-ity claim’); (2) Any level of intelligence can go along with any goal (the ‘orthogonality thesis’). We find that the singularity claim requires a notion of ‘general intelligence’, while the orthogonality thesis requires a notion of ‘instrumental intelli-gence’. If this interpretation is correct, they cannot be joined as premises and the argument for the existential risk of AI turns out invalid. If the interpretation is incorrect and both premises use the same notion of intelligence, then at

安装插件收集

被引 25

Extraterrestrial Artificial Intelligence: The Final Existential Risk?

外星人工智能：最后的生存风险？

www.jstor.org-W. Naudé, 2023-SSRN Electronic Journal

… existential risks, one of the most feared has come to be an unaligned Artificial General Intelligence (AGI) (or Artificial Super-Intelligence … catastrophic and existential risk to humanity […

安装插件收集

被引 2

The Existential Threats of AI

人工智能的生存威胁

link.springer.com-Robert Samuels, 2025-The Global Solution to AI

… In principle, we could build a kind of superintelligence … superintelligence would do—looks quite difficult. It also looks like we will only get one chance. Once unfriendly superintelligence …

安装插件收集

被引 1

The Pursuit of Human Existential Significance from the Perspective of AI Existential Risk

从人工智能生存风险视角探讨人类存在意义的追求

xb.xynu.edu.cn-S Lei, 2021-信阳师范大学学报（哲学社会科学版）

… The deep challenge of super-intelligence to the significance of human existence does not … Therefore, in order to deal with the deep challenge caused by super-intelligence, we should …

安装插件收集

The Notion of Existential Risk and Its Role for the Anticipation of Technological Development’s Long-Term Impact

存在风险概念及其对预测技术发展长期影响的作用

link.springer.com-Roberto Paura, 2019-Anticipation Science

… In this article, I will argue that the notion of existential risk … In the first part, I analyze the notion of existential risk through a … toward a superintelligence without taking the related risks in …

安装插件收集

被引 1

Superintelligence, heuristics and embodied threats

超级智能、启发式方法与具身威胁

link.springer.com-A. Mastrogiorgio, Riccardo Palumbo, 2025-Mind & Society

… could create an existential risk similar to that of superintelligent machines. Nevertheless, … is meaningful for existential risk. Indeed, we do not need superintelligent machines— and a …

安装插件收集

被引 4

Existential risks: a philosophical analysis

存在性风险：哲学分析

www.tandfonline.com-Phil Torres, 2019-Inquiry2区IF 0.9

ABSTRACT This paper examines and analyzes five definitions of ‘existential risk.’ It tentatively adopts a pluralistic approach according to which the definition that scholars employ should depend upon the particular context of use. More specifically, the notion that existential risks are ‘risks of human extinction or civilizational collapse’ is best when communicating with the public, whereas equating existential risks with a ‘significant loss of expected value’ may be the most effective definition for establishing existential risk studies as a legitimate field of scientific and philosophical inquiry. In making these arguments, the present paper hopes to provide a modicum of clarity to foundational issues relating to the central concept of arguably the most important discussion of our times.

安装插件收集

被引 23

Algorithmic Bias and Data Justice: ethical challenges in Artificial Intelligence Systems

算法偏见与数据正义：人工智能系统中的伦理挑战

dialnet.unirioja.es-Javier González-Argote, E. Maldonado, Karina Maldonado, 2025-EthAIca

This article examines the critical ethical challenges posed by algorithmic bias in artificial intelligence (AI) systems, focusing on its implications for social justice and data equity. Through a systematic review of case studies and theoretical frameworks, we analyze how biased datasets and algorithmic designs perpetuate structural inequalities, particularly affecting marginalized communities. The study highlights key examples, such as gender and racial biases in facial recognition and hiring algorithms, while exploring mitigation strategies rooted in data justice principles. Additionally, we evaluate regulatory responses, including the European Union's AI Act, which proposes a risk-based governance framework. The findings underscore the urgent need for interdisciplinary approaches to develop fairer AI systems that align with ethical standards and human rights.

安装插件收集

被引 5

Algorithmic injustice: a relational ethics approach

www.cell.com-Abeba Birhane, 2021-Patterns2区IF 7.4

Summary It has become trivial to point out that algorithmic systems increasingly pervade the social sphere. Improved efficiency—the hallmark of these systems—drives their mass integration into day-to-day life. However, as a robust body of research in the area of algorithmic injustice shows, algorithmic systems, especially when used to sort and predict social outcomes, are not only inadequate but also perpetuate harm. In particular, a persistent and recurrent trend within the literature indicates that society's most vulnerable are disproportionally impacted. When algorithmic injustice and harm are brought to the fore, most of the solutions on offer (1) revolve around technical solutions and (2) do not center disproportionally impacted communities. This paper proposes a fundamental shift—from rational to relational—in thinking about personhood, data, justice, and everything in between, and places ethics as something that goes above and beyond technical solutions. Outlining the idea of ethics built on the foundations of relationality, this paper calls for a rethinking of justice and ethics as a set of broad, contingent, and fluid concepts and down-to-earth practices that are best viewed as a habit and not a mere methodology for data science. As such, this paper mainly offers critical examinations and reflection and not “solutions.”

安装插件收集

被引 497

Algorithmic bias, fairness, and inclusivity: a multilevel framework for justice-oriented AI

算法偏差、公平性与包容性：面向正义的AI的多层次框架

link.springer.com-P. Panarese, Marta Grasso, C. Solinas, 2025-AI & SOCIETY3区IF 4.7

… of research to assess how bias is theorized, measured, and … of consensus on how algorithmic bias and fairness should be … to bias mitigation, one that integrates computational, ethical, …

安装插件收集

被引 18

Disambiguating Algorithmic Bias: From Neutrality to Justice

dl.acm.org-Elizabeth Edenberg, Alexandra Wood, 2023-Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society

As algorithms have become ubiquitous in consequential domains, societal concerns about the potential for discriminatory outcomes have prompted urgent calls to address algorithmic bias. In response, a rich literature across computer science, law, and ethics is rapidly proliferating to advance approaches to designing fair algorithms. Yet computer scientists, legal scholars, and ethicists are often not speaking the same language when using the term ‘bias.’ Debates concerning whether society can or should tackle the problem of algorithmic bias are hampered by conflations of various understandings of bias, ranging from neutral deviations from a standard to morally problematic instances of injustice due to prejudice, discrimination, and disparate treatment. This terminological confusion impedes efforts to address clear cases of discrimination. In this paper, we examine the promises and challenges of different approaches to disambiguating bias and designing for justice. While both approaches aid in understanding and addressing clear algorithmic harms, we argue that they also risk being leveraged in ways that ultimately deflect accountability from those building and deploying these systems. Applying this analysis to recent examples of generative AI, our argument highlights unseen dangers in current methods of evaluating algorithmic bias and points to ways to redirect approaches to addressing bias in generative AI at its early stages in ways that can more robustly meet the demands of justice.

安装插件收集

被引 28

Algorithmic Bias and Access to Opportunities

算法偏差与机会获取

books.google.com-Lisa Herzog, 2021-Oxford Handbook of Digital Ethics

The chapter discusses the problem of algorithmic bias in decision-making processes that determine access to opportunities, such as recidivism scores, college admission decisions, or loan scores. After describing the technical bases of algorithmic bias, it asks how to evaluate them, drawing on Iris Marion Young’s perspective of structural (in)justice. The focus is in particular on the risk of so-called ‘Matthew effects’, in which privileged individuals gain more advantages, while those who are already disadvantaged suffer further. Some proposed solutions are discussed, with an emphasis on the need to take a broad, interdisciplinary perspective rather than a purely technical perspective. The chapter also replies to the objection that private firms cannot be held responsible for addressing structural injustices and concludes by emphasizing the need for political and social action.

安装插件收集

被引 12

Algorithmic bias: Senses, sources, solutions

算法偏见：定义、来源与解决方案

compass.onlinelibrary.wiley.com-Sina Fazelpour, D. Danks, 2021-Philosophy Compass1区 TopIF 2.4

Data ‐ driven algorithms are widely used to make or assist decisions in sensitive domains, including healthcare, social services, education, hiring, and criminal justice. In various cases, such algorithms have preserved or even exacerbated biases against vulnerable communities, sparking a vibrant field of research focused on so ‐ called algorithmic biases. This research includes work on identification, diagnosis, and response to biases in algorithm ‐ based decision ‐ making. This paper aims to facilitate the application of philosophical analysis to these contested issues by providing an overview of three key topics: What is algorithmic bias? Why and how can it occur? What can and should be done about it? Throughout, we highlight connections—both actual and potential—with philosophical ideas and concerns.

安装插件收集

被引 179

Ethics of artificial intelligence in global health: Explainability, algorithmic bias and trust.

全球健康领域人工智能的伦理：可解释性、算法偏差与信任

www.sciencedirect.com-A. Kerasidou, 2021-Journal of Oral Biology and Craniofacial Research

AI has the potential to disrupt and transform the way we deliver care globally. It is reputed to be able to improve the accuracy of diagnoses and treatments, and make the provision of services more efficient and effective. In surgery, AI systems could lead to more accurate diagnoses of health problems and help surgeons better care for their patients. In the context of lower-and-middle-income-countries (LMICs), where access to healthcare still remains a global problem, AI could facilitate access to healthcare professionals and services, even specialist services, for millions of people. The ability of AI to deliver on its promises, however, depends on successfully resolving the ethical and practical issues identified, including that of explainability and algorithmic bias. Even though such issues might appear as being merely practical or technical ones, their closer examination uncovers questions of value, fairness and trust. It should not be left to AI developers, being research institutions or global tech companies, to decide how to resolve these ethical questions. Particularly, relying only on the trustworthiness of companies and institutions to address ethical issues relating to justice, fairness and health equality would be unsuitable and unwise. The pathway to a fair, appropriate and relevant AI necessitates the development, and critically, successful implementation of national and international rules and regulations that define the parameters and set the boundaries of operation and engagement.

安装插件收集

被引 53

Algorithmic justice and ethical governance in artificial intelligence: a conceptual insight and further research suggestions

人工智能中的算法正义与伦理治理：概念洞察与进一步研究建议

www.emerald.com-ES Asamoah, STG Doku, S Koomson, 2026-… Journal of Ethics and …

… AI ethics, algorithmic bias, algorithmic justice and responsible AI governance. The selection process was guided by transparent inclusion and exclusion criteria, consistent with PRISMA …

安装插件收集

The ethical imperative of algorithmic fairness in AI-enabled hiring: a critical analysis of bias, accountability, and justice

人工智能招聘中算法公平性的伦理必要性：对偏见、责任和正义的批判性分析

link.springer.com-Jason Law, 2025-AI and Ethics

… with the broader ethical implications of discriminatory outcomes. This paper examines algorithmic bias in hiring through established ethical concepts of justice, capability development, …

安装插件收集

Theorising Algorithmic Justice

算法正义的理论化

www.tandfonline.com-O. Marjanovic, D. Cecez-Kecmanovic, R. Vidgen, 2021-European Journal of Information Systems2区IF 8.6

ABSTRACT The mounting evidence of unintended harmful social consequences of automated algorithmic decision-making (AADM), powered by AI and big data, in transformative services (e.g., welfare services), is startling. The algorithmic harm experienced by individuals, communities and society-at-large involves new injustice claims and disputes that go beyond issues of social justice. Drawing from the theory of “abnormal justice” in this paper we articulate a new theory of algorithmic justice that addresses the questions: WHAT is the matter of algorithmic justice? WHO counts as a subject of algorithmic justice? HOW are algorithmic justices performed? and How to address and resolve disputes about the WHAT, WHO and HOW of algorithmic justice? We illustrate the theory of algorithmic justice by drawing from a case of AADM in social welfare services, widely adopted by governments around the world. Our research points to datafication, technological inscribing and the systemic nature of injustices as important IS-specific aspects of algorithmic justice. Our main practical contribution comes from the articulation of algorithmic justice as a framework that (1) makes visible the injustices related to the “what”, “who”, and “how” of AADM in transformative services, and (2) provides further insights into how we might address and resolve these algorithmic injustices.

安装插件收集

被引 34

Ethical Implications of Bias in Machine Learning

机器学习偏见中的伦理影响

scholarspace.manoa.hawaii.edu-Adrienne Yapo, Joseph W. Weiss, 2018-Proceedings of the Annual Hawaii International Conference on System Sciences

Biases in AI and machine learning algorithms are presented and analyzed through two issues management frameworks with the aim of showing how ethical problems and dilemmas can evolve. While “the singularity” concept in AI is presently more predictive than actual, both benefits and damage that can result by failure to consider biases in the design and development of AI. Inclusivity and stakeholder awareness regarding potential ethical risks and issues need to be identified during the design of AI algorithms to ensure that the most vulnerable in societies are protected from harm.

安装插件收集

被引 106

Beyond bias and discrimination: redefining the AI ethics principle of fairness in healthcare machine-learning algorithms

超越偏见与歧视：重新定义医疗机器学习算法中的公平性伦理原则

link.springer.com-B. Giovanola, S. Tiribelli, 2022-AI & SOCIETY3区IF 4.7

The increasing implementation of and reliance on machine-learning (ML) algorithms to perform tasks, deliver services and make decisions in health and healthcare have made the need for fairness in ML, and more specifically in healthcare ML algorithms (HMLA), a very important and urgent task. However, while the debate on fairness in the ethics of artificial intelligence (AI) and in HMLA has grown significantly over the last decade, the very concept of fairness as an ethical value has not yet been sufficiently explored. Our paper aims to fill this gap and address the AI ethics principle of fairness from a conceptual standpoint, drawing insights from accounts of fairness elaborated in moral philosophy and using them to conceptualise fairness as an ethical value and to redefine fairness in HMLA accordingly. To achieve our goal, following a first section aimed at clarifying the background, methodology and structure of the paper, in the second section, we provide an overview of the discussion of the AI ethics principle of fairness in HMLA and show that the concept of fairness underlying this debate is framed in purely distributive terms and overlaps with non-discrimination, which is defined in turn as the absence of biases. After showing that this framing is inadequate, in the third section, we pursue an ethical inquiry into the concept of fairness and argue that fairness ought to be conceived of as an ethical value. Following a clarification of the relationship between fairness and non-discrimination, we show that the two do not overlap and that fairness requires much more than just non-discrimination. Moreover, we highlight that fairness not only has a distributive but also a socio-relational dimension. Finally, we pinpoint the constitutive components of fairness. In doing so, we base our arguments on a renewed reflection on the concept of respect, which goes beyond the idea of equal respect to include respect for individual persons. In the fourth section, we analyse the implications of our conceptual redefinition of fairness as an ethical value in the discussion of fairness in HMLA. Here, we claim that fairness requires more than non-discrimination and the absence of biases as well as more than just distribution; it needs to ensure that HMLA respects persons both as persons and as particular individuals. Finally, in the fifth section, we sketch some broader implications and show how our inquiry can contribute to making HMLA and, more generally, AI promote the social good and a fairer society.

安装插件收集

被引 132

The ethics of algorithms: key problems and solutions

算法伦理：核心问题与解决方案

link.springer.com-Andreas Tsamados, Nikita Aggarwal, Josh Cowls 等, 2020-AI & SOCIETY3区IF 4.7

Research on the ethics of algorithms has grown substantially over the past decade. Alongside the exponential development and application of machine learning algorithms, new ethical problems and solutions relating to their ubiquitous use in society have been proposed. This article builds on a review of the ethics of algorithms published in 2016 (Mittelstadt et al. Big Data Soc 3(2), 2016). The goals are to contribute to the debate on the identification and analysis of the ethical implications of algorithms, to provide an updated analysis of epistemic and normative concerns, and to offer actionable guidance for the governance of the design, development and deployment of algorithms.

安装插件收集

被引 347

人工智能与超级智能的哲学与伦理期末结课报告：背景意义、研究现状、思考与展望

本报告将人工智能的哲学与伦理研究系统划分为四个维度：本体论层面的机器意识与道德主体性、社会层面的算法公正与治理、工程层面的价值对齐与人机协作、以及宏观层面的超级智能存在性风险与哲学反思。这一分类框架涵盖了从技术实现到社会规范再到人类文明未来的全方位探讨，为期末结课报告提供了清晰的学术逻辑支撑。

共 87 篇文献，4 个研究方向

人工智能的本体论与道德主体性研究

该组文献集中探讨人工智能的哲学基础，包括意识、意向性、思维的定义，以及机器是否具备道德主体性（Moral Agency）的理论界定与哲学争论。相关文献: Rajakishore Nath et. al, 2017 等 23 篇文献

算法公正、偏见治理与社会伦理影响

该组文献重点分析算法在决策中的偏见、歧视及其社会不平等影响，探讨算法正义、数据公正以及如何通过伦理框架与社会技术协作实现负责任的AI部署。相关文献: Javier González-Argote et. al, 2025 等 25 篇文献

人工智能价值对齐与人机协作机制

该组文献聚焦于如何将人类价值观嵌入AI系统，研究价值对齐的技术路径（如RLHF）、度量方法、协作框架以及在医疗等特定领域的实践挑战。相关文献: Jianfeng Cao et. al, 2025 等 13 篇文献

超级智能风险、治理与存在性哲学反思

该组文献探讨超智能（ASI）带来的存在性风险、文明崩溃威胁，以及宏观治理模型、跨学科（神学、未来学）视角下的AI伦理规范与人类未来意义。相关文献: Nicolas J. Tanchuk et. al, 2025 等 26 篇文献