Abstract
In his book Superintelligence, Nick Bostrom points to several ways the development of Artificial Intelligence (AI) might fail, turn out to be malignant or even induce an existential catastrophe. He describes ‘Perverse Instantiations’ (PI) as cases, in which AI figures out how to satisfy some goal through unintended ways. For instance, AI could attempt to paralyze human facial muscles into constant smiles to achieve the goal of making humans smile. According to Bostrom, cases like this ought to be avoided since they include a violation of human designer’s intentions. However, AI finding solutions that its designers have not yet thought of and therefore could also not have intended is arguably one of the main reasons why we are so eager to use it on a variety of problems. In this paper, I aim to show that the concept of PI is quite vague, mostly due to ambiguities surrounding the term ‘intention’. Ultimately, this text aims to serve as a starting point for a further discussion of the research topic, the development of a research agenda and future improvement of the terminology.
Original language | English |
---|---|
Title of host publication | 34th Bled eConference |
Subtitle of host publication | Digital Support from Crisis to Progressive Change |
Editors | Andreja Puhicar, Mirjana Kljajić Borštnar, Roger Bons, Helen Cripps, Anand Sheombar, Doroteja Vidmar |
Place of Publication | Bled |
Pages | 67-73 |
ISBN (Electronic) | 978-961-286-485-9 |
DOIs | |
Publication status | Published - 1 Jun 2021 |
Event | 34th Bled eConference : Digital Support from Crisis to Progressive Change - Virtuell, Slovenia Duration: 27 Jun 2021 → 30 Jun 2021 |
Conference
Conference | 34th Bled eConference |
---|---|
Country/Territory | Slovenia |
City | Virtuell |
Period | 27/06/21 → 30/06/21 |