Jame5

On benevolence and friendly AI theory

October 25, 2007 at 10:01 pm · Filed under Uncategorized

Jame5 it not just a science fiction novel – it is a science fiction novel with a cause. Ensuring the creation of a friendly AI is hard for many reasons:

Creating an AGI is hard
Goal retention is hard
Recursive self improvement is hard

The question what friendliness means however existed before all of those problems, is a separate one and needs to be answered before the creation of a friendly AI can be attempted. Coherent Extrapolated Volition in short CEV is Eliezer S. Yudkowsky’s take on Friendliness.

While CEV is great to describe what a friendly AIG will do, my critique of CEV is that it postpones answering the question of what friendliness is specifically until after we have an AIG that will answer that question for us.

Yes – a successfully implemented friendly AGI will do ‘good’ stuff and act in our ‘best interest’. But what is good and what is our best interest? In Jame5 I provide a different solution to the friendliness issue and suggest to skip right to the end of chapter 9 for anyone who would like to get right to the meat.

In addition I have summarized my core friendliness concepts in a paper called ‘Benevolence â€“ a Materialist Philosophy of Goodness‘ (2007/11/09 UPDATE: latest version here) and in the end formulate the following friendly AGI supergoal:

Definitions:

Suffering: negative subjective experience equivalent to the subjective departure from an individualâ€™s model of optimal fitness state as encoded in its genome/memome
Growth: absolute increase in individualâ€™s fitness
Joy: positive subjective experience equivalent to the subjective contribution to moving closer towards an individualâ€™s model of optimal fitness state as encoded in its genome/memome

Derived friendly AGI super goal: â€œMinimize all involuntary human suffering, direct all
unavoidable suffering towards growth, and reward all voluntary suffering contributing
to an individualâ€™s growth with an equal or greater amount of joy.â€

Permalink

3 Comments »

Jame5 Â» Self improvement versus non-eudaemonic dystopias said,

November 6, 2007 @ 11:48 pm

[…] the context of my friendly AI theory I suggest a similar approach to Bostrom’s Singleton however honoring Ben Goertzel’s […]
Jame5 Â» Estimating cognitive evolution’s complexity boundary in humans said,

November 7, 2007 @ 7:21 pm

[…] As basis I will assume that: 1) cognitive evolution in humans is taking place on the level of beliefs (a brief summary can be found in my paper on friendly AI […]
Jame5 Â» Understanding inter group competion in humans said,

November 8, 2007 @ 8:07 pm

[…] or perish. For a quick introduction to my thoughts on this issue I suggest reading my paper on friendly AI theory or Jame5 pages 69 and […]

RSS feed for comments on this post · TrackBack URI

On benevolence and friendly AI theory

3 Comments »

Jame5 Â» Self improvement versus non-eudaemonic dystopias said,

Jame5 Â» Estimating cognitive evolution’s complexity boundary in humans said,

Jame5 Â» Understanding inter group competion in humans said,

Leave a Comment

Recent Posts

Recent Comments

Further Reading

Jame5 Friends

Join the Discussion

Meta

Jame5

On benevolence and friendly AI theory

3 Comments »

Jame5 Â» Self improvement versus non-eudaemonic dystopias said,

Jame5 Â» Estimating cognitive evolution’s complexity boundary in humans said,

Jame5 Â» Understanding inter group competion in humans said,

Leave a Comment

Recent Posts

Recent Comments

Tags

Further Reading

Jame5 Friends

Join the Discussion

Meta