Wiki: Identity assessment and continuity test

The shortest idea

When questioning someone's identity, it is not only necessary to ask whether they speak the same way. It is important to decide first what is going on and what will be put on hold if things break down.

Why L4 suddenly becomes difficult

In L1 and L2, you can put relatively clear metrics like accuracy and prediction match. However, in L4, questions such as ``Is this memory match enough?'' ``If a person's values change slightly, are they a different person?'' and ``To what extent should we allow changes due to learning?'' come into play. In other words, not only the measurement but also the judgment rules themselves become difficult.

First of all, 5 items to consider separately

Item	What do you want to see	Why that's not enough
Memory	Autobiographical memory and episodic coherence.	Memory replay alone does not necessarily indicate subjective continuity.
Values/Preferences	Consistency in judgment tendencies and priorities.	It is necessary to distinguish between short-term mood swings and long-term personality trends.
Learning history	How to incorporate new experiences and connect with previous trends.	It is natural for things to change as they learn, and the change itself cannot be immediately called a mismatch.
Reaction to changes in conditions	How do responses diverge under unlearning conditions and interventions?	Even if they are similar during normal times, there is a possibility that they will break down drastically due to branching.
Longitudinal stability	What is stable and what fluctuates within the day, between days, and over the long term?	It is not possible to see the persistence of identity with just one measurement.

What kind of continuity test do you want to consider

Tests you would like to include as examples

Autobiographical memory alignment: Tracks not only the content of events, but also their associations and priorities. </li>
Preference stability:Looks at whether value judgments and choice trends persist beyond short-term noise.

Learning continuity:After giving new information, see if the update method connects with the original trend.

Branch verification:When changing conditions, record the point at which it should be treated as a separate individual.

Long-term drift monitoring: Track characteristics that change and characteristics that don't change over days or weeks.
</ul> </div>
In particular, if you want to organize only the entrance of longitudinal evaluation first, Wiki: state/trait/drift is a supplementary lecture.
</section>

Why pre-registration is especially important

Evaluations of a person's personality can be interpreted in any way that suits them in hindsight. That's why it is necessary to pre-register ``what to consider as a match,'' ``to what degree of deviation to suspend,'' and ``which branches to treat as separate individuals.''

Things you should decide first
These are test items, scoring rules, observation period, failure conditions, stopping conditions, and handling of branching. In L4, if this part is ambiguous, the entire conclusion will be shaken.

What is difficult when branching occurs

If the two systems start learning separately at some point, they may start out almost the same, but over time they will have different histories. At this time, the question is ``to what point should they be treated as the same evaluation unit?'' and ``at what point should they be separated as separate entities?''

Therefore, when evaluating L4, it is important to consider not only similarity but also branch log and version control.

If you want to clarify the differences between branching points, branch IDs, stop conditions, and kill switches first, Wiki: Update/branching/stop rules is a supplementary lecture.

Things you shouldn't say at this stage

Expressions that are easy to overstate Safer reading

Identity verified There have been no major discrepancies so far in the pre-registered continuity test group.

Same person completely saved Preliminary evaluation regarding memory, values, learning, and branching has been established.

Same in the long run No significant drift was observed in the defined indicators within the observation period.

Minimum checks when reading L4 stories

Checklist

What do you consider to be continuous:Whether you are looking at memory, values, learning, branching, or longitudinal.

Are there pre-registrations?Are the criteria changed later?

Are there any failure conditions?Are there any discrepancies that will cause the project to be put on hold?

Is the observation period sufficient?Does a single match indicate long-term identity?

Where to go back next

If you want to go back to the philosophy-oriented entrance, please use Identity and the copy problem, if you want to go back to the L4 position, please use Introduction to WBE, and if you want to go back to verification design, please use Verification infrastructure.

</article>

Related Wiki

Personhood and copy problem →

How to read claims and evidence →

Counterfactual/intervention/perturbation →

Public page

Introduction to WBE →

Verification infrastructure →

Technology roadmap →

</main>

Wiki: Identity assessment and continuity test

Read this first to avoid getting lost

What we know now

What we still do not know

Check the basics in the wiki

Plain-language terms on this page

See the structure before reading the whole page

The shortest idea

Why L4 suddenly becomes difficult

First of all, 5 items to consider separately

What kind of continuity test do you want to consider

Tests you would like to include as examples

Why pre-registration is especially important

What is difficult when branching occurs

Things you shouldn't say at this stage

Minimum checks when reading L4 stories

Checklist

Where to go back next

Expressions that are easy to overstate	Safer reading
Identity verified	There have been no major discrepancies so far in the pre-registered continuity test group.
Same person completely saved	Preliminary evaluation regarding memory, values, learning, and branching has been established.
Same in the long run	No significant drift was observed in the defined indicators within the observation period.