The shortest idea
When questioning someone's identity, it is not only necessary to ask whether they speak the same way. It is important to decide first what is going on and what will be put on hold if things break down.
Why L4 suddenly becomes difficult
In L1 and L2, you can put relatively clear metrics like accuracy and prediction match. However, in L4, questions such as ``Is this memory match enough?'' ``If a person's values change slightly, are they a different person?'' and ``To what extent should we allow changes due to learning?'' come into play. In other words, not only the measurement but also the judgment rules themselves become difficult.
First of all, 5 items to consider separately
| Item | What do you want to see | Why that's not enough |
|---|---|---|
| Memory | Autobiographical memory and episodic coherence. | Memory replay alone does not necessarily indicate subjective continuity. |
| Values/Preferences | Consistency in judgment tendencies and priorities. | It is necessary to distinguish between short-term mood swings and long-term personality trends. |
| Learning history | How to incorporate new experiences and connect with previous trends. | It is natural for things to change as they learn, and the change itself cannot be immediately called a mismatch. |
| Reaction to changes in conditions | How do responses diverge under unlearning conditions and interventions? | Even if they are similar during normal times, there is a possibility that they will break down drastically due to branching. |
| Longitudinal stability | What is stable and what fluctuates within the day, between days, and over the long term? | It is not possible to see the persistence of identity with just one measurement. |
What kind of continuity test do you want to consider
Tests you would like to include as examples
- Autobiographical memory alignment: Tracks not only the content of events, but also their associations and priorities. </li>
- Preference stability:Looks at whether value judgments and choice trends persist beyond short-term noise.
- Learning continuity:After giving new information, see if the update method connects with the original trend.
- Branch verification:When changing conditions, record the point at which it should be treated as a separate individual.
- Long-term drift monitoring: Track characteristics that change and characteristics that don't change over days or weeks.
</ul> </div>In particular, if you want to organize only the entrance of longitudinal evaluation first, Wiki: state/trait/drift is a supplementary lecture.
</section>Why pre-registration is especially important
Evaluations of a person's personality can be interpreted in any way that suits them in hindsight. That's why it is necessary to pre-register ``what to consider as a match,'' ``to what degree of deviation to suspend,'' and ``which branches to treat as separate individuals.''
Things you should decide firstThese are test items, scoring rules, observation period, failure conditions, stopping conditions, and handling of branching. In L4, if this part is ambiguous, the entire conclusion will be shaken.
What is difficult when branching occurs
If the two systems start learning separately at some point, they may start out almost the same, but over time they will have different histories. At this time, the question is ``to what point should they be treated as the same evaluation unit?'' and ``at what point should they be separated as separate entities?''
Therefore, when evaluating L4, it is important to consider not only similarity but also branch log and version control.
If you want to clarify the differences between branching points, branch IDs, stop conditions, and kill switches first, Wiki: Update/branching/stop rules is a supplementary lecture.
Things you shouldn't say at this stage
Expressions that are easy to overstate Safer reading Identity verified There have been no major discrepancies so far in the pre-registered continuity test group. Same person completely saved Preliminary evaluation regarding memory, values, learning, and branching has been established. Same in the long run No significant drift was observed in the defined indicators within the observation period. Minimum checks when reading L4 stories
Checklist
- What do you consider to be continuous:Whether you are looking at memory, values, learning, branching, or longitudinal.
- Are there pre-registrations?Are the criteria changed later?
- Are there any failure conditions?Are there any discrepancies that will cause the project to be put on hold?
- Is the observation period sufficient?Does a single match indicate long-term identity?
</article> </main>Where to go back next
If you want to go back to the philosophy-oriented entrance, please use Identity and the copy problem, if you want to go back to the L4 position, please use Introduction to WBE, and if you want to go back to verification design, please use Verification infrastructure.