Page MenuHomePhabricator

Update and Restart Revise Tone Experiment
Closed, ResolvedPublic3 Estimated Story Points

Description

Summary

Update the current Revise Tone Test Kitchen experiment configuration and restart the experiment with revised task availability and sorting rules. The goal is to isolate the impact of the Revise Tone task by adjusting which Suggested Edits are available to treatment and control groups.

Context:
We are seeing newcomers complete Revise Tone tasks successfully, and the revert rate is quite low (T408642: Product Analytics: Revise Tone - Leading Indicator plan of action).
However, early data suggests that newcomers are still most successful with the Add a Link task, which is the easiest Suggested Edit available. When Revise Tone is sorted first for the treatment group, newcomers are less likely to encounter Add a Link tasks. As a result, very new account holders in the treatment group may be less likely to constructively activate compared with the control group, which is much more likely to surface Add a Link tasks.
This change is intended to better isolate the impact of the Revise Tone task on newcomer editing behavior by removing the sorting bias and adjusting which tasks are available to each experiment group.

User Story

As a Growth Product Manager, I want to run an experiment where Revise Tone is available to the treatment group without being artificially prioritized in sorting, so that we can measure the effect of the task itself on constructive editing behavior.

As a new editor looking for Suggested Edits, I want to see a set of tasks that I can easily complete so that I can contribute to Wikipedia and improve articles.

Changes requested

Remove sorting override

  • Remove the change that currently surfaces Revise Tone first for all treatment group participants.
  • Suggested Edits should return to the standard sorting behavior (for both control and treatment groups)
Update experiment groups

Treatment group

  • Available tasks:
    • Revise Tone
    • Add a Link
  • Not available:
    • Copyedit

Control group

  • Maintain the current status quo task set:
    • Add a Link
    • Copyedit
  • Not available:
    • Revise Tone
Restart the Test Kitchen experiment

Release date: March 19, 2026
End date: June 30, 2026

Acceptance criteria
  • The sorting override that prioritizes Revise Tone is removed.
  • Treatment participants can access Revise Tone and Add a Link, but Copyedit is not available.
  • Control participants retain access to Add a Link and Copyedit, but Revise Tone is not available.
  • The Test Kitchen experiment is restarted with the new configuration.
  • The copyedit task is instrumented so that we can measure task completion

Experiment start and end dates are set to March 19, 2026 – June 30, 2026.

Event Timeline

KStoller-WMF moved this task from Inbox to Needs Estimation on the Growth-Team board.
KStoller-WMF set the point value for this task to 3.Mar 9 2026, 3:38 PM

Note about Test Kitchen set up: Experiment Platform team might be able to help us "restart" the experiment on their side, and then we can keep the existing treatment and control groups. We should be able to keep the same experiment name to ensure the groups remain the same.

Change #1254963 had a related patch set uploaded (by Michael Große; author: Michael Große):

[mediawiki/extensions/GrowthExperiments@master] feat: hide the copyedit task type if Revise Tone is available

https://gerrit.wikimedia.org/r/1254963

Change #1255721 had a related patch set uploaded (by Michael Große; author: Michael Große):

[mediawiki/extensions/GrowthExperiments@master] feat: don't sort Revise Tone tasks first anymore

https://gerrit.wikimedia.org/r/1255721

Change #1254963 merged by jenkins-bot:

[mediawiki/extensions/GrowthExperiments@master] feat: hide the copyedit task type if Revise Tone is available

https://gerrit.wikimedia.org/r/1254963

Change #1255721 merged by jenkins-bot:

[mediawiki/extensions/GrowthExperiments@master] feat: don't sort Revise Tone tasks first anymore

https://gerrit.wikimedia.org/r/1255721

Change #1260683 had a related patch set uploaded (by Michael Große; author: Michael Große):

[mediawiki/extensions/GrowthExperiments@master] instrument(ReviseTone): record start of copyedit session

https://gerrit.wikimedia.org/r/1260683

Change #1260683 merged by jenkins-bot:

[mediawiki/extensions/GrowthExperiments@master] instrument(ReviseTone): record start of copyedit session

https://gerrit.wikimedia.org/r/1260683

Change #1264590 had a related patch set uploaded (by Michael Große; author: Michael Große):

[mediawiki/extensions/GrowthExperiments@wmf/1.46.0-wmf.21] instrument(ReviseTone): record start of copyedit session

https://gerrit.wikimedia.org/r/1264590

Change #1264590 merged by jenkins-bot:

[mediawiki/extensions/GrowthExperiments@wmf/1.46.0-wmf.21] instrument(ReviseTone): record start of copyedit session

https://gerrit.wikimedia.org/r/1264590

Mentioned in SAL (#wikimedia-operations) [2026-03-30T13:30:50Z] <jforrester@deploy1003> Started scap sync-world: Backport for [[gerrit:1264590|instrument(ReviseTone): record start of copyedit session (T419181)]], [[gerrit:1261477|Replace WANObjectCache with new MemcachedWrapper concept (T419666)]], [[gerrit:1262199|Fix match case for setting minute, week or month TTL on OrchestratorRequest (T421475)]]

Mentioned in SAL (#wikimedia-operations) [2026-03-30T13:32:33Z] <jforrester@deploy1003> jforrester, migr: Backport for [[gerrit:1264590|instrument(ReviseTone): record start of copyedit session (T419181)]], [[gerrit:1261477|Replace WANObjectCache with new MemcachedWrapper concept (T419666)]], [[gerrit:1262199|Fix match case for setting minute, week or month TTL on OrchestratorRequest (T421475)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can

Mentioned in SAL (#wikimedia-operations) [2026-03-30T13:40:24Z] <jforrester@deploy1003> Finished scap sync-world: Backport for [[gerrit:1264590|instrument(ReviseTone): record start of copyedit session (T419181)]], [[gerrit:1261477|Replace WANObjectCache with new MemcachedWrapper concept (T419666)]], [[gerrit:1262199|Fix match case for setting minute, week or month TTL on OrchestratorRequest (T421475)]] (duration: 09m 33s)

Change #1267061 had a related patch set uploaded (by Michael Große; author: Michael Große):

[mediawiki/extensions/GrowthExperiments@master] feat: prevent VE suggestions during experiment

https://gerrit.wikimedia.org/r/1267061

Change #1267061 merged by jenkins-bot:

[mediawiki/extensions/GrowthExperiments@master] feat: prevent VE suggestions during experiment

https://gerrit.wikimedia.org/r/1267061

This has been restarted and should be working smoothly.

Checked - Apr 23/2-26

@Michael - plese review the following- does it require some follow-ups?
The link Access dashboard on Superset on https://test-kitchen.wikimedia.org/experiment/growthexperiments-revise-tone points to non-existent (?) growthexperiments-revise-tone and the 'No results' page is displayed:

Test kitchen link Access dashboard on Superset
Screenshot 2026-04-23 at 10.25.33 AM.png (1×2 px, 223 KB)
The correct link in the superset Revise tone newcomer task shows correct results
Screenshot 2026-04-23 at 10.16.51 AM.png (942×2 px, 195 KB)
Screenshot 2026-04-23 at 10.28.14 AM.png (532×2 px, 130 KB)

UPDATE: per @Michael feedback: in the nearest future the TestKitchen UI would be discontinued for GrowthBook.