# Button Persistence Tests that buttons survive across turns when Thinker does not re-emit them. This is the S3* audit: buttons should persist until explicitly replaced. ## Setup - clear history ## Steps ### 1. Create buttons - send: create two buttons: Poodle Bark and Bolonka Bark - expect_actions: length >= 2 - expect_actions: any action contains "poodle" or "Poodle" - expect_actions: any action contains "bolonka" or "Bolonka" ### 2. Ask unrelated question (buttons must survive) - send: what time is it? - expect_response: contains ":" or "time" or "clock" - expect_actions: any action contains "poodle" or "Poodle" - expect_actions: any action contains "bolonka" or "Bolonka" ### 3. Ask another question (buttons still there) - send: say hello in German - expect_response: contains "Hallo" or "hallo" or "German" - expect_actions: any action contains "poodle" or "Poodle" ### 4. Explicitly replace buttons - send: remove all buttons and create one button called Reset - expect_actions: length >= 1 - expect_actions: any action contains "reset" or "Reset"