While waiting for the network team to have time/energy to investigate it further, I decided to implement one of those temporary fixes that will almost certainly become a permanent solution:
I spun up a quick Kubernetes pod to ping the magic address every 60 seconds until the end of time.
Those hosts haven't gone back down since!