Yes. Underperformance is a decent indicator. Number of PRs is not a great one, but it can be somewhat correlated with eng velocity/productivity. Twitter will probably end up using a combination of metrics like that.
They will hopefully also talk to EMs, but at this scale might choose not to. It's going to be an ugly process but I doubt it's going to be a random one. They clearly understand that it's in their interest to retain the most skilled and performant employees.
Also, don't want to sound too cynical, but I would guess that it's actually not that hard to evaluate eng performance with all the data companies are collecting during WFH. I am sure Twitter requires using a VPN, so digging into that data would yield quite a revealing and precise performance metric. The only reason this approach is not widespread is ethical, not technological.