I'm reminded of how I felt it was unfair that the hotword detector for assistants was discussed as "recording", but it's really just parsing a byte stream, never storing it.
Voice ID systems I'm familiar with work on a similar premise.