Why don't AI humanizers always work the same way twice?

Why does the same AI humanizer give different results on different runs of the same text?

I ran the same paragraph through the same tool twice without changing any settings and got noticeably different output both times, with different detection scores afterward. Is this just inherent randomness in how these tools generate output, or is there a consistency setting I’m missing?