I don't mind at all, but A) it's been years since I worked in the field and B) my formal education dates back to 2000. Given that ...
That seems pretty natural to me. They might have read the lines separately and edited them together. That'd probably end up cutting the breaths out, and maybe shortening the spaces between the lines. We used to do this sometimes, though we tended to do it more to take out overlong pauses or extra breaths--both of which were mostly the result of edits. It is possible to time-compress audio without shifting the pitch, but unless they had some hard upper limit in terms of episode duration I can't see why they'd do that. More than a little bit tends to (or at least used to tend to) introduce artifacts unless you babysit the process.
EDIT: And now I have "Danse Macabre" in my head. There are worse things.