We rely on both, and I think we've got the mix about right. Text for relatively simple things, when numbers and other identifiers need to be transmitted accurately, voice / video for complex discussions that require screen sharing and / or when there are multiple opinions that might need to be reconciled.
And as it happens... I do CRUD web apps too, offically about 20% of my time. Including that one. And for those we use text.