The functional ability for a machine to convert spoken language from audio inputs into text that a computer can then further process or simply annotate. STT is a component of Natural Language Understanding, part of the conversational pattern of AI. The terms Automatic Speech Recognition (ASR) and Speech-to-Text (STT) are often interchangeable.