Let user see the text box that's being entered; pop items off
one at a time and display in a history view.
This might display the first few symbols faster than they're being
heard, depending on the size of the audio buffers. Not sure what to do
about that, other than writing a real-time mixer.