A method for transcribing a conference call among a plurality of participants using a plurality of audio connections; the method comprising the steps of:(a) capturing a plurality of portions of audio, each of the plurality of portions of audio being associated with at least one of the plurality of audio connections;(b) forwarding each of the captured plurality of portions of audio to at least one of a plurality of speech recognition engines, whereby each of the plurality of speech recognition engines converts the audio to text; and(c) re-assembling the text converted by the plurality of speech recognition engines.