Transcription Times

HOW LONG DOES A ONE HOUR RECORDING TAKE TO TRANSCRIBE?
It's a common misconception that one hour of recording takes one hour to transcribe. Far from it! We speak much faster than we can write or type; otherwise there would be no need for shorthand or stenographers.  It's generally accepted that we speak four times faster than we can type and seven times faster than we can write.

The professional industry standard allows ONE hour to transcribe 15 minutes of clearly recorded speech. It therefore takes a MINIMUM of 4 hours to transcribe a one hour recording depending on a number of factors detailed below.

 

WHAT INFLUENCES TRANSCRIPTION TIMES?

It's in the interests of both the transcriber and the client to deal with recordings of the highest possible quality.  No transcriber enjoys working with poor quality recordings, and why invest time, money and effort arranging an event only to scupper it at the recording stage? A poor recording will result in a high number of 'inaudibles' and take far longer to transcribe and will increase client costs. Producing a good quality, clearly audible recording is vital. If we can't hear it, we can't transcribe it! The less time it takes to transcribe your material, the lower your final costs will be.

It may also be useful to understand what factors influence how long a transcription will take to process.

The format and quality of the recording
Digital recordings will always produce a clearer recording than any analogue format such as standard audio tapes, mini tapes or micro cassettes. The use of a cheap recorder will be a false economy as the extra transcription costs involved far outweigh any savings made on the equipment.

Whether an external microphone is used
If the in-built microphone is used to make a recording of anything other than dictation, the results will be very poor. External microphones are essential for capturing a clearly audible recording. The position of the microphone is also important; if it's too far away from the speaker (or speakers), much will be inaudible.  

The clarity and number of voices
If the speaker's voice is hard to hear either because the speaker is too far away from the microphone, mumbles, speaks too fast or too quietly, this can mean it's difficult to decipher the words. With recordings of focus groups, meetings or roundtable discussions, transcription can be more difficult due to the multiple voices involved. Obviously, each voice has a different tone, pitch, and speed as well as accent. People in groups have a tendency to all speak at once, interrupt each other or raise their voices if they become animated. Distinguishing between different voices will always take longer than a one-to-one or single voice situation.

Whether you require speakers to be identified
With recordings of one-to-one interviews or small groups, identification of the speakers by name is usually fairly trouble free. With large focus groups or meetings where there may be a 'babble' of voices, this becomes more difficult - especially if the transcriber has never heard those voices before, which is the likely scenario. Unless clients provide a voice 'brief' or speakers identify themselves, either at the beginning or throughout the recording, it becomes almost impossible to match names to voices and takes much longer because the transcriber has to 'tune in' to the different voices.

Whether they speak in coherent sentences
Everyday speech is usually littered with verbal habits and quirks which we generally don't 'hear' in conversation. People switch thought in mid-sentence, add unnecessary 'you knows' and 'sort ofs' every few words, or sometimes don't speak in coherent sentences at all. We rarely speak in the same way as we write. In such situations, the transcriber must go back and work out where to insert the punctuation so as not to lose the thread of the whole piece. The more coherent the speakers are, the less time it takes to transcribe their words. The transcriber can 'type as they talk' and rarely needs to go back and puzzle out the meaning.

The level of background noise
Background noise can make or break a recording, so choosing the recording location is vital, preferably a quiet indoor environment. Our ears can filter out most of the extraneous noise which is constantly around us, from traffic noise, equipment interference, other voices, even background hiss from the recorder itself. Microphones are not so selective - they pick up every sound, giving each noise equal prominence (unless using noise cancelling microphones).

The degree of regional accents
Wherever a recording is made in the world, if the speaker has an accent which is difficult to understand, this will adversely affect the time it takes to transcribe the recording. Some accents are easier to decipher than others, but it will still take the transcriber time to 'tune their ears in' and necessitate listening several times in order to capture what is being said.

The amount of industry specific or technical terminology involved
Material which is full of technical, financial or specialised terminology may be unfamiliar to the transcriptionist. It may be necessary to re-listen to words, sentences, or even whole sections several times in order to distinguish the words. In such circumstances, it helps enormously if a glossary of keywords or some kind of brief about the topic involved can be provided in advance by the client, or if words can be spelt out at the time by the person dictating. We can do this research ourselves on the Internet but this all adds to the transcription time.