
HOW LONG DOES A
ONE HOUR RECORDING TAKE TO TRANSCRIBE?
It's a common misconception that one hour of recording takes one hour to
transcribe. Far from it! We speak much faster than we can write or type;
otherwise there would be no need for shorthand or stenographers.
It's generally accepted that we speak four times faster than we can type
and seven times faster than we can write.
The professional industry standard allows ONE hour to transcribe 15 minutes of clearly recorded speech. It therefore takes a MINIMUM of 4 hours to transcribe a one hour recording depending on a number of factors detailed below.
WHAT INFLUENCES TRANSCRIPTION TIMES?
It's in the interests of both the
transcriber and the client to deal with recordings of the highest
possible quality.
No transcriber enjoys
working with poor quality recordings, and why invest time, money and
effort arranging an event only to scupper it at the recording stage? A
poor recording will result in a high number of 'inaudibles' and take far
longer to transcribe and will increase client costs. Producing a good
quality, clearly audible recording is vital. If we can't hear it, we
can't transcribe it! The less time it takes to transcribe your material,
the lower your final costs will be
It may also be useful to understand what factors influence how long a transcription will take to process.
The format and
quality of the recording
Digital recordings will always produce a clearer recording than any
analogue format such as standard audio tapes, mini tapes or micro
cassettes. The use of a cheap recorder will be a false economy as the
extra transcription costs involved far outweigh any savings made on the
equipment.
Whether
an external microphone is used
If the
in-built microphone is used to make a recording of anything other than
dictation, the results will be very poor. External microphones are
essential for capturing a clearly audible recording. The position of the
microphone is also important; if it's too far away from the speaker (or
speakers), much will be inaudible.
The clarity and
number of voices
If the speaker's voice is hard to
hear either because the speaker is too far away from the microphone,
mumbles, speaks too fast or too quietly, this can mean it's difficult to
decipher the words. With recordings of focus groups, meetings or
roundtable discussions, transcription can be more difficult due to the
multiple voices involved. Obviously, each voice has a different tone,
pitch, and speed as well as accent. People in groups have a tendency to
all speak at once, interrupt each other or raise their voices if they
become animated. Distinguishing between different voices will always
take longer than a one-to-one or single voice situation.
Whether you
require speakers to be identified
With recordings of one-to-one
interviews or small groups, identification of the speakers by name is
usually fairly trouble free. With large focus groups or meetings where
there may be a 'babble' of voices, this becomes more difficult -
especially if the transcriber has never heard those voices before, which
is the likely scenario. Unless clients provide a voice 'brief' or
speakers identify themselves, either at the beginning or throughout the
recording, it becomes almost impossible to match names to voices and
takes much longer because the transcriber has to 'tune in' to the
different voices.
Whether they speak
in coherent sentences
Everyday speech is usually littered
with verbal habits and quirks which we generally don't 'hear' in
conversation. People switch thought in mid-sentence, add unnecessary
'you knows' and 'sort ofs' every few words, or sometimes don't speak in
coherent sentences at all. We rarely speak in the same way as we
write. In such situations, the transcriber must go back and work out
where to insert the punctuation so as not to lose the thread of the
whole piece. The more coherent the speakers are, the less time it takes
to transcribe their words. The transcriber can 'type as they talk' and
rarely needs to go back and puzzle out the meaning.
The level of
background noise
Background noise can make or break a
recording, so choosing the recording location is vital, preferably a
quiet indoor environment. Our ears can filter out most of the extraneous
noise which is constantly around us, from traffic noise, equipment
interference, other voices, even background hiss from the recorder
itself. Microphones are not so selective - they pick up every sound,
giving each noise equal prominence (unless using noise cancelling
microphones).
The degree of
regional accents
Wherever a recording is made in the world, if the speaker has an accent
which is difficult to understand, this will adversely affect the time it
takes to transcribe the recording. Some accents are easier to decipher
than others, but it will still take the transcriber time to 'tune their
ears in' and necessitate listening several times in order to capture
what is being said.
The amount of
industry specific or technical terminology involved
Material which is full of technical, financial or specialised
terminology may be unfamiliar to the transcriptionist. It may be
necessary to re-listen to words, sentences, or even whole sections
several times in order to distinguish the words. In such circumstances,
it helps enormously if a glossary of keywords or some kind of brief
about the topic involved can be provided in advance by the client, or if
words can be spelt out at the time by the person dictating. We can do
this research ourselves on the Internet but this all adds to the
transcription time.