Skip Navigation


Family Practice Advance Access originally published online on February 27, 2008
Family Practice 2008 25(2):127-131; doi:10.1093/fampra/cmn003
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrowOA All Versions of this Article:
25/2/127    most recent
cmn003v1
Right arrow E-letters: Submit a response
Right arrow E-letters: View responses
Right arrow Alert me when this article is cited
Right arrow Alert me when E-letters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Bailey, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bailey, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2008 The Authors
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (
http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

First steps in qualitative data analysis: transcribing

Julia Bailey

Level 2, Holborn Union Building, Highgate Hill, London N19 3UA, UK

Email: j.bailey{at}pcps.ucl.ac.uk

Received 5 June 2007; Revised 2 November 2007; Accepted 8 January 2008.


    Abstract
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
Qualitative research in primary care deepens understanding of phenomena such as health, illness and health care encounters. Many qualitative studies collect audio or video data (e.g. recordings of interviews, focus groups or talk in consultation), and these are usually transcribed into written form for closer study. Transcribing appears to be a straightforward technical task, but in fact involves judgements about what level of detail to choose (e.g. omitting non-verbal dimensions of interaction), data interpretation (e.g. distinguishing ‘I don't, no’ from ‘I don't know’) and data representation (e.g. representing the verbalization ‘hwarryuhh’ as ‘How are you?’).

Representation of audible and visual data into written form is an interpretive process which is therefore the first step in analysing data. Different levels of detail and different representations of data will be required for projects with differing aims and methodological approaches. This article is a guide to practical and theoretical considerations for researchers new to qualitative data analysis. Data examples are given to illustrate decisions to be made when transcribing or assigning the task to others.

Keywords. Audio recording, data transcription, data analysis, qualitative research, video recording.


    Introduction
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
Qualitative research can explore the complexity and meaning of social phenomena,1,2 for example patients' experiences of illness3 and the meanings of apparently irrational behaviour such as unsafe sex.4 Data for qualitative study may comprise written texts (e.g. documents or field notes) and/or audible and visual data (e.g. recordings of interviews, focus groups or consultations). Recordings are transcribed into written form so that they can be studied in detail, linked with analytic notes and/or coded.5

Word limits in medical journals mean that little detail is usually given about how transcribing is actually done. Authors' descriptions in papers convey the impression that transcribing is a straightforward technical task, summed up using terms such as ‘verbatim transcription’.6 However, representing audible talk as written words requires reduction, interpretation and representation to make the written text readable and meaningful.7,8 This article unpicks some of the theoretical and practical decisions involved in transcribing, for researchers new to qualitative data analysis.


    What are the aims of the research project?
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
Researchers' methodological assumptions and disciplinary backgrounds influence what are considered relevant data and how data should be analysed. To take an example, talk between hospital consultants and medical students could be studied in many different ways: the transcript of a teaching session could be analysed thematically, coding the content (topics) of talk. Analysis could also look at the way that developing an identity as a doctor involves learning to use language in particular ways, for example, using medical terminology in genres such as the ‘case history’.9 The same data could be analysed to explore the construction of ‘truth’ in medicine: for example, a doctor saying ‘the patient's blood pressure is 120/80’ frames this statement as an objective, quantifiable, scientific truth. In contrast, formulating a patient's medical history with statements such as ‘she reports a pain in the left leg’ or ‘she denies alcohol use’ frames the patient's account as less trustworthy than the doctor's observations.10 The aims of a project and methodological assumptions have implications for the form and content of transcripts since different features of data will be of analytic interest.7


    What level of detail is required?
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
Making recordings involves reducing the original data, for example, selecting particular periods of time and/or particular camera angles. Selecting which data have significance reflects underlying assumptions about what count as data for a particular project, for example, whether social talk at the beginning and end of an interview is to be included or the content of a telephone call which interrupts a consultation.

Visual data
Verbal and non-verbal interaction together shape communicative meaning.11 The aims of the project should dictate whether visual information is necessary for data interpretation, for example, room layout, body orientation, facial expression, gesture and the use of equipment in consultation.12 However, visual data are more difficult to process since they take a huge length of time to transcribe, and there are fewer conventions for how to represent visual elements on a transcript.5

Capturing how things are said
The meanings of utterances are profoundly shaped by the way in which something is said in addition to what is said.13,14 Transcriptions need to be very detailed to capture features of talk such as emphasis, speed, tone of voice, timing and pauses but these elements can be crucial for interpreting data.7

Example 1
The following example shows how the addition of pauses, laughter and body conduct to a transcript invites a different interpretation of an exchange between doctor and patient. The excerpt below is taken from near the end of a consultation, after the doctor has made the diagnosis of a viral infection which does not warrant antibiotics. Transcribing the verbal content alone, it appears that the patient is happily accepting the doctor's advice:

Dr 9: I would suggest yes paracetamol is a good symptomatic treatment, and you'll be fine

Pt K: fine, okay, well, thank you very much.

Representing (some) non-verbal features of the interaction on the transcript changes the interpretation of this two-line interaction (see Appendix, transcription conventions):

Dr 9: (.) I would suggest (.) yes paracetamol or ibuprofen is a good (.) symptomatic treatment (.) um (.) (slapping hands on thighs) and you'll be fine

Pt K: fine (.) okay (.) well (.) (shrugging shoulders and laughing) thank you very much

In the second representation of this interaction, both speakers pause frequently. The doctor slaps his thigh and uses the idiom ‘you'll be fine’ to wrap up his advice giving. In response, Patient K is hesitant and he uses the mitigation ‘well’, shrugs his shoulders and laughs, suggesting turbulence or difficulty in interaction.15 Although the patient's words seem to indicate agreement, the way these words are said seem to indicate the opposite.16

Example 2
In another example, the addition of non-verbal features again gives a deeper understanding of the doctor–patient interaction. This patient has consulted on a Saturday morning with sore throat symptoms. In the extract below, the doctor seeks clarification about Patient F's symptoms:

Dr 5: So let's just go back to this. So, so you've had this for a few weeks

Pt F: yes

Adding in non-verbal features conveys that this is a potentially problematic exchange:

Dr 5: .hhh so let's just go back to this (.) so (.) so you've had this for a few weeks

Pt F: yes (1.0) (left hand on throat, stroking with fingers)

The doctor starts with a prominent in-breath and stresses the word ‘weeks’ in her recapping of the duration of symptoms. Patient F responds, but there is then a prominent pause during which he strokes his throat with his fingers (the site of his sore throat). The 1-second pause is ‘accountable’, in other words something is expected in this space.17 Patient F does not expand on his answer, but his gesture visibly demonstrates his symptoms. The duration of the symptoms (a few weeks) appears therefore to be accountable, in other words to need explaining. The doctor addresses this accountability directly in her next turn:

Dr 5: I must ask you (.) why have you come in today because it is a Saturday morning (1.0) it's for urgent cases only that really have just started

Pt F: Yes because it has been troubling me since last last night (left hand still on neck)

This more detailed level of transcribing facilitates analysis of the social relationship between doctor and patient; in this example, the consequences for the doctor–patient interaction of consulting in an urgent surgery with ‘minor’ symptoms.16

Data must inevitably be reduced in the process of transcribing, since interaction is hugely complex. Decisions therefore need to be made about which features of interaction to transcribe: the level of detail necessary depends upon the aims of a research project, and there is a balance to be struck between readability and accuracy of a transcript.18


    Who should do the transcribing?
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
Transcribing is often delegated to a junior researcher or medical secretary for example, but this can be a mistake if the transcriber is inadequately trained or briefed. Transcription involves close observation of data through repeated careful listening (and/or watching), and this is an important first step in data analysis. This familiarity with data and attention to what is actually there rather than what is expected can facilitate realizations or ideas which emerge during analysis.1 Transcribing takes a long time (at least 3 hours per hour of talk and up to 10 hours per hour with a fine level of detail including visual detail)5 and this should be allowed for in project time plans, budgeting for researchers’ time if they will be doing the transcribing.


    What contextual detail is necessary to interpret data?
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
Recordings may be difficult to understand because of the recording quality (e.g. quiet volume, overlaps in speech, interfering noise) and differing accents or styles of speech. Utterances are interpretable through knowledge of their local context (i.e. in relation to what has gone before and what follows),8 for example, allowing differentiation between ‘I don't, no’ and ‘I don't know'. Interaction is also understood in wider context such as understanding questions and responses to be part of an ‘interview’ or ‘consultation’ genre with particular expectations for speaker roles and the form and content of talk.19 For example, the question ‘how are you?’ from a patient in consultation would be interpreted as a social greeting, while the same question from a doctor would be taken as an invitation to recount medical problems.14 Contextual information about the research helps the transcriber to interpret recordings (if they are not the person who collected the data), for example, details about the project aims, the setting and participants and interview topic guides if relevant.


    How should data be represented?
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
Written language is represented in particular standardized ways which are quite different from audible speech. For example, ‘hwaryuhh’ is much more easily read and understood if represented as separate words, with punctuation and capital letters, as ‘How are you?’.20 Choosing to use the grammar and spelling conventions of standard UK written English aids readability, but at the same time irons out the linguistic variety which is an important feature of cultural and subcultural identity.20 For example, the following extract represents a patient speaking a Cockney English dialect (typically spoken by working class Londoners), in consultation with a doctor speaking English with Received Pronunciation (typically spoken by educated, middle class English people):

Dr 1: so what are your symptoms since yesterday (.) the aches

Pt B: aches ere (.) in me arm (.) sneezing (.) edache

Dr 1: ummm (.) okay (.) and have you tried anything for this (.) at all?

Pt B: no (.) I ain't a believer of me- (.) medicine to tell you the truth

Although this attempts to represent linguistic variety, using a more literal spelling is difficult to read and runs the risk of portraying respondents as inarticulate and/or uneducated.20 Even using standard written English, transcribed talk appears faltering and inarticulate. For example, verbal interaction includes false starts, repetitions, interruptions, overlaps, in- and out-breaths, coughs, laughs and encouraging noises (such as ‘mm’), and these features may be omitted to avoid cluttering the text.18

If talk is mediated via an interpreter, decisions must be made about how to represent translation on a transcript,8 for example, whether to translate ‘literally’, and then to interpret the meaning in terms of the second language and culture. For example, from French to English, ‘j'ai mal au coeur’ translates literally as ‘I have bad in the heart’, interpreted in English as ‘I feel sick’. Translation therefore adds an additional layer of interpretation to the transcribing process.

Written representations reflect researchers’ interpretations. For example, laughter could be transcribed as ‘he he he', ‘laughter (2 seconds)’, ‘nervous laughter’, ‘quiet laughter’ or ‘giggling’ and these representations convey different interpretations. The layout on paper and labelling also reflect analytic assumptions about data.20 For example, labelling speakers as ‘patient’ and ‘doctor’ implies that their respective roles in a medical encounter are more salient than other attributes such as ‘man’, ‘mother’, ‘Spanish speaker’ or ‘advice giver’. Talk is often presented in speech turns, with a new line for the next speaker (as in the data examples given), but could also be laid out in a timeline, in columns or in stanzas like poetry, for example.7 Transcripts are not therefore neutral records of events, but reflect researchers’ interpretations of data.

Presenting quotations in a research paper involves further steps in reduction and representation through the choice of which data to present and what to highlight. There is debate about what counts as relevant context in qualitative research.21,22 For example, studies usually describe the setting in which data were collected and demographic features of respondents such as their age and gender, but relevant contextual information could also include historical, political and policy context, participants’ physical appearance, recent news events, details of previous meetings and so on.23 Authors’ decisions on which data and what contextual information to present will lead to different framing of data.


    What equipment is needed?
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
Decisions about the level of detail needed for a project will inform whether video or audio recordings are needed.24 Taking notes instead of making recordings is not sufficiently accurate or detailed for most qualitative projects. Digital audio and video recorders are rapidly replacing analogue equipment: digital recordings are generally better quality, but require computer software to store and process, and digital video files take up huge quantities of computer memory. It is usually necessary to playback recordings repeatedly: a foot-controlled transcription machine facilitates this for analogue audio tapes (see Fig. 1) and transcribing software is recommended for digital audio or video files, since this allows synchronous playback and typing (see Fig. 2).


Figure 1
View larger version (75K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIGURE 1 Analogue audio recording equipment: dictaphone with microphone and mini-cassette tape and foot-pedal controlled transcription machine with headphones

 


Figure 2
View larger version (67K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIGURE 2 Digital video recording equipment: video camera with firewire computer lead, mini DV cassette and Transana transcribing software

 

    Conclusion
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
Representation of audible and visible data into written form is an interpretive process which involves making judgments and is therefore the first step in analysing data. Decisions about transcribing are guided by the methodological assumptions underpinning a particular research project, and there are therefore many different ways to transcribe the same data. Researchers need to decide which level of transcription detail is required for a particular project and how data are to be represented in written form.

Transcribing is an interpretive act rather than simply a technical procedure, and the close observation that transcribing entails can lead to noticing unanticipated phenomena. It is impossible to represent the full complexity of human interaction on a transcript and so listening to and/or watching the ‘original’ recorded data brings data alive through appreciating the way that things have been said as well as what has been said.


    Declaration
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
Funding: Primary Care Researcher Development award, Department of Health National Coordinating Centre for Research Capacity Development.

Ethical approval: East London and the City Ethical Committee.

Conflict of interest: None.


    Appendix:
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
Transcription Conventions
(?) talk too obscure to transcribe.

Hhhhh audible out-breath

.hhh in-breath

[ overlapping talk begins

] overlapping talk ends

(.) silence, less than half a second

(.) silence, less than one second

(2.8) silence measured in 10ths of a second

:::: lengthening of a sound

Becau- cut off, interruption of a sound

he says. Emphasis

= no silence at all between sounds

LOUD sounds

? rising intonation

(left hand on neck) body conduct

[notes, comments]


    Acknowledgments
 
This paper derives from a PhD thesis written by Julia Bailey entitled ‘Doctor-patient consultations for upper respiratory tract infections: a discourse analysis’, which was supervised by Celia Roberts, Roger Jones and Jane Barlow. Thanks are due to doctors and patients who participated in the project, to practice staff, and to Anne Rouse for her advice on the practicalities of transcribing.


    Notes
 
Bailey J. First steps in qualitative data analysis: transcribing. Family Practice 2008; 25: 127–131.


    References
 Top
 Abstract
 Introduction
 What are the aims...
 What level of detail...
 Who should do the...
 What contextual detail is...
 How should data be...
 What equipment is needed?
 Conclusion
 Declaration
 Appendix:
 References
 
1 Pope C, Mays N. Qualitative Research in Health Care (2000) 2nd edn. London: BMJ Books.

2 Britten N, Jones R, Murphy E, Stacy R. Qualitative research methods in general practice and primary care. Fam Pract. (1995) 12:104–114.[Free Full Text]

3 Lawton J. Lay experiences of health and illness: past research and future agendas. Soc Health Illn. (2003) 25:23–40.

4 Flowers P, Smith JA, Sheeran P, Beail N. Health and romance: understanding unprotected sex in relationships between gay men. Br J Health Psychol. (1997) 2:73–86.[Web of Science]

5 ten Have P. Transcribing Talk in Interaction. Doing Conversation Analysis. A Practical Guide (1999) London: Sage. 75–98.

6 Kumar S, Little P, Britten N. Why do general practitioners prescribe antibiotics for sore throat? Grounded theory interview study. BMJ. (2003) 326:138.[Abstract/Free Full Text]

7 Roberts C. Qualitative Data Analysis. Transcribing Spoken Discourse. FDTL Data Project 2004. http://www.kcl.ac.uk/schools/sspp/education/research/projects/dataqual.html. (accessed on November, 2007).

8 Green J, Franquiz M, Dixon C. The myth of the objective transcript: transcribing as a situated act. Tesol Q. (1997) 31:172–176.[CrossRef]

9 Becker HS, Geer B, Hughes EC, Strauss AL. Boys in White: Student Culture in Medical School (1961) Chicago, IL: University of Chicago Press.

10 Fleischman S. Language and medicine. In: The Handbook of Discourse Analysis.—Schiffrin D, Tannen D, Hamilton HE, eds. (2001) Malden, MA: Blackwell. 470–502.

11 Heath C. Body Work: The Collaborative Production of the Clinical Object. Communication in Medical Care: Interaction between Primary Care Physicians and Patients. (2006) 1st edn. Cambridge: Cambridge University Press. 185–213.

12 Greatbatch D, Heath C, Campion P, Luff P. How do desk-top computers affect the doctor-patient interaction? Fam Pract. (1995) 12:32–36.[Abstract/Free Full Text]

13 Sarangi S, Roberts C. Talk, Work and Institutional Order (1999) Berlin, Germany: Mouton de Gruyter.

14 Robinson J. Soliciting patients’ presenting concerns. In: Communication in Medical Care: Interaction between Primary Care Physicians and Patients.—Heritage J, Maynard DW, eds. 1st edn. Cambridge: Cambridge University Press, 2006: 22–47.

15 Silverman D. The construction of delicate objects in counselling. In: Discourse Theory and Practice (2001) London: Sage. 119–137.

16 Bailey JV. Doctor-Patient Communication in Consultations for Upper Respiratory Tract Infections: A Discourse Analysis (2007) London: University of London. PhD Thesis.

17 Sacks H. Lecture 1: rules of conversational sequence. In: Harvey Sacks. Lectures on Conversation.—Jefferson G, ed. (1992) Oxford: Blackwell.

18 Tilley SA. "Challenging" research practices: turning a critical lens on the work of transcription. Qual Inquiry. (2003) 9:750–773.[CrossRef]

19 Maybin J. Language, struggle and voice: the Bakhtin/Volosinov writings. In: Discourse Theory and Practice.—Wetherell M, Taylor S, Yates SJ, eds. (2001) London: Sage. 64–71.

20 Roberts C. The politics of transcription. Transcribing talk: issues of representation. Tesol Q. (1997) 31:167–172.[CrossRef]

21 Schegloff EA. Whose text? Whose context? Discourse Soc. (1997) 8:165–187.[Abstract]

22 Wetherell M. Debates in discourse research. In: Discourse Theory and Practice (2001) London: Sage. 380–399.

23 Gesler WM. Words in wards: language, health and place. Health Place. (1999) 5:13–25.[CrossRef][Web of Science][Medline]

24 Heath C. The analysis of activities in face to face interaction using video. In: Qualitative Research: Theory, Method and Practice.—Silverman D, ed. (1998) London: Sage. 183–200.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Fam PractHome page
S. E Shaw and J. Bailey
Discourse analysis: what is it and why is it relevant to family practice?
Fam. Pract., June 25, 2009; (2009) cmp038v1.
[Abstract] [Full Text] [PDF]

E-letters:

Read all E-letters

Transcription of Qualitative Research Data
Christian T. K.-H. Stadtländer
Family Practice, 30 Jun 2008 [Full text]

This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrowOA All Versions of this Article:
25/2/127    most recent
cmn003v1
Right arrow E-letters: Submit a response
Right arrow E-letters: View responses
Right arrow Alert me when this article is cited
Right arrow Alert me when E-letters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Bailey, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bailey, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?