• No results found

Analysis of domestic dog mitochondrial DNA sequence variation for forensic investigations

N/A
N/A
Protected

Academic year: 2021

Share "Analysis of domestic dog mitochondrial DNA sequence variation for forensic investigations"

Copied!
40
0
0

Loading.... (view fulltext now)

Full text

(1)

Helen Angleby (2005): Analysis of domestic dog mitochondrial DNA sequence variation for forensic investigations. Department of Biotechnology, Royal Institute of Technology, Stockholm, Sweden.

ISBN 91-7283-995-3

Abstract

The first method for DNA analysis in forensics was presented in 1985. Since then, the

introduction of the polymerase chain reaction (PCR) has rendered possible the analysis of small amounts of DNA and automated sequencing and fragment analysis techniques have facilitated the analyses. In most cases short tandemly repeated regions (STRs) of nuclear DNA are analysed in forensic investigations, but all samples cannot be successfully analysed using this method. For samples containing minute amounts of DNA or degraded DNA, such as shed hairs, analysis of mitochondrial DNA (mtDNA) is generally more successful due to the presence of thousands of copies of mtDNA molecules per cell.

In Sweden, ~40 % of all households have cats or dogs. With ~9 million humans shedding ~100 scalp hairs per day, and ~1.6 million cats and ~1 million dogs shedding hairs it is not surprising that shed hairs are one of the most common biological evidence found at crime scenes. However, the match probability for domestic dog mtDNA analysis has only been investigated in a few minor studies. Furthermore, although breed –sequence correlations of the noncoding mtDNA control region (CR) have been analysed in a few studies, showing limited correlations, no large-scale studies have been performed previously. Thus, there have not been any comprehensive studies of forensic informativity of dog mtDNA. In the two papers presented in this thesis we have tried to lay a foundation for forensic use of analysis of domestic dog mtDNA. In the first paper, CR sequences were analysed and the exclusion capacity was investigated for a number of different populations. This is also the first comprehensive study of the correlation between mtDNA CR type and breed, type, and geographic origin of domestic dogs. Since the exclusion capacity for analysis of domestic dog CR sequences is relatively low, it was investigated in the second paper to what extent the discrimination power is improved by analysis of coding sequence. The exclusion capacity improved considerably when 3,000 base pairs of coding sequences where analysed in addition to CR sequences. This study will hopefully work as a basis for future development of analysis of dog mtDNA for forensic purposes.

(2)

LIST OF PUBLICATIONS

This thesis is based on the following papers, referred to by the corresponding Roman numerals in the text:

I. Angleby H, Savolainen P. (2005) Forensic informativity of domestic dog mtDNA control region sequences. Forensic Sci Int. In press.

II. Angleby H, Oskarsson M, Pang J, Zhang Y-p, Leitner T, Arvestad L, Lundeberg J, Savolainen P. Forensic informativity of 3,000 of coding sequence of domestic dog mtDNA. Manuscript.

(3)

Never do today, what can be better

done tomorrow

(4)

Contents

INTRODUCTION 1

Forensic science prior to the introduction of DNA analysis 1

Analysis of human nuclear DNA as a tool for forensic investigations 2

Mitochondrial DNA 5

Mitochondria and mtDNA 5

The mitochondrial control region 7

Analysis of human mtDNA 8

DNA analysis of hair 9

Analysis of non-human DNA in forensic casework 11

Domestic dog populations 15

Debates 16 Population databases 16 Contamination 17 Heteroplasmy 17 Present investigation 18 Conclusions 22 Acknowledgements 23

(5)

References 24

(6)
(7)

Errata

LIST OF PUBLICATIONS: Paper II reads “…of 3,000 of …”, should read “…of 3,000 bp of…”

INTRODUCTION:

Page Paragr Line Reads Should read

1 and 4, resp. 1 2 4 5 9 11 12 13 16 18 18 2 and 1, resp. 2 2 1 1 2 1 1 2 2 1 1 16 and 9, resp. 17,18,19 19 10 3 1 6 22 14 16 21 23 …(Barbaro et al, 2005)… …ABO… ...murder in 1987. ...al, 2004; Barbaro et al, 2005)… …2001; Brion et al, 2005). …inheritance imitates… …(Pertraco… …for nuclear DNA…to 1 ng nuclear DNA,… …where of… …; Bandelt et al, 2004;… …(2000a)… …to an erratum (Grzybowski, 2000b). …(Barbaro et al, 2004). …AB0… …murder in 1988. …al, 2004)… …2001). …inheritance limits… …(Petraco… …of nuclear DNA…to 1 ng,… …where… …; Bandelt et al, 2004b;… …(2000)… …to revision (Grzybowski, 2003).

(8)

Introduction

Forensic science prior to the introduction of DNA analysis

The term “forensic” originates from the Latin words forensis, “in open court, public”, and forum, (“what is out of doors”). Forum was a public square or market place where the technical evidence was presented in open trials in ancient Roman cities (Compact Oxford English Dictionary, www.askoxford.com).

Forensic science includes a wide range of disciplines within the fields of biology, physics, chemistry, geology and mathematics, such as botany, fingerprinting, microscopy, ballistics, handwriting analysis, forensic pathology, psychology and odontology (Saferstein, 1990; Jobling and Gill, 2004). A great deal of time and effort is usually spent during a forensic investigation to identify individuals relevant for a specific forensic inquiry, such as a perpetrator of a crime or a victim of a mass disaster. The first scientific system of personal identification, anthropometry, was developed by Alphonse Bertillon in 1879. The method utilises a series of body measurements as a means of distinguishing one individual from another. For nearly 20 years anthropometry was considered the most accurate method of personal identification and Bertillon’s contribution to forensic science has earned him the epithet “the father of criminal identification” (Saferstein, 1990). It generally takes some time before a newly developed method is applied extensively and although Francis Galton

published the method for classification of fingerprints in 1892 in the book Finger Prints it was not until the early 1900’s that fingerprinting replaced anthropometry as a means for identification of individuals (Saferstein, 1990). Even to this date crime-scene investigators use fingerprint analysis as part of everyday criminal investigation (Barbaro et al, 2005) At the turn of the century Karl Landsteiner published his discovery of variation in human ABO blood groups and the first antibody test for ABO blood groups was introduced by Leone Lattes in 1915 (Jobling and Gill, 2004). The ABO blood group analysis method could be used to exclude an individual from being the source of the trace evidence but the level of variation was not high enough to enable personal identification. A number of different techniques and blood group/protein systems were introduced during the coming years for distinguishing

(9)

between individuals (Jobling and Gill, 2004). However, these techniques where gradually phased out after DNA analysis methods were introduced in the 1980s. The disadvantages with the methods of blood group and protein analysis were that the polymorphic protein and blood group markers were not reliable for old bloodstains and that there were problems with

resolving mixed stains, such as female/male mixtures in cases of rape. In addition to this, the likelihood of two unrelated individuals sharing the same profile, the Match Probability (Pm), was rather high, 0.01-0.001, which means that 10-100 in 10,000 share the same profile (Jobling and Gill, 2004).

Analysis of human nuclear DNA as a tool for forensic investigations

In 1985, the first method for DNA analysis in forensics was presented. The method was based on the detection of hypervariable “minisatellite” regions in human nuclear DNA (Jeffreys et al, 1985a). Minisatellites are loci made up of ~10-1,000 tandemly repeated sequences, each typically 10-100 base pairs in length. These loci are hypervariable and vary in length between individuals. In minisatellite analysis, restriction enzymes are used for cutting the nuclear DNA into fragments. After separation by electrophoresis the fragments are transferred to a membrane by Southern blotting and detected by hybridisation of DNA probes. Using non-specific DNA probes many minisatellite regions can be detected at the same time and the result of the analysis is a fragment pattern, distinct for different individuals, resembling a barcode called “DNA fingerprints” (Jeffreys et al, 1985b; Gill et al, 1985). The

discrimination between individuals is high, giving a match probability of less than 5 x 10-19 (Jeffreys et al, 1985b). In 1985, a method was introduced for separating sperm nuclei from vaginal cellular debris, Differential Lysis (Gill et al, 1985), which facilitated positive identification of the male suspect in male/female DNA mixtures in cases of rape. Another method for DNA analysis was soon introduced, “single-locus probes” (SLP), which targeted single loci that are highly polymorphic instead of many minisatellite regions at the same time. Both SLP analysis and DNA-fingerprinting was used to convict the perpetrator in the first criminal case utilising DNA analysis, Colin Pitchfork in Leicestershire, England, who was sentenced to prison for life for double rape and murder in 1987. At the same time, DNA analysis could clear another suspect for one of the two murders. Ever since these first cases DNA analysis has been used extensively in criminal investigations. This has resulted not only in the conviction of thousands of criminals but innocent people have also been cleared – in some cases after years in prison.

(10)

Although the early methods for nuclear DNA analysis were very useful in many cases, the DNA had to be undegraded for these methods to work (Gill et al, 1985), and amounts of at least 0.5-1 microgram of DNA were needed (Jeffreys et al, 1985b; Benecke 1997a). After introduction and development of the PCR method (polymerase chain reaction) (Saiki et al, 1986; Mullis et al, 1986; Mullis and Faloona, 1987) the sensitivity of DNA analysis increased considerably as samples containing smaller amounts of DNA could now be analysed after amplification. Even partially degraded DNA could be amplified to some extent. The

introduction of automated sequencing and fragment length analysis techniques also facilitated the analysis of the amplified DNA. One of the first DNA analysis methods in forensic science based on the PCR was analysis of polymorphisms in the HLA-DQA1 gene (a gene encoding a human leukocyte antigen cell surface protein) (Saiki et al, 1986; Helmuth et al, 1990;

Hochmeister et al, 1991; Allen et al, 1995). However, this method has mostly been replaced by the so-called short tandem repeat (STR) analysis of “microsatellites”. Microsatellites resemble minisatellites, but are generally shorter. They consist of short sequences ~2-6 base pairs long which are tandemly repeated, giving a total length for the loci of ~80-400 base pairs (Edwards et al, 1991). In STR analysis, a PCR is performed using primers on each side of the microsatellite, followed by electrophoresis and detection of fragment lengths. The STRs analysed in forensic investigations are generally 150-400 base pair long regions of tetranucleotide repeats (Gill, 2002). STR analysis of human nuclear DNA has become a widely used method for discrimination between individuals in contemporary forensic casework (Gill, 2002). Commercial kits are generally used, giving a probability of random match between two unrelated individuals of ~1 in 10 trillion, 1 x 10-13 for a kit of 10 STR loci and amelogenin (a locus for gender identification). However, the practice is to give a

conservative figure and for a full DNA profile the match probability is generally reported as less than 1 in 1 billion(Gill, 2002). STR analysis is now the dominating method and has been used to identify individuals in a wide range of forensic investigations, such as after mass disasters (Whitaker et al, 1995; Clayton et al, 1995a, b) and in cases of rape (da Silva et al, 2004; Csete et al, 2005) and murder (Sivolap et al; 2001), but also in determination of

paternity, (Csete et al, 2005). For kinship testing and mixed male/female cell admixture, STR analysis of the Y chromosome can also be used (Sibille et al, 2002; Daniels et al, 2004; Csete et al, 2005).

(11)

With improved sensitivity of the PCR technique small amounts of DNA are required for a successful analysis and samples containing 250 pg-1 ng of nuclear DNA are routinely

analysed in STR analysis (Gill, 2002). However, in some cases as little as a single cell can be typed using STR analysis (Benecke, 1997a; Findlay et al, 1997). The increase in sensitivity has led to an increase in the range of evidence types that can be analysed for identification purposes with improved success, and STR analysis has been applied to for example severely degraded tissue and bone samples (Clayton et al, 1995a, b; Sivolap et al, 2001), hair roots (Moss et al, 2003), saliva on cigarette ends and stamps (Moss et al, 2003; Barbaro et al, 2005), dried chewing gum (Fregeau et al, 1999), fingerprints (Balogh et al, 2003; Schulz et al, 2004; Barbaro et al, 2005) and urine (Benecke, 1997b). The increased sensitivity also has a major drawback; non-target DNA molecules can contaminate the sample and thereby have a negative effect on the final result. Contamination will be discussed further in the section Debates. Ever since the STR analysis method was introduced, many countries have established national databases to store analysed DNA profiles from convicted individuals, profiles from trace evidence, and in some cases persons suspected of crimes, to facilitate screening for matches between profiles (Benecke, 1997a; Jobling and Gill, 2004). Using a database, screening can be made automatically for matches between profiles of different persons, persons and crime scenes, and between profiles from different crime scenes.

Although STR analysis is useful in many cases, not all samples can be analysed successfully with this method. In, for example, the disaster of Waco (Clayton et al, 1995b) the success rate was 66 % - 83 % for the STR analysis. DNA degradation results in small fragments of DNA (Benecke, 1997a). When the average DNA fragment length is reduced to less than 300 base pairs this could lead to problems such as allelic dropout (complete loss of one allele in a heterozygous genotype), making interpretation of the STR profiles difficult or unsuccessful (Whitaker, 1995). In cases of severe DNA degradation there will be no profiles to interpret. Further, in samples containing only small amounts of intact DNA the STR profiles can become uninterpretable as they approach background levels (caused by low amounts of contamination) (Ladd et al, 1999). SNP (single nucleotide polymorphism) analysis of the human genome has been suggested as an alternative to analysis of STRs, since for this analysis smaller DNA fragments can be analysed than in analysis of STRs. However, to obtain a similar level of discrimination power as for a multiplex of 9-15 STR loci, 50-100 SNP loci would need to be analysed (Chakraborty et al, 1999; Gill et al, 2004). Large, balanced multiplexes, using less than 1 ng of genomic template, corresponding to ~150 cells,

(12)

will need to be developed for PCR amplification of these SNP loci (Gill, 2001). For the Y chromosome, SNP analysis of the non-recombinant part of the Y chromosome (NRY) is an alternative to STR analysis (Jobling, 2001; Brion et al, 2005). SNP analysis could be an alternative method to STR analysis for nuclear DNA in the future. For samples containing minute amounts of nuclear DNA or partially degraded DNA molecules, another DNA molecule than the nuclear DNA molecule can often be more successfully analysed – the mitochondrial DNA molecule (mtDNA) – as there are many more copies per cell as described below (Allen et al, 1998; Alonso et al, 2004). mtDNA analysis is a method that is currently in use in many forensic laboratories.

Mitochondrial DNA

Mitochondria and mtDNA

Mitochondria are a type of organelles that can be found in the cell cytoplasm of most eucaryotes. The mitochondria generate energy for cellular processes by producing ATP through oxidative reactions (in the β-oxidation, citric acid cycle and oxidative

phosphorylation pathways). Mitochondria have their own genome, mitochondrial DNA (mtDNA), which encodes a limited number of proteins and RNAs that are vital for

mitochondrial function (Figure 1). In each vertebrate cell there are 100-1,000 mitochondria, each containing 2-10 copies of mitochondrial DNA (Nass, 1969; Bogenhagen and Clayton, 1974). Among multicellular animals, mtDNA is generally a circular molecule (with few exceptions in Cnidaria) (Wolstenholme, 1992). The circular mtDNA is replicated via unidirectional synthesis from separate origins for the two strands.

The mtDNA molecule ranges in size from ~14,000 to ~42,000 bp in multicellular animals; it is generally 16,000-17,000 bp long in vertebrates (Wolstenholme, 1992, Nosek et al, 1998), which is small in comparison to nuclear DNA molecules. The two strands of mammalian mtDNA molecules differ in guanine and thymine content enough for them to be separated in alkaline cesium chloride gradients, and the two strands have been denoted the heavy (H) strand and the light (L) strand (Wolstenholme, 1992). The mtDNA includes about 37 genes in most vertebrates - 22 tRNAs, 2 rRNAs and 13 protein coding genes (Figure 1). However, the gene arrangements can differ among vertebrates (Wolstenholme, 1992). All 13 protein-coding

(13)

genes code for components in the respiratory chain complexes. ND1-ND4, ND4L, ND5 and ND6 are subunits of complex 1, cytochrome b is a subunit of complex 3, CO1-CO3 are subunits of complex 4 and ATPase 6 and ATPase 8 are subunits of complex 5 (Andersson et al, 1981; Chomyn et al, 1985; Chomyn et al, 1986). The nuclear genome codes for the remainder of the subunits in the respiratory chain complexes and it also codes for the other proteins needed for the structure and function of the mitochondrion (e.g. replication, transcription, and translation). These proteins are transferred to and imported into the mitochondrion from the cytoplasm. The 24 RNAs are needed for the translation of the protein-coding units within the mitochondrion (Andersson et al, 1981). There are generally few or no nucleotides between genes in vertebrate mtDNA. There is however a single

noncoding region in vertebrate DNA, that contains sequences that are needed for initiation of replication and transcription (Wolstenholme, 1992), known as the control region or the D-loop (Figure 1). D-LOOP 12S 16S ND1 ND2 COI COII ATP6 COIII ND4 ND5 ND6 CYT b OL HSP OH LSP ND4L ND3 ATP8 D-LOOP

Fig. 1 The figure represents a general vertebrate mtDNA molecule. The shaded segments represent the 22 tRNA genes. The 12S and 16 S rRNA genes and the 13 protein coding genes cytochrome b (Cyt b), ND1-ND4, ND4L, ND5, ND6, Cytochrome c oxidase subunits COI-COIII and ATPase subunits ATP6 and ATP8 are noted in the figure. OH and OL are the respective origins of H- and L-strand synthesis, arrows indicate the direction of synthesis. HSP respectively LSP are the promoters for transcripts copied from the H and L template strands, respectively.

(14)

The mitochondrial control region

The major reason for differences in size of the mtDNA molecule among multicellular animals is differences in the length of the control region. The length can vary both between

individuals and between different mtDNA molecules within an individual, the latter is called length heteroplasmy (Walberg and Clayton, 1981; Clayton, 1991; Kim et al, 1998; Fridez et al, 1999; Savolainen et al, 2000a,b). Length heteroplasmy is characteristic for several mammal mtDNAs. It is caused by a variable number of tandem repeats, often between Conservative Sequence Block 1 (CSB I) and CSB II. This kind of repeats is absent in humans, and size differences between human mtDNAs is therefore limited to a few bp, in short homopolymeric stretches. However, dog mtDNA contains an extremely heteroplasmic region consisting of imperfect 10 bp repeats. The length of this region can vary between 110 and 460 bp for different molecules within a single individual, and up to 85 mtDNA types have been recorded in one individual (Savolainen et al, 2000b). In humans the control region is ~ 1120 bp and in domestic dogs ~970 bp, not counting the tandem repeat region (Andersson et al, 1981; Kim et al, 1998). The control region is also known as the displacement-loop (D-loop). The replication of the H-strand often pauses after only a few hundred base pairs, generating a structure of a partially synthesised H-strand associated with the template and a third, displaced, single strand. The resulting structure is the D-loop. As the control region is a noncoding region, mutations are generally not as deleterious in this region as mutations in regions coding for proteins or RNAs. Therefore, mutations tend to accumulate at a higher rate than in the rest of the mitochondrial genome. Furthermore, apparent lack of mtDNA repair mechanisms and low fidelity of mtDNA polymerase has led to a higher mutation rate in mitochondrial genomes compared with nuclear genomes (Budowle et al, 2003). Within mtDNA, the control region has been the primary target for analysis of variation between individuals for forensic purposes. It has also been the primary target for the study of relatively recent evolutionary events, such as evolution among populations within species.

(15)

Analysis of human mtDNA

As previously mentioned, analysis of mitochondrial DNA is an alternative to analysis of nuclear DNA in forensic investigations when biological evidence lacks enough nuclear DNA for successful STR analysis, or when nuclear DNA is too fragmented due to degradation. The higher degree of success for mtDNA in these cases is due to a higher copy number of mtDNA in cells compared to nuclear DNA. There can be as many as 1,000 mitochondria in a somatic cell and each mitochondrion can contain 2-10 copies of mtDNA, in comparison with a single copy of each nuclear DNA molecule in the same cell (Nass, 1969; Bogenhagen and Clayton, 1974; Budowle et al, 2003). The likelihood of finding an intact copy of mitochondrial DNA in a small or severely degraded sample is therefore much greater than the likelihood of finding an intact copy of nuclear DNA.

The two routinely analysed mtDNA hypervariable regions (HVI and HVII) of the human control region are generally analysed by automated DNA sequencing (Hopgood et al, 1992; Sullivan et al, 1992; Allen et al, 1998; Seo et al, 2000, Alonso et al, 2002) using Sanger sequencing, and all positions in the sequenced blocks are inspected in an electropherogram. There are some characteristics of mitochondrial DNA that reduces the possibility to

discriminate between individuals in comparison with STR analysis of nuclear DNA. The mtDNA molecule is relatively small compared to nuclear DNA molecules, and thereby contains fewer positions that can mutate. Unlike nuclear DNA, mtDNA is maternally inherited, and is therefore identical for all maternal relatives and all siblings except for sporadic mutations. In nuclear DNA, recombination is a source of variation between individuals. The predominant view is that mitochondrial DNA molecules do not recombine (Hauswirth and Lapis, 1982; Piganeau and Eyre-Walker, 2004) but this is a continuously debated question (D’Aurelio et al, 2004). For these reasons, the result of an mtDNA analysis cannot be used to positively identify an individual. However, it can be used to exclude suspects, and in the case of an inclusion, it can be used as circumstantial evidence, in combination with other evidence.

The exclusion capacity, which is 1 minus match probability, for the two hypervariable regions in human mtDNA (HV1 and HVII ) is generally between 0.975 and 0.995 (Jobling and Gill, 2004). However, significance of a match is usually calculated based on the number of times a

(16)

specific sequence has been observed in a population database (Budowle et al, 2003; see also the database discussion under the heading Debates). Analysis of coding regions of mtDNA should be able to improve the exclusion capacity for mtDNA analysis (Parsons and Coble, 2001; Lutz-Bonengel et al; 2003; Vallone et al, 2004; Coble et al 2004) but it would not be practical to sequence and analyse all positions in the coding region in the same way. Methods for analysing point mutations, SNPs, in coding regions of mtDNA are being developed for forensic purposes, but are not yet routinely used (Andreasson et al, 2002; Brandstätter et al, 2003; Quintáns et al, 2004).

Although the mode of inheritance imitates the usefulness of mtDNA it can be an aid in some cases. For example, known maternal relatives of missing persons can provide reference samples for direct comparison with samples from recovered remains (Ginther et al, 1992; Holland et al, 1993). Mitochondrial DNA analysis has been a useful aid for human identification based on mtDNA in skeletal fragments, very old materials and severely decomposed or burned bodies (Ginther et al.1992; Sullivan et al, 1992; Holland et al, 1993; Gill et al, 1994; Seo et al, 2000). As previously mentioned, STR analysis is performed routinely on hair roots. However, hair shafts without a viable hair root are generally analysed by mtDNA analysis (Wilson et al, 1995; Pfeiffer et al, 1999; Alonso et al, 2002). The reason for this methodological difference will be explained in the following section.

DNA analysis of hair

Hair is a characteristic for one vertebrate class in particular, Mammalia, although hair cells are also present in other vertebrates, for example in lateral line organs of fishes and

amphibians where they detect movement relative to the environment (Campbell, 1996). The hairs that are found at most crime scenes are mostly shed hairs of human origin, or from pets (Menotti-Raymond et al, 1997; Savolainen et al, 1999). In Sweden, for example, ~40 % of all households have either cats or dogs (Manimalis, 2005). With ~9 million humans shedding ~100 scalp hairs per day, and ~1,6 million cats and ~1 million dogs shedding hairs it is not surprising that shed hairs are one of the most common biological evidence found at crime scenes. Experiments on transfer of animal hair during simulated criminal behaviour has shown that if somebody enters an apartment where a dog or a cat lives it is almost certain that

(17)

he will take some animal hairs with him when he leaves (D’Andrea et al, 1998). In cases of robbery, articles of clothing such as robbery caps are often found discarded in near vicinity of the crime and they can be a source of evidence material such as shed human hairs (Allen et al, 1998). Persistence of human scalp hair on different clothing fabrics have been analysed and speed of hair loss varied considerably between different fabrics (Dachs et al, 2003). Hairs tend to fall off more readily from materials such as polyester or cotton than from wool, indicating that the best chance of recovering hair from fabrics would be from articles of clothing, blankets, and other products made of wool. Formerly, the principal methods used in forensic hair analysis were morphological comparison and ABO grouping. Generally,

microscopical hair analysis is now primarily used prior to DNA analysis to investigate properties that can provide the examiner with details for determination of species but also details of pigmentation and artificial coloration are noted.

Hairs can be subdivided into two parts, root and shaft. Most of the DNA in a hair is located in the root and surrounding sheath cells (Hukkelhoven et al, 1981). As previously mentioned, STR analysis can often be successfully performed on hair roots. However, STR analysis of shed hairs is generally not successful (Vigilant, 1999; Alonso et al, 2004). Hairs should normally be treated as separate evidence materials and individual hairs must therefore be analysed separately. The reason that a single plucked hair often contain sufficient amount of intact nuclear DNA for successful STR analysis, when a single shed hair generally does not, can be traced back to changes that take place during the development of a hair. When a hair follicle is actively producing hair it is in an ~2-5 year long growth phase, the anagen phase, and the cells in and around the base of the follicle divide rapidly. The cells situated beside the growing point form the layers of the inner and outer root sheaths and the hair itself is formed of more centrally placed cells (Teerink, 1991). There is a high turnover of cells in the

developed hair bulb, and it is rich in nuclear DNA and mtDNA (Linch et al, 1998). When the cells are pushed further from the growing point, their nuclear bodies gradually become

smaller and formation of horn or keratin starts in the cell. The hair shaft gradually increases in length; it reaches the skin surface and pushes through. Numerous intact nuclei can be seen in the root bulb with the aid of a light microscopy during the growth phase. After the growth phase, the follicle enters the catagen phase, a transition period of about 2 to 4 weeks. During the catagen phase cell division stops. Programmed cell death (apoptosis) results in double strand cleavage of DNA in the root bulb. The follicle begins to degenerate, the bulb shrinks and the outer root sheath collapses. A club formation, a product of cell deletion and

(18)

keratinisation, is gradually developed above the hair bulb during transition from the catagen phase to the telogen phase. In the telogen phase, the hair follicle enters a quiescent period of about 2 to 4 months. The telogen club rises towards the skin surface as the root bulb and its adherent structure shrinks and descends into the skin. The telogen hair is subsequently shed (Linch et al, 1998). Although the majority (~ 80 %) of hairs on an adult human scalp is in the growth phase (Pertraco et al, 1988), ~95 % of all hairs encountered in casework are in the final stage, the telogen phase (Robertson, 1999). The molecules are to a large degree degraded in the dead cells of the hair, and a shed telogen hair is not associated with any viable tissue such as a root bulb. Therefore, the likelihood of finding enough intact nuclear DNA in a shed hair is smaller than for a plucked hair with still viable tissue at the root. For this reason, hairs are generally investigated with microscopic methods prior to DNA analysis in forensic cases and nuclear DNA analysis is often not even attempted unless the hair has a root

(www.skl.polisen.se). Instead, mtDNA analysis is performed, as the copy number is higher for mtDNA than nuclear DNA in cells in general.

Analysis of non-human DNA in forensic casework

As mentioned in the previous section, microscopy can be used to get an indication of the species origin of a hair found in association with a crime. For other evidence materials of unknown species origin other techniques are required. Currently used forensic methods for determining if a particular sample is of human or non-human origin are for example Ag-Ab immuno-diffusion techniques or DNA sequence analysis of Cytochrome b, 12S-RNA and 16S-RNA regions of the mitochondrial genome followed by BLAST search

(www.ncbi.nlm.nih.gov/BLAST) for species identification (Parson et al, 2000; Prieto et al, 2003). For animal evidence materials that contain enough nuclear DNA, additional analysis can be carried out by STR analysis to further investigate the origin of the DNA at the level of the individual. Analysis of non-human DNA for forensics can be a valuable aid not only in crime investigations involving a human perpetrator and a human victim but also in a wide range of other cases. Applications of non-human DNA analysis include investigations of illegal trade in endangered species or poaching (Guglich et al, 1994; Miller et al, 1995; Savolainen et al, 1999; Singh et al, 2004; Wetton et al, 2004), animal-attacks (Brauner et al, 2001; Eichmann et al, 2004), non-declared, wrongly declared or forbidden animal products in

(19)

human food and animal feed (Meyer et al, 1996; Dalmasso et al, 2004), traffic accidents (Schneider et al, 1999); fishing competition fraud (Primmer et al, 2000) and medical negligence such as HIV transmission (Ou et al, 1992). There are many cases where plant or animal DNA has been found in association with murder. For example, mosses found on a suspect and a crime scene were analysed by PCR-based DNA fingerprinting (random amplified polymorphic DNA, RAPD) in an attempt to tie a suspect to the scene of the crime (Korpelainen and Virtanen, 2003). Post-mortem intervals from one day to several weeks can be estimated by calculating the age of immature insect stages feeding on a corpse and noting the species present, but species identification can be difficult. In these cases, analysis of nuclear or mitochondrial DNA sequences can be an aid in species identification (Amendt, et al, 2004). As previously mentioned hairs originating from a perpetrator’s pet can be found in association with a crime. By analysing STR loci in domestic species, individual animals can be identified using the same methods as for human identification of individuals. Using 10 feline-specific STRs, cat hairs stuck on the lining of a bloodstained jacket found in the vicinity of the scene of a murder could be matched to hairs from the suspect’s pet cat (Menotti-Raymond et al, 1997). STRs have been developed also for identification of

individual domestic dogs and has been used in a number of cases (Müller et al, 1999; Shutler et al; 1999; Pádár et al, 2001a,b; Pádár et al, 2002; Eichmann et al, 2004;

www.questgen.biz/CV.htm) However, the same problems are encountered in analysis of shed animal hairs as for shed human hairs – when hairs are naturally shed they generally need to be analysed using mtDNA analysis methods. The cat hairs in the above mentioned case included a hair with a root, facilitating STR analysis. The minimal amount for nuclear DNA required for STR genotyping using the Stockmarks for Dogs Canine Genotyping Kit range from 0.35 to 1 ng nuclear DNA, and for a full profile approximately 5-10 dog hairs with roots are needed (Pfeiffer et al, 2004).

Analysis of domestic dog mtDNA for forensics has so far generally entailed sequencing of the control region with the exemption of the tandem repeat region. This region has been analysed for its potential use for forensic casework (Fridez et al, 1999; Savolainen et al, 2000a). Two tandemly repeated regions in mtDNA control region have also been analysed for the domestic cat (Fridez et al, 1999). However, although the regions were shown to be highly variable in length between individuals they also varied within different individuals (Fridez et al, 1999; Savolainen et al, 2000a). This variation is not currently utilised in analysis of domestic cat and dog mtDNA for forensic purposes, as high intra-individual variation causing mosaic

(20)

distribution of tandem repeat types would give inconsistent results in analysis of separate samples from the same individual. Also heteroplasmy in one or more base positions (point heteroplasmy) can be found in analysis of mtDNA. This will be dealt with in more detail in a following section, Debates, as there has been differences of opinions regarding interpretation issues when point heteroplasmy has been detected. Within the tandemly repeated region of domestic dog mtDNA, there are two different 10-bp repeat types, the different order of which results in different “repeat type sequences” in different molecules. These sequences could potentially be analysed to increase exclusion capacity, but cloning techniques for analysis of single molecules would be needed to make use of this information (Savolainen et al, 2000a).

The match probability that can be obtained by analysis of the domestic dog mtDNA CR (excluding the tandemly repeated region) has been investigated in a few dog populations (Savolainen et al, 1997; Wetton et al, 2003; Paper I; Paper II), and was for example between 0.90 and 0.95 for a 582 bp region in dogs from Sweden, the UK, China and Japan. This is lower than the exclusion capacity for approximately the same mtDNA region in human populations (between 0.975 and 0.995; Jobling and Gill, 2004). This is probably due to the domesticated dog having a more recent origin from its wolf ancestors (~15,000 yr. ago; Savolainen et al, 2002) than humans have from their common origin (~150,000 yr. ago; Ingman et al, 2000), giving much less time for variation to evolve among domestic dogs than among humans. Very few forensic cases involving analysis of domestic dog mtDNA have been published in scientific journals (Savolainen et al, 1999; Schneider et al; 1999) although there are other cases where the method was used (for example, see

www.questgen.biz/CV.htm). Six cases from Sweden (Savolainen et al, 1999) are addressed below to illustrate different situations where of mtDNA analysis of shed hairs from domestic dogs can be useful for forensic casework:

1. Dog hairs found in a suspect’s car. A female went missing and foul play was suspected. Dog hairs were discovered in the suspect’s car and as the female’s family had a dog the hairs were subjected to mtDNA analysis. None of the evidence hairs were of the same mtDNA CR type as the reference hairs from the family’s dog. However, the mtDNA CR types only differed in one position and the evidence hairs could therefore not be excluded as having the same origin as the reference hairs as point heteroplasmy must be taken into consideration in evaluation of the evidence.

(21)

2. Dog hairs found in stolen property. A medieval bible was stolen and later retrieved. Dog hairs were found in the book cover and an mtDNA CR sequence was retrieved from one of the hairs. The suspects had resided in two apartments were dogs were present. Reference hairs were collected from the dogs and the analysed mtDNA CR sequences were compared with that of the evidence hairs. One of the reference hairs matched the mtDNA sequence of the evidence hair and it could therefore not be excluded that those hairs were from the same dog individual.

3. Dog hairs found on suspect’s clothes. A series of bank robberies had been carried out by a group of men. A car was stolen in connection with the crime. The car belonged to a family who owned a dog. When the suspects were apprehended, dog hairs were found on one of the suspect’s clothes. The mtDNA CR was analysed and compared with the corresponding sequence in reference hairs collected from the family’s dog and the mtDNA types matched. The hairs could therefore not be excluded to have originated from the same dog.

4. Suspected poaching. Hairs were analysed that were found on items in association with a person suspected of poaching wolves, a protected species in Sweden. Since all Swedish wolves share a single mtDNA type, which has not been found among domestic dogs, it could be concluded that the evidence hairs, which had normal dog mtDNA types, most probably originated from domestic dogs.

5-6. Dog hairs found on, or in near vicinity, of victims. A young female was found raped and murdered in 1989 and dog hairs were retrieved from her body. Another female was found murdered a few months after the first body was found and dog hairs were found on objects connected with the crime. There were similarities between the two murders that indicated that the same individual murdered both females. The evidence hairs were analysed and were found to be of the same mtDNA CR type, a rare type found only so far in a group of closely related Scandinavian spitz breeds (Finnish Spitz, Jämthund,

Norwegian Elkhound). Reference hairs were obtained from dogs in association with seven suspects and the mtDNA types were compared with the type found in the evidence hairs. None of the sequences from the reference hairs matched that of the evidence hairs.

(22)

Since the article was published in 1999 (Savolainen et al) the British laboratory Forensic Science Service was able to secure a DNA profile from a small amount of sperm found in the body of the first murdered female. In 2004 a suspect was found to have a matching DNA profile and has been found guilty of kidnapping, rape and murder of the first victim. An STR profile matching the second victim has been found in bloodstains from a house where the suspect previously lived but the trial involving the second victim is pending as the court is awaiting additional test results.

Domestic dog populations

As mentioned in the section about human mtDNA analysis, significance of a match is usually calculated based on the number of times a specific sequence has been observed in a

population database. It is therefore important to have an understanding of how the reference population is composed. As mitochondrial DNA molecules are uniparentally inherited there will be some degree of population substructure. The domestic dog has a rather recent origin from its common ancestor. Analysis of mtDNA CR sequences shows that they probably originate from wolves domesticated in East Asia ~15,000 years ago (Savolainen et al, 2002). The dog mtDNA types are distributed into six phylogenetic groups, called clades A, B, C, D, E and F, indicating an origin from at least six female wolf lines. However, there is no clear division between morphological variants of dogs (such as greyhound or spitz) among these six phylogenetic groups of mtDNA sequences. This indicates that dog breeds are not the result of geographically distinct domestications of wolves. In modern dog breeds, artificial genetic barriers have been created between dogs by the practices of dog breeders. Therefore, even though all dogs share a relatively recent common origin there is a substantial genetic sub-structuring between different breeds within dog populations. There is consequently a very large difference in the composition of human populations and dog populations. It is probably more relevant to compare a Swedish Collie with a database of English Collies than a Swedish Collie with a database of Swedish German Shepherds. The creation of a population database for dogs is therefore much more complicated than creating one for humans. It is relevant to take into consideration that all major subgroups (breeds) should be well represented in a high quality mtDNA database for forensic investigations in the future. A database for domestic dog mtDNA sequences is under development that will include most of the published sequences (Pereira et al, 2004).

(23)

Debates

A thorough review of all questions that are under debate in regards to forensic casework would be too time- and space consuming for this dissertation, but some important topics are especially important and must be addressed.

Population databases

DNA profiles from biological evidence found at crime scenes are generally entered into a database where DNA profiles from suspects and persons convicted of a crime are also entered. When a search generates a hit between a forensic profile and a suspect/offender profile the statistical significance of a match is calculated. This calculation is based on a population database of the frequencies of alleles or DNA types in the population. The data in the population database must be of a high quality in order to get calculations of significance that are as correct as possible. There has been a debate about the reliability of population databases, and mtDNA databases in particular, since errors have been discovered among published human mtDNA sequences and among those entered in the population database used by the FBI (Bandelt et al, 2001; Forster, 2003; Dennis, 2003; Bandelt et al, 2004a, b;

Budowle et al, 2004; Yao et al, 2004; Budowle and Polanskey, 2005). Most errors are caused by clerical errors when interpreted data is manually transcribed. These clerical mistakes lead to errors such as base shifts from one position to another in the same or another table column, missed variations in regards to the reference sequence, base misscoring, and even artificial recombinants when two or more sequenced regions from different individuals are erroneously recombined (Bandelt et al, 2001; Bandelt et al, 2004; Parson et al, 2004). Another source of artificial recombination can be contamination. It has been suggested that phylogenetic

analysis of mtDNA data should be used as a tool to discover errors in mtDNA data before the data is entered in any population databases. However, it has been argued that although

correction of errors is advocated, the impact of errors on forensic estimates calculated using the FBI database is nominal before and after the errors are corrected (Budowle and Polanskey, 2005). In Europe, the European DNA Profiling (EDNAP) Group is developing a web-based mtDNA database (EMPOP) which will hold high quality mtDNA population data from diverse populations. Only international forensic DNA laboratories which have successfully participated in a number of different collaborative exercises will be supplying data to the database (Parson et al, 2004).

(24)

Contamination

The DNA analysed from biological evidence from a crime scene can have been transferred in a number of different ways, before, during or after the crime. The source of the DNA could be one or more innocent individuals that have been at the scene of the crime before the crime took place. The perpetrator can shed DNA during the crime. The crime scene investigators, pathologists, laboratory staff, plastic-ware and aerosols from other sample could have

contaminated the sample (Gill, 2002). The forensic laboratory Forensic Science Service (FSS) has recently described how they work to minimise contamination (Sullivan et al, in press, published online 2004 ahead of print in Forensic Sci. Int.). To reduce the risk of

contamination and to detect it when it occurs they have taken a number of preventive measures. These measures range from protective clothing, strict sample handling, extensive process automation, use of contamination-detection software and access to elimination databases that contain DNA profiles from police personnel, FSS staff and from contaminants of unknown source in negative controls and consumables. In spite of preventive measures, contamination was still detected in some cases. In a murder case where the victim’s

toothbrush was analysed a mixed profile was generated and the minor component was a male profile. As the profile matched those from four unsolved minor crimes committed 100’s of miles away contamination was taken into consideration. It was found that the profile yielded a match with profiles stored in the database of profiles from negative controls of processed casework samples – the same profile was linked to nine different occasions over a 3-year period and was also the contaminant of a batch of microfuge tubes. The manufacturer was contacted and one of his production line staff gave a full match with the contaminating profile. A further check showed that 20 of the staff had been the source of sporadic

contaminants in negative controls during a 3-year period, and 14 matched either partial or full profiles in casework samples. The FSS has now developed a way of effectively removing DNA from consumables and has applied for a patent. It will be interesting to find out what this approach entails.

Heteroplasmy

Heteroplasmy is the occurrence of more than one mtDNA type within the same individual. Length heteroplasmy in human mtDNA is commonly observed in relatively short segments of

(25)

sequence that are homopolymeric (i.e., C-streches) (Bär et al, 2000). Long tandemly repeated regions, as described for domestic cats and dogs, are not present in human mtDNA. Point heteroplasmy was previously thought to be rare in humans (Monnat and Loeb, 1985; Monnat and Reay, 1986; Budowle et al, 1990). A large number of cases of detected point

heteroplasmy in not only human mtDNA but also in other mammalian mtDNA has altered this view (for example, Hauswirth and Lapis, 1982; Holt et al, 1990; Koehler et al; 1991). As heteroplasmy can lead to different sequences in different tissues from a single individual, such as in different hairs, this is taken into consideration when sequences are interpreted (Budowle et al, 2003; Tully et al, 2004). When the mtDNA sequence from an evidence sample is compared with that of a reference sample and the sequences are identical, the conclusion will be that it can not be excluded that the samples originate from the same individual (or the same maternal lineage if this is investigated). The same conclusion will be drawn if both sequences are heteroplasmic in the same positions or if one sequence is homoplasmic and the other is heteroplasmic and they share the same bases. However, if both samples differ in one position although they are homoplasmic, there is not enough evidence for an inclusion or exclusion. Two or more differences will generally lead to exclusion (Budowle et al, 2003). Point

heteroplasmy can in some cases aid an investigation, as in the case of the putative remains of Tsar Nicholas II of Russia where the remains of the brother was shown to have the same type of point heteroplasmy (Gill et al, 1994; Ivanov et al, 1996). In some cases the detected heteroplasmy can be a false heteroplasmy caused by contamination. One such case is probably encountered in Grzybowski (2000a) where a very high level of heteroplasmy was reported and one individual was reported to have six heteroplasmic positions. A reanalysis lead to an erratum (Grzybowski, 2000b). Point heteroplasmy has been described in mtDNA from blood and hairs in one dog in a study of a dog population in the UK (Wetton et al, 2003). The individual was heteroplasmic in one position, having a deletion of A in ~50 % of the molecules in blood and a range of variation in 12 hairs from A in majority to almost complete deletion of A.

Present investigation

In this work, domestic dog mtDNA was analysed to increase the knowledge of the information available for forensic investigations. The methods used in these studies were amplification of DNA, using the PCR method, DNA sequence analysis and analysis of mtDNA types using a phylogenetic approach (Paper I, II). A minimum spanning network

(26)

shows the shortest evolutionary connections between sequences and can be used to illustrate patterns such as differences in the composition of sequence types between different

geographic regions (Paper I). For example, if a particular mtDNA type is very common in dogs from East Asia but has never been found in Europe and a new type is found which has its shortest connection to the Asian type, it is probable that the DNA originates from a dog of Asian evolutionary origin. Minimum spanning networks can also be used to aid the discovery of erroneous sequences, for example if two or more sequenced regions from different

individuals are erroneously recombined (Paper II and Debates).

A few Swedish cases have been described, in which analysis of mtDNA CR sequences from shed dog hairs have provided useful information for forensic investigations (Savolainen et al, 1999). In one of the cases it was found that the mtDNA was of a rare CR type previously found only in a group of Scandinavian spitzes. It would of course be valuable if one could pinpoint the breed-origin of a dog hair not only in this case, but also in general. An indication of the breed based on DNA is possible using STR analysis of nuclear DNA (Koskinen, 2003, Parker et al, 2004), but STR analysis works reliably only when the hair root is available. When the hair is shed naturally the root is generally no longer attached to the hair shaft. The high copy number of mtDNA, thousands of copies per cell (Nass, 1969; Bogenhagen and Clayton; 1974), makes mtDNA analysis a more reliable method than nuclear DNA analysis for materials containing only minute amounts of DNA or degraded DNA, such as shed hairs (Allen et al, 1998; Vigilant, 1999; Alonso et al, 2004). Breed - sequence correlations of the mtDNA CR have been analysed in some dog breeds prior to the investigation presented in Paper I (Okumura et al, 1996; Savolainen et al, 1997; Tsuda et al, 1997), showing limited correlation, but no large-scale studies have been performed previously. Furthermore, the match probability for domestic dog mtDNA analysis has only been investigated in a few minor studies (Savolainen et al, 1997; Wetton et al, 2003). Thus, there have not been any comprehensive studies of dog populations for forensic purposes. In order to find out the forensic informativity of domestic dog mtDNA, two different studies were performed (Paper I, II). In Paper I the mtDNA CR was analysed for 867 dogs to investigate exclusion

capacities for mtDNA analysis and the breed – sequence correlations. The exclusion capacities for a 573 bp sequence of the mitochondrial control region was between 0.90 and 0.95 for dogs in Sweden, the UK, Japan and China. This shows that the genetic variation of the dog mtDNA CR is high enough to be a useful tool in many investigations, especially for

(27)

for example that of human mtDNA, implies that an inclusive result normally has a limited value. In Paper I was furthermore investigated correlations not only between mtDNA type and breed, but also between mtDNA type and the geographic origin of the breed and the morphological type of dog. This type of information may be used as an indication of the breed and, with some degree of probability, to include or exclude certain breeds from being the source of evidence materials. This resembles, to a certain degree, the indications one can get in forensic studies of the mtDNA CR region of human DNA where population groups, such as Caucasians and Africans, are studied to get an indication of the most likely ethnical

background of the perpetrator of a crime (Budowle et al, 1999). The direct correlation

between mtDNA type and breed, type of dog, and geographical origin of breed was generally low, but in some cases certain mtDNA types were found in one breed only, and for wider groupings such as morphologically similar breeds, some mtDNA types were in many cases found in a distinct group of breeds, often originating from the same geographic region. For example, the only two relatively well-represented breeds having a single mtDNA-type were Rottweiler and Alaskan Malamute, both of which have probably experienced population bottlenecks. However, also less clear-cut correlations between mtDNA-type and breed or type of dog may be of value in many cases. If a genetic type is found in several different breeds, but is especially common in a certain breed and found at a low frequency in the other breeds, it is probable that a hair with this mtDNA-type derive from a dog of the most common breed, and if the type in question has not been found in dogs of another well-sampled breed, it is not probable that the hair derives from a dog of that breed. Thus, although there was more than one mtDNA type among the German Shepherds, which is the most popular breed in Sweden, it is valuable to know that type sA19 is particularly frequent among German Shepherds and was not found in breeds of Scandinavian or British origin. For a number of groups of breeds, particular mtDNA types were characteristic, such as type sA31 which was found only in a group of sled dogs of North American/Greenland origin and the phylogenetic cluster of four mtDNA types, sD1-sD4, in spitzes of Scandinavian breed origin. Furthermore, some mtDNA CR types were particularly frequent in certain breed groups although they were not exclusive for the group, such as mtDNA type B2 that was found in a large proportion of the

sighthounds. It was found also in a few Middle-Eastern dogs of other morphological types, but not in European sighthound breeds. Thus, B2 appears to be characteristic both of dogs from the Middle Eastern region and for Middle Eastern sighthounds, and the absence of B2 in the European sighthound breeds could reflect a bottleneck in the forming of these breeds, or possibly a separate origin for the European sighthounds. Using phylogenetic analysis of the

(28)

data, some larger geographic patterns could also be suggested. Comparing Southern and Northern Europe, one subcluster of clade A, which was represented at a large frequency in the North, was absent among South European dogs. Furthermore, comparing East Asian and European breeds, large parts of phylogenetic clade A was absent for European breeds. From the European view it is especially interesting to see that a number of dogs of East Asian breeds, sampled in Europe, had types grouping with these parts of clade A. If a hair with one of these types are found at a European crime scene it can with some certainty be thought to derive from a dog of an East Asian rather than a European breed.

Since the discrimination power for analysis of the dog mtDNA CR is relatively small, giving a limited value for inclusive results, it was in Paper II investigated to what degree analysis of a larger part o the mitochondrial genome can increase the exclusion capacity. In Paper I the majority of the sequences were obtained from other investigations and not collected for forensic purposes, giving for example overrepresentation of rare breeds (Okumura et al, 1996; Tsuda et al, 1997; Savolainen et al, 2002). Therefore, in the study of 100 dogs described in Paper II, the dogs were sampled to represent dog breeds in accordance with their frequency in the Swedish population. Many mtDNA CR types were relatively rare in the sample of 867 dogs from around the world (Paper I), but as many as 51 % of the sampled dogs had one of the eight most common CR types. In the second investigation (Paper II) of breeds in the Swedish population, 60 % of the 100 studied individuals had one of these common CR types. By analysing three additional regions totalling ~3,000 bp of coding part of the mitochondrial genome (Paper II), six of these common types were divided into subtypes and the total number of mtDNA types among the 100 dogs was increased from 32 to 55. With the analysis of the three coding regions the value of inclusions increased. This is exemplified by mtDNA-type A11, the most frequent CR mtDNA-type. While an inclusion based on A11, which had a

frequency of 17 % in this data set, has little value, an inclusion based on one of the seven subtypes of A11 would be a useful piece of circumstantial evidence, especially for the five subtypes having frequencies of 1 or 2 %.Analysis of the three coding regions increased the overall discrimination power considerably, compared to analysis of the CR only. The exclusion capacity was increased from 0.9198 for the CR to 0.9638 for all four regions, indicating that the match probability (the chance of two unrelated individuals sharing the same mtDNA type) can be halved by the extended analysis. In some cases, dogs of a specific breed that shared the same mtDNA CR type could be distinguished as the analysis of coding

(29)

same CR type could in some cases be distinguished from each other as they were separated into different mtDNA types by analysis of the coding regions. Of particular interest for Swedish forensic investigations is that one CR region type that is commonly found in

Scandinavian spitzes was divided into three different mtDNA type variants. The mtDNA CR type is the one found in dog hairs in the forensic cases 5-6 discussed in the section Analysis of non-human DNA in forensic casework. A reanalysis of remaining samples from the two cases in question could show if the dog hairs have the same mtDNA variant. This could further strengthen the link between the two cases.

Conclusions

Methods are continuously being developed to aid identification of individuals in forensic investigations. In the two papers in this thesis we have laid a foundation to further studies by showing the level of forensic informativity of domestic dog mtDNA sequences. In the first paper, CR sequences were analysed and the exclusion capacity was investigated for a number of different populations (Paper I). This is also the first comprehensive study of the correlation between mtDNA CR type and breed, type, and geographic origin of domestic dogs. Since the exclusion capacity for analysis of domestic dog CR sequences is relatively low, it was

investigated in the second paper how much the exclusion capacity for analysis of dog mtDNA is improved by analysis of 3,000 bp of coding sequence (Paper II). The increase from 0.9198 for the CR to 0.9638 for all four regions, and the subdivision of mtDNA CR types, imply a considerable increase in the value of inclusions. To achieve a comprehensive picture of the correlation between mtDNA types and breeds and more reliable statistical analyses, and to make full use of this forensic method, further large-scale studies of more geographical regions and breeds, with samples collected in a more strictly random fashion, are necessary. Large databases are needed to allow that population substructure due to artificial genetic barriers between dog breeds is fully accommodated. This study will hopefully work as a basis for future development of analysis of dog mtDNA for forensic purposes.

(30)

Acknowledgements

I would like to thank my supervisors Peter Savolainen and Joakim Lundeberg, and of course also Mattias Uhlén, for giving me the opportunity to carry out the studies in this thesis. I have really learnt a lot of new things during the 2+ years at AlbaNova, and that’s what science is all about really. I especially would like to thank Peter for spending a lot of time these last days so that the thesis could be given to the printers on time.

I also want to thank the other co-authors of the two papers for good cooperation. Thank you Mattias Oskarsson for working so fast and hard during the time we analysed the sequences for Paper II.

Thank you Danko for being such a good neighbour next to me in the Rookie room, and all the other people at the department for being so nice to work with.

I especially want to thank my mother, my sister, my brother and my good friends for being so understanding under stressful times. Thank you also Marie for all help with dog information, and blood and hair samples from The Best Dog in the World – Springmist´s Carmen.

(31)

References

Allen M, Saldeen T, Gyllensten U. (1995) Allele-specific HLA-DRB1 amplification of forensic evidence samples with mixed genotypes. Biotechniques 19(3):454-63.

Allen M, Engstrom AS, Meyers S, Handt O, Saldeen T, von Haeseler A, Paabo S, Gyllensten U. (1998) Mitochondrial DNA sequencing of shed hairs and saliva on robbery caps:

sensitivity and matching probabilities. J Forensic Sci. 43(3):453-64.

Alonso A, Salas A, Albarran C, Arroyo E, Castro A, Crespillo M, di Lonardo AM, Lareu MV, Cubria CL, Soto ML, Lorente JA, Semper MM, Palacio A, Paredes M, Pereira L, Lezaun AP, Brito JP, Sala A, Vide MC, Whittle M, Yunis JJ, Gomez J. (2002) Results of the 1999-2000 collaborative exercise and proficiency testing program on mitochondrial DNA of the GEP-ISFG: an inter-laboratory study of the observed variability in the heteroplasmy level of hair from the same donor. Forensic Sci Int. 125(1):1-7

Alonso A, Martin P, Albarran C, Garcia P, Garcia O, de Simon LF, Garcia-Hirschfeld J, Sancho M, de La Rua C, Fernandez-Piqueras J.(2004) Real-time PCR designs to estimate nuclear and mitochondrial DNA copy number in forensic and ancient studies. Forensic Sci Int. 139(2-3):141-9.

Amendt J, Krettek R, Zehner R. (2004) Forensic entomology. Naturwissenschaften. 91(2):51-65.

Anderson S, Bankier AT, Barrell BG, de Bruijn MH, Coulson AR, Drouin J, Eperon IC, Nierlich DP, Roe BA, Sanger F, Schreier PH, Smith AJ, Staden R, Young IG (1981). Sequence and organization of the human mitochondrial genome. Nature 290: 457-465. Andreasson H, Asp A, Alderborn A, Gyllensten U, Allen M. (2002) Mitochondrial sequence analysis for forensic identification using pyrosequencing technology. Biotechniques.

32(1):124-6, 128, 130-3.

Balogh MK, Burger J, Bender K, Schneider PM, Alt KW. (2003) STR genotyping and mtDNA sequencing of latent fingerprint on paper.Forensic Sci Int. 137(2-3):188-95.

Bandelt HJ, Lahermo P, Richards M, Macaulay V. (2001) Detecting errors in mtDNA data by phylogenetic analysis. Int J Legal Med. 115(2):64-9.

Bandelt HJ, Salas A, Bravi C. (2004a) Problems in FBI mtDNA database. Science. 305(5689):1402-4.

Bandelt HJ, Salas A, Lutz-Bonengel S. (2004b) Artificial recombination in forensic mtDNA population databases. Int J Legal Med. 118(5):267-73.

Barbaro A, Cormaci P, Teatino A, La Marca A, Barbaro A. (2004) Anonymous letters? DNA and fingerprints technologies combined to solve a case. Forensic Sci Int. 146 Suppl:S133-4. Benecke M. (1997a) DNA typing in forensic medicine and in criminal investigations: a current survey. Naturwissenschaften 84, 181-188.

(32)

Benecke M, Schmitt C, Staak M. (1997b) In: Proceedings of the first European Symposium on Human Identification, Toulouse, 29-31 May 1996. Madison: Promega Corporation 1997. 148.

Bogenhagen D, Clayton DA. (1974) The number of mitochondrial deoxyribonucleic acid genomes in mouse L and human HeLa cells. Quantitative isolation of mitochondrial deoxyribonucleic acid. J Biol Chem 249: 7991-7995.

Brandstätter A, Parsons TJ, Parson W. (2003) Rapid screening of mtDNA coding region SNPs for the identification of west European Caucasian haplogroups. Int J Legal Med. 117(5):291-8.

Brauner P, Reshef A, Gorski A. (2001) DNA profiling of trace evidence--mitigating evidence in a dog biting case. J Forensic Sci. 46(5):1232-4.

Budowle B, Adams DE, Comey CC, Meril CR. (1990) Mitochondrial DNA: a possible genetic material suitable for forensic analysis. In Advances in Forensic Science ed.H Lee, RE Gaensslen, 76-97. Chigago, IL: Year Book Med. Publ. 278 pp.

Budowle B, Wilson MR, DiZinno JA, Stauffer C, Fasano MA, Holland MM, Monson KL. (1999 ) Mitochondrial DNA regions HVI and HVII population data. Forensic Sci Int. 103(1):23-35.

Budowle B, Allard MW, Wilson MR, Chakraborty R. (2003) Forensics and mitochondrial DNA: applications, debates, and foundations. Annu Rev Genomics Hum Genet. 4:119-41. Budowle B, Polanskey D, Allard MW, Chakraborty R. (2004) Addressing the use of

phylogenetics for identification of sequences in error in the SWGDAM mitochondrial DNA database. J Forensic Sci. 49(6):1256-61.

Budowle B, Polanskey D. (2005) FBI mtDNA database: a cogent perspective. Science. 307(5711):845-7.

Bär W, Brinkmann B, Budowle B, Carracedo A, Gill P, Holland M, Lincoln PJ, Mayr W, Morling N, Olaisen B, Schneider PM, Tully G, Wilson M. (2000) DNA Commission of the International Society for Forensic Genetics: guidelines for mitochondrial DNA typing. Int J Legal Med. 113(4):193-6.

Campbell, N A (1996). Biology. 4th ed. Benjamin/Cummings Publ. ISBN 0-8053-1957-3. Chakraborty R, Stivers DN, Su B, Zhong Y, Budowle B. (1999) The utility of short tandem repeat loci beyond human identification: implications for development of new DNA typing systems. Electrophoresis. 20(8):1682-96.

Chomyn A, Mariottini P, Cleeter MW, Ragan CI, Matsuno-Yagi A, Hatefi Y, Doolittle RF, Attardi G. (1985) Six unidentified reading frames of human mitochondrial DNA encode components of the respiratory-chain NADH dehydrogenase.

Nature. 314(6012):592-7

Chomyn A, Cleeter MW, Ragan CI, Riley M, Doolittle RF, Attardi G. (1986) URF6, last unidentified reading frame of human mtDNA, codes for an NADH dehydrogenase subunit.

(33)

Science. 234(4776):614-8.

Clayton DA. (1991) Nuclear gadgets in mitochondrial DNA replication and transcription. Trends Biochem Sci. 16(3):107-11

Clayton TM, Whitaker JP, Maguire CN. (1995a) Identification of bodies from the scene of a mass disaster using DNA amplification of short tandem repeat (STR) loci. Forensic Sci Int. 76(1):7-15.

Clayton TM, Whitaker JP, Fisher DL, Lee DA, Holland MM, Weedn VW, Maguire CN, DiZinno JA, Kimpton CP, Gill P. (1995b) Further validation of a quadruplex STR DNA typing system: a collaborative effort to identify victims of a mass disaster. Forensic Sci Int. 76(1):17-25.

Coble MD, Just RS, O'Callaghan JE, Letmanyi IH, Peterson CT, Irwin JA, Parsons TJ. (2004) Single nucleotide polymorphisms over the entire mtDNA genome that increase the power of forensic testing in Caucasians. Int J Legal Med. 118(3):137-46.

Csete K, Beer Z, Varga T. (2005) Prenatal and newborn paternity testing with DNA analysis. Forensic Sci Int. 147 Suppl:S57-60.

Dachs J, McNaught IJ, Robertson J. (2003 ) The persistence of human scalp hair on clothing fabrics. Forensic Sci Int. 138(1-3):27-36.

Dalmasso A, Fontanella E, Piatti P, Civera T, Rosati S, Bottero MT. (2004) A multiplex PCR assay for the identification of animal species in feedstuffs. Mol Cell Probes. 18(2):81-7. Daniels DL, Hall AM, Ballantyne J. (2004) SWGDAM developmental validation of a 19-locus Y-STR system for forensic casework. J Forensic Sci. 49(4):668-83.

da Silva DA, Goes AC, de Carvalho JJ, de Carvalho EF. (2004) DNA typing from vaginal smear slides in suspected rape cases. Sao Paulo Med J. 122(2):70-2.

D’Andrea F, Fridez F, Coqouz R. (1998). Preliminary experiments on the transfer of animal hair during simulated criminal behaviour. J Forensic Sci. 43:1257-1258.

D'Aurelio M, Gajewski CD, Lin MT, Mauck WM, Shao LZ, Lenaz G, Moraes CT, Manfredi G. (2004) Heterologous mitochondrial DNA recombination in human cells.

Hum Mol Genet. 13(24):3171-9.

Dennis C. (2003) Error reports threaten to unravel databases of mitochondrial DNA. Nature 421(6925):773-4.

Eichmann C, Berger B, Reinhold M, Lutz M, Parson W. (2004) Canine-specific STR typing of saliva traces on dog bite wounds. Int J Legal Med. 118(6):337-42.

Edwards A, Civitello A, Hammond HA, Caskey CT. (1991) DNA typing and genetic mapping with trimeric and tetrameric tandem repeats. Am J Hum Genet. 49, 746-756.

(34)

Findlay I, Taylor A, Quirke P, Frazier R, Urquhart A. (1997) DNA fingerprinting from single cells. Nature 389(6651):555-6.

Forster PM. To error is human. (2003) Ann Hum Genet. 67:2-4

Fregeau CJ, Bowen KL, Fourney RM. (1999) Validation of highly polymorphic fluorescent multiplex short tandem repeat systems using two generations of DNA sequencers.

J Forensic Sci. 44(1):133-66.

Fridez F, Rochat S, Coquoz R. (1999) Individual identification of cats and dogs using mitochondrial DNA tandem repeats? Sci Justice. 39(3):167-71

Gill P, Jeffreys AJ, Werrett DJ (1985) Forensic application of DNA “fingerprints”. Nature 318, 577-579.

Gill P, Ivanov PL, Kimpton C, Piercy R, Benson N, Tully G, Evett I, Hagelberg E, Sullivan K. (1994) Identification of the remains of the Romanov family by DNA analysis. Nat Genet. 6(2):130-5.

Gill P. (2001) An assessment of the utility of single nucleotide polymorphisms (SNPs) for forensic purposes.Int J Legal Med. 114(4-5):204-10.

Gill P. (2002) Role of short tandem repeat DNA in forensic casework in the UK – past, present, and future perspectives. BioTechniques 32 (2), 366-385.

Gill P, Werrett DJ, Budowle B, Guerrieri R. (2004) An assessment of whether SNPs will replace STRs in national DNA databases--joint considerations of the DNA working group of the European Network of Forensic Science Institutes (ENFSI) and the Scientific Working Group on DNA Analysis Methods (SWGDAM).Sci Justice. 44(1):51-3.

Ginther C, Issel-Tarver L, King MC. (1992) Identifying individuals by sequencing mitochondrial DNA from teeth. Nat Genet. 2(2):135-8.

Grzybowski T. (2000) Extremely high levels of human mitochondrial DNA heteroplasmy in single hair roots. Electrophoresis. 21(3):548-53.

Grzybowski T, Malyarchuk BA, Czarny J, Miscicka-Sliwka D, Kotzbach R. (2003) High levels of mitochondrial DNA heteroplasmy in single hair roots: reanalysis and revision. Electrophoresis. 24(7-8):1159-65.

Guglich EA, Wilson PJ, White BN. (1994) Forensic application of repetitive DNA markers to the species identification of animal tissues. J Forensic Sci. 39(2):353-61.

Hauswirth WW, Laipis PJ. (1982) Mitochondrial DNA polymorphism in a maternal lineage of Holstein cows. Proc Natl Acad Sci U S A. 79(15):4686-90

Helmuth R, Fildes N, Blake E, Luce MC, Chimera J, Madej R, Gorodezky C, Stoneking M, Schmill N, Klitz W, et al. (1990) HLA-DQ alpha allele and genotype frequencies in various

References

Related documents

• The genotype distributions of CYP2D6 and CYP2C19 have been studied in three different autopsy materials: fatal intoxication cases, suicide cases (intoxications excluded)

In active magnetic regenerative cycle, the magnetic material serves not only as a refrigerant providing temperature change as a result of adiabatic magnetization

The increase in resistance (resulting in lower current) is in agreement with the estimation that amphetamine would behave as an electron donor and due to opposite doping

The research focuses on road transports by truck into Sweden from Norway, concentrating on two commodity groups, chemicals and metals, in the analysis of redistribution of flows

Following this simulation experiment people can analysis the sunlight how to affect the solar cells, when we research the sunlight power conversion on the solar array it

only incidences of domestic water cooperation initiated by the state, but such events can occur between the government and non-state actors (e.g. grassroots organizations, firms,

The routinely used technologies in forensic DNA typing (both for nuclear and mtDNA analysis) are based on fluorescence detection using CE 52-55,103.. As previously discussed,

Files that are interesting to Forensic investigators and that could serve as potential evidence are Contacts, Messages, Pictures, Videos, Internet History, Location data, Social